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PROTEINS AND NUCLEIC ACIDS ENCODING SAME 

FIELD OF THE INVENTION 

The invention generally relates to nucleic acids and polypeptides encoded thereby. 

5 BACKGROUND OF THE INVENTION 

The invention generally relates to nucleic acids and polypeptides encoded therefrom. 
More specifically, the invention relates to nucleic acids encoding cytoplasmic, nuclear, 
membrane bound, and secreted polypeptides, as well as vectors, host cells, antibodies, and 
recombinant methods for producing these nucleic acids and polypeptides. 

10 

SUMMARY OF THE INVENTION 

The invention is based in part upon the discovery of nucleic acid sequences encoding 
novel polypeptides. The novel nucleic acids and polypeptides are referred to herein as NOVX, 
or NOV1-99 nucleic acids and polypeptides. These nucleit acids and polypeptides, as well as 

1 5 derivatives, homologs, analogs and fragments thereof, will hereinafter be collectively 
designated as "NOVX" nucleic acid or polypeptide sequences. 

In one aspect, the invention provides an isolated NOVX nucleic acid molecule 
encoding a NOVX polypeptide that includes a nucleic acid sequence that has identity to the 
nucleic acids disclosed in SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162, . In 

20 some embodiments, the NOVX nucleic acid molecule will hybridize under stringent 

conditions to a nucleic acid sequence complementary to a nucleic acid molecule that includes 
a protein-coding sequence of a NOVX nucleic acid sequence. The invention also includes an 
isolated nucleic acid that encodes a NOVX polypeptide, or a fragment, homolog, analog or 
derivative thereof For example, the nucleic acid can encode a polypeptide at least 80% 

25 identical to a polypeptide comprising the amino acid sequences of SEQ ID NOS:2n, wherein n 
is an integer between 1 and 162. The nucleic acid can be, for example, a genomic DNA 
fragment or a cDNA molecule that includes the nucleic acid sequence of any of SEQ ID 
NOS:2n-l , wherein n is an integer between 1 and 162. 

Also included in the invention is an oligonucleotide, e.g., an oligonucleotide which 

30 includes at least 6 contiguous nucleotides of a NOVX nucleic acid (e.g., SEQ ID NOS:2n-l, 
wherein n is an integer between 1 and 162) or a complement of said oligonucleotide. 
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Also included in the invention are substantially purified NOVX polypeptides (SEQ ID 
NOS:2n, wherein n is an integer between 1 and 162). In certain embodiments, the NOVX 
polypeptides include an amino acid sequence that is substantially identical to the amino acid 
sequence of a human NOVX polypeptide. 
5 The invention also features antibodies that immunoselectively bind to NOVX 

polypeptides, or fragments, homologs, analogs or derivatives thereof. 

In another aspect, the invention includes pharmaceutical compositions that include 
therapeutically- or prophylactically-effective amounts of a therapeutic and a pharmaceutically- 
acceptable carrier. The therapeutic can be, e.g., a NOVX nucleic acid, a NOVX polypeptide, 
10 or an antibody specific for a NOVX polypeptide. In a further aspect, the invention includes, in 
one or more containers, a therapeutically- or prophylactically-effective amount of this 
pharmaceutical composition. 

In a further aspect, the invention includes a method of producing a polypeptide by 
culturing a cell that includes a NOVX nucleic acid, under conditions allowing for expression 
15 of the NOVX polypeptide encoded by the DNA. If desired, the NOVX polypeptide can then 
be recovered. 

In another aspect, the invention includes a method of detecting the presence of a 
NOVX polypeptide in a sample. In the method, a sample is contacted with a compound that 
selectively binds to the polypeptide under conditions allowing for formation of a complex 
20 between the polypeptide and the compound. The complex is detected, if present, thereby 
identifying the NOVX polypeptide within the sample. 

The invention also includes methods to identify specific cell or tissue types based on 
their expression of a NOVX. 

Also included in the invention is a method of detecting the presence of a NOVX 
25 nucleic acid molecule in a sample by contacting the sample with a NOVX nucleic acid probe 
or primer, and detecting whether the nucleic acid probe or primer bound to a NOVX nucleic 
acid molecule in the sample. 

In a further aspect, the invention provides a method for modulating the activity of a 
NOVX polypeptide by contacting a cell sample that includes the NOVX polypeptide with a 
30 compound that binds to the NOVX polypeptide in an amount sufficient to modulate the 

activity of said polypeptide. The compound can be, e.g., a small molecule, such as a nucleic 
acid, peptide, polypeptide, peptidomimetic, carbohydrate, lipid or other organic (carbon 
containing) or inorganic molecule, as further described herein. 

2 
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Also within the scope of the invention is the use of a therapeutic in the manufacture of 
a medicament for treating or preventing various disorders or syndromes described below. 

The therapeutic can be, e.g., a NOVX nucleic acid, a NOVX polypeptide, or a NOVX- 
specific antibody, or biologically-active derivatives or fragments thereof. 
5 For example, the compositions of the present invention will have efficacy for treatment 

of patients suffering from the diseases and disorders disclosed above and/or other pathologies 
and disorders of the like. The polypeptides can be used as immunogens to produce antibodies 
specific for the invention, and as vaccines. They can also be used to screen for potential 
agonist and antagonist compounds. For example, a cDNA encoding NOVX may be useful in 

10 gene therapy, and NOVX may be useful when administered to a subject in need thereof. By 
way of non-limiting example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from the diseases and disorders disclosed above and/or other 
pathologies and disorders of the like. 

The invention further includes a method for screening for a modulator of disorders or 

15 syndromes including, e.g., the diseases and disorders disclosed above and/or other pathologies 
and disorders of the like. The method includes contacting a test compound with a NOVX 
polypeptide and determining if the test compound binds to said NOVX polypeptide. Binding 
of the test compound to the NOVX polypeptide indicates the test compound is a modulator of 
activity, or of latency or predisposition to the aforementioned disorders or syndromes. 

20 Also within the scope of the invention is a method for screening for a modulator of 

activity, or of latency or predisposition to disorders or syndromes including, e.g., the diseases 
and disorders disclosed above and/or other pathologies and disorders of the like by 
administering a test compound to a test animal at increased risk for the aforementioned 
disorders or syndromes. The test animal expresses a recombinant polypeptide encoded by a 

25 NOVX nucleic acid. Expression or activity of NOVX polypeptide is then measured in the test 
animal, as is expression or activity of the protein in a control animal which recombinantly- 
expresses NOVX polypeptide and is not at increased risk for the disorder or syndrome. Next, 
the expression of NOVX polypeptide in both the test animal and the control animal is 
compared. A change in the activity of NOVX polypeptide in the test animal relative to the 

30 control animal indicates the test compound is a modulator of latency of the disorder or 
syndrome. 

In yet another aspect, the invention includes a method for determining the presence of 
or predisposition to a disease associated with altered levels of a NOVX polypeptide, a NOVX 
nucleic acid, or both, in a subject (e.g., a human subject). The method includes measuring the 
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amount of the NOVX polypeptide in a test sample from the subject and comparing the amount 
of the polypeptide in the test sample to the amount of the NOVX polypeptide present in a 
control sample. An alteration in the level of the NOVX polypeptide in the test sample as 
compared to the control sample indicates the presence of or predisposition to a disease in the 
5 subject. Preferably, the predisposition includes, e.g., the diseases and disorders disclosed 

above and/or other pathologies and disorders of the like. Also, the expression levels of the new 
polypeptides of the invention can be used in a method to screen for various cancers as well as 
to determine the stage of cancers. 

In a further aspect, the invention includes a method of treating or preventing a 

10 pathological condition associated with a disorder in a mammal by administering to the subject 
a NOVX polypeptide, a NOVX nucleic acid, or a NOVX-specific antibody to a subject (e.g., a 
human subject), in an amount sufficient to alleviate or prevent the pathological condition. In 
preferred embodiments, the disorder, includes, e.g., the diseases and disorders disclosed above 
and/or other pathologies and disorders of the like. 

15 In yet another aspect, the invention can be used in a method to identity the cellular 

receptors and downstream effectors of the invention by any one of a number of techniques 
commonly employed in the art. These include but are not limited to the two-hybrid system, 
affinity purification, co-precipitation with antibodies or other specific-interacting molecules. 
Unless otherwise defined, all technical and scientific terms used herein have the same 

20 meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although methods and materials similar or equivalent to those described herein can 
be used in the practice or testing of the present invention, suitable methods and materials are 
described below. All publications, patent applications, patents, and other references . 
mentioned herein are incorporated by reference in their entirety. In the case of conflict, the 

25 present specification, including definitions, will control. In addition, the materials, methods, 
and examples are illustrative only and not intended to be limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 



30 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides novel nucleotides and polypeptides encoded thereby. 
Included in the invention are the novel nucleic acid sequences and their encoded polypeptides. 
The sequences are collectively referred to herein as "NOVX nucleic acids" or "NOVX 

4 
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polynucleotides" and the corresponding encoded polypeptides are referred to as "NOVX 
polypeptides" or "NOVX proteins." Unless indicated otherwise, "NOVX" is meant to refer to 
any of the novel sequences disclosed herein. Table A provides a summary of the NOVX 
nucleic acids and their encoded polypeptides. 
5 TABLE A. Sequences and Corresponding SEQ ID Numbers 



NOVX 
Assignment 


Internal Identification 


SEQ ID 

NO 
(nucleic 

acta) 


SEO ID NO 
(polypeptide) 


Homology 


la 


CG56592-01 


l 


2 


Claudin 6- like 


lb 


CG56586-01 


3 


4 


Claudin-3-like 


lc 


CG56592-03 


5 


6 


Claudin- 6 -like 


Id 


CG56592-02 


7 


8 


Claudin 6- like 


2 


CG56596-01 


9 


10 


Protein Serine Kinase -like 


3a 


CG56594-01 


11 


12 


Claudin- 19 -like 


3b 


CG56594-02 


13 


14 


Claudin- 19 -like 


3C 


CG57576-01 


15 


16 


Claudin-19-like 


4a 


CG56589-01 


17 


18 


Claudin- 6- like 


4b 


CG565B9-01 


19 


20 


^-JL ciU.tXX.ll o — xixte 


4C 


CG56589-02 


21 


22 


Claudin- 6- like 


5a 


CG56635-01 


23 




riuiiuL aruoAy x a L e b rauapor t cr 
(MCT3)-like 


5b 


CG56635-02 


25 


26 


nuiiw uctLiJUAyxauc ailopox Ccr 

3 -like 


5c 


CG56635-03 


27 


28 


rtull<Jl».e±X.XJiJJty AdL.c LXculopUX UcX 

3 -like 


5d 


CG56635-04 


29 


30 


3-like 


5e 


CG56635-05 


31 


32 


3-like 


6 


CG56674-01 


33 


34 


Nitrilase-l-like 


7a 


CG56613-01 


35 


36 


Cleavage Signal -l Protein- 
Like 


7b 


CG56613-02 


37 


38 


Cleavage Signal -1 Protein- 
Like 


7c 


CG56613-D3 


39 


40 


Cleavage Signal -1 Protein- 
Like 


7d 


174307820 


41 


42 


Cleavage Signal -1 Protein- 
Like 


7e 


167474749 


323 


324 


Cleavage Signal-l Protein- 
Like 


8 


153472451 


43 


44 


Matriptase -like 


9a 


CG56554-01 


45 


46 


Neuropeptide Y/ Peptide YY 
receptor -like 


9b 


CG56554-02 


47 


48 


Neuropeptide Y/ Peptide YY' 
receptor -like 


10 


CG55964-01 


49 


50 


G- Protein Coupled Receptor- 
like 


11 


CG55966-01 


51 


52 


G- Protein Coupled Receptor- 
like 


12 


CG56003-01 


53 


54 


Neuromodul in- 1 ike 


13a 


CG56021-01 


5 5 


56 


G- Protein Coupled Receptor- 
like 


13b 


CG56021-02 


57 


58 


G- Protein Coupled Receptor- 
like 


14 


CG56023-01 


59 


60 


G- Protein Coupled Receptor - 
like 


15a 


CG56065-01 


61 


62 


G- Protein Coupled Receptor- 
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like 




pfic c. n 6 ^ - no 




fid 


like 


J. Oct 


CG56067-01 


6R 




vjr riuLciii vuupicu KCLcptui " 

like 


16b 


CG56753-02 




68 


vj rxutciu uuupxcu nc^cptui 

like 


17a 


CG56657-01 


69 


70 


w riULCXU ^XJU^JXCU RCCCpLUl 

like 


17b 


CG56657-02 


71 


72 


like 


17c 

X I w 


CG56659-01 


73 


74 


p-rot* "i n Potrnl Rpr!^rvt"OT~— 

VJ FiwLC XXX UUUjJl C; IX l\CLC^LUi 

like 


17d 


CG56659 02 


75 


76 


G— Protein CoirDled Receiptor— 
like 


18a 


CG56663-01 


77 


78 


G- Protein Coimled ReceDtor- 

rjtvbcxu vvu^xcu tvuwc^/bvii 

like 


18b 


CG56663-02 


79 


80 


like 


19a 


CG56665-01 


81 


82 


w tiULCXH VVUpXCU ACUCyUvl 

like 


19b 


CG56665-02 


83 


84 


fl— Pr*ot" *a "t n Pntml pH PpppnlriT* 

u riULClll \«UUUJ.CU ivCLvULUl 

like 


20 


CG56667-01 


85 


86 


VJ ri-WLCAll ^uupicu IVCUCpLUJb " ■ 

like 


21a 


CG56639-01 


87 


88 


Protease-Like 


21b 


CG56639-02 


89 


90 


AUlCUCll OC^lCLUl y OCL1UC 

Protease-Like 


22a 


CG56643-01 


91 


9? 
-7 « 


2VHT"*»n^T flprrpfftrv Qpti no 
wm ciiqx ocuicuuiy ociiue 

Protease-Like 


22b 


CG56643-02 


93 


94 


Adrenal Secretory Serine 
Protease -Like 


22c 


CG56643-03 


95 


96 


Adrenal Secretory Serine 
Protease-Like 


23a 


CG56647-02 


97 


98 


Serine Protease DESCl-like 


23b 


CG56647-03 


99 


100 


Serine Protease DESCl-like : 


23c 


CG56647-01 


101 


102 


Serine Protease DESCl-like 


24a 


CG56155-01 


103 


104 


Parchorin-like 


24b 


CG56155-02 


105 


106 


Parchorin- like 


25 


CG56457-01 


107 


108 


Protein Phosphatase -liJce 


26a 


CG56461-01 


109 


110 


GAGE- 7- like 


26b 


CG56461-02 


111 


112 


GAGE- 7- like ; 


27a 


CG56645-01 


113 


114 


PoH *i nm— Gl nrnep 

Cot ransporter- like 


27b 


CG56645-02 


115 


116 


Sodium- Glucose 
Cotransporter- 1 ike 


27c 


191828203 


117 


118 


Sodium-Glucose 

Cotransporter- like 1 


28 


CG56185-01 


119 


120 


MYD-l-like 


29a 


CG56187-01 


121 


122 


CRAL- TRIO - like 


29b 


CG56187-03 


123 


124 


CRAL-TRIO- like 


29c 


CG56189-01 


125 


126 


CRAL- TRIO- like 


30 


CG56191-01 


127 


128 


Ryudocan- like 


31 


CG56392-01 


129 


130 


Sulfur- rich Keratin- like 


32 


CG56686- 01 


131 


132 


DNMT1 associated protein- 1 
(DMAP) 


33 


CG56688-01 


133 


134 


Notchl-like 


34 


CG56715-01 


135 


136 


Olfactory Receptor- like 


35 


CG56718-01 


137 


138 


Olfactory Receptor-like 


Joa 




139 


140 


cacuie r x n 1 1 - 1 1 Ke 


36b 


CG56729-02 


141 


142 


Cadherin ll-like 


37 


CG56733-01 


143 


144 


Ten-M2-like 


38 


CG56737-01 


145 


146 


Activin Beta C Chain- like 


39a 


CG56737-02 


147 


14 8 


Activin Beta C Chain-like 
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39b 


CG56637-03 


149 


150 


Inhibin Beta E Chain- like 


40 


CG56097-01 


151 


152 


UDP Glycosyl transf erase - 
like 


41a 


CG56680-01 


153 


154 


Sodium/ Hydrogen Exchanger 
4 -like 


41b 


CG56680-02 


155 


156 


Sodium/ Hydrogen Exchanger 
4 -like 


42a 


CG56682-01 


157 


158 


Kupffer Cell Receptor- like 


42b 


CG56682-02 


159 


160 


Kupffer Cell Receptor- like * 


42C 


CG56682-03 


161 


162 


Kupffer Cell Receptor- like 


42d 


CG56682-04 


163 


164 


Kupffer Cell Receptor- like 


43 


CG56690-01 


165 


166 


P2Y Purinoceptor -like 


44 


CG56692-01 


167 


168 


G Protein Coupled Receptor - 
like 


45 


CG56694-01 


169 


170 


Mas Pro to - Oncogene - 1 ike 


46a 


CG56696-01 


171 


172 


Mas Proto-Oncogene -like 


4 6b 


CG56696-02 


173 


174 


Mae-Dai a r* DynhAi m 

Coupled Receptor -like 


46c 




175 


J. / D 


Mas Proto-Oncogene- like 


46d 


CG56698-01 


177 


178 


Mas Proto-Oncogene -like 


47 


CG56700-01 


179 


180 


Pep t idyl- Prolyl Cis- Trans 
Isomerase-like 


48a 


CG56743-01 


181 


182 


Phospholipase C Delta- 4- 
like 


48b 






184 


Phospholipase C Delta- 4- 
like 


49 


CG56739-01 


185 


186 


Leukotriene-B4 Omega - 
Hy droxy 1 ase-like 


50a 


CG56771-01 


187 


188 


riULCJ.il «i (jXllinc JN — 

Methyltransferase 2-like 


50b 


CG56771-02 


189 


190 


Protein Arginine N- 

Me thy ltransf erase 2-like 


51 


CG56759-01 


191 


192 


Olfactory Receptor- like 


52 


CG56731-01 


193 


194 


xl.O c> O — X J. ri-ti 


53 


CG56745-01 


195 


196 


Uracil 

Pho spho r xbosy 1 1 rans f e r a s e - 
Like 


54a 


CG5677*}- 01 




1 QC 


Protein Phosphatase 2C-like 


54b 


CG5677T- no 




<5UU 


Protein Phosphatase 2C-like 


55 




om 


o no 


Heparan Sulfate 6- 
Sulfo transferase 3 -like 


56a 


CG56816-01 




O f\A 


N - Hy droxya ry 1 amine 
Sulf ot rans f erase- 1 ike 


56b 


CG56816-02 


205 


206 


N - Hydroxy aryl am ine 
Sul fo trans f erase - 1 ike 


57 


CG56829-01 


207 


208 


Testis Specific Serine 
Kinase- 3 -like 


58a 


CG56315-01 


209 


210 


Gap Junction Beta-5-like 


58b 


CG56315-02 


211 


212 


* Connexin-like 


59 


CG56633-01 


213 


214 


Translation Initiation 
Factor 5- like 


60a 


CG56894-01 


215 


OIK 


juynxx - 1 i jce 


60b 


.CG56894-02 


217 


218 


Lynxl-like 


61 


CG56453-01 


219 


220 


Adlican-like 


62 


CG56781-01 


221 


222 


Neurops in Precursor- 1 ike 


63 


CG53054-02 


223 


224 


Wnt-14 Precursor- like 


64 


CG56884-01 


225 


226 


Dipeptidyl peptidase -like 


65a 


CG56651-01 


227 


228 


Protein phosphatase - 1 ike 


65b 


CG56651-02 


229 


230 


Protein phosphatase- like 


66 


CG56313-01 


231 


232 


Olfactory receptor -like 


67 


CG56571-01 


233 


234 


Olfactory Receptor-Like 
Protein OLF3-like 


68 


CG56844-01 


235 


236 


Endoglin (CD105 antigen) - 
like 
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69a 


CG56950-01 


237 


238 


Interleukin l Epsilon-like 


69b 


CG56136-02 


239 


240 


Interleukin l Epsilon-like 


70a 


CG56878-01 


241 


242 


OS-9-like 


70b 


CG56878-04 


243 


244 


OS-9-like 


71 


CG56906-01 


245 


246 


Sodium/ Hydrogen Exchanger 
6- like 


72 


CG56910-01 


247 


248 


Ubiqui tin- Specific 
Protease- like 


73 


CG56822-01 


249 


250 


Sulf otransf erase-1 ike 


74 


CG56775-01 


6ji 


252 


Phosphatase- 1 ike 


75 


CG56783-01 


253 


254 


Phospha tase-like 


76a 


CG56789-01 


255 


256 


Phosphatase - 1 ike 


76b 




£3 r 


258 


uual opeciLlClLy 

Phospha tase-like 


77 


CG56804-01 


259 


260 


fuai opcciiicxLy 
Phosphatase- like 


78 


CG56810-01 


261 


262 


UuaJi opcciliciwy 

Phosphatase- like 


79 


CG56862-01 


263 


264 


Dual Specificity 
Phospha t as e- 1 ike 


80 


CG56882-01 


265 


266 


Dual Specificity 
Phospha tase- 1 ike 


81a 


CG56283-01 


267 


268 


Beta-1,3- 

Galactosvl transferase -like 


81b 


CG56283-02 


269 


270 


Beta- 1,3- 

Gal ac to svX trans feiras© - 1 ilce* 


82 


CG56983-01 


271 


272 


Peptide YY-like' 


83 




£ 1 i 


& I*k 


G Protein- Coupled Receptor 
Kinase GRK7-like 


84 


V»V33 O 27 X^ UI 






Phospholipase C delta 1- 
like 


85 




9*7*7 


T>*7 Q 
& / 0 


b i ras e - ac c lva c nig protein- 
like 


86 


V»V99 O 7 3 / w X 






GTPase-Act ivating Protein- 
like 


87a 


CG56886- 01 


X 


npo 


Rho - GTP as e - Ac t iva t ing 
Protein- like 


87b 


CG56886- 02 


283 




Kuo - <j i c sls e - ac u i va c ing 
Protein- like 


86 j 


CG563 94-01 


285 




bjLytciui — j -iriiospnace 
Dehydrogenase- like 


89 


CG56396-01 


287 


288 


v»xyctSX70X — J - r'ilOopilci Lc 

Dehydrogenase - 1 ike 


90 


CG56888-01 


289 


290 


ociiuc/ x iix tsuiixiic- rx uucxn 

Kinase PAK 2- like 


91 


CG56779-01 


291 


292 


uuLjai.xxX7uiHc lautoiUciaSc 

like 


92 


CG56904-01 


293 


294 


OCWicLCU ICULlHc ll l_.il 

repeat (LRR)-like • 


93 


CG56277-01 


295 




inosine-3 -rionopnospnace 
Dehydrogenase - like 


94 


CG56261-01 


297 


298 


Male- Specific Lethal 3 -Like 
1-like 


95 


CG56975-01 


299 


300 


Cysteine Conjugate Beta- 
Lyase-like 


96a 


CG56918-01 


301 


3 02 


Monocarboxyla t e 
transporter- 1 ike 


96b 


CG56918-02 


303 


304 


Monocarboxylate 
transporter- like 


96C 


CG56918-03 


305 


306 


Sugar Transporter -like 


97a 


CG57070-01 


307 


308 


Carboxypeptidase Al-like 


97b 


CG57070-02 


309 


310 


Carboxypeptidase Al-like 


97c 


CG57070-03 


311 


312 


Carboxypeptidase Al-like 
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97d 


CG57070-04 


313 


314 


Carboxypeptidase Al-like 


97e 


CG57070-05 


315 


316 


Carboxypeptidase Al-like 


97f 


CG57070-06 


317 


318 


carboxypeptidase Al-like 


98 


CG55939-01 


319 


320 


Agrin-like 


99 


CG57010-01 


321 


322 


SNC73-like 



NOVX nucleic acids and their encoded polypeptides are useful in a variety of 
applications and contexts. The various NOVX nucleic acids and polypeptides according to the 
invention are useful as novel members of the protein families according to the presence of 
5 domains and sequence relatedness to previously described proteins. Additionally, NOVX 
nucleic acids and polypeptides can also be used to identify proteins that are members of the 
family to which the NOVX polypeptides belong. 

NOV1, NOV3, and NOV4 are homologous to a Claudin-like family of proteins. Thus, 
the NOV1, NOV3, and NOV4 nucleic acids, polypeptides, antibodies and related compounds 
10 according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions. 

NOV2 is homologous to the Protein Serine Kinase-like family of proteins. Thus 
NOV2 nucleic acids, polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
1 5 pathologies or conditions. 

NOV5 is homologous to a family of Monocarboxylate transporter-like proteins. Thus, 
the NOV5 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 
20 NOV6 is homologous to the nitrilase-1 -like family of proteins. Thus, NOV6 nucleic 

acids, polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions. 

NOV7 is homologous to the Cleavage Signal-l-like family of proteins. Thus NOV7 
25 nucleic acids, polypeptides, antibodies and related compounds according to the invention will 
be useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions. 

NOV8 is homologous to the Matripase-like family of proteins. Thus NOV8 nucleic 
acids, polypeptides, antibodies and related compounds according to the invention will be 
30 useful in therapeutic and diagnostic applications implicated in, various pathologies or 
conditions. 
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NOV9 is homologous to members of the Neuropeptide Y/Peptide YY receptor-like 
family of proteins. Thus, the NOV9 nucleic acids, polypeptides, antibodies and related 
compounds according to the invention will be useful in therapeutic and diagnostic applications 
implicated in various pathologies or conditions. 

NOVslO through 20, , NOV43, NOV44, and NOV83 are homologous to the G-Protein 
Coupled Receptor-like family of proteins. Thus, these nucleic acids and polypeptides, 
antibodies and related compounds according to the invention will be useful in therapeutic and 
diagnostic applications implicated in various pathologies or conditions. 

NOV21 and NOV22 are homologous to the Adrenal; secretory serine protease like 
growth factor binding protein-like family of proteins. Thus, NOV21 and NOV22 nucleic acids 
and polypeptides, antibodies and related compounds according to the invention will be useful 
in therapeutic and diagnostic applications implicated in various pathologies or conditions. 

NOV23 is homologous to the Serine Protease DESC-l-like family of proteins. Thus, 
NOV23 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in various pathologies or conditions. 

NOV24 is homologous to the Parchorin-like family of proteins. Thus, NOV24 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
disorders. 

NOV25 is homologous to theProtein Phosphatase-like family of proteins. Thus, 
NOV25 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions. 

NOV26 is homologous to the GAGE7-like family of proteins. Thus, NOV26 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies/disorders. 

NOV27 is homologous to the Sodium-Glucose Cotransporter-like family of proteins. 
Thus, NOV27 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV28 is homologous to the MYD-l-like family of proteins. Thus, NOV28 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 
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NOV29 is homologous to the CRAL-TRIO-iike family of proteins. Thus, NOV27 
nucleic acids and polypeptides, antibodies and related compounds according to the invention 
will be useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

5 NOV30 is homologous to the Ryudocan-like family of proteins. Thus, NOV30 nucleic 

acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV31 is homologous to the Sulfur-rich Keratin-like family of proteins. Thus, 
1 0 NOV3 1 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV32 is homologous to the DMNT1 associated protein-like family of proteins. 
Thus, NOV32 nucleic acids and polypeptides, antibodies and related compounds according to 
15 the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV33 is homologous to the Notch 1 -like family of proteins. Thus, NOV33 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
20 conditions 

NOV34, NOV35, NOV51, NOV66, and NOV67 are homologous to the Olfactory 
Receptor-like family of proteins. Thus, NOV34, NOV35, NOV51, NOV66, and NOV67 
nucleic acids and polypeptides, antibodies and related compounds according to the invention 
will be useful in therapeutic and diagnostic applications implicated in various pathologies or 
25 conditions 

NOV36 is homologous to the Cadherin 1 1-like family of proteins. Thus, NOV36 
nucleic acids and polypeptides, antibodies and related compounds according to the invention 
will be useful in therapeutic and diagnostic applications implicated in various pathologies' or 
conditions 

30 NOV37 is homologous to the Ten-M2-like family of proteins. Thus, NOV33 nucleic 

acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 
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NOV38 and NOV39 are homologous to the Activin/Inhibin-like family of proteins. 
Thus, NOV38 and NOV39 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 
5 NOV40 is homologous to the UDP Glycosyltransferase-Iike family of proteins. Thus, 

NOV40 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV4 1 is homologous to the Sodium/Hydrogen Exchanger 4-like family of proteins. 
10 Thus, NOV41 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV42 is homologous to the Kupffer Cell Receptor-like family of proteins. Thus, 
NOV42 nucleic acids and polypeptides, antibodies and related compounds according to the 
15 invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV45 and NOV46 is homologous to the Mas Proto-Oncogene-like family of 
proteins. Thus, NOV45 and NOV46 nucleic acids and polypeptides, antibodies and related 
compounds according to the invention will be useful in therapeutic and diagnostic applications 
20 implicated in various pathologies or conditions 

NOV47 is homologous to the Peptidyl-Prolyl Cis-Trans Isomerase-like family of 
proteins. Thus, NOV47 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 
25 NOV48 is homologous to the Phospholipase C Delta-4-like family of proteins. Thus, 

NOV48 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV49 is homologous to the Leukotriene-B4 Omega Hydroxylase-like family of 
30 proteins. Thus, NOV49 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 

NOV50 is homologous to the Protein Arginine N-Methyltransferase 2-like family of 
proteins. Thus, NOV50 nucleic acids and polypeptides, antibodies and related compounds 
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according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 

NOV52 is homologous to the H326-like family of proteins. Thus, NOV52 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
5 useful in therapeutic and diagnostic applications implicated in various pathologies or ' 
conditions 

NOV53 is homologous to the Uracil Phosphoribosyltransferase-like family of proteins. 
Thus, NOV53 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
1 0 pathologies or conditions 

NOV54 is homologous to the Protein Phosphatase 2C-like family of proteins. Thus, 
NOV54 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 
15 NOV55 is homologous to the Heparan Sulfate 6-Sulfotransferase 3-like family of 

proteins. Thus, NOV55 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 

NOV56 is homologous to the N-Hydroxyarylamine Sulfotransferase 3-like family of 
20 proteins. Thus, NOV52 nucleic acids and polypeptides, antibodies and related compounds 

according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 

NOV57 is homologous to the Testis Specific Serine Kinase-3-like family of proteins. 
Thus, NOV57 nucleic acids and polypeptides, antibodies and related compounds according to 
25 the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV58 is homologous to the Gap Junction Beta-5-like family of proteins. Thus, 
NOV58 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
30 pathologies or conditions 

NOV59 is homologous to the Translation Initiation Factor 5-like family of proteins. 
Thus, NOV59 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologi es or conditions 

13 
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NOV60 is homologous to the Lynxl-like family of proteins. Thus, NOV60 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

5 NOV61 is homologous to the Adlican-like family of proteins. Thus, NOV61 nucleic 

acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV62 is homologous to the Neuropsin Precursor-like family of proteins. Thus, 
10 NOV62 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV63 is homologous to the Wnt-14-Iike family of proteins. Thus, NOV63 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
15 useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV64 is homologous to the Dipeptidyl peptidase-like family of proteins. Thus, 
NOV64 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
20 pathologies or conditions 

NOV65 is homologous to the Protein phosphatase-like family of proteins. Thus, 
NOV65 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

25 NOV68 is homologous to the Endoglin (CD105 antigen>like family of proteins. Thus, 

NOV68 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

* 

NOV69 is homologous to the Interleukin 1 Epsilom-like family of proteins. Thus, 
30 NOV69 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV70 is homologous to the OS-9-like family of proteins. Thus, NOV70 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
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useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV71 is homologous to the Sodium/Hydrogen Exchanger 6-like family of proteins. 
Thus, NOV71 nucleic acids and polypeptides, antibodies and related compounds according to 
5 the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV72 is homologous to the Ubiquitin Specific Protease-like family of proteins. 
Thus, NOV72 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
10 pathologies or conditions 

NOV73 is homologous to the Sulfotransferase-like family of proteins. Thus, NOV73 
nucleic acids and polypeptides, antibodies and related compounds according to the invention 
will be useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

15 NOV74, NOV75, NOV76, NOV77, NOV78, NOV79, and NOV80 are homologous to 

the Dual Specificity Phosphatase-like family of proteins. Thus, NOV74, NOV75, NOV76, 
NOV77, NOV78, NOV79, and NOV80 nucleic acids and polypeptides, antibodies and related 
compounds according to the invention will be useful in therapeutic and diagnostic applications 
implicated in various pathologies or conditions 

20 NOV81 is homologous to the Beta-1, 3-Galactosyltransferase-like family of proteins. 

Thus, NOV81 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV82 is homologous to the Peptide YY-like family of proteins. Thus, NOV82 

25 nucleic acids and polypeptides, antibodies and related compounds according to the invention 
will be useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV84 is homologous to the Phospholipase C delta 1 -like family of proteins. Thus, 
NOV84 nucleic acids and polypeptides, antibodies and related compounds according to the 
30 invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV85, NOIV86, and NOV87 are homologous to the GTPase-Activating Protein-like 
family of proteins. Thus, NOV85, NOIV86, and NOV87 nucleic acids and polypeptides, 
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antibodies and related compounds according to the invention will be useful in therapeutic and 
diagnostic applications implicated in various pathologies or conditions 

NOV88 and NOV89 are homologous to the Glyceroil-3-Phosphate Dehydrogenase- 
like family of proteins. Thus, NOV88 and NOV89 nucleic acids and polypeptides, antibodies 
5 and related compounds according to the invention will be useful in therapeutic and diagnostic 
applications implicated in various pathologies or conditions 

NOV90 is homologous to the Serine/Threonine-Protein Kinase PAK 2-like family of 
proteins. Thus, NOV90 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
10 in various pathologies or conditions 

NOV91 is homologous to the D-Dopachrome Tautomerase family of proteins. Thus, 
NOV91 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 
15 " NOV92 is homologous to the Secreted leucine-rich repeat (LRR)-like family of 

proteins. Thus, NOV92 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 

NOV93 is homologous to the Inosme-5'-Monophosphate Dehydrogenase-like family 
20 of proteins. Thus, NOV93 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 

NOV94 is homologous to the Male-Specific Lethal 3-like family of proteins. Thus, 
NOV94 nucleic acids and polypeptides, antibodies and related compounds according to the 
25 invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV95 is homologous to the Cysteine Conjugate Beta Lyase-like family of proteins. 
Thus, NOV95 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
30 pathologies or conditions 

NOV96 is homologous to the Monocarboxylate transporter-like family of proteins. 
Thus, NOV96 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 
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NOV97 is homologous to the Carboxypeptidase Al-like family of proteins. Thus, 
NOV97 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 
5 NOV98 is homologous to the Agrin-like family of proteins. Thus, NOV98 nucleic 

acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV99 is homologous to the SNC73-like family of proteins. Thus, NOV99 nucleic 
1 0 acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

The NOVX nucleic acids and polypeptides can also be used to screen for molecules, 
which inhibit or enhance NOVX activity or function. Specifically, the nucleic acids and 
1 5 polypeptides according to the invention may be used as targets for the identification of small 
molecules that modulate or inhibit, e.g., neurogenesis, cell differentiation, cell proliferation, 
hematopoiesis, wound healing and angiogenesis. 

Additional utilities for the NOVX nucleic acids and polypeptides according to the 
invention are disclosed herein. 

20 NOV1 

NOV1 includes three novel human 1 Clandin-like proteins disclosed below. The 
disclosed sequences have been named NOVla, NOVlb, NOVlc, NOVld, NOVle, NOVlf, 
andNOVlg. 

25 NOVla 

A disclosed NOVla nucleic acid of 687 nucleotides (also referred to as CG56592-02) 
encoding a novel human Claudin 6-like protein is shown in Table 1 A An open reading frame 
was identified beginning with an ATG initiation codon at nucleotides 6-8 and ending with a 
TAG termination codon at nucleotides 678-680. The start and stop codons are in bold letters 

30 in Table 1 A, and the 5* and 3' untranslated regions are underlined. 
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Table 1A. NOVla nucleotide sequence (SEQ ID NO:l). 

TGACTA TGGCCTGGAGTTTCCGTGCAAAAGTCCAGCTCGGGGGGCT 

GCTCCTGTGTTACCACCATCCTGCCCCAGTGGAAGACTCTTAATCTGGAACTGAACGAGATGGAGACCTGGA 
TCATGGGGATTTGGGAGGTCTGCGTGGATCGAGAGGAAGTCGCCACTGTGTGCAAGGCCTTTGAATCCTTCT 
TGTCTCTGCCCCAGGAGCTCCAGGTAGCCCG^^ 
TTTTGCTCTGCAGCTTTGGGTCTGAATGCTTCCAGTTT 

GTCTCCTGGGAAGGACTTTGGAGGCATCCGCTTCAGCCACrACCCTCCTTCCAGTCTCCTGGGTGGCCCATG 

CCACAATCCAAGACTTCTGGGATGACAGCATCCCTGACATC^^ 

TACTTGGGCTGGGCTGCTGGTATTTTCCTGGCT 

AAAGAAGATGTGCCTTTTCCTTTGATGGCTGGTCCCAC^ 

GATGGCTCCTTCCACCTCATGCTAAGACCTAGGAACCTG 



In a search of public sequence databases, the NOVla nucleic acid sequence, located on 
chromsome 12 has 337 of 534 bases (63%) identical to a gb:GENBANK- 
ID:HSA249735|acc:AJ249735.1 mRNA from Homo sapiens (CLDN6 gene for claudin-6). 
5 In all BLAST alignments herein, the "E-value" or."Expecf value is a numeric 

indication of the probability that the aligned sequences could have achieved their similarity to 
the BLAST query sequence by chance alone, within the database that was searched. For 
example, the probability that the subject ("Sbjct") retrieved from the NOVla BLAST analysis, 
e.g., Homo sapiens CLDN6 gene for claudin-6, matched the Query NOVla sequence purely 
10 by chance is 1.4e" 15 . The Expect value (E) is a parameter that describes the number of hits one 
can "expect" to see just by chance when searching a database of a particular size. It decreases 
exponentially with the Score (S) that is assigned to a match between two sequences. 
Essentially, the E value describes the random background noise that exists for matches 
between sequences. 

1 5 The Expect value is used as a convenient way to create a significance threshold for 

reporting results. The default value used for blasting is typically set to 0.0001. In BLAST 2.0, 
the Expect value is also used instead of the P value (probability) to report the significance of 
matches. For example, an E value of one assigned to a hit can be interpreted as meaning that 
in a database of the current size one might expect to see one match with a similar score simply 

20 by chance. An E value of zero means that one would not expect to see any matches with a 
similar score simply by chance. See, e.g., 

ht^>://www.ncbi.iilm.iiih.gov/Education/BLASTinfoA Occasionally, a string of X's or N*s 
will result from a BLAST search. This is a result of automatic filtering of the query for low- 
complexity sequence that is performed to prevent artifactual hits. The filter substitutes any 
25 low-complexity sequence that it finds with the letter ,I N" in nucleotide sequence (e.g., 

"NNNNNNNNNNNNN") or the letter "X w in protein sequences (e.g., "XXXXXXXXX"). 
Low-complexity regions can result in high scores that reflect compositional bias rather than 
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significant position-by-position alignment. (Wootton and Federhen, Metbods Enzymol 
266:554-571, 1996). 

The disclosed NOV1 a polypeptide (SEQ ID NO:2) encoded by SEQ ID NO:l has 229 
amino acid residues and is presented in Table IB using the one-letter amino acid code. Signal 
P, Psort and/or Hydropathy results predict that NOVla has no signal peptide and is likely to be 
localized the plasma membrane with a certainty of 0.6400. Alternatively, NOVla also may 
localize to the Golgi body with acertainty of 0.4600, the endoplasmic reticulum (membrane) 
with a certainty of 0.3700 or in the endoplasmic reticulum (lumen) with a certainty of 0.1000. 
The most likely cleavage site for a NOVla peptide is between amino acids 24 and 25, at: 
VCS-CV. 



Table IB. Encoded NOVla protein sequence (SEQ ID NO:2). 

LPQELQVARILMVASHGLGLLGLLI^SFGSECFQFHRIRWVFKW^GLIiGRTLEASASATTLLPVSWVflHAT 
IQDFWDDS I PDI I PRWEFGGALYLGWAAGI FLALGGLLLI FSACJjGKEDVPFPLMAGPTVPLSCAPVEESDG 
SFHLMLRPRNIiVI • 

A search of sequence databases reveals that the NOVla amino acid sequence has 81 of 
207 amino acid residues (39%) identical to, and 1 1 1 of 207 amino acid residues (53%) similar 
to, the 219 amino acid residue ptnr:SWISSPROT-ACC:Q9Z262 protein from Mus musculus 
(Mouse) (Claudin-6) (E = 2.7e" 27 ). 

NOVla is predicted to be expressed in Bone Marrow, Brain, Liver, Placenta, and 

Lung. 

NOVlb 

A disclosed NOVlb nucleic acid of 687 nucleotides (also referred to as CG56586-01) 
encoding a human Claudin-3-like protein is shown in Table 1C. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 6-8 and ending with a TAG 
codon at nucleotides 678-680. Putative untranslated regions upstream from the initiation 
codon, and downstream from the termination codon, if any, are underlined in Table 1C. The 
start and stop codons are in bold letters. 
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Table 1C NOVlb nucleotide sequence (SEQ ID NO:3). 

TGACTATGGCCTGGAGTTTCCGTGCAAAAGTCCAGCTCGGGGGGCTACTTCTCTCCCTCCTTGGCTGGGTCT 

GCTCCTGTGTTACCACCATCCTGCCCCAGTGGAAGACTCTTAATCTGGAACTGAACGAGATGGAGACCTG^ 

TCATGGGGATTTGGGAGGTCTGCGTCK^TCGAGAGGAAGTCGC^^ 

TGTCTCTGCCCCAGGAGCTCCAGGTAGCCCGCATCC^ 

TTTTGCTCTGCAGCTTTGGGTCTGAATGCTTC 

GTCTCCTGGGAAGGACTTTGGAGGCATCCGCITCAGCCACTACCCTCCTTCCAGTCTC 

CCACAATCCAAGACTTCTGGGATGAC^GC^TCCCTC 

TACTTGGGCTGGGCTGCTGGTATTTTCCT 

AAAGAAGATGTGCCTTTTCCTTTGATGGCTGGTCCCA 

GATGGCTCCTTCCACCTCATGCTAAGACCTA GGAACCTG 



In a search of public sequence databases, the NOVlb nucleic acid sequence, located on 
chromsome 11 is 338 of 534 bases (63%) identical to a gb:GENBANK- 
ID:HSA249735|acc:AJ249735.1 mRNA from Homo sapiens (CLDN6 gene for claudin-6) (E = 
5 2.8e 16 ). 

The disclosed NOVlb polypeptide (SEQ ID NO:4) encoded by SEQ ID NO:3 has 224 
amino acid residues and is presented in Table ID using the one-letter amino acid code. Signal 
P, Psort and/or Hydropathy results predict that NOVlb has a signal peptide and is likely to be 
localized in the plasma membrane with a certainty of 0.4600. Alternatively, NOVlb may also 
10 localize to the microbody (peroxisome) with acertainty of 0.3200, the endoplasmic reticulum 
(membrane) with a certainty of 0.1000 or in the endoplasmic reticulum (lumen) with a 
certainty of 0.1000. The most likely cleavage site for a NOVlb peptide is between amino acids 
24 and 25, at: VCS-CV. 



Table ID. Encoded NOVlb protein sequence (SEQ ID NO:4). 

MAWS FRAKVQLGGLLLS LLGWV CS CVTT I L PQWKTLNLELNEMETW I MGI WEVCVDRE EVATVCKAFES FLS 
LPQELQVARI LMVASHGLGLLGLLLCS FGS ECFQFHRI RWVFKRRI^LLGRTLEAS ASATTLLPVSWVAHAT 
IQDETODSIPDIIPSVGVWRCPLLGIiGCVreFPGSWWA 

LLPPHAKT 

15 

A search of sequence databases reveals that the NOVlb amino acid sequence has 50 of 
149 amino acid residues (33%) identical to, and 83 of 149 amino acid residues (55%) similar 
to, the 219 amino acid residue ptnr:SWISSPROT-ACC:Q63400 protein from Rattus 
norvegicus (Rat) (Claudin-3 (Ventral Prostate.l Protein) (RVP1)) (E = 0.0). 
20 NOVlb is predicted to be expressed in Bone Marrow, Brain, Liver, Placenta, and 

Lung. 

NOVlc 

A disclosed NOVlc nucleic acid of 642 nucleotides (also referred to as CG56592-03) 

encoding a novel Claudin-6-like protein is shown in Table IE. An open reading frame was 
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identified beginning with a ATG initiation codon at nucleotides 6-8 and ending with a TAG 
codon at nucleotides 609-61 1 . The start and stop codons are in bold letters, and the 5' and 3 9 
untranslated regions are underlined. 

Table IE. NO Vic Nucleotide Sequence (SEQ ID NO:5) 

TGACTATGGCCTGGAGTTTCCGTGCAAAAGTCCAGCTCGGGGGGCTACTTCTCTCCCTCCTTGGCT 

TGCTC CTGTGTTACCACCATCCTGCCCCAGTGGAAGACT CTTAATCTGGAACTGAACGAGATGGAGACCTG • 

GATCATGGGGATTTGGGAGGTCTGCGTGGATCGAGAGGAAGTCGCCACTGTGTGCAAGGCCTTTGAATCCT 

TCTTGTCTCTGCCCCAGGAGCTCCTGTTTCACAGGAT 

GGAAGGACTTTGGAGGCATCCGCTTCAGCCACTACCCTCCTTCCAGTCTCCTGGGTGGCCCATGCCACAAT 
CCAAGACTTCTGGGATGACAGCATCCCTGACATCATACCTCGGTGGGAGTTTGGAGGTGCCCTCTACTTGG 
GCTGGGCTGCTGGTATTTTCCTGGCTCTTGGTGGGCTACTCCTCATCTTCTCGGCCTGCCTGGGAAAAGAA 
GATGTGCCTTTTCCTTTGATGGCTGGTCX2CACAGTCCCCCTATCCTGTGCTCCAGTGGAGGAGTCAGATGG 
CTCCTTCCACCTCATGCTAAGACCTAGGAACCTGGTCATCTAGGACTGGCTTCTGCCAAGGATCTCTGGAA 



5 The disclosed NOVlc nucleic acid sequence maps to chromosome 12 and has 144 of 

220 bases (65%) identical to a gb:GENBANK-ID:HSA249735|acc:AJ249735.1 mRNA from 
Homo sapiens (CLDN6 gene for claudin-6) (E = 0.0). 

A disclosed NOVlc protein (SEQ ID NO:6) encoded by SEQ ID NO:5 has 201 amino 
acid residues, and is presented using the one-letter code in Table IF. Signal P, Psort and/or 

1 0 Hydropathy results predict that NOVlc does have a signal peptide, and is likely to be localized 
to the plasma membrane with a certainty of 0.4600. In other embodiments NOVlc is also 
likely to be localized to the microbody (peroxisome) with a certainty of 02651, to 
endoplasmic reticulum (membrane) with a certainty of 0.1000, or to the endoplasmic 
reticulum (lumen) with a certainty of 0.1000. The most likely cleavage site for NOVlc is 

1 5 between positions 24 and 25, (VCS-CV). 



Table IF. Encoded NOVlc protein sequence (SEQ ID NO:6). 

MAWSFRAKVQLGGLLLSLLGOTCSCVTTILPQWK 

SLPQELQFHRIRWVFKRRLGLLGRTLEASASATTIiLPVSWVAHATIQDFWDDSIPDIIPRWEFGGALYLGW ' 
AAG I FLALG GLLL I FSACLGKEDVPFFLMAGPTVPLS CAP VEESDGSFHLMLRPRNLVT 

The disclosed NOVlc amino acid has 55 of 94 amino acid residues (58%) identical to, 
and 62 of 94 amino acid residues (65%) similar to, the 220 amino acid residue 
ptnr:SPTREMBL-ACC:Q9D7U6 protein from Mus musculus (Mouse) (221 0404 A22RIK 
Protein) (E-3.1e^ 7 ). 

In addition, NOVlc is predicted to be expressed in Bone Marrow, Brain, Liver, 
Placenta, and Lung. 
NOVld 
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A disclosed NOVld nucleic acid of 726 nucleotides (also referred to as CG56592-02) 
encoding a novel Claudin 6-like protein is shown in Table 1G. An open reading frame was 
identified beginning with an ATG codon at nucleotides 6-8 and ending with a TAG codon at 
nucleotides 693-695. The start and stop codons are in bold letters and the 5' and 3' 
5 untranslated regions are underlined in Table 1G. 



Table 1G. NOVld nucleotide sequence (SEQ ID NO:7). 

TGACTA TGGCCTGGAGTTTCCGTGCAAAA^ 
GTTCCTGTGTTACCACCATCCTGCCCCAGTGGAAG 

TCATGGGGATTTGGGAGGT CTGCGTGGATCGAGAGGAAGTCGCCACTGTGTG CAAGGCCTTTGAATCCTTCT 
TGTCTCTGCCCCAGGAGCTCCAGGTAGCC^ 

TTTTGCTOTGCAGCTTTGGGTCTGAATGCTTCCAGTTTCACAGGATCAGATG 

GTCTCCTGGGAAGGACTTTGGAGGCATCCGCTTCAGCCACTACCCTCCT 

CCACAATCCAAGACTTCTGGGATGA^ 

ACTTGGGCTGGGCTGCTGGTATTTTCCTGGCTCTTGGTGGGCTACTCCTGATC 
AAGAAGATGTGCCTTTTCCTTTGA^ 

ATGGCTCCTTCCACCTCATGCTAAGACCTAGGAACCTGGTCATCTA GGACTGGCCT 
GAATAA 

In a search of public sequence databases, the NOVld nucleic acid sequence, located on 
chromsome 12 has 336 of 534 bases (62%) identical to a gb:GENBANK- 
10 ID:HSA249735|acc:AJ249735.1 mRNA from Homo sapiens (CLDN6 gene for claudin-6) (E = 
6.5e 16 >. 

The disclosed NOVld polypeptide (SEQ ID NO:8) encoded by SEQ ID NO:7 has 229 
amino acid residues and is presented in Table 1H using the one-letter amino acid code. Signal 
P, Psort and/or Hydropathy results predict that NOVld has no signal peptide and is likely to be 
15 localized the the plasma membrane with a certainty of 0.6400. Alternatively, NOVld may also 
localize to the Golgi body with acertainty of 0.4600, the endoplasmic reticulum (membrane) 
with a certainty of 0.3700 or in the endoplasmic reticulum (lumen) with a certainty of 0.1000. 
The most likely cleavage site for a NOVld peptide is between amino acids 24 and 25, at: 
VCS-CV. 

20 

Table 1H. Encoded NOVld protein sequence (SEQ ID NO:8). 

l^AWSFRAKVQLGGlI^ 
LPQELQVARIL1WASHGIX3LLGLLLCSFGS 

IQDFWDDS I PDI I PRWEFGGALYLGWAAGI FLALGGLLLZ FSACLGKEDVPFPLMAGPTVPLSCAPVEESDG 
SFHLMLRPRNLVI , 

A search of sequence databases reveals that the NOVld amino acid sequence has 81 of 
207 amino acid residues (39%) identical to, and 111 of 207 amino acid residues (53%) similar 
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to, the 219 amino acid residue ptnr:SWISSPROT-ACC:Q9Z262 protein from Mus musculus 
(Mouse) (Claudin-6) (E = 2.8e~ 27 ). 

Expression information was derived from the tissue sources of the sequences that were 
included in the derivation of NOVld. The sequence is predicted to be expressed in Bone 
5 Marrow, Brain, Liver, Placenta, and Lung. 

Homologies to any of the above NOV1 proteins will be shared by the other NOV1 
proteins insofar as they are homologous to each other as shown below. Any reference to 
NOV1 is assumed to refer to all four of the NOV1 proteins in general, unless otherwise noted. 

The disclosed NOVla polypeptide has homology to the amino acid sequences shown 
10 in the BLASTP data listed in Table II. 



Table 11. BLAST results for NOVla 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Po 
sitives 
(%) 


Expect 


gi| 17458947 |ref |XP 
061964.1 | 
(XM_061964) 


similar to 
putative {H. 
sapiens) [Homo 
sapiens] 


229 


229/229 
(100%) 


229/229 
(100%) 


e-109 


>gi | 17437506 |ref|XP 
_068031.l| 
(XM_068031) 


similar to 
putative (H. 
sapiens) [Homo 
sapiens] . 


220 


99/172 
(57%) 


125/172 
(72%) 


4e-50 


gi j 17437504 |ref|XP 
068030.1 | 
(XM_06803G) 


similar to 
putative (H. 
sapiens) [Homo 
sapiens] 


220 


99/172 
(57%) 


126/172 
(72%) 


4e-43 


gi | 12843248 | dbj | BAB 
25914. l| (AK008821) 


PMP- 

2 2 /EMP/MP2 0 /Claud 

in family 

containing 

protein-data 

source :Pf am, 

source 

key :PF00822, 

evidence : ISS-^puta 

tive [Mus 

musculus] 


220 


104/188 
(55%) 


131/188 
(69%) 


3e-40 


gi | 7710002 |ref|NP o 
57883. l| 
(NM 016674) 


claudin l [Mus 
musculus] 


211 


67/194 
(34%) 


99/194 
(50%) 


2e-20 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 1 J. In the ClustalW alignment of the NOV1 proteins, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
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can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 1J. ClustalW Analysis of NOV1 



1) Novel NOVia (SEQ ID NO: 2) 

2) Novel NOVlb (SEQ ID NO: 4) 

3) Novel NOV1C (SEQ ID NO: 6) 

4) Novel NOVld (SEQ ID NO: 8) 

5) gi 1 17458947 | ref |XP_061964 . 1 1 (XM_061964) similar to putative (H. 
sapiens] (SEQ ID NO: 1383) 

6) gi 1 17437506 | ref |XP_068031 . l | (XM_068031) similar to putative (H. 
sapiens] (SEQ ID NO: 1384) 

7) gi 1 17437504 | ref |XP_068030.l| (XMJ>68030) similar to putative (H. 
sapiens) (SEQ ID NO:325) 

8) gi 1 12843248 I dbj |BAB25914.l| (AK008821) PMP-22/EMP/MP20/Claudin family containing 
protein-data source :Pf am, source key:PF00822, evidence :ISS~putative [Mus musculus] 
(SEQ ID NO:326) 

9) gi | 7710002 | ref |NP_057883 . 1 1 (NMJ116674) claudin 1 [Mus musculus) (SEQ ID NO:327) 



sapiens) [Homo 
sapiens) [Homo 
sapiens) [Homo 



NOVla 


1 


NOVlb 


1 


NOVlc 


1 


NOVld 


1 


gi 


17458947 


1 


gi 


17437S06 


1 


gi 


17437504 


1 


gi 


12843248 


1 


gi 


7710002) 


i 



NOVla 
NOVlb 
NOVlc 
NOVld 



gi 
gi 
gi 
gi 
gi 



17458947] 
17437506) 
17437504) 
12843248) 
7710002) 



NOVla 
NOVlb 
NOVlc 
NOVld 



gi 
gi 
gi 
gi 
gi 



17458947) 
17437506) 
17437504) 
12843248 j 
7710002) 



NOVla 
NOVlb 
NOVlc 
NOVld 

gi | 17458947 | 
gi | 17437506 




lYSYAGDNIVTAgAJYE 
100 110 




120 



130 



140 



150 



160 



fhri: 

FHRlj 
jgFHRlEl 
■FpPHRIj 3 " 
^FHRI 
EIGESJQRDL 
IGES$RD1 
iGETgEGL 
ICLEDDl ~ 

170 




180 




190 



200 

| | )....) | 

178 SWWATPHLItGLPGKRRCAFS FDGWSHSE 



210 



220 



230 





kedjSpfp: 

KEDj^PFP: 
•KEDgPFPLl 
fKEDwPFPi 
SHAP|ks< 




--PILCSSGGVR1 



PTVQLSCAPVEES 
.SCAPVEES 
SCAPVEES 
CAPVEES 

AQTQ- 
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gi 
g± 



17437504| 178 ^CffiftgC^SSHAPpALGHYgVAQMQ- TQCPXReDGTADPQV 220 



12843248| 178 ^C^JhCaSwSPAPAASSHY§VAGPE BHQQgJ^KQASpEI 220 

7710002| 177 j^vjgsCgCPRKTTSYPTPRPYPKPTE SSGKDYV- 211 



The claudins are a family of integral membrane proteins that are major components of 
tight junction (TJ) strands. When claudins are introduced into cells that lack tight junctions, 
networks of strands and grooves form at cell-cell contact sites that closely resemble native 
tight junctions. There are at least 17 members of this family in mammals. Claudin family 
1 0 members share -38% amino acid identity, and are predicted to have four transmembrane (TM) 
domains, which is reminiscent of occludin, although they share no sequence similarity with it. 
Multiple sequence alignment reveals their sequences to be fairly well conserved in the first 
and fourth putative TM domains, and in the first and second extracellular loops, but they 
diverge in the second and third TM domains. Although the sequences of their C-terminal 
15 cytoplasmic domains vary, the known family members share a common motif of -Y-V. This 
has been postulated as a possible binding motif for PDZ domains of other tight junction- 
associated peripheral membrane proteins, such as ZO-1. 

The disclosed NOV1 nucleic acid of the invention encoding a Human Claudin -like 
protein includes the nucleic acid whose sequence is provided in Table 1 A ,1C, IE, 1G, or a 
20 fiagment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 1A, 1C, IE, or 1G while 
still encoding a protein that maintains its Human Claudin-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
25 that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
30 stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 37 percent of the bases may be so changed. 

The disclosed NOV1 protein of the invention includes the Human Claudin-like protein 
whose sequence is provided in Table IB, ID, IF, or 1H. The invention also includes a mutant 
35 or variant protein any of whose residues may be changed from the corresponding residue 
shown in Table IB, 1 D, IF, or 1H while still encoding a protein that maintains its Human 
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Claudin-Iike activities and physiological functions, or a functional fragment thereof. In the 
mutant or variant protein, up to about 66 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 
5 The above disclosed information suggests that this Human Claudin-like protein 

(NOV1) is a member of a "Human Claudin family". Therefore, the NOV1 nucleic acids and 
proteins identified here may be useful in potential therapeutic applications implicated in (but 
not limited to) various pathologies and disorders as indicated below. The potential therapeutic 
applications for this invention include, but are not limited to: protein therapeutic, small 

1 0 molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 

antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 
research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
(but not limited to) those defined here. 

The NOV 1 nucleic acids and proteins of the invention are useful in potential 

1 5 therapeutic applications implicated in cancer including but not limited to various pathologies 
and disorders as indicated below. For example, a cDNA encoding the Human Claudin-like 
protein (NOV1) may be useful in gene therapy, and the Human Claudin -like protein (NOV1) 
may be useful when administered to a subject in need thereof. By way of nonlimiting 
example, the compositions of the present invention will have efficacy for treatment of patients 

20 suffering from Von Hippel-Lindau (VHL) syndrome, Cirrhosis, Transplantation, Hemophilia, 
hypercoagulation, Idiopathic thrombocytopenic purpura, autoimmume disease, allergies, 
immunodeficiencies, transplantation, Graft vesus host, Alzheimer's disease, Stroke, Tuberous 
sclerosis, hypercalcemia, Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, 
Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia-telangiectasia, Leukodystrophies, 

25 Behavioral disorders, Addiction, Anxiety, Pain, Neuroprotection, Systemic lupus 

erythematosus , Autoimmune disease, Asthma, Emphysema, Scleroderma, allergy, and 
Cancer, or other pathologies or conditions. The NOV 1 nucleic acid encoding the Human 
Claudin-like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be 

30 assessed. 

NOV1 nucleic acids and polypeptides are further useful in the generation of antibodies 
that bind immuno-specifically to the novel NOV1 substances for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 
art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
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15 



20 



section below. The disclosed NOV1 proteins have multiple hydrophilic regions, each of 
which can be used as an immunogen. These novel proteins can be used in assay systems for 
functional analysis of various human disorders, which will help in understanding of pathology 
of the disease and development of new drug targets for various disorders. 

NOV2 

A disclosed NOV2 nucleic acid of 1361 nucleotides (also referred to as CG56596-01) 
encoding a novel Protein Serine Kinase-like protein is shown in Table 2A. An open reading 
frame was identified beginning with an ATG initiation codon at nucleotides 20-22 and ending 
with a TAA codon at nucleotides 1268-1270. A The start and stop codons are in bold letters 
in Table 2A. 



Table 2A. NOV2 nucleotide sequence (SEQ ID NO:9). 



CGGCGC&CGTGTTGCOOGT&TG^TGCCrere^ 

AATAGAATTGGCATCCAAAGTGGAACCCAGAAATGGAACAAAGAATGATCTCTATAAATTTTTTTATTATAC 
TTTAAGTTCTACTCCTCCCTGCCCTCTGCCACTCCCCTCIACTACCCCAGTGCCCCCTCCCTCCTTGCCCTGG 
GCCCGAGGCGGCGGCCCAGGCGGCGCAGAGGATACAGGTGGCTCGCTTCCGAGCCAAGTTCGACCCCCGGGT 
CCTTGCCAGATATGACATCAAAGCTCTTATTGGGACAGGCAGTTTCAGCAGGGTTGTCAGGGTAGAGCAGAA 
GACCACCAAGAAACCTTTTGCAATAAAAGTGATGGAAACCAGAGAGAGGGAAGGTAGAGAAGCGTGCGTGTC 
TGAGCTGAGCGTCCTGCGGCGGGTTAGCCATCGTTACATTGTCCAGCTCATGGAGATCTTTGAGACTGAGGA 
TCAAGTTTACATGGTAATGGAGCTGGCTACCGGAGGGGAGCTCTTTGArCGACTCATTGCTCAGGGATCCTT 
TACAGAGCGGGATGCCGTCAGGATCCTCCAGATGGTTGCTGATGGGATTAGGTATTTGCATGCGCTGCAGAT 
AACTCATAGGAATCTAAAGCCTGAAAACCTCTTATACTATCATCCAGGTGAAGAGTCGAAAATTTTAATTAC 
AGATTTTGGTTTGGCATACTCCGGGAAAAAAAGTGGTGACTGGACAATGAAGACACTCTGTGGGACCCCAGA 
GTACATAGCTCCTGAGGTTTTGCTAAGGAAGCCTTATACCAGTGCAGTGGACATGTGGGCTCTTGGTGTGAT 
CACATATGCTTTACTTAGCGGATTCCTGCCTTTTGATGATGAAAGCCAGAGAAGGCTTTACAGGAAGATTCT 
GAAAGG CAAATATAATTATACAGGAGAGCCTTGGC CAAGCATTTCCCACTTGGCGAAGGACTTTATAGACAA 
ACTACTGATTTTGGAGGCTGGTCATCGCATGTCAGCTGGCCAGGCCCTGGACCATCCCTGGGTGATCACCAT 
GGCTGCAGGGTCTTCCATGAAGAATCTCCAGAGGGCCATATCCCGAAACCTCATGCAGAGGGCCTCTCCCCA 
CTCTCAGAGTCCTGGATCTGCACAGTCTTCTAAGTCACATTATTCTCACAAATCCAGGCATATGTGGAGCAA 
GAGAAACTTAAGGATAGTAGAATCGCCACTGTCTGCGCTTTTGTAAGCAGATGACCTCTAAAACTATTTTTG 
CCTATTTTAGGACCATTTCATCATGATTAGGGCACCCTCAAGCTCCAAAGACACGGGACTCCATG 



The disclosed NOV2 nucleic acid sequence, localized to the q2 1.3-22 region of 
chromsome 18, has 685 of 997 bases (68%) identical to a gb:GENBANK- 
ID:HSA272212|acc:AJ272212.1 mRNA from Homo sapiens (mRNA for protein serine kinase 
(PSKHlgene))(E = 6.1e" 8s ). 

A NOV2 polypeptide (SEQ ID NO:10) encoded by SEQ ID NO:9 has 416 amino acid 
residues and is presented using the one-letter code in Table 2B. Signal P, Psort and/or 
Hydropathy results predict that NOV2 contains no signal peptide and is likely to be localized 
to the endoplasmic reticulum (membrane) with a certainty of 0.5500. Alternatively, NOV2 
may also localize to the lysosome (lumen) with a certainty of 0.2403, the plasma membrane 
with a certainty of 0,1900, or the microbody (peroxisome) with a certainty of 0.1 1 1 1 . 
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Table 2B. Encoded NOV2 protein sequence (SEQ ID NO: 10). 

MGCGASRKWPGPPKILVIEIASKVEPRNGTKNDLYKFFYYTLSSTPPCPLPLPSLPQCPLPPCPGPEAAAQ 
AAQRI QVARFRAKFDPRVLARYD IKAL I GTGSFS RWRVEQKTTKKP FAI KVMETREREGREACVS EI*S VLR 
RVSHR Y IVQLMEI FETEDQVYMVMEL ATGGELFDRLI AQG S FTERDAVRI LQMVADGI RYLHALQ I THRNLK 
PENLL YYHPGEES KILITDFGLAYSGKKSGDWTMKTLCGTPE YI APEVLLRKP YTSAVDMWALGVI TYALLS 
GFLPFDDESQTRLYRKILKGKYNYTGEPWPSISHLAKDFIDKLLILEAGHRMSAGQALDHPWITMAAGSSM 
KNXiQRAISRNLMQRASPHSQSPGSAQSSKSHYSHKSRHMWSKRNLRIVESPLSAIiL 

The disclosed NOV2 amino acid sequence has 267 of 412 amino acid residues (64%) 
identical to, and 332 of 412 amino acid residues (80%) similar to, the 424 amino acid residue 
ptnr.SPTREMBL-ACC:Q9NYl 9 protein from Homo sapiens (Human) (Protein Serine 
5 Kinase) (E=Ue 138 ). 

NOV2 is predicted to be expressed in Kidney, Lymph node, Pancreas, Salivary Glands, 
Brain, and Placenta because of the expression pattern of (GENBANK-ID: gb:GENBANK- 
ID:HSA272212|acc:AJ272212.1) a closely related Homo sapiens mRNA for protein serine 
kinase (PSKH1 gene) homolog. 
10 In addition, the sequence is predicted to be expressed in keratinocytes because of the 

expression pattern of (GENBANK-ID: gb:GENBANK-ID:HSPI1371 l|acc:AJ00 1696.2) a 
closely related Homo sapiens mRNA for hurpin, clone R7-1.1 homolog. 

NOV2 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 2C. 

15 



Table 2C. BLAST results for NOV2 



Gene Index/ 
Identifier 


Protein/ 
Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi| 14916455 |ref|NP_ 
149117.1 | 
(NM_033126) 


serine/ threo 
nine kinase 
PSKH2 [Homo 
sapiens) 


385 


369/416 
(88%) 


372/416 
(88%) 


0.0 


gi | 17530179 |gb | AAL4 
0735.1) (AF416988) 


protein 
serine 

kinase/ lucif 
erase fusion 
protein 


975 


257/391 
(65%) 


318/391 
(80%) 


e-149 


gi | 14776113 |ref|XP 
043047. l| 
(XM_043047) 


hypothetical 
protein 
XP_043047 
[Homo 
sapiens] 


424 


257/391 
(65%) 


318/391 
(80%) 


e-145 


gi|l5963448|gb|AALl 
1033. 1| (AF236367) 


protein 
serine 

kinase Pskhl 
[Mus 

musculus] 


424 

V 


254/386 
(65%) 


311/386 
(79%) 


e-144 
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gi|2136035|pir| |I38 


protein- 


319 


209/320 


258/320 


e-115 


138 


serine 
kinase (EC 
2.7.1.-) 
PSK-Hl - 
human 
(fragment) 




(65%) 


(80%) 





The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 2D. 



Table 2D. ClustalW Analysis of NOV2 

<NM_033126) serine/threonine kinase PSKH2 [Homo 



1) NOV2 (SEQ ID NO: 10) 

6) gi 1 14916455 | ref |NP__149117 . 1 1 
sapiens] (SEQ ID NO: 328) 

7) gi | 17530179 |gb | AAL40735 . 1 | (AF416988) protein serine kinase/lucif erase fusion 
protein (SEQ ID NO: 32 9) 

8) gi | 14776113 | ref |XP_043047 . 1 1 (XM_043047) hypothetical protein XP 043047 [Homo 
sapiens] (SEQ ID NO: 330) 

9) gi) 15963448 |gb|AALH033.i| (AF236367) protein serine kinase Pskhl [Mus musculus] 

(SEQ ID NO: 331) 

10) gi|2i36035|pir| |I38138 protein-serine kinase (EC 2.7.1.-) PSK-Hl - human 
(fragment) (SEQ ID NO: 332) 



NOV2 


1 


gi 


14916455 


1 


gi 


17530179 


1 


gi 


14776113 


1 


gi 


15963448 


1 


gi 


2136035) 


1 


NOV2 


61 


gi 


14916455 


33 


gi 


17530179 


60 


gi 


14776113 


60 


gi 


15963448 


60 


gi 


2136035) 


60 


NOV2 


115 


gi 


14916455 


84 


gi 


17530179 


119 


gi 


14776113 


119 


gi 


15963448 


119 


gi 


2136035) 


119 


NOV2 


175 


gi 


14916455 


144 


gi 


17530179 


179 


gi 


14776113 


179 


gi 


15963448 


• 179 


gi 


2136035) 


179 



NOV2 




190 



200 



210 



220 
..I .. 



230 



240 



ELFDRgJASGSFTERDAgRgLQMVgDGHRYLHALWITHRgLKPEiVLLYYHPGjgSKI^I^ 
ELFDRSli^GSFTERDAaRaLQI^gDGyRYLHALSlTHRlLKPENLLYYHPG^SKIlll 1 
ELFDRIIAKGSFTERDATRVLQMVLDGVRYLHALGITHRDLKPENLLYYHPGTDSKIIIT 
ELFDRI I AKGSFTERDATRVLQMVLDGVRYLHALGITHRDLKPEltajLYYHPGTDSKI 1 1 T ; 
ELFDR I IAKGS FTERDATRVLQMVLDG VR YLHALG ITHRDLKPENLL Y YHPGTDS K 1 1 1 T: 

elfdriiakgsfterdatrvlqmvldgvrylhalgithrdlkpenllyyhpgtdskiiitI 



234 
203 
238 
238 
238 
238 



250 



260 



270 



280 



290 



300 



235 



CGTPEYIAPEVI 



294 
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10 



15 



20 



25 



30 



gi 


| 14916455 


| 204 


gi 


[17530179 


239 


gi 


14776113 


| 239 


gi 


15963448 


| 239 


gi| 2136035| 


239 


NOV2 


295 


gi 


14916455 


264 


gi 


17530179 


299 


gi 


14776113 


299 


gi 


15963448 


299 


gi 


2136035) 


299 


NOV2 


355 


gi 


14916455 


324 


gi 


17530179 


359 


gi 


14776113 


359 


gi 


15963448 


359 


gi 


2136035] 


319 



NOV2 

gi 
gi 
gi 
gi 
gi 



14916455 | 
17530179 | 
14776113 | 
15963448) 
2136035| 




310 



320 



330 



d IS^ TRLYRMIlSgKyISy* 

ddnrtrlyrqilrgkysyj 
ddnrtrlyrqilrgkysy? 
ddnrtrlyrqilrgkysyS 
ddnrtrlyrqi lrgkys y£ 



GEPWPSnSjjjLAKDFIDjjjLL 
GEPWPSMSgLAKDFIDgLL 

gepwpsSsSlakdfidSll 

GSPWPsSsgLAKDFIDgLL; 

gepwpsSsSlakdfidsl 



sSsSlakdfidISll! 




370 



380 



390 




430 



440 



450 



460 



470 



480 



415 LL 416 

384 LL - - 385 

419 QQQYNGANEDAKNIKKGPAPFYPLEDGTAGEQLHKAMKRYALVPGTIAFTDAHIEVDITY 478 

419 QQQYNG -- 424 

419 QQQYNG 424 

319 - 319 



35 



40 



490 



NOV2 


416 


gi) 14916455 


385 


gi) 17530179 


479 


gi) 1477 6113 


424 


gi) 15963448 


424 


gi j 2136035 | 


319 



500 
|....|....|. 



510 



520 

..)....) 



530 



540 



416 
385 
538 
424 
424 
319 



45 



50 



NOV2 



gi 
gi 
gi 
gi 
gi 



14916455 
17530179 
14776113 
15963448 
2136035 | 



550 560 570 580 590 600 

4 " — - 416 

385 - - 385 

539 LIJISMSISQPTWFVSKKGLQKILNVQKKLPIIQKIIIMDSKTDYQGFQSMYTFVTSHLP 598 

424 — 424 

424 " 424 

319 — - 319 



55 



60 



65 



70 



NOV2 


416 


gi 


14916455 | 


385 


gi 


17530179| 


599 


gi 


14776113) 


424 


gi 


15963448 | 


424 


gi 


2136035) 


319 



N0V2 

gi 1 14916455 | 
gi 1 17530179 | 
gi j 14776113 | 
gi j 15963448 j 
gi [2136035 | 



610 
|....|....|. 



620 



630 



640 



650 



660 



416 
385 
658 
424 
424 
319 



670 



680 



690 



700 



710 



720 



416 - 416 

385 - 385 

659 DTA1LSVVPFHHGFGMFTTLGYLICGFRVVLMYRFEEELFLRSLQDYKIQSALLVPTLFS 718 

424 - 424 

424 ~- 424 

319 - — - - - 319 



730 



740 



750 

30 



760 



770 



780 
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30 



35 



40 



NOV2 

gi 1 14916455 | 
gi j 17530179 | 
gi 1 14776113 j 
gi 1 15963448 j 
gi j 2136035 | 



NOV2 



gi 
gi 
gi 
gi 
gi 



14916455 
17530179 
14776113 
15963448 
21360351 



NOV2 



gi 
gi 
gi 
gi 
gi 



14916455 
17530179 
14776113 
15963448 
21360351 



NOV2 



gi 
gi 
gi 
gi 
gi 



14916455) 
1753 0179) 
14776113) 
15963448| 
2136035) 



NOV2 



gi 

gi 

gi 
gi 
gi 



14916455) 
17530179 j 
1477 6113) 
15963448) 
2136035) 



416 ^ 

385 - 3g5 

719 FFAKSTLIDKYDLSNLHEIASGGAPLSKEVGEAVAKRFHLPGI RQGYGLTETTSAILI TP 778 
424 424 

424 — 424 

319 

X * 319 

790 800 810 820 830 840 

779 EGDDKPGAVGKWPFFEAKVVDLDTGKTLGWQRGELCVRGPMIMSGYVNNPEATNALID 838 

424 — — 424 

424 * A 

424 

319 

^ 319 

850 860 870 880 890 900 

« M :: 1 • ■ • • 1 • • ■ - ' 41S 

385 415 

JO J 

839 KDGWLHSGDI AYWDEDEHFFI VDRLKS L I KYKGYQVAPAELES I LLQHPNI FDAGVAGLP 898 
424 

424 

424 

424 

319 

" " — 319 

910 920 930 940 950 960 

416 - - ' — ' — » — i — " — i — i — i 
ass I::::::::::::::::::::::::::::::::::: tit 

899 DDDAGELPAAVWLEHGKTMT EKE I VD YVAS Q VTTAKKLRGGVVFVDE VPKGLTGKLDAR 958 
424 . _ . 

424 424 

319 - — " — " 424 

" - 319 

970 

....|....|....|.. 

416 4X6 

385 — - — 385 

959 KIREILIKAKKGGKIAV 975 

424 424 

424 424 

319 --- 319 



45 



50 



55 



The presence of identifiable domains in NOV2, as well as all other NOVX proteins, 
was determined by searches using software algorithms such as PROSITE, DOMAIN, Blocks, 
Pfam, ProDomain, and Prints, and then determining the Interpro number by crossing the 
domain match (or numbers) using the Interpro website (http:www.ebi.ac.uk/ interpro). 
DOMAIN results for NOV2 as disclosed in Tables 2E-2G, were collected from the Conserved 
Domain Database (CDD) with Reverse Position Specific BLAST analyses. This BLAST 
analysis software samples domains found in the Smart and Pfam collections. For Table 2K 
and all successive DOMAIN sequence alignments, fully conserved single residues are 
indicated by black shading or by the sign (|) and "strong" semi-conserved residues are 
indicated by grey shading or by the sign (+). The "strong" group of conserved amino acid 
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residues may be any one of the following groups of amino acids: STA, NEQK, NHQK, 
NDEQ, QHRK, MILV, MILF, HY, FYW. 

Tables 2E-G lists the domain description from DOMAIN analysis results against 
NOV2. This indicates that the NOV2 sequence has properties similar to those of other 
5 proteins known to contain this domain. 



Table 2E Domain Analysis of NOV2 

gnl | Smart | smart 00 220, S_TKc, Serine/Threonine protein kinases, 
catalytic domain; Phosphotransferases. Serine or threonine -specif ic 
kinase subfamily.. (SEQ ID NO: 799) 
CD-Length « 256 residues, 100.0% aligned 

Score ° 261 bits (668), Expect = 4e-71 





NOV 2: 


94 


10 


Sbjct: 


1 




NOV 2: 


152 


15 


Sbjct: 


61 




NOV 2: 


212 




Sbjct: 


121 


20 


NOV 2: 


272 




Sbjct: 


177 


25 


NOV 2: 


332 




Sb j Ct : 


237 



I++ ++I l+l +1 III IIII++ + ++ II + 1+ +I+++ I I 

YELLE VLGKGAFGKVYLARDKKTGKLVAI KVIKKEKLKKKXRERILREI KILKKLDHPNI 60 

VQLME IFETEDQVYMVMEIATGGELFT)RLIAQGSFTERDAVRILQMVADG IRYLHALQIT 211 
| + | ++ || +|++|+||l ||+||| | +| + | +| + + + |||+ | 
VKLYDVFEDDDKLYLVMEYCEGGDLFDIiLKKRGRLSEDEARFYARQILSALEYLHSQGI I 120 

HRNLKPENLLYYHPGEESKILITDFGLAYSGKKSGDWTMKTLCGTPEYIAPEVLLRKPYT 271 

ii+imii+i i + + inn ii ♦ i iiiii+iiiiii i i 

HRDLKPENILLDSDGH VKLADFGLA- KQLDS GGTLLTT FVGT P EYMAPEVLLGKG YG 176 

S AVDMWALGVITYALLSGFLP FDDESQTRLYRKILKGKYNYTGEPWPS I SHLAKDF IDKL 331 

111*1+1111 I IM 11*1 I * I II III I II 

KAVDI WSLG VILYELLTGKPP FPGDDQIiLALFKKIGKPPPP FP PPEWKI S PEAKDLI KKL 236 
L I LEAGHRMS AGQALDHPWV 351 

l+:+ I++I +11+11+ 



Table 2F Domain Analysis of NOV2 

gnl|Pfam|pfam00069, pkinase, Protein kinase domain (SEQ ID NO: 800) 
CD-Length = 256 residues, 100.0% aligned 
Score - 230 bits (586) , Expect = le-61 



30 


NOV 2: 


94 




Sbjct: 


1 




NOV 2: 


153 


35 


Sbjct: 


61 




NOV 2: 


212 


40 


Sbjct: 


121 




NOV 2: 


272 




Sbjct: 


178 


45 


NOV 2: 


332 




Sbjct: 


237 



YD1KALIGTGSFSRWRVEQKTTKKPFAIKVMETREREGREACV- SEL S VLRRVSHRYI V 152 
|++ + | + | + | +| + + | | + IM+++ I ++ 1+ +IM + II || 

YELGEKLGSGAFGKVYKGKHKDTGEIVAIKILKKRSLSEKKKRFLREIQILRRLSHPNIV 60 

QLMEI FETEDQVYMVMELATGGELFDRLIAQGS - FTERDAVRILQMVADGIRYLHALQIT 211 
+ 1 + +11 +1 +I+III ll+lll I I +I++I +| + |+ |||+ | 
RLLGVFEEDDHLYL VME YMEGGDL FDYLRRNGLLLSEKEAKKI ALQILRGLEYLHS RG I V 12 0 

HRNLKPENLLYYHPGEESKILITDFGLAYSGKKSGDWTMKTLCGTPEYIAPEVLLRKPYT 271 

imiih+i i ♦ i iiiii +i * i iiiimim ♦ i* 

HRDLKPENILLDENGT VKIADFGLARKLESSSYEKLTTFVGTPEYMAPEVLEGRGYS 177 

SAVDMWALGVITYALLSGFLPFDDESQTRLYRKILKGKYNYTGEPWPS ISHIAKDFIDKL 331 
I -11 + 1 + 111! I ll + l III +| + 1+ I II I I 

SKVDVWSliGVILYELLTGKLPFPGlDPLEELFRIKERPR- LRLPLPPNCSEELKDLIKKC 236 

L I LEAGHRM S AGQ ALDHP WV 351 
I + I +1 + l+lll 



32 



WO 02/068649 



PCT/US02/02785 



10 



Table 2G Domain Analysis of NOV2 

gnl|Sraart|smart002l9, TyrKc, Tyrosine kinase, catalytic domain; 
Phosphotransferases. Tyrosine -specif ic kinase subfamily. (SEQ ID 
NO: 801) 

CD-Length = 258 residues, 83.7% aligned 
Score = 117 bits (292), Expect = 2e-27 



NOV 2: 100 IGTGS FSRWR - - - VEQKTTKKPFAI KVM - ETREREGREACVSELSVLRRVSHRYI VQLM 155 

ov . +11 + 11+ + + l + l + I +1+1 ++I++ | || + | + 

Sb]Ct: 7 LGEGAFGEVYKGTLKGKGGVEVEVAVKTLKEI)ASEQQIEEFIiREARlJ<RKI^HPNIVKLL 66 

NOV 2: 156 EIFETEDQVY^^/MEIATGGELFDI^IAQG--SFTERDAVRILQMVAEX5IRYiHALQITHR 213 

+ 1+ + +11 I ll + l I I + . | + +|. |+ || + || 

Sbjct: 67 GVCTEEEPLMIVMEYMEGGDLLDYLRKNRPKELSLSDLLSFALQIARGMEYLESKNFVHR 126 

NOV 2: 214 NLKPENIiLyYHPGEESKILITOFGLAYSGKKSGDWTMKTLCGTP-EYIAPEVLIjRKPYTS 272 

eK , , „ +1 M II + I I I I I I +| I ++IH I +|| 

Sb 3 ct: 127 DLAARNCLV OBNKTVKIADFGLARDLYDDDYYRKKKSPRLPIRWMAPESLKDGKFTS 183 



NOV 2: 273 AVDMWALGVITYALLS -GFLPFDDESQTRLYRKILKGKY 310 
15 . 1+1+ 11+ + + + I |+ | - + || 

Sbjct: 184 KSDVWSFGVLLWEIFTLGESPYPGMSNEEVLEYLKKGYR 222 

Protein phosphorylation is a fundamental process for the regulation of cellular 
functions. The coordinated action of both protein kinases and phosphatases controls the levels 

20 of phosphorylation and, hence, the activity of specific target proteins. One of the predominant 
roles of protein phosphorylation is in signal transduction, where extracellular signals are 
amplified and propagated by a cascade of protein phosphorylation and dephosphorylation 
events. Eukaryotic protein kinases are enzymes that belong to a very extensive family of 
proteins which share a conserved catalytic core common with both serine/threonine and 

25 tyrosine protein kinases. There are a number of conserved regions in the catalytic domain of 
protein kinases. In the N-terminal extremity of the catalytic domain there is a glycine-rich 
stretch of residues in the vicinity of a lysine residue, which has been shown to be involved in 
ATP binding. In the central part of the catalytic domain there is a conserved aspartic acid 
residue which is important for the catalytic activity of the enzyme. 

30 The disclosed NOV2 nucleic acid of the invention encoding a Protein Serine Kinase- 

like protein includes the nucleic acid whose sequence is provided in Tables 2A or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Tables 2A while still encoding a protein 
that maintains its Protein Serine Kinase -like activities and physiological functions, or a 

35 fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 

33 



WO 02/068649 



PCT/US02/02785 



chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
5 binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 32 percent of the bases may be so changed. 

The disclosed NOV2 protein of the invention includes the Protein Serine Kinase -like 
protein whose sequence is provided in Tables 2B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 

1 0 in Table 2B while still encoding a protein that maintains its Protein Serine Kinase-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 35 percent of the residues may be so changed. 

The NOV2 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Diabetes, Von Hippel-Lindau (VHL) syndrome , 

1 5 Pancreatitis, Obesity, Lymphedema , Allergies, Alzheimer's disease, Stroke, Tuberous 

sclerosis, hypercalcemia, Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, 
Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia-telangiectasia, Leukodystrophies, 
Behavioral disorders, Addiction, Anxiety, Pain, Neuroprotection, Diabetes, Autoimmune 
disease, Renal artery stenosis, Interstitial nephritis, Glomerulonephritis, Polycystic kidney 

20 disease, Systemic lupus erythematosus, Renal tubular acidosis, IgA nephropathy, and/or other 
pathologies and disorders. 

NOV2 nucleic acids and polypeptides are further useful in the generation of antibodies 
that bind immunospecifically to the novel substances of the invention for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 

25 art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
section below. The disclosed NOV2 protein has multiple hydrophilic regions, each of which 
can be used as an immunogen. These novel proteins can be used in assay systems for 
functional analysis of various human disorders, which are useful in understanding of 
pathology of the disease and development of new drug targets for various disorders. 

30 

NOV3 

NOV3 includes three novel human 1 Claudin-Iike proteins disclosed below. The 
disclosed sequences have been named NOV3a, NOV3b, and NOV3c. 



34 



WO 02/068649 



PCT/US02/02785 



NOV3a 

A disclosed NOV3a nucleic acid of 695 nucleotides (designated CuraGen Acc. No. 
CG56594-01) encoding a novel Claudin-19-like protein is shown in Table 3A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 53-55 and 
5 ending with a TGA codon at nucleotides 662-664. A putative untranslated region downstream 
from the termination codon is underlined in Table 3A, and the start and stop codons are in 
bold letters. 



Table 3A. NOV3a Nucleotide Sequence (SEQ ID NO:ll) 

GCACCCTGG CCCAQCTCTGAGTCCTGGGACCCTCGGTCCTCTCTCCTGGGCCA TGGCCAACTCAGGC'rTr 
CAGCTCCTGGGCTACTTCTTGGCCCTG^ 

GGAAGCAGTCTTCCTACGCAGGCGACGCCATCATC^CTGCCGTGGGCCTCTATGAAGGGCTCTGGATGTC 

CTG CGCCTCCCAGAGCACTGGGCAAGTGCAGTGCAAGCTCTACGACT CGCTGCTCGCCCTGGACGGTAGG 

CCCCAGGCCGCGCGGGCCCTGATGGTGGTGGCCGTGCTCCTGGGCTTCGTGGCCATGGTCCTCAGCGTAG 

TTGGCATGAAGTGTACGCGGGTGGGAGA(^GCAACCCCATTGCCAAGGGCCGTGTTGCCATCGCCGGGGG 

AGCCCTCTTCATCCTGGCAGGCCTCTGCACTTTGACTGCTGTCTCGTGGTATGCCACCCrr 

GAGTTCTTCAACCCAGAATTTGGCCCAGCCCTGTTCGTGGGCTGGGCCTCAGCTGGCCTGGCCGTGCTGG 

GCGGCTCCTTCCTCTGCTGCACATGCCCGGAGCCAGAGAGACCCAACAGCAGCCCACAGCCCTATCGGCC 

TGGACCCTCTGCTGCTGCCCGAGAGTACGTCTGAGCTCCGCCTGCCCTGGCCAGCCCCCCACCCA 



10 The nucleic acid sequence, localized to chromosome 1, has 402 of 482 bases (83%) 

identical to a gb:GENBANK-ID:AF249889|acc:AF249889.1 mRNA from Mus musculus 
(claudin-19 mRNA, partial cds) (E = l.le" 67 ). 

A NOV3a polypeptide (SEQ ID NO: 12) encoded by SEQ ID NO: 1 1 is 203 amino acid 
residues and is presented using the one letter code in Table 3B. Signal P, Psort and/or 

15 Hydropathy results predict that NOV3a has no signal peptide and is likely to be localized at 
the endoplasmic reticulum (membrane) with a certainty of 0.6850. Alternatively, NOV3a may 
also localize to the plasma membrane with a certainty of 0.6400, the Golgi body with a , 
certainty of 0.4600, or the endoplasmic reticulum (lumen) with a certainty of 0.1000. The most 
likely cleavage site for NOV3a is between positions 23 and 24: IIA-ST. 



Table 3B. NOV3a protein sequence (SEQ ID NO:12) 

MANSGLQLLGY FLALGGWVGI I ASTALPQWKQSS YAGDAI ITAVGLYEGLWMS CASQSTGQVQCKL YDS LLALD 
GRPQAARAIJVTWAVLLGFVAMVLSWGM^ 

FNPEFGPALFVGWASAGIiAVLGGSFLCCTCPEPERPNSSPQPYRPGPSAAAREYV 

The full amino acid sequence of the protein of the invention was found to have 174 of 
193 amino acid residues (90%) identical to, and 178 of 193 amino acid residues (92%) similar 
to, the 193 amino acid residue ptnr:TREMBLNEW-ACC:AAF98323 protein from Mus 
25 musculus (Mouse) (CLAUDIN-1 9) (E = 5.7e~* 9 ). 
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NOV3a is predicted to be expressed in at least the Spinal cord. 
NOV3b 

A disclosed NOV3b nucleic acid of 695 nucleotides (also referred to as CG56594-01) 
encoding a novel Claudin-19-like protein is shown in Table 3C. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 53-55 and ending with a 
TGA termination codon at nucleotides 662-664. The start and stop codons are in bold letters 
in Table 3C, and the 5' and 3' untranslated regions are underlined. 



Table 3C. NOV3b nucleotide sequence (SEQ ID NO:13). 



GCACCCTGGCCCAGCTCTGAGTCCTGGGACCCTCGGTCCTCTCTCCTGGGCCA TGGCCAACTCAGGCCTCCA 
GCTCCTGGGCTACTTCTTGGCCCTGGGTGGCTGGGTGGGCATCATTGCTAGCACAGCCCTGCCACAGTGGAA 
GCAGTCTTCCTACGCAGGCGACGCCATCATCACTGCCGTGGGCCTCT 

CTCCCAGAGCACTGGGCAAGTGC^GTGCAAGCTCrACGACTCGCTGCTCGCCCTGGACGGTAGGCCCCAGGC 
CGCGCGGGCCCTGATGGTGGTGGCCGTGCTCCTGGGCTTCGTGGCCATGGTCCTCAGCGTAGTTGGCATGAA 
GTGTACGCGGGTGGGAGACAGCAACCCCATTGCCAAGGGCCGTGTTGCCATCGCCGGGGGAGCCCTCTTCAT 
CCTGGCAGGCCTCTGCACTTTGACTGCTGTCTCGTGGTATGC 

AGAATTTGGCCCAGCCCTGTTCGTGGGCTGGGCCTCAGCTGGCCTGGCCGTGCTGGGCGGCTCCTTCCTCTG 



CCGAGAGTACGTCTGAGCTCCGCCTGCCCTGGCCAGCCCCCCACCCA 



,10 In a search of public sequence databases, the NOV3b nucleic acid sequence, located on 

chromsome 1 has 402 of 482 bases (83%) identical to a gbrGENBANK- 
ID:AF249889|acc:AF249889.1 mRNA from Mus musculus (claudin-19 mRNA, partial cds) (E 
^l.le" 67 ). 

The disclosed NO V3b polypeptide (SEQ ID NO: 1 4) encoded by SEQ ID NO: 1 3 has 
15 203 amino acid residues and is presented in Table 3D using the one-letter amino acid code. 
, Signal P, Psort and/or Hydropathy results predict that NOV3b has a signal peptide and is 
likely to be localized the endoplasmic reticulum (membrane) with a certainty of 0.6850. 
Alternatively, NOV3b may also localize to the plasma membrane with acertainty of 0.6400, 
the Golgi body with a certainty of 0.4600 or in the endoplasmic reticulum (lumen) with a 
20 certainty of 0.1000. The most likely cleavage site for NOV3b is between positions 23 and 24: 
IIA-ST. 



Table 3D. Encoded NOV3b protein sequence (SEQ ID NO:14). 



MANSGLQLLGYFIiALGGWVGI IASTALPQWKQSS YAGDAI I TAVGLYEGLWMSCASQSTGQVQCKLYDSLLA 
IJXSRPQAARALMVVAVLLGFVAMVL^ 

TQEFFNPEFGPALFVGWASAGLAVLGGSFLCCTCPEPERPNSSPQPYRPGPSAAAREYV 



25 



A search of sequence databases reveals that the NOV3b amino acid sequence has 174 
of 193 amino acid residues (90%) identical to, and 178 of 193 amino acid residues (92%) 
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similar to, the 193 amino acid residue ptnnTREMBLNE W-ACC : AAF98323 protein from Mus 
musculus (Mouse) (Claudin-19) (E = 5.7e* 89 ). 

NOV3b is predicted to be expressed in at least the Spinal cord. 

NOV3c 

5 A disclosed NOV3c nucleic acid of 690 nucleotides (also referred to as CG57576-01) 

encoding a novel Claudin 19-like protein is shown in Table 3E. An open reading frame was 
identified beginning with an ATG codon at nucleotides 51-53 and ending with a TGA codon at 
nucleotides 684-686. The start and stop codons are in bold letters and the 5' and 3' 
untranslated regions are underlined in Table 31. Because the start codon is not a traditional 
10 initiation codon, NOV3c could be a partial reading frame. NOV3c could extend further in the 
5' direction. 



Table 3E. NOV3c nucleotide sequence (SEQ ID NO:15). 

ACCCTGGCCCAGCT CTGAGTCCTGGGACCCTCGGT^ 

TCCTGGGCTACTTCTTGGCCCTGGGTGGCTGGGTGGGCATCATTGCTAGCACAGCCCTGCCACAGTGGAAGC 

AGTCTTCCTACGCAGGCGACGCCATCATCACTGCCGTGGGCCTCTATGAAGGGCTCTGGATGTCCTGCGCCT 
CCCAGAGCACTGGGCAAGTGCAGTGCAAGCTCTACGACTC^ 

CGCGGGCCCTGATGGTGGTGGCCGTGCTCCTGGGCTTCGTGGCCATGGTCCTCAGCGTAGTTGGCATGAAGT 
GCACACCTGTCAATGCCAGGTATGAATTT 

TGCTGGGCGGCTCCTTCCTCTCCTGCAC^TGCCCGGAGCCAGAGAGACCCAACAGCAGCCCACAGCCCTATC 
GGCCTGGACCCTCTGCTGCTGCCCGAGAGTACGTCTGAGCTC 

In a search of public sequence databases, the NOV3c nucleic acid sequence, located on 
15 chromsome 1 has 445 of 671 bases (66%) identical to a gb:GENBANK- 

ID:HSA011497|acc:AJ011497.1 mRNA from Homo sapiens (mRNA for Claudin-7) (E - 5 3e~ 
46 ). 

The disclosed NOV3c polypeptide (SEQ ID NO: 1 6) encoded by SEQ ID NO: 1 5 has 
21 1 amino acid residues and is presented in Table 3F using the one-letter amino acid code. 

20 Signal P, Psort and/or Hydropathy results predict that NOV3c has no signal peptide and is 
likely to be localized the the endoplasmic reticulum (membrane) with a certainty of 0.6850. 
Alternatively, NOV3c may also localize to the plasma membrane with acertainty of 0.6400, 
the Golgi body with a certainty of 0.4600 or in the endoplasmic reticulum (lumen) with a 
certainty of 0.1 000. The most likely cleavage site for a NOV3c peptide is between amino acids 

25 23 and24,at:IIA-ST. 
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Table 3F. Encoded NOV3c protein sequence (SEQ ID NO:16). 

MANSGLQLLGYFLAIjGGWVGI I ASTALPQWKQS SYAGDAI ITAVGLYEGLWMS CASQSTGQVQCXliYDSLLA 
LDGHI QS ARALMWAVLLGFVAMVLS VVGMKCTRVGDSNP I AKGRVAI AGGALF I LAGLCTLTAVS W YATL V 
TQEFFNPSTPVNARYEFGPALFVGWASAGLAVLGGSFLCCTCPEPERPNSSPQPYRPGPSAAAREYV 



A search of sequence databases reveals that the NOV3c amino acid sequence has 121 
of 211 amino acid residues (57%) identical to, and 159 of 21 1 amino acid residues (75%) 
similar to, the 21 1 amino acid residue ptnr:SWISSNEW-ACC:095832 protein from Homo 
5 sapiens (Human) (Claudin- 1 (Senescence-Associated Epithelial Membrane Protein)) (E = 

9.6C 66 ). 

NOV3c is predicted to be expressed in at least Spinal cord. 
NOV3a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 3G. 

10 



Table 3G. BLAST results for NOV3a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 9789476 |gb (AAF98 
323.1 | (AF249889) 


claudin- 19 [Mus 
musculus] 


193 


174/193 
(90%) 


178/193 
(92%) 


le-84 


gi (17489134 |ref |XP_ 
060892. l| 
(XM_060892) 


similar to 
claudin-19 (H. 
sapiens) [Homo 
sapiens] 


309 


126/137 
(91%) 


127/137 
(91%) 


3e-59 | 


gi | 12654455 | gb | AAH0 

1055.l|AAH01055 

(BC001055) 


claudin 7 [Homo 
sapiens] 


211 


112/211 
(53%) 


149/211 
(70%) 


2e-55 


gi (10835008 |ref |NP_ 
001298. l| 
(NM_001307) 


claudin 7; 

Clostridium 

perfringens 

enterotoxin 

receptor-like 2; 

claudin 9 [Homo 

sapiens] 


211 


111/211 
(52%) 


148/211 
(69%) 


7e-55 


gi (7710002 |ref|NP 0 
57883. 1| 
(NM 016674) 


claudin 1 [Mus 
musculus] 


211 


112/212 
(52%) 


149/212 
(69%) 


8e-55 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 3H. 



Table 3H ClustalW Analysis of NOV3 

1) NOV3a (SEQ ID NO: 12) 

2) NOV3b (SEQ ID NO: 14) 

3) NOV3C (SEQ ID NO: 16) 

4) gi|9789476|gb|AAF98323.l| (AF249889) claudin-19 [Mus musculus] (SEQ ID NO: 333) 

5) gijl7489l34lreflXP_060892.il (XM_060892) similar to claudin-19 (H. sapiens) 
[Homo sapiens] (SEQ ID NO: 3 34) 

6) gi|l2654455|gb|AAH01055.l|AAH01055 (BC001055) claudin 7 [Homo sapiens] (SEQ ID 
NO:335) 

7) gi 1 10835008 | ref |NP_001298. l| (NM_001307) claudin 7; Clostridium perfringens 
enterotoxin receptor- like 2; claudin 9 [Homo sapiens] (SEQ ID NO: 336) 
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6) gi|7710002|ref |NP_057883.l| (NM_01G674) claudini [Mus musculus] (SEQ ID NO:327) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



NOV3A 


1 


NOV3b 


1 


NOV3C 


1 


gi 


9789475 | 


1 


gi 


17489134 


1 


gi 


12654455 


1 


gi 


10835008 


1 


gi 


7710002) 


1 



NOV3A 


61 


NOV3b 


61 


NOV3c 


61 


0i 


9789476 | 


.51 


gi 


17489134 | 


61 


gi 


12654455) 


61 


gi 


10835008) 


61 


gi 


7710002) 


61 



NOV3A 


74 


MOV3b 


74 


NOV3c 


74 


gi 


9789476) 


64 


gi 


17489134) 


121 


gi 


12654455) 


74 


gi 


10835008 j 


74 


gi 


7710002 ( 


74 



NOV3A 


83 


NOV3b 


83 


NOV3C 


83 


gi 


9789476) 


73 


gi 


17489134 


181 


gi 


12654455 


83 


gi 


10835008 


83 


gi 


7710002 | 


83 



NOV3A 
NOV3b 
NOV3C 



gi 
gi 
gi 
gi 
gi 



9789476 | 

17489134) 

12654455| 

10835008) 

77100021 



NOV3A 
NOV3b 
NOV3c 



gi 
gi 
gi 
gi 
gi 



9789476) 
17489134 
12654455 
10835008 
77100021 





80 



90 



100 



110 



120 
••I 



74 
74 
74 
64 

PPTKPEGTLELPSAAWEDGPSPRGSPQARKQQQSESYYRKSLQG 120 

74 
74 
74 



130 



140 
..|.. 



150 



•|....|. 



160 
..|.. 



170 



180 



rvengvdsegdshphtlsscpgchsslsctcqkaptqppspparlsvippt: 




190 



200 



210 



220 



230 



240 




143 
143 
143 
133 
241 

143 QjjfVTjp 
143 
143 



310 
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Table 31 lists the domain description from DOMAIN analysis results against NOV3. 
This indicates that the NOV3 sequence has properties similar to those of other proteins known 
to contain this domain. 



Table 31 Domain Analysis of NO V3 

gnl | Pf am | pf 311100822, PMP22_Claudin, PMP-22/EMP/MP20 /Claudin family 
(SEQ ID NO: 802) 

CD-Length - 162 residues, 99.4% aligned 
Score a 80.5 bits (197), Expect = 9e-17 



NOV 3: 5 GLQLLGYFLALGGWVG - 1 IASTALPQWKQSS YAGDAI ITAVGLYEGLWMSCASQS- TGQV €2 

+ I II + +1 III I I I I IN +| +|| IM + 

Sb^ct: 2 LVLLLGFI VSHI AWVI LLFVATITDQWKVSRYVGAAA SAGLWRNCTTQSCTGQI 55 

10 00 ° 

NOV 3: 63 QCKLYDSLIJU^DGRPQAAEAIAfi^ 122 

11+ 11+ II +III+++++U +++++ +11 +1 
Sbjct: 56 SCKV LELNDALQAVQALMILSI ILGI ISLIVFFFQLFTMRKGGRFKLA 103 

15 NOV 3: 123 GALF JLAGLCTLTAVS WYATLVTQE FFNP EFGPALFVGWAS AGLAVLGGS FL 174 

I +I+++III I I I + + +1 II II + +11+ II +11 

Sb}Ct: 104 GI IFLVSGLCVLVGAS I YTSRIATDFGNPFTPNRKYSFGY SFILGWVAFALAFIGGVLY 162 

The claudins are a family of integral membrane proteins that are major components of 

20 tight junction (TJ) strands. When claudins are introduced into cells that lack tight junctions, 
networks of strands and grooves form at cell-cell contact sites that closely resemble native 
tight junctions. There are at least 17 members of this family in mammals. Claudin family 
members share -38% amino acid identity, and are predicted to have four transmembrane (TM) 
domains, which is reminiscent of occludin, although they share no sequence similarity with it. 

25 Multiple sequence alignment reveals their sequences to be fairly well conserved in the first 
and fourth putative TM domains, and in the first and second extracellular loops, but they 
diverge in the second and third TM domains. Although the sequences of their C-terminal 
cytoplasmic domains vary, the known family members share a common motif of -Y-V. This 
has been postulated as a possible binding motif for PDZ domains of other tight junction- 

30 associated peripheral membrane proteins, such as ZO-1 . 

The disclosed NOV3 nucleic acid of the invention encoding a Claudin- 19 -like protein 
includes the nucleic acid whose sequence is provided in Table 3 A or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 3A while still encoding a protein that maintains 

35 its Claudin- 19 -like activities and physiological functions, or a fragment of such a nucleic acid. 
The invention further includes nucleic acids whose sequences are complementary to those just 
described, including nucleic acid fragments that are complementary to any of the nucleic acids 
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just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
5 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 17 percent of the 
bases may be so changed. 

The disclosed NOV3 protein of the invention includes the Claudin-19 -like protein 

1 0 whose sequence is provided in Table 3B. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
3B while still encoding a protein that maintains its Claudin-19 -like activities and 
physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 48 percent of the residues may be so changed. 

1 5 The protein similarity information, expression pattern, and map location for the 

Claudin-19-like protein and nucleic acid (NOV3) disclosed herein suggest that this NOV3 
protein may have important structural and/or physiological functions characteristic of the 
Ciaudin-19 family. Therefore, the NOV3 nucleic acids and proteins of the invention are 
useful in potential diagnostic and therapeutic applications. These include serving as a specific 

20 or selective nucleic acid or protein diagnostic and/or prognostic marker, wherein the presence 
or amount of the nucleic acid or the protein are to be assessed, as well as potential therapeutic 
applications such as the following: (i) a protein therapeutic, (ii) a small molecule drug target, 
(iii) an antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) a 
nucleic acid useful in gene therapy (gene delivery/gene ablation), and (v) a composition 

15 promoting tissue regeneration in vitro and in vivo. 

The NOV3 nucleic acids and proteins of the invention are useful in potential diagnostic 
and therapeutic applications implicated in various diseases and disorders described below. For 
example, the compositions of the present invention will have efficacy for treatment of patients 
suffering from Von Hippel-Lindau (VHL) syndrome, Cirrhosis, Transplantation, Hemophilia, 

10 hypercoagulation, Idiopathic thrombocytopenic purpura, autoimmume disease, allergies, 

immunodeficiencies, transplantation, Graft vesus host, Alzheimer's disease, Stroke, Tuberous 
sclerosis, hypercalcemia, Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, 
Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia-telangiectasia, Leukodystrophies, 
Behavioral disorders, Addiction, Anxiety, Pain, Neuroprotection, Systemic lupus 
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erythematosus , Autoimmune disease, Asthma, Emphysema, Scleroderma, allergy, and 
Cancer, and/or other pathologies. The NOV3 nucleic acids, or fragments thereof, may further 
be useful in diagnostic applications, wherein the presence or amount of the nucleic acid or the 
protein are to be assessed. 
5 NOV3 nucleic acids and polypeptides are further useful in the generation of antibodies 

that bind immunospecifically to the novel substances of the invention for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 
art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
section below. These novel proteins can be used in assay systems for functional analysis of 
10 various human disorders, which will help in understanding of pathology of the disease and 
development of new drug targets for various disorders. 

NOV4 

NOV4 includes three novel human 1 Claudin-like proteins disclosed below. The 
1 5 disclosed sequences have been named NOV4a, NOV4b, and NOV4c. 
NOV4a 

A disclosed NOV4a nucleic acid of 694 nucleotides (also referred to as CG56589-01) 
encoding a novel Claudin-6-like protein is shown in Table 4A. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 1 1-13 and ending with a 
20 TAA codon at nucleotides 671-673. Putative untranslated regions upstream from the initiation 
codon and downstream from the termination codon are underlined in Table 4A, and the start 
and stop codons are in bold letters. 



Table 4A. NOV4a Nucleotide Sequence (SEQ ID NO:17) 

ACCTCTC^ 

TCCTGGGATGGATTTTATCCATTATTACAACTTATTTGCCACACTGGAAGAACCTCAACCTG 
GACTTAAATGAAATGGAAAACTGGACCATGGGACTCTGGCAAACCTC 

AGTGGGGATGCAATGCAAGGACTTTGACTCCTTCCTGGCTTTGCCTGCTGAACTCAGGGTCT 
CCAGGATCTTAATGTTTCTGTCAAATGGGCTGGGATTTCTGGGCCTGCTGGTCTCTGGGTTT 
GGCCTGGACTGTTTGAGAATTGGAGAGAGTCAGAGAGATCTCAAGAGGCGACTGCTCATTCT 
GGGAGGAATTCTGTCCTGGGCCTCGGGAATCACAGCCCTGGTTCCCGTCTCTTGGGTTGCCC 
ACAAGACGGTTCAGGAGTTCTGGGATGAGAACGTCCCAGACTTTGTCCCCAGGTGGGAGTTT 
GGGGAGGCCCTGTTTCTGGGCTGGTTTGCTGGACTTTCTCTTCTGCTAGGAGGGTGTCTGCT 
CAACTGCGCAGCCTGCTCCAGCCACGCTCCCCTAGCrTTGGGC 

TGCAAACTCAGTGTCCCTACCTGGAAGATGGGACAGCAGATCCTCAAGTGTA AGACTCCGAC 
AAGGCCAGAGAT 
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The NOV4a nucleic acid was identified on chromosome 4 and has 330 of 556 bases 
(59%) identical to a gb:GENBANK-ID:AFl 34 160|acc:AF 134 160.1 mRNA from Homo 
sapiens (claudin-l (CLDN1) mRNA, complete cds) (E = 2.9e" 9 ). 

A disclosed NOV4a polypeptide (SEQ ID NO: 18) encoded by SEQ ID NO: 17 is 220 
5 amino acid residues and is presented using the one-letter code in Table 4B. Signal P, Psort 
and/or Hydropathy results predict that NOV4a has no signal peptide and is likely to be 
localized in the plasma membrane with a certainty of 0.6400. Alternatively, NOV4a may also 
localize to the Golgi body with acertainty of 0.4600, the endoplasmic reticulum (membrane) 
with a certainty of 0.3700, or the enoplasmic reticulum (lumen) with a certainty of 0.1000. The 
1 0 most likely cleavage site for NOV4a is between positions 24 and 25: ILS-II. 



Table 4B. Encoded NOV4a protein sequence (SEQ ID NO:18) 



M^IFRTTWQSVGLLLSIAGWILSIITTYLPHWKOTJ^ 

LPAELRVSRILMFliSNGLGFLGIiLVSGFGIJJCLRIGESQRDLKEIQiLILGGILSWASGITALVPVSWVAHKT 
VQEFWDI^PDFVPRWEFGEALFLGV^AGLSLLLGGCLLNCAACSSHAPIJ^GHYAVAQMQTQCPYLEDGTA 



The disclosed NOV4a amino acid sequence has 84 of 204 amino acid residues (41%) 
identical to, and 1 19 of 204 amino acid residues (58%) similar to, the 219 amino acid residue 
15 ptnr:SWISSPROT-ACC:Q9Z262 protein from Mus musculus (Mouse) (Claudin-6) (E = 1 le 
32 )- 

NOV4a is predicted to be expressed in at least Brain. This information was derived by 
determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
20 sources. 

In addition, the sequence is predicted to be expressed in Adrenal Gland/Suprarenal 
gland, Brain, Bronchus, Brown adipose, Cervix, Colon, Coronary Artery, Epidermis, Gall 
Bladder, Heart, Hippocampus, Islets of Langerhans, Kidney, Liver, Lung, Lung Pleura, 
Mammary gland/Breast, Oesophagus, Ovary, Oviduct/Uterine Tube/Fallopian tube, Parotid 
25 Salivary glands, Peripheral Blood, Placenta, Prostate, Proximal Convoluted Tubule, 
Respiratory Bronchiole, Skin, Stomach, Substantia Nigra, Thymus, Thyroid, Trachea, 
Umbilical Vein, Uterus, and Vulva. ; 
NOV4b 

A disclosed NOV4b nucleic acid of 694 nucleotides (also referred to as CG56589-01) 
30 encoding a novel Claudin-6-like protein is shown in Table 4C. An open reading frame was 

identified beginning with an ATG codon at nucleotides 1 1-13 and ending with a TAA codon at 
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nucleotides 671-67 j. ±uc start and stop codons are in bold letters and the 5' and 3' 
untranslated regions are underlined in Table 4C. Because the start codon is not a traditional 
initiation codon, NOV4b could be a partial reading frame. NOV4b could extend further in the 
5' direction. 

5 

Table 4C. NOV4b nucleotide sequence (SEQ ID NO:19). 

ACCTGTCGCAA TGGCTTTAATCT^ 

GATTTTATCCATTATTACAACT^ 

CTGGACCATGGGACTCTGGCAAACCT 

CTTCCTGGCTTTGCCTGCTGAACTCAGGGT 

GGGCCTGCTGGTCTCTGGGTTTGGCCTGGACTGTTTGAGAAOT 

ACTGCTCATTCTGGGAGGAATTCTGTCCTGGGCCTC 

CCACAAGACGGTTCAGGAGTTCTGGGATGAGAACGTCCCAGACTTTGTCCCCAGGTGG 
CCTGTTTCTGGGCTGGTTTGCTGGACTTTCTCTTCTGCT 
CAGCCACGCTCCCCTAGCTTTGGGCCACTATGCACT 
TGGGACAGCAGATCCTCAAGTGTA AGACTCCGACAAGGCCAGAGAT 

In a search of public sequence databases, the NOV4b nucleic acid sequence, located on 
chromsome 4 has 330 of 556 bases (59%) identical to a gb:GENBANK- 
ID:AF134160|acc:AF134160.1 mRNA from Homo sapiens (claudin-1 (CLDNl)mRNA, 

10 complete cds) (E = 2.9e' 09 ). 

The disclosed NOV4b polypeptide (SEQ ID NO:20) encoded by SEQ ID NO:19 has 
220 amino acid residues and is presented in Table 4D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV4b has no signal peptide and is 
likely to be localized the the plasma membrane with a certainty of 0.6400. Alternatively, 

1 5 NOV4b may also localize to the Golgi body with acertainty of 0.4600, the endoplasmic 

reticulum (membrane) with a certainty of 0.3700 or in the endoplasmic reticulum (lumen) with 
a certainty of 0.1000. The most likely cleavage site for a NOV4b peptide is between amino 
acids 24 and 25, at: ELS-H 



Table 4D. Encoded NOV4b protein sequence (SEQ ID NO:20). 

MALIFRTAMQSVGLLLSLLGWILSII . 
LPAELRVSRI LMFLSNGLGFLGLLVSGFGLDCn^ TALVPVSWVAHKT 
VQEFWDENVPDFVPRVHSFGEALFLGWFAGLSL^ 
DPQV 

20 ^ ' —— ~ - - ~- ~ ~ ~ ~" 

A search of sequence databases reveals that the NOV4b amino acid sequence has 84 of 
204 amino acid residues (41%) identical to, and 119 of 204 amino acid residues (58%) similar 
to, the 219 amino acid residue ptnr:SWISSPROT-ACC:Q9Z262 protein fromM^ musculus 
(Mouse) (Claudin-6) (E = Lie 32 ). 
25 NOV4b is predicted to be expressed in at least Brain. 
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In addition, ^v.-n-iS predicted to be expressed in Adrenal Giana/suprarenal gland, 
Brain, Bronchus, Brown adipose, Cervix, Colon, Coronary Artery, Epidermis, Gall Bladder, 
Heart, Hippocampus, Islets of Langerhans, Kidney, Liver, Lung, Lung Pleura, Mammary 
gland/Breast, Oesophagus, Ovary, Oviduct/Uterine Tube/Fallopian tube, Parotid Salivary 
glands, Peripheral Blood, Placenta, Prostate, Proximal Convoluted Tubule, Respiratory 
Bronchiole, Skin, Stomach, Substantia Nigra, Thymus, Thyroid, Trachea, Umbilical Vein, 
Uterus, and Vulva. 

NOV4c 

A disclosed NOV4c nucleic acid of 694 nucleotides (also referred to as CG56589-02) 
encoding a novel Claudin 6-like protein is shown in Table 4E. An open reading frame was 
identified beginning with an ATG codon at nucleotides 11-13 and ending with a TAA codon at 
nucleotides 671-673. The start and stop codons are in bold letters and the 5' and 3' 
untranslated regions are underlined in Table 4E. 



Table 4E. NOV4c nucleotide sequence (SEQ ID NO:21). 

^CCTGTCGCAATGGClTTAATCTTTAAAAC&GCAATGra 1 
GATTTTATCCATTATTACAACTTATTTGCCACACTGGAAGAACCTC^ 

CTGGACCATGGGACTCTGGCAAACCTGTGTCATCCAAGAGGAAGTGGGGATGCAATGCAAGGACTTTGACTC 
CTTCCTGGCTTTGCCTGCTGAACT<^GGGTCTCCAGGATCTTA 

GGGCCTGCTGGTCTCTGGGTTTGGCCTGGACTGTTTGAGAATTGGAGAGAGTCAGAGAGATCTCAAGAGGCG 
CCACAAGACGGTTCAGGAGTTCTGGGATGAGAACGTCCCAGAOT 

CCTGTTTCTGGGCTGGCTTGCraaCTTTCTCTTCTGCTAGGAGGGTGTCTGCTCAACTGCXSCAGCCTGCTC 

CAGCCACGCTCCCCTAGCTTTGGGCCACTATGCAGTGGCGCAAATC 

TGGGACAGCAGATCCTCAAGTGTAAGACTCCGACAAGGCCAGAGAT 

In a search of public sequence databases, the NO V4c nucleic acid sequence, located on 
chromsome 4 has 33 1 of 556 bases (59%) identical to a gbrGENB ANK- 
ID:AF134160|accAF134160.1 mKNA from Homo sapiens (claudin- 1 (CLDN1) mRNA, 
complete cds) (E = 32e" 9 ). 

The disclosed NOV4c polypeptide (SEQ ID NO:22) encoded by SEQ ID NO:21 has 
220 amino acid residues and is presented in Table 4F using the. one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV4c has no signal peptide and is 
likely to be localized the the plasma membrane with a certainty of 0.6400. Alternatively, 
NOV4c may also localize to the Golgi body with acertainty of 0.4600, the endoplasmic 
reticulum (membrane) with a certainty of 0.3700 or in the endoplasmic reticulum (lumen) with 
a certainty of 0.1000. The most likely cleavage site for a NOV4c peptide is between amino 
acids 24 and 25, at: ELS-IL 
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Table 4F. Encoded NOV4c protein sequence (SEQ ID NO:22). 

MALIFKTiU^QSVGLLLSFLGWILSIITTYLPHWKNLNLDLNEMENW 

LPAEI^VSRILMFLSNGLGFLGLLVSGFGLDCLRIGES 

VQEFWDENVPDFVPRVTCFGEALFIX3WIAGLSL^^ 

DPQV 



A search of sequence databases reveals that the NOV4c amino acid sequence has 83 of 
204 amino acid, residues (40%) identical to, and 118 of 204 amino acid residues (57%) similar 
to, the 219 amino acid residue ptnr:SWISSPROT-ACC:Q9Z262 protein from Mus musculus 
5 (Mouse) (Claudin-6) (E = 9.6e 66 ). 

The sequence is predicted to be expressed in the following tissues : Adrenal 
Gland/Suprarenal gland, Brain, Bronchus, Brown adipose, Cervix, Colon, Coronary Artery, 
Epidermis, Gall Bladder, Heart, Hippocampus, Islets of Langerhans, Kidney, Liver, Lung, 
Lung Pleura, Mammary gland/Breast, Oesophagus, Ovary, Oviduct/Uterine Tube/Fallopian 
10 tube, Parotid Salivary glands, Peripheral Blood, Placenta, Prostate, Proximal Convoluted 

Tubule, Respiratory Bronchiole, Skin, Stomach, Substantia Nigra, Thymus, Thyroid, Trachea, 
Umbilical Vein, Uterus, and Vulva. 

NOV4 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 4G. 

15 



Table 4G. BLAST results for NOV4 



Gene Index/ - 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 17437504 |ref |XP 
068030.1) 
(XM_06803 0) 


similar to 
putative (H. 
sapiens) [Hojno 
sapiens) 


220 


220/220 
(100%) 


220/220 
(100%) 


e-105 


gi)l7437506|ref |XP 
068031. 1| 
(XM_068031> 


similar to 
putative (H. 
sapiens) [Hojho 
sapiens] 


220 


192/212 
(90%) 


198/212 
(92%) 


9e-96 


gi| 12843248 |dbj |BAB 
25914. l| (AK008B21) 


PMP- 

22/EMP/MP20/Claud 
in family- 
containing 
protein-data 
source :Pf am, 
source 
key:PF00822, 
evidence : ISS-puta 
tive [Mus 
musculus] 


220 


158/220 
(71%) 


182/220 
(81%) 


3e-70 


gi | 17458947 |ref|XP 
061964.1 | 
(XM_061964) 


similar to 
putative (H. 
sapiens) [Homo 
sapiens] 


229 


108/188 
(57%) 


137/188 
(72%) 


2e-45 


gi | 7710002 |ref|NP 0 

57883.1) 

(NM 016674) 


claudin 1 [MUs 
musculus] 


211 


72/181 
(39%) 


105/181 
(57%) 


le-27 
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The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 4H. 



Table 4H Clustal W Sequence Alignment 



10 



15 



20 



25 



30 



1> NOV4a (SEQ ID NO: 18) 

2) NOV4b (SEQ ID Np:20) 

3) NOV4C (SEQ ID NO: 22) 

4) gi | 17437504 | ref | XP_O68030 . 1 | 
sapiens] (SEQ ID NO: 325) 

5) gi | 17437506 | ref |XP_06803l.l| 
sapiens] (SEQ ID NO:324) 

6) gi 1 12843248 | dbj'j BAB25914 . 1 1 
protein-data source :Pf am, source 

(SEQ ID NO:326) 

7) gi | 17458947 | ref |XP_061964 . 1 | 
sapiens] (SEQ ID NO: 323) 

8 ) gi | 7710002 | ref |NP_057883 . 1 | 



(XM_068030) similar to putative (H. sapiens) [Homo 

<XM_06B03l) similar to putative (H. sapiens) [Homo 

(AK008821) PMP-22/EMP/MP20/Claudin family containing 
Jcey:PF00822, evidence-. I SS -putative [Mus rousculus] 

(XM_061964) similar to putative (H. sapiens) [Homo 

NMJ>16674) claudin l [Mus musculus] (SEQ ID NO:327) 



NOV4a 


1 


NOV4b 


1 


NOV4C 


1 


gi 


17437504) 


1 


gi 


17437506) 


1 


gi 


12843248) 


1 


gi 


17458947 j 


1 


gi 


7710002) 


1 



NOV4a 


58 


NOV4b 


58 


NOV4c 


58 


gi 


17437504 


58 


gi 


17437506 


58 


gi 


12843248 


58 


gi 


17458947 


58 


gi 


7710002) 


58 



NOV4a 


118 


NOV4b 


118 


NOV4c 


118 


gi 


17437504) 


118 


gi 


17437506) 


118 


gi 


12843248) 


118 


gi 


17458947) 


118 


gi 


7710002) 


118 



NOV4a 


178 


NOV4b 


178 


NOV4c 


178 


gi| 17437504 


178 


gij 17437506 


178 


gij 12843248 


178 


gij 17458947 


178 


gij 7710002) 


' 177 



50 



EMENWTMGLWQTCVIQ 
EMENWTMGLWQTCV I Q 
EMENWTMGLWQTCVI Q 
EMENWTMGLWQTCVI Q 
EMENWTMGLWQTCVI Q 
EMENWTMGLWi8E|cVIQ 

meRwRmg^Sc 

GLwSjSr 



57 
57 
57 
57 
57 
57 
57 
57 
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Table 41 lists the domain description from DOMAIN analysis results against NOV4. 
This indicates that the NOV4 sequence has properties similar to those of other proteins known 
to contain this domain. 



Table 41 Domain Analysis of NOV4 

gnl|pfam|pfam00822, PMP22_Claudin, PMP-22/EMP/MP20/Claudin family. 
(SEQ ID NO: 802) 

CD-Iiength = 162 residues, 67.3% aligned 
Score =35.0 bits (79), Expect = 0.004 



NOV 4: 49 GLWQTCVTQEEVGM- QCKDFDSFLALPAELRVSRI LMFLSNGLGFLGLLVSGFGLDCLRI 107 

111+ II I II I 1+ + II II II + M I I *l 

Sbjct: 41 GLWRNCTTQSCTGQISCKVL ELNDALQAVQALMILSI ILGI I SLI VFFFQLFTMRK 96 

10 NOV 4: 108 GESQlUJLKMIXILGGXLSWASGITALVPVSWVAEnCTVQEFWDENVPDFVPRWEFGEA^ 167 

I I ||+ 11+ II I ♦ +1 I ++ II + 

Sbjct: 97 GGR FKLAGI I FLVSGLCVLVGAS I YTSRI ATDF- -GNPFTPNRKYSFGYSFI 146 

NOV 4: 168 LGW 170 

15 III 

Sbjct: 147 LGW 149 

The claudins are a family of integral membrane proteins that are major components of 
tight junction (TJ) strands. When claudins are introduced into cells that lack tight junctions, 

20 networks of strands and grooves form at cell-cell contact sites that closely resemble native 
tight junctions. There are at least 17 members of this family in mammals. Claudin family 
members share -38% amino acid identity, and are predicted to have four transmembrane (TM) 
domains, which is reminiscent of occludin, although they share no sequence similarity with it. 
Multiple sequence alignment reveals their sequences to be fairly well conserved in the first 

25 and fourth putative TM domains, and in the first and second extracellular loops, but they 
diverge in the second and third TM domains. Although the sequences of their C-terminal 
cytoplasmic domains vary, the known family members share a common motif of -Y-V. This 
has been* postulated as a possible binding motif for PDZ domains of other tight junction- 
associated peripheral membrane proteins, such as ZO-1. 

30 The disclosed NOV4 nucleic acid of the invention encoding a Claud in-6 -like protein 

includes the nucleic acid whose sequence is provided in Table 4A or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 4A while still encoding a protein that maintains 
its Claudin-6 -like activities and physiological functions, or a fragment of such a nucleic acid. 

35 The invention further includes nucleic.acids whose sequences are complementary to those just 
described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
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complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 41 percent of the 
bases may be so changed. 

The disclosed NOV4 protein of the invention includes the Claudin-6 -like protein 
whose sequence is provided in Table 4B. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
4B while still encoding a protein that maintains its Claudin-6-Iike activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 61 
percent of the residues may be so changed. 

The NOV4 nucleic acids and proteins of the invention are useful in potential 
1 5 therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome, Cirrhosis, 
Transplantation, Hemophilia, hypercoagulation, Idiopathic thrombocytopenic purpura, 
autoimmume disease, allergies, immunodeficiencies, transplantation, Graft vesus host, 
Alzheimer's disease, Stroke, Tuberous sclerosis, hypercalcemia, Parkinson's disease, 
Huntington's disease, Cerebral palsy, Epilepsy, Lesch-Nyhan syndrome, Multiple sclerosis, 
Ataxia-telangiectasia, Leukodystrophies, Behavioral disorders, Addiction, Anxiety, Pain, 
Neuroprotection, Systemic lupus erythematosus , Autoimmune disease, Asthma, Emphysema, 
Scleroderma, allergy, and Cancer, and/or other pathologies and disorders of the like. The 
NOV4 nucleic acid, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. 

NOV4 nucleic acids and polypeptides are further useful in the generation of antibodies 
that bind immunospecifically to the novel substances of the invention for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 
art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
section below. For example the disclosed NOV4 protein have multiple hydrophilic regions, 
each of which can be used as an immunogen. This novel protein also has value in 
development of powerful assay system for functional analysis of various human disorders, 
which will help in understanding of pathology of the disease and development of new drug 
targets for various disorders. 



20 
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NOV5 

NOV5 includes three novel Monocarboxylate transporter (MCT3)-like proteins 
disclosed below. The disclosed sequences have been named NOV5a, NOV5b, NOV5c s 
NOV5d, and NOV5e. 
5 NOVSa 

A disclosed NOV5a nucleic acid of 1502 nucleotides (also referred to as CG56635-01) 
encoding a novel Monocarboxylate transporter (MCT3)-like protein is shown in Table 5a. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 24- 
26 and ending with a TGA codon at nucleotides 1365-1367. The start and stop codons are in 
1 0 bold letters in Table 5 A. 



Table 5 A. NOVSa Nucleotide Sequence (SEQ ID NO:23) 

GTTTCCCCACCCCGCAGACGGCGA TGACCCCCCAGCCCGCCGGACCCCCGGATGGGGGCTGGGGCTGGGT 
GGTGGCGGCCGCAGCCTTCGCGATAAACGGGCTGTCCTACGGGCTGCT 

CCTGACCTTGCCGAGCACTTTGACCX5AAGCGCCCAGGACACTGCGTGGATCAGCGCCCTGGCCTTGGCCG 
TGCAGCAGGCAGCCAGCCCCGTGGGCAGCGCCCTGAGCACGCGCTGGGGGGCCCGCCCCGTGGTGATGGT 
TGGGGGCGTCCTCGCCTCGCTGGGCTTCGTCTTCTCGGCTTTCGCCAGCGATCTGCTGCATCTCTACCTC 
GGCCTGGGCCTCCTCGCTGGCTTTGGTTGGGCCCTGGTGTTC 

GGCGCCCGCCTTGCAGCTTCTTCTCGATACTTTCGGCTGGCGGGGCGCTCTGCTCCTCCTCGGCGCGATC 

CGCGTAGTCCCCTAGCTGCCCTCGGCCAGAGTCTGTTCACACGCCGGGC 
CACAGCCCTGGTTGGGGGCGGGTACTTCGTTCCTTACGTGCACTTGG 

CTGGGGGGATACGGAGCAGCGCTGGTGGTGGCCGTGGCTGCGATGGGGGATGCGGGCGCCCGGCTGGTCT 

GCGGGTGGCTGGCAGACCAAGGCTGGGTGCCCCTCCCGCGGCTCCrrGGCCGTATTCGGGGCTCTGACTGG 

GCTGGGGCTGTGGGTGGTGGGGCTGGTGCCCGTGGTGGGCGGCGAAGAGAGCTGGGGGGGTCCCCTGCTG 

GCOSCGGCTGTGGCCTATGGGCTGAGCGCGGGGAGTTACGCCCCGCTGGTTTTCGGTGTACTCCCCGGGC 

TGGTGGGCGTCGGAGGTGTGGTGCAGGCCACAGGGCTGGTGATGATGCTGATGAGCCTCGGGGGGCTCCT 

GGGCCCTCCCCTGTCAGGCTTCCTAAGGGATGAGACAGGAGACTTCACCGCCTCTTTCCTCCT 

TCTTTGATCCTCTCCGGCAGCTTCATCTACATAGGGTTGCCCAGGGCGCTGCCCTCCTGTGGTCCAGC^ 

CCCCTCCAGCCACGCCTCCCCCAGAGACGGGGGAGCTGCTTCCCGCTCCCCAGGCAGTCTTGCTGTCCCC 

AGGAGGCCCTGGCTCCACTCTGGACACCACTTGTTQA TTATTTTCTTGTTTGAGCCCCT 

GAATTTTTATCGGGTTTTCCTGAAACCTCCAACTGTTCACCAATCTAGGACCCTGAAAATATTCTACATA 

AGACAGCCAGAAAGGCTGGTTCAAAGGAACAG 



The disclosed NOVSa nucleic acid sequence , located on chromosome 17, has 672 of 
1 1 10 bases (60%) identical to a gb:GENBANK-ID:AF132610|acc:AF132610.1 mRNA from 
1 5 Homo sapiens (monocarboxylate transporter MCT3 mRNA, complete cds) (E - 1 .6e* 29 ). 

A disclosed NOV5a polypeptide (SEQ ID NO:24) encoded by SEQ ID NO:23 is 447 
amino acid residues and is presented using the one-letter amino acid code in Table 5B. Signal 
P, Psort and/or Hydropathy results predict that NOVSa contains no signal peptide and is likely 
to be localized in the endoplasmic reticulum (membrane) with a certainty of 0.6850. 
20 Alternatively, NOVSa is also likely to be localized to the plasma membrane with a certainty of 
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0.6400, to the Golgi body with a certainty of 0.4600, or to the endoplasmic reticulum (lumen) 
with a certainty of 0.1000 



10 



15 



20 



Table 5B. Encoded NOV5a protein sequence (SEQ ID NO:24). 



MTPQPAGPPDGGWGWWAAAAFAINGLSYGLLRSLGIJiPPDLAEHFDRSAQDTAWISALAIiAVQQAASPVGSAIiS 
THWGAHPVVMVGGVI^SLGFVFSAFASDLLHLYIiGLGLLAGFGWALVFAPALGTLSRYFSRREVIiAVGLALTGNG 
ASSLLIiAPALQLLLDTFGWRGALLLIX5AITLHLTPCGALLLPLVLPGDPPAPPRSPLAALGQSLFTRRAFSIFAL 
GTALVGGGYFVPYVHIAPHAIXIRGIX^YGAALW^^ 

WGLVPWGGEBSWGGPIJIJ^AAVAYGLSAGSYAPLVPGVLPGLVGVGGWOATGLV^I^lIlMSIlGGLLGPPLSGF^lR 
DETGDFTAS FLLSGSLILSGSFIYIGI,PRALPSCGPASPPATPPPETGELI,PAPOAVIJ,SPGGPGSTIJ>TTC 



The disclosed NOV5a amino acid sequence has 96 of 198 amino acid residues (48%) 
identical to, and 122 of 198 amino acid residues (61%) similar to, the 504 amino acid residue 
ptnr:SPTREMBL-ACC:O95907 protein from Homo sapiens (Human) (DJ1039K5.2 (Similar 
To Monocarboxylate Transporter (MCT3))) (E = 1.2c" 67 ). 

NOV5a is predicted to be expressed in at least Adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, retina, salivary gland, skeletal 
muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, uterus. 

NOV5b 

A disclosed NOV5b nucleic acid of 61 1 nucleotides (also referred to as CG56635-02) 
encoding a novel Monocarboxylate transporter 3-like protein is shown in Table 5C. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 6-8 and 
ending with a TGA codon at nucleotides 500-502. The start and stop codons are in bold letters 
in Table 5B. 



Table 5C. NOV5b Nucleotide Sequence (SEQ ID NO:25) 



ACGGCGATGACCCCCCAGCCCGCCGGACCCCCGGATGGGGGCTGGGGCTGGGTGGTGGCGGCCGCAGCCT 
TCGCGATAAACGGGCTGTCCTACGGGCTGCTGCGCTCGCTGGGCCTTGCCTTCCCTGACX^rTGCCGAGCA 
CTTTGACCGAAGCGCCCAGGACACTGCGTGGATCAGCGCCCTGGCCCTGGCCGTGCAGCAGGCAGCCAGC 
CCCGTGGGCAGGGCCCTGAGCACGCGCTGGGGGGCCCGCCCCGTGGTGATGGTTGGGGGCGTCCTCGCCT 
CGCTGGGCTTCGTCTTCTCGGCTTTCGCCAGCGATCTGCTGCATCTCTACCTCGGCCTGGGCCTCCTCGC 
TGGCTTCCTAAGGGATGAGACAGGAGACTTCACCGCCTCTTTCCTCCTGTCTGGTTCTTTGATCCTCTCC 
GGCAGCTTCATCTACATAGGGTTGCCCAGGGCGCTGCCCTCCTGTGGTCCAGCCTCCCCTCCAGCCACGC 
CTCCCCCAGAGACGGGGGAGCTGCrrCCCGCTCCCCAGGCAGTCTTGCTGTCCCCAGGAGGCCCTGGCTC 
CACTCTGGACACCACTTGTTGATTATTTTCTTGTTTGAGCCCCTCCCCCAC 



The disclosed NOV5b nucleic acid sequence , located on chromosome 17, has 323 of 
520 bases (62%) identical to a gb:GENBANK-ID:AFl 326 10|acc:AF 1326 10.1 mRNA from 
Homo sapiens (monocarboxylate transporter MCT3 mRNA, complete cds) (E = 3.2e' 18 ). 
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A disclosed NOV5b polypeptide (SEQ ID NO:26) encoded by SEQ ID NO:25 is 191 
amino, acid residues and is presented using the one-letter amino acid code in Table 5D. Signal 
P, Psort and/or Hydropathy results predict that NOV5b contains no signal peptide and is likely 
to be localized in the endoplasmic reticulum (membrane) with a certainty of 0.9325. 
Alternatively, NOV5b is also likely to be localized to the plasma membrane with a certainty of 
0.4960, to the microbody (peroxisome) with a certainty of 0.3200, or to the Golgi body with a 
certainty of 0.1900 The most likely cleavage site for NOV5b is between positions 38 and 39: 
GLA-FP. 



Table 5D. Encoded NOV5b protein sequence (SEQ ID NO:26). 

OTPQPAGPPDGGWGWVVAAAAFAINGLSYGLLRSLGLAFPDIjAEHFDRS 
TRWGARPVVMVGGVLASLGFVFSAFASDL 

PSCGPASPPATPPPETGEIiLPAPQAVLIiSPGGPGSTLiyrTC 

The disclosed NOV5b amino acid sequence has 53 of 1 10 amino acid residues (48%) 
identical to, and 72 of 1 10 amino acid residues (65%) similar to, the 504 amino acid residue 
ptnr:SPTREMBL-ACC:Q9UBE2 protein from Homo sapiens (Human) (Monocaiboxylate 
Transporter MCT3) (E - 2.9e~ 28 ). 

NOV5b is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. 

NOVSc 

A disclosed NOVSc nucleic acid of 704 nucleotides (also referred to as CG56635-03) 
encoding a novel Monocarboxylate transporter 3-like protein is shown in Table 5E. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 28-30 and 
ending with a TGA codon at nucleotides 673-675. The start and stop codons are in bold letters 
in Table 5E. 



Table 5E. NOVSc Nucleotide Sequence (SEQ ID NO:27) 

CGAGCAGCCAGAGGCTGG&TCTC^ 

ACAGATGTTTCCCCACCCCGCAGACGGCGATGACCCCCCAGCCCGCCGGACCCCCGGATGGGGGC 

CTGGGTGGTGGCGGCCGCAGCCHTCGCG&TAAA 

GCCrreCCCTGACCTTGCCGAGCACTTTGA 

TGGCCGTGCAGCAGGCAGCCAGCCCCGTGGGCAGCGCCCTGAGCACGCGCTGGGGGGCCCGCCCCGTGGT 
GATGGTTGGGGGCGTCCTCGCCTC&CTGGGCCT 
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TACCTCGGCCTGGGCCTCCTCGCTGGCTTCCTAAGGGATGAGACAGGAGACTTCACCGCCTCTTTCCTCC 
TGTCTGGTTCTTTGATCCTCTCCGGCAGCTTCATCTACATAGGGTTGCCCAGGGCGCTGCCCTCCTGTGG 
TCC^GCCTCCCCTCC^GCCACGCCTCCCCCAGAGACGGGGGAGCTGCTTCCCGCTCCCCAGGCAGTCTTG 
CTGTCCCCAGGAGGCCCrGGCTCCACTCTGGACACCACTTGTTG ATTATTTTCTTGTTTGAGC CCCTCCC 
CCAC ' 



The disclosed NOV5c nucleic acid sequence , located on chromosome 17, has 340 of 
547 bases (62%) identical to a gb:GENBANK-ID:AF01 9 1 1 l|acc:AF0191 1 1.2 mRNA from 
Mus musculus (monocarboxylate transporter 3 (MCT3) mRNA, complete cds) (E = 2.4e" 15 ). 
5 A disclosed NOV5c polypeptide (SEQ ID NO:28) encoded by SEQ ID NO:27 is 2 1 5 

amino acid residues and is presented using the one-letter amino acid code in Table 5F. Signal 
P, Psort and/or Hydropathy results predict that NOV5c contains no signal peptide and is likely 
to be localized in the endoplasmic reticulum (membrane) with a certainty of 0.8500. 
Alternatively, NOV5c is also likely to be localized to the microbody (peroxisome) with a 
1 0 certainty of 0.6400, to the plasma membrane with a certainty of 0.4400, or to the nucleus with 
a certainty of 0.3000 



Table 5F. Encoded NOV5c protein sequence (SEQ ID NO:28). 

MPAPQRKHRRGGFSHRCFPTPQTAMTPQPAGPPDGGWGWVVAAAAFAINGLSYGLLRSLGIjAFPDIJ^EHFDRSAQ 
DTAWISAIAIAVMAASPVGSALSTRWGARPVVMVGGVLASLGFVFSAFAS 

ASFLLSGSLILSGSFIYIGLPRALPSCGPASPPATPPPETGELLPAPQAVLLSPGGPGSTLDTTC 

The disclosed NOV5c amino acid sequence has 53 of 1 10 amino acid residues (48%) 
1 5 identical to, and 72 of 1 1 0 amino acid residues (65%) similar to, the 504 amino acid residue 
ptar:SPTREMBL-ACC:Q9UBE2 protein from Homo sapiens (Human) (Monocarboxylate 
Transporter MCT3) (E = 2.9e" 28 ). 

NOV5c is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
20 nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. . 

NOV5d 

25 A disclosed NOV5d nucleic acid of 1513 nucleotides (also referred to as CG56635-04) 

encoding a novel Monocarboxylate transporter 3-like protein is shown in Table 5G. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 28-30 and 
ending with a TGA codon at nucleotides 1444-1446. The start and stop codons are in bold 
letters in Table 5G. 
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Table 5G. NOVSd Nucleotide Sequence (SEQ ID NO:29) 

CGAGCAGCCAGAGGCTGGATCTCAGGG ATGCCAGCTCCCCAGCGGjAGCACAGGCGTGGAGGCTTCTCTC 

ACAGATGTTTCCCCACCCCGCAGACGGCGATGACCCCCCAGCCCGCCGGACCCCCGGATGGGGGCTGGGG 

CTGGGTGGTGGCGGCCGCAGCCTTCGCGATAAACGGGCTGTCCTACGGGCTGCTGCGCTCGCTGGGCCTT 

GCCTTCCCTGACCTTGCCGAGCACTTTGACCGAAGCGCCC^GGACACTGCGTGGATCAGCGCCCTGGCCC 

TGGCCGTGCAGCAGGCAGCCAGTCCCGTGGGCAGCGCCCTGAGCACGCGCTGGGGGGCCCGCCCCGTGGT 

GATGGTTGGGGGCGTCCTCGCCTCGCTGGGCTTCGTCTTCTCGGCTTTCGCCAGCGATCTGCT 

TACCTCGGCCTGGGCCTCCTCGCTGGTTTTGGTTGGGCCCTGGTGTTCGCCCCCGCCCTAGGCACCCTCT 

CGCGTT ACTTCTCCCGCCGTCGAGTCTTGGCGGTGGGGCTGGCGCTCACCGGC AACGGGGCCT CCT CGCT 

GCTCCTGGCGCCCGCCTTGCAGCTTCTTCTCGATACTTTCGGCTGGCGGGGCGCTCTGCTCCTCCTCGGC 

GCGATC^CCCTCCACCTC^CCCCCTGTGGCGCCCTGCTGCTACCCCrGGTCCTTCCTGGAGACCCCCCAG 

CCCC^CCGCGTAGTCCCCTAGCTGCCCTCGGC^ 

TCTAGGCACAGCCCTGGTTGGGGGCXSGGTACTTCGTTCCTT^ 

CGGGGCCTGGGGGGATACGGAGCAGCGCTGGTGGTGGCCGTGGCTGCGATGGGGGATGCGGGCGCCCGGC 
TGGTCTGCGGGTGGCTGGCAGACCAAGGCTGGGTGCCCCTCCCGCGGCTGCTGGCCGTATTCGGGGCTCT 
GACTGGGCTGGGGCTGTGGGTGGTGGGGCTGGTGCCCGTGGTGGGCGGCGAAGAGAGCTGGGGGGGTCCC 
CTGCTGGCCGCGGCTGTGGCCTATGGGCTGAGCGCGGGGAGTTACGCCCCGCTGGTTTTCGGTGTACTCC 
CCGGGCTGGTGGGCGTCGGAGGTGTGGTGCAGGCCACAGGGCTGGTGATGATGCTGATGAGCCTCGGGGG 
GCTCCTGGGCCCTCCCCTGTCAGGTAAGTTCCTAAGGGATGAGACAGGAGACTTCACCGCCTCTTTCCTC 
CTGTCTGGTTCTTTGATCCTCTCCGGCAGCTTCATCTACATAGGGTTGCCCAGGGCGCTGCCCTCCTGTG 
GTCCAGCCTCCCCTCCAGCCACGCCTCCCCCAGAGACGGGGGAGCTGCTTCCCGCTCCCCAGGCAGTCTT 
GCTGTCCCCAGGAGGCCCTGGCTCCACTCTGGACACCACTTGTTG ATTATTTTCTTGTTTGAGCCCCTCC 
CCCAATAAAGAATTTTTATCGGGTTTTCCTGAAACCTCCAACT 



The disclosed NOV5d nucleic acid sequence , located on chromosome 17, has 567 of 
940 bases (60%) identical to a gb:GENBANK-ID:HSU81800|acc:U8 1800.1 mRNA from 
5 Homo sapiens (monocarboxylate transporter (MCT3) mRNA, complete cds) (E = 6.5e" 30 ). 

A disclosed NOV5d polypeptide (SEQ ID NO:30) encoded by SEQ ID NO:29 is 472 
amino acid residues and is presented using the one-letter amino acid code in Table 5H. Signal 
P, Psort and/or Hydropathy results predict that NOV5d contains no signal peptide and is likely 
to be localized in the plasma membrane with a certainty of 0.6000. Alternatively, NOVSd is 
10 also likely to be localized to the Golgi body with a certainty of 0.4000, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or to the microbody (peroxisome) with a 
certainty of 0.3000 



Table 5H. Encoded NOVSd protein sequence (SEQ ID NO:30). 

MPAPQRKHRRGGFSHRCFPTPQTAMTPQPAGPPDGGWGWW 

DTAWISAIAIAVQQAASPVGSALSTRWGARPVVMVGGVIASLG 

LGTLSRYFSRRRVLAVGLALTGNGASSLLIAPAL 

PPRS PLAALGLSLFTRRAFS I FALGTALVGGGYFVP YVHLAPHALDRGI^G YGAAL WAVAAMGDAGAI^VCGWL 
ADQGOTPLPRLLAVFGALTGLGLWWGLVPWGGEESWGGPLLAAAVAYGLSAGSYAPLVFGVLPGLVGVGGVVQ 
ATGLVWLMSLGGLLGPPIiSGKFLRDETGDFTASFLLSGSLILSGSFIYIGLPRALPSCGPASPPATPPPETGEL 
LPAPQAVLLS PGGPGSTLDTTC 



1 5 The disclosed NOV5d amino acid sequence has 96 of 1 98 amino acid residues (48%) 

identical to, and 122 of 198 amino acid residues (61%) similar to, the 504 amino acid residue 
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ptnr:SPTREMBL-ACC:O95907 protein from Homo sapiens (Human) (DJ1039K5.2 (Similar 
To Monocarboxyiate Transporter (MCT3))) (E = 7.9c 6 *). 

NOV5d is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
5 nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. . 
NOV5e 

1 0 A disclosed NOV5e nucleic acid of 465 nucleotides (also referred to as CG56635-05) 

encoding a novel Monocarboxyiate transporter 3-like protein is shown in Table 51. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 7-9 and 
ending with a TGA codon at nucleotides 436-438. The start and stop codons are in bold letters 
in Table 51., and the 5' and 3' untranslated regions, if any, are underlined. 
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Table 51. NOV5e Nucleotide Sequence (SEQ ID NO:31) 



ACGGCGATQACCCCCCAGCCCGCCGGACCCCCGGATGGGGGCTGGGGCTGGGTGGTGGCGGCCGCAGCCT 
TCGCGATAAACGGGCTGTCCTACGGGCTGCTGCGCTCGCTGGGCCrTGCCTTCCCTGTCCTTGCCGAGCA 
CTTTGACCGAAGCGCCC^GGACACTGCGTGGATCAGCGCCCTGGCCCTGGCCGTGCAGCAGGCAGCCAGC 
TTCCTAAGGGATGAGACAGGAGAOTCACCGCCTCTTTCCTCCTGTCTGGTTCTTTGATCCTCTCCGGCA 
GCTTCATCTACATAGGGTTGCCCAGGGCGCTGCCCTCCTGTGGTCCAGCCTCCCCTCCAGCCACGCCTCC 
CCCAGAGACGGGGGAGCTGCTTCCCGCTCCCCAGGCAGTCTTGCTGTCCCCAGGAGGCCCTGGCTCCACT 
CTGGACACCACTTGTTGATTATTTTCTTGTTTGAGCCCCTCCCCC 



The disclosed NOV5e nucleic acid sequence , located on chromosome 17, has 351 of 
434 bases (80%) identical to a gb:GENBANK-ID:AX083362|acc:AX083362.1 mRNA from 
Homo sapiens (Sequence 54 from Patent WO01 12660) (E = 1.6e" 53 ). 

20 A disclosed NOV5e polypeptide (SEQ ID NO:32) encoded by SEQ ID NO:3 1 is 1 43 

amino acid residues and is presented using the one-letter amino acid code in Table 5J. Signal 
P, Psort and/or Hydropathy results predict that NOV5e contains no signal peptide and is likely 
to be localized extracellularly with a certainty of 0.5040. Alternatively, NOV5e is also likely 
to be localized to the endoplasmic reticulum (membrane) with a certainty of 0.1000, to the 

25 endoplasmic reticulum (lumen) with a certainty of 0. 1 000, or to the lysosome (lumen) with a 
certainty of 0.1000. The most likely cleavage site for NOV5e is between positions 43 and 44: 
VLA-EH. 
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Table 5J. Encoded NOV5e protein sequence (SEQ ID NO:32). 

MTPQPAGPPDGGWGWWAAAAFAINGLSYGLDRSLGLAFPVLflEHFDRSAQDTAWISALAIiAVQQAASFLRDETG 
DFTASFLLSGSLILSGSFIYIGLPRALPSCGPASPPATPPPBTGEIJ.PAPQAVI.LSPGGPGSTLDTTC 

The disclosed NOV5e amino acid sequence has 67 of 68 amino acid residues (98%) 
identical to, and 67 of 68 amino acid residues (98%) similar to, the 375 amino acid residue 
ptnr:REMTREMBL-ACC:CAC332 85 protein from Homo sapiens (Human) (Sequence 54 
5 from Patent WO01 12660) (E = 2.9e- 31 ). 

NOV5e is predicted to be expressed in at least Mammalian Tissue, Parathyroid Gland, 
Mammary gland/Breast, Prostate. . 

NOV5a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 5K. 
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Table 5K. BLAST results for NOV5a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi| 7670446 (dbj ( BAA9 
5074. 1| (AB041591) 


unnamed protein 
product [Mas 
musculus] 


290 


252/288 
(87%) 


263/288 
(90%) 


le-86 


gi| 17491104 |ref |XP 

064368.1) 

(XM_064368) 


similar to solute 
carrier family 16 
(monocarboxyl i c 
acid 

transporters) , 
member 8 (H. 
sapiens) [Homo 
sapiens] 


427 


196/398 
(49%) 


257/398 
(64%) 


6e-74 


gi|2497855|sp|Q6334 
4|MOT2_RAT 


MONOCARBOXYIATE 
TRANSPORTER 2 
(MCT 2) 


489 


142/420 
(33%) 


220/420 
(51%) 


6e-53 


gi 1 1432167 |.gb | AAB04 
023. l| (U62316) 


monocarboxyl at e 
transporter 2 
(Rattus 
norvegicus] 


489 - 


143/420 
(34%) 


220/420 * 
(52%) 


6e-53 


gi | 6755536 |ref|NP 0 
35521. l| 
(NM 011391) 


solute carrier 

family 16 

( monocarboxyl i c 

acid 

transporters) , 
member 7 {Mus 
musculus] 


484 


142/421 
(33%) 


221/421 
(51%) 


2e-52 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 5 J. 



Table 5J Information for the ClustalW proteins 

1) NOV5a (SEQ. ID NO:24) 

2) NOV5b (SEQ ID NO: 26) 

3) NOV5C (SEQ ID NO: 28) 

4) NOV5d (SEQ ID NO:30) 

5) NOV5e (SEQ ID NO: 32) 

6) gi | 7670446 |dbj|BAA95074.l| (AB041591) unnamed protein product [Mus musculus) (SEQ 
ID NO:337) 
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35 



40 



45 



50 



55 



60 



65 



7) gi | 17491104 | ref |XP_064368 . 1 1 (XM_064368) similar to solute carrier family 16 
(monocarboxylic acid transporters), member 8 (H. sapiens) [Homo sapiens] (SEO ID 

NO:338) 

8) gil2497855|sp|Q63344|MOT2_RAT MONOCARBOXYLATE TRANSPORTER 2 (MCT 2) (SEQ ID 
NO: 33 9) 

9) gi | 1432167 |gb|AAB04G23.l| (U62316) monocarboxylate transporter 2 {Rattus 
norvegicus] (SEQ ID NO: 340) 

10) . gi | 6755536 | ref |NP_035521 . l | (NW_011391) solute carrier family 16 
(monocarboxylic acid transporters), member 7 [Mus musculus) (SEQ ID NO: 341) 
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NOV5a 
NOVSb 
NOVSc 
NOVSd 
NOV5e 



gi 
gi 
gi 
gi 
gi 



7670446) 
17491104 | 
2497855 
1432167 
6755536 



NOV5a 
NOV5b 
NOVSc 
NOVSd 
NOVSe 

gi|7670446| 
gi | 17491104 | 
gi|2497855| 
gi 1 1432167 j 
gi (6755536 



1 
1 
1 
1 
1 
1 
1 
1 
1 
1 



|....|. 



20 



30 



40 



50 



60' 



' 1 

----------- i 

---------- - - l 

~" - 1 

" 1 

MARRT EPPDGGWGWXVVVLSAFFQSALVFGVLRSFGVFFVEFVAAFEE 48 

MPSESSVKATAAPPPFPLPPDGGWGWVVVCAS - FISIGFSYAFPKAVTVFFNDIKDIFKT 59 
MPSESSVKATAAPPPFPLPPDGGWGWWVCAS - FISIGFSYAFPKAVTVFFNDIKDIFKT 5 9 
MPSEP - — S - - APLPQPLPPDGGWGWVWCAS - FISIGFS YAFPKAVTVFFKDIQEIFNT 54 



70 



80 



90 



100 



110 



120 



«|WT 

3fshrcHptpqta!t 



2 
2 

26 



1 " &JPAPQRKHRRL _ 

* - MPAPQRKHRRGgFSHRCgPTPQTAMT 26 



1 

49 
60 
60 
55 



" MVGGVLTS: 

QAARVSWIASIGIAVQQFGSPVGSALSTKFGPRP^VMTGGILAALl 
TSSQIAWISSIMLATOYAGGPISSVLVNNYGSRPfelVGGLLCi 
TSSQIAWISSIMLAVMYAGGPISSVLVJ^GSRpji^IVGGLLCl 
TSSQIAWISSirOiAVMYAGGPISSVLVNNYGSRP^IVGGLLCClj 




24 

.TSLTH&Y 108 
SSVIeSy 119 
SSVIeSy 119 
k 114 



13 0 



150 



NOVSa 


3 


PQPAGPPD( 


NOVSb 


3 


PQPAGPPD< 


NOVSc 


.27 


PQPAGPPD( 


NOV5d 


27 


PQPAGPPD( 


NOV5e 


3 


PQPAGPPD< 


gi 


7670446 


25 


LGLGLLAGi 


gi 


17491104) 109 


LSIGLLSG. 


gi 


2497855 


120 


LTVGFIGG3 


gi 


1432167 


120 


LTVGFIGG3 


gi 


6755536 


115 


LTVGFIGGJ 




'GLLRSI 
YGLXjE 

glErsi 




[G- 
[G- 

fe- 

_ JG 

T5sk 

[PTLACpSCYj _ 

.TlBGKYFYRKRP] 

ihg] 
ti0gkyfyrrrp: 




160 170 180 

3RSAQ13t| 54 
DRSAQDTB 54 
DRSAq|tB 78 
DRSAQpTg 78 
DRSAQDtS 5 * 
CjTGNGASSLIijSPALQFLLDTFG 84 
LTGVGLSSFT^PFgQWLLSHYg 168 
kGSPVFLSTflgPFNQFLFNSYG 179 
\GS PVFLSTjjgPFNQFLFNS YG 179 
3SPVFLSTffipFNQYLFN§YG 174 




190 



200 



NOV5a 


55 


NOVSb 


55 


NOVSc 


79 


NOVSd 


79 


NOVSe 


55 


gi 


7670446 


85 


gi 


17491104] 169 


gi 


2497855 


180 


gi 


1432167 


180 


gi 


6755536 


175 



210 



i-STRWG---] 
STRWG-- 

STRWG j 

STRWG 



NOVSa 
NOVSb 
NOVSc 
NOVSd 
NOV5e 





230 



240 



92 
92 
116 
116 
68 



LALS GDgLAPPRTPLgS 125 

PSLA ED§- -A^GPrBq 2 07 

|C^f*RPVGPSPRApSKSK^RQ^ 239 
|Cl|toVGPSPRA^KSKRfeRQDBsTKRLSKVSTAEKINR 239 
SPKPTKgSKSK^SRHD§rLKKASKVSTAQKVNR 234 



270 



280 



290 



300 



FGWALVFAPALGTLSRYFSRRRVLAVGLALT 147 
t 115 

^|gfgwalvfapalgtlsryi^srrrvlavglalt 171 

- 68 



57 



WO 02/068649 



PCT7US02/02785 



gi | 7670446 | 
gi j 17491104 | 
gi|2497855| 
gi j 1432167 j 
gi |6755536 | 



ELHHGP FLRYT^ALTI^XNtByfI P YLHLV - - 
GFLilYLSGNVVfflFffS^FSP 1 1 FLAP - 
GFLirYLSGNVvfflF^IFBpiIFLAP- 
Flij YLSOJVi3f2§I FgP 1 1 FLAQ - 




158 
239 
274 
274 
269 



NOVSa 
NOV5b 
HOV5C 
NOV5d 
NOV5e 



gi 
gi 
gi 
gi 



7670446 | 

17491104) 

2497855 

1432167 

6755536 



310 320 330 340 350 360 

....]....)... 

148 GNGASST »I>L APALQLLLDT FGWRGALLLLGAI TLHLTPCGALLLPL VL PGDP PAP PRS P L 207 

115 - us 

139 - - -- 139 

172 GNGAS SLLLAPALQLLLDTFGWRGALLLLGAI TLHLTPCGALLLPLVL PGD PP AP PRSPL 231 



68 

158 

239 

274 

274 

269 



68 

158 

239 

274 

274 

269 



NOVSa 
NOV5b 
NOV5C 
NOVSd 
NOV5e 



gi 
gi 
gi 
gi 
gi 



7670446| 
17491104 | 
2497855 | 
1432167) 
6755536| 



370 380 390 400 410 420 

....) | | | |..-.| | | |....| |."..-| 

208 AALGQSLFTRRAFS I FALGTAL VGG GY FVP YVHLAPHALDRGLGG YGAAL VVAVAAMGD A 267 

115 — - 115 

139 139 

232 AALGLSLFTRRAFS I FALGTALVGGGTYFVP YVHLAPHALDRGLGG YGAAL WAVAAMGDA 291 

68 — 68 

158 158 

239 - 239 

274 -- - 274 

274 - - 274 

269 - 269 



430 
..|.. 



440 



450 



460 



NOV5a 


268 


NOV5b 


115 


NOV5e 


139 


NOVSd 


292 


NOV5e 


68 


gi 


7670446 


158 


gi 


17491104) 239 


gi 


2497855 


274 


gi 


1432167 


274 


a*. 


6755536 


269 



470 
..|.. 



480 



115 

139 

351 

68 

158 

239 

274 

274 

269 



490 



500 



510 



520 



530 



540 



NOVSa 


328 


NOV5b 


115 


NOVSc 


139 


NOVSd 


352 


NOVSe 


68 


gi 


7670446 


158 


gi 


17491104) 239 


gi 


2497855 


274 


gi 


1432167 


274 


gi 


6755536 


269 



NOVSa 
NOVSb 
NOVSc 
NOVSd 
NOVSe 

gi | 7670446 | 
gi | 17491104 
gi | 2497855 
gi j 1432167 
gij 6755536 




-AHLQDLgWDPLPgAjg 255 
--YAKDKGVDgY^jSAffl 289 



550 



560 



570 



580 



-YAKDKG 
-YAKHIGV 

590 




600 




387 
131 

155 FgGSLi 
412 ^GSLJ - 
83 BgGSljj- 

175 ,vavSa;vgdac 

256 fflSvVA0SDLVGRivSGW£rGDAVPGP^ 

290 BvmaftdmfarpBvgiBan- tsBi: 

290 BvMAFTDMFACT^GLflAN- TStl i: 
285 EvMAFIDMFARP§VGLaAN-Tsiii: 



WTTLJjG VSLgLFgVAQAPTJ 

|lQYIiFSvBlMFTG ICHLLC s LAHSYT2 

QYLFSV0IMFTG ICHLLC | LAHSYTJ 

jlQYLFSSgllFTG ICHLLCQLATTYSJ 

58 
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308 
344 
344 
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NOVSa 


426 


NOV5b 


170 


NOV5c 


194 


NOVSd 


451 


NOV5e 


122 


gi 


7670446 


! 235 


gi 


174911041 309 


gi 


2497855 


345 


gi 


1432167 


345 


gi 


6755536 


340 




630 



640 



YAPLVFGVLPGLVs 

'LAFSVLPELI 
ISSLLFECLMDQ 1 
ISSLLFECLMDQ 1 




650 



660 



447 
191 
215 
472 
143 



.VMMLMSLGGLLGPPLSGKG 290 

'RRIYCGLGLLQMIES IGGLLGPPLSGYLRDVT 368 
"RFSSAVGLVTIVECCPVLFGPPLAGKLLDIT 404 
RFSSAVGLVTIVECCPVLPGPPLAGKLLDIT 404 
'SSAVGLTTIVECCPVLFGPPLAGKLLDIT 399 



670 



680 



690 



710 



720 



NOVSa 
NOVSb 
NOV5c 
NOVSd 
NOVSe 



gi 
gi 
gi 
gi 
gi 



7670446] 
17491104 | 
2497855) 
1432167 | 
6755536) 



700 
■|....|.... 

' 447 

igi 

215 

472 

143 

-- 290 

— — — — — — ~— — — — — _______ p'Q'j'TTSG 404 

405 GQYKYLYI ASG I WLS SG I YLL I CNAINYRLLEKERKREKARRKKS ASQAS KEMEALSRS 464 
405 GQYKYLYIASGI WLSSGI YLLI CNAI2TCRIXEKERKREKARRKKSAS QAS KEMEALSRS 464 
400 GEYKYLYIASGTVVLVSGTYLLIGNAINYRLLDICERKREP^ 459 



447 
191 
215 
472 
143 
290 

369 GNYTAS FWAGAFLLSGSG ILLTLPHFFC - 



730 



NOV5a 
NOVSb 
NOV5c 
NOVSd 
NOVSe 

gi | 7670446 | 



447 
191 
215 
472 
143 
290 



|....|. 



740 



447 
191 
215 
472 
143 
290 



gi 1 17491104 | 405 PQDLVTEALDTKVPLPKEGLEED- - 427 

gi 1 2497855 465 KQDDVTVKVSNTHNPPSDRDKESSI 489 

gi 1 1432167 465 KQDDVTVKVSNTHNPPSDRDKESSI 489 

gi | 6755536 460 KQDEVTVKASNAHNPPSDRDKESNI 484 



Monocarboxylates such as lactate and pyruvate play a central role in cellular 
metabolism and metabolic communication between tissues. Essential to these roles is their 
rapid transport across the plasma membrane, which is catalysed by a recently identified family 
of proton-linked monocarboxylate transporters (MCTs). Nine MCT-related sequences have so 
far been identified in mammals, each having a different tissue distribution, whereas six related 
proteins can be recognized in Caenorhabditis elegans and 4 in Saccharomyces cerevisiae. 
Direct demonstration of proton-linked lactate and pyruvate transport has been demonstrated 
for mammalian MCT1-MCT4, but only for MCT1 and MCT2 have detailed analyses of 
substrate and inhibitor kinetics been described following heterologous expression in Xenopus 
oocytes. MCT1 is ubiquitously expressed, but is especially prominent in heart and red muscle, 
where it is up-regulated in response to increased work, suggesting a special role in lactic acid 
oxidation. By contrast, MCT4 is most evident in white muscle and other cells with a high 
glycolytic rate, such as tumour cells and white blood cells, suggesting it is expressed where 
lactic acid efflux predominates. MCT2 has a ten-fold higher affinity for substrates than MCT1 



59 



WO 02/068649 



PCT/US02/02785 



and MCT4 and is found in cells where rapid uptake at low substrate concentrations may be 
required, including the proximal kidney tubules, neurons and sperm tails. MCT3 is uniquely 
expressed in the retinal pigment epithelium. MCT1 and MCT4 have been shown to interact 
specifically with OX-47 (CD147), a member of the immunoglobulin superfamily with a single 
5 transmembrane helix. This interaction appears to assist MCT expression at the cell surface 

The disclosed NOV5 nucleic acid of the invention encoding a Monocarboxylate 
transporter (MCT3)-like protein includes the nucleic acid whose sequence is provided in Table 
5 A, 5C, 5E, 5G, 51 or a fragment thereof. The invention also includes a mutant or variant 
nucleic acid any of whose bases may be changed from the corresponding base shown in Table 

1 0 5A, 5C, 5E, 5G, or 51 while still encoding a protein that maintains its Monocarboxylate 

transporter (MCT3)-like activities and physiological functions, or a fragment of such a nucleic 
acid. The invention further includes nucleic acids whose sequences are complementary to 
those just described, including nucleic acid fragments that are complementary to any of the 
nucleic acids just described. The invention additionally includes nucleic acids or nucleic acid 

15 fragments, or complements thereto, whose structures include chemical modifications. Such 
modifications include, by way of nonlimiting example, modified bases, and nucleic acids 
whose sugar phosphate backbones are modified or derivatized. These modifications are 

carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 

j 

that they may be used, for example, as antisense binding nucleic acids in therapeutic 
20 applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
about 40 percent of the bases may be so changed. 

The disclosed NOV5 protein of the invention includes the Monocarboxylate 
transporter (MCT3)-like protein whose sequence is provided in Table 5B, 5D, 5F, 5H, or 5 J. 
The invention also includes a mutant or variant protein any of whose residues may be changed 

25 from the corresponding residue shown in Table 5B, 5D, 5F, 5H, or 5 J while still encoding a 
protein that maintains its Monocarboxylate transporter (MCT3)-like activities and 
physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 67 percent of the residues may be so changed. 

NOV5 nucleic acid and polypeptide show homology to the Monocarboxylate 

30 transporter (MCT3) familyof proteins. Accordingly, to the NOV5 nucleic acid and polypeptide 
may function as members of this family. The NOV5 nucleic acids and proteins identified here 
may be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 
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invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
5 those defined here. 

The nucleic acids and proteins of NOV5 are useful in metabolic disorders such as salla . 
disease, infantile sialic acid storage disease, symptomatic deficiency in lactate transport, 
subnormal erythrocyte lactate transport, muscle injuries, cystinosis, streptozotocin-induced 
diabetes, hypoxia, cardiac arrest or stroke, neuronal disorders, retinal angiogenesis, and/or 

10 other pathologies and disorders. 

NOV5 nucleic acids and polypeptides are further useful in the generation of antibodies 
that bind immunospecifically to the novel substances of the invention for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 
art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 

1 5 section below. For example the disclosed NOV5 protein have multiple hydrophilic regions, 
each of which can be used as an immunogen. This novel protein also has value in 
development of powerful assay system for functional analysis of various human disorders, 
which will help in understanding of pathology of the disease and development of new drug 
targets for various disorders. 

20 

NOV6 

A disclosed NOV6 nucleic acid of 1336 nucleotides (also referred to CG56674-01) 
encoding a novel Nitrilase-l-like protein is shown in Table 6A. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 77-79 and ending with a 
25 TAA codon at nucleotides 1058-1060. In Table 6A, the 5' and 3' untranslated regions are 
underlined and the start and stop codons are in bold letters. 
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Table 6A. NOV6 Nucleotide Sequence (SEQ ID NO:33) 

GCCC^CTCGCTGCGGCCTATCTGGCTCCAGACCGCCCTCCGGATCGGACCCTGCGAATGGTTTTGGCTATA 
TCTTCA TGCTGGGCTTCATCACCAGGCCTCCTCACAGATTCCTGTCCCTTCTGTGTCCTGGACTCCGGATA 
CCTCMCTCTCTGGGGAAGGTGCTCAGCCCAGGCCCAGAGCCATGGCTA^ 

GCCCCTGGTGGCTGTGTGCCAGGTAACATCGACGCCAGACAAGCAACAGAACTTTAAAACATGTGCTGAGC 
TGGTTCGAGAGGCTGCCAGACTGGGTGCCTGCCTGGCTTTCCTGCCTGAGGCATTTGACTTCATTGCACGG 
GAC CC TGCAGAGACG CT ACAC CT GT CTGAAC CACT GGGT GGGAAACTTTT GGAAG AAT ACAC C CAG CTT G C 
CAGGGAATGTGGACTCTGGCTGTCCTTGGGTGGTTTCCATGAGCGTGGCCAAGACTGGGAGCAGACTCAGA 
AAATC TACAATTGT CACGTGC TG CTGAACAG CAAAGGGG CAGTAGTG GCCATTTACAGGAAGACACATCTG 
TGTG ACG T AG AGATTCCAGGG CAGGGGCC T ATGTGTG AAAG C AACTCTAC CATGC CTGGGCCCAGT CTTGA 
GTCACCTGTCAGCACACCAGCAGGCAAGATTGGTCTAGCTGTCTGCTATGACATGCGGTTCCCTGAACTCT 
CTCTGGCATTGGCTCAAGCTGGAAC&GAGATACTTACCTATC 

GCCCACTGGGAGGTGTTGCTGCGGGCCCGTGCTATCGAAACCCAGTGCTATGTAGTGGCAGCAGCACAGTG 

TGGACGCCACCATGAGAAGAGAGC^GTTATGGCC^C^GCATGGTGGTAGACCCCTGGGGAACAGTGGTGG 

CCCGCTGCTCTGAGGGGCCAGGCCTCTGCCTTGCCCGAATAGACCTCAACTATCTGCGA 

CACCTGCCTGTGTTCCAGCACCGCAGGCCTGACCTCTATGGCAATCTGGGTCACCCACTGTCTTAA GACTT 

GACTT CTGT G AGTTT AGAC CTGC C C CT C C CAC C C C CAC C CTG CCACT ATGAGCTAGTG CTCATGTG ACTTG 

GAGGCAGGATCCAGGCACAGCTCCCCTCACTTGGAGAACCT 

CTTGGGAAAGAAACTTTCACCTGAGCTTCACCTGAGGTCAGACTGCAGTTTCAGAAAGGTGGAATTTTATA 
TAGTCATTGTTTATTTCATGGAAACTGAAGCTCTGCTGAGGGCTGAGCACCTTCCCCA 



The disclosed NOV6 nucleic acid sequence, localized to the pi 4.2 region of 
chromosome 3, has 1 3 1 9 of 1 329 bases (99%) identical to a gb:GENBANK- 
ID:AF069987|acc:AF069987.1 mRNA from Homo sapiens (nitrilase 1 (NIT1) mRNA, 
5 complete cds) (E = 3.1 e 290 ). 

, A disclosed NOV6 polypeptide (SEQ ID NO:34) encoded by SEQ ID NO:33 is 327 
amino acid residues and is presented using the one-letter amino acid code in Table 6B. Signal 
P 5 Psort and/or Hydropathy results predict that NOV6 has a signal peptide and is likely to be 
localized in the cytoplasm with a certainty of 0.4500. Alternatively, NOV6 is also likely to be 
1 0 localized to the microbody (peroxisome) with a certainty of 0.3000, to the lysosome (lumen) 
with a certainty of 0.2021, or to the mitochondrial matrix space with a certainty of 0.1000. The 
most likely cleavage site for NOV6 is between positions 27 and 28: LSG-EG 



Table 6B. Encoded NOV6 protein sequence (SEQ ID NO:34). 

MliGFITRPPHRFLSLLCPGLRIPQLSGEGAQPRPRAMAISSSSCELPLVAVCQVTSTPDKQQNFKTCAELV 

REAARLGACIiAFLPEAFDFIARDPAETLHLSEPLGGKLLEEYTQIiARECG 

YNCHVLLNSKGAWAIYRKTHLCDVEIPGQGPMCES 

AIAQAGTEILTYPSAFGSITGPAHWEVLLRARAIOTQCYWAAA^ 

CSEGPGLC1ARIDLNYLRQLRRHLPVFQHRRPDLYGNLGHPLS 



1 5 The disclosed NOV6 amino acid sequence has 322 of 327 amino acid residues (98%) 

identical to, and 322 of 327 amino acid residues (98%) similar to, the 327 amino acid residue 
ptnr:SPTREMBL-ACC:O76091 protein from Homo sapiens (Human) (Nitrilase Homolog 1) 
(E = 4.5e 176 ). 



62 



WO 02/068649 PCT/US02/02785 

NOV6 also has homology to the amino acid sequence shown in the BLASTP data 
listed in Table 6C. 



Table 6C. BLAST results for NOV6 



Gene Index/ 
Identifier ' 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 5031947 | ref |NP_00 
5591.1 | (NM_005600) 


nitrilase 1 [Homo 
sapiens] 


327 


322/327 
(98%) 


322/327 


0.0 


gi | 3242980 |gb| AAC401 
84. 1| (AF069985) 


nitrilase homolog 
1 [Mus musculus] 


323 


272/327 
(83%) 


298/327 
(90%) 


e-154 


gi | 6754856] ref |NP 03 
6179. 1| (NM012049) 


nitrilase 1 [Mus 
musctilus] 


323 


272/327 
(83%) 


297/327 
(90%) 


e-153 


gi| 18204913 |gb|AAH21 
634.1 |AAH21634 
(BC021634) 


Unknown (protein 
for MGC:13825) 
[Mus musculus] 


323 


271/327 
(82%) 


296/327 
(89%) 


e-153 


gi | 12836591 |dbj|BAB2 
3723.1 | (AK004988) 


data source :MGD, 
source 

key:MGI: 1350916, 
evidence : ISS-nitr 
ilase 1 -putative 
[Mus musculus] 


290 


251/288 
(.87%) 


272/288 
(94%) 


e-145 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 6D. 

Table 6D. Information for the ClustalW proteins 

1) NOV6 (SEQ ID NO: 34) 

2) gi 1 5031947 | ref |NP_005591 . 1 ) (NM 005600) nitrilase 1 [Homo sapiens] (SEQ ID 
NO;342) 

3) gi|3242980|gb|AAC40184.l| (AF069985) nitrilase homolog 1 [Mus muscufus] (SEQ ID 
N0:343) 

4) gi 1 6754856 | ref |NP_036179 . 1 1 (NM_012049) nitrilase 1 [Mus musculus] (SEQ ID 
NO: 344) 

5) gi 1 18204913 | gb | AAH21634 . 1 | AAH21634 (BC021634) Unknown (protein for MGC: 13825) 
[Mus musculus] (SEQ ID NO: 345) 

6) gi | 12836591 |dbj|BAB23723.l| (AK004988) data source:MGD, source key :MGI : 1350916, 
evidence : iSS-nitrilase 1 -putative [Afus musculus] (SEQ ID NO: 346) 
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NOV6 




1 


MLGFITRPPH 


RFLS 


gi| 5031947 




1 


MLGFITRPPH 


RFLS 


gij 3242980 




1 


MLGFITRFPK 


— Q 


gij 6754856 




1 


MLGFITRPPH 




gi | 18204913 | 


1 


MLGFITRPPH 




gij 12836591) 


1 








N0V6 



gi 
gi 
gi 
gi 
gi 



5031947 
3242980 
6754856 
18204913) 
12836591 | 



NOV6 



gi 
gi 
gi 



5031947 
3242980 
6754856 



70 



80 



I 



90 



100 



I 



110 

..I.. 



120 
..I 



61 
61 
57 
57 
57 
24 



«§nfktcaelvj^eaarlgaclaflpeafdfiar^ 

q^fktcaelv|eaarlgaclaflpeafdfiar|paetl2lsepl§g5ll^Y;|qlarec 
qenfktcaelvqeaarlgaclaflpeafdfi arnpaetllls e plngdllgq ysqlarec 
qenfktcaelvqeaarlgaclaflpeafdfiarnpaetlllseplngdllgqysqlared 
qenfktcaelvqeaarlgaclaflpeafdfiarnpaetlllseplngdllgqysqlarec 
qenfktcaelvqeaarlgaclaflpeafdfiarnpaetlllse plngdllgq ysqlarec 



120 
120 
116 
116 
116 
83 



130 



180 




G^JLSLGGFHERGQDWEGgQKIYNCHVLLNSKGgWA4jYRKTHLCDVEIPGQGPMgESNS 
GyjjWLSLGGFHERGQDWEQ|)QKI YNCHVLLNS KG^WAgjYRKTHLCD VE I PGQGPM§ESN§ 
GIWLSLGGFHERGQDWEQNQKIYNCHVLLNSKGSWASYRKTHLCDVEIPGQGPMRESNY 
GIWLSLGGFHERGQDWEQNQKI YNCHVLLNS KGS WAS YRKTHLCDVE I PGQGPMRE S NY 
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gi 


1 18204913 | 


117 


gi 


12836591 | 




NOV6 


181 


gi 


5031947) 


181 


gi 


3242980) 


177 


gi 


6754856) 


177 


gi 


18204913 | 


177 


gi 


12836591) 


144 



GIWLSLGGFHERGQDWEQNQKIYNCHVLLNSKGSVVASYRKTHLCDVEIPGQGPMRESNY 
GIVTLSLGGFHERGQDWEQNQKIYNCHVLLNSKGSWASYRKTHLCDVEIPGQGPMRES 



176 
Jgj 143 



190 



200 



210 



220 



230 



240 




250 



260 



15 



20 



N0V6 


241 


gi 


5031947 | 


241 


gi 


3242980) 


237 


gi 


6754856) 


237 


gi 


18204913) 


237 


gi 


12836591) 


204 



270 



280 



290 



LLRARAI ES Q C YVffiAAAQ CGRHHEgRAS Y GH SM WDPVJ GT WARC SEG P GL CL ARI DLf 
LLRARAI EyQ C YVgjAAAQ CGRHHEjJrA S YGHS MWD P WGT WARCS EGPGLCLAR I DL{ 
LLRARAIESQCYVIAAAQCGRHHETRASYGHSMVVDPV7GTWARCSEGPGLCLARIDLHF 
LLRARAIESQCYVIAAAQCGRHHETRASYGHSMWDPWGTWARCSEGPGLCLARIDLHF 
LLRARAIESQCYVIAAAQCGRHHETRASYGHSMWDPWGTWARCSEGPGLCLARIDLHF 
LLRARAI ESQCY VI AAAQCGRHHETRASYGHSMVVDPVIGT WARC SEGPGLCLARIDLHF 



300 

d 

T% 300 



300 
296 
296 
296 
263 
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30 



NOV6 


301 


gi 


5031947) 


301 


gi 


3242980) 


297 


gi 


6754856) 


297 


gi 


18204913) 


297 


gi 


12836591) 


264 



310 
..I.. 



320 



LgQgRgHLPVFQHRRPDLYGgLGHPLS 
L§Q2RgHLPVFQHRRPDLYG§ LGH PLS 
LQQMRQHLPVFQHRRPDLYGSLGHPLS 
LQQMRQHLPVFQHRRPDLYGSLGHPLS 
LQQMRQHLPVFQHRRPDLYGSLGHPLS 
LQQMRQHLPVFQHRRPDLYGSLGHPLS 



327 
327 
323 
323 
323 
290 



35 



Tables 6E list the domain description from DOMAIN analysis results against NOV6. 
This indicates that the NOV6 sequence has properties similar to those of other proteins known 
to contain this domain. 



Table 6E. Domain Analysis of NOV6 



gnl | Pfamjpf am00795, CN_hydrolase , Carbon -nitrogen hydrolase. This 
family contains hydrolases that break carbon -nitrogen bonds. The 
family includes: Nitrilase EC-.3.5.5.1, Aliphatic amidase EC:3.5.1.4, 
Biotidinase EC: 3. 5. 1.12, Beta-ureidopropionase EC: 3. 5. 1.6. (SEQ ID 
NO: 803) 

CD-Length a 267 residues, 100. 0% aligned 
Score « 273 bits (698), Expect = le-74 



40 



45 



50 



NOV 6: 


51 


Sbjct: 


1 


NOV 6: 


107 


Sbjct: 


61 


NOV 6: 


167 


Sbjct: 


116 


NOV 6: 


224 


Sbjct: 


166 


NOV 6: 


279 



VCQVTSTP-DKQQNFKTCAELVREAARLGACIiAFIiPEAFDFI ARDPAETIiHLSEPLG 106 

i iii+ii* 111+ ii i mi + n +i + 

AVQAEPVPEDUVftOTjQKAEEliIEEAAKAGAEIi\7VFPEAFIPGYPYCKSDAEyYENAEAID 60 



1+ + ++III+ 1+ + II 1+ l+ll II++ I ++ III || 

GEETQFLSRLARKNGIVIVLGVSEREGEG KLYNTAVLIDPDGKLIGKYRKIHLFT 115 



I I II 11+11 +III+IIIII+ III I III 

-GGSGFPVFDTPVGKLGLI.ICYDIRFPELARALALKGABIL 165 



+IIIII II +III+I llllll ll+l || | I + 



-SYGHSMWDPW 278 
IIIII++II 
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10 



i i + i i ii + i in- + ♦ i+ + . mini 

Sbjct: 226 GKVLAAPAEEEEGLIIADIDLSRIAEARQKMDFLGHRRPDIiY 267 

The tumor suppressor gene FHIT encompasses the common human chromosomal 
fragile site at 3pl4.2 and numerous cancer cell biallelic deletions. In human and mouse, the 
nitrilase homologs and Fhit are encoded by two different genes: FHIT and NIT1, localized on 
chromosomes 3 and 1 in human, and 14 and 1 in mouse, respectively. 

Bacterial and plant nitrilases are enzymes that cleave nitriles and organic amides to the 
corresponding carboxylic acids plus ammonia. The NIT1 gene is expressed as alternatively 
spliced transcripts. The major NIT1 transcript encodes a deduced 327-amino acid protein that 
shares 90% amino acid sequence identity with mouse Nitl, 58% identity with the nitrilase 
domain of C. eiegans NitFhit, and 53% identity with the nitrilase domain of Drosophila 
NitFhit. The NIT1 gene spans approximately 3.2 kb and contains 7 exons. Northern blot 
15 analysis detected NIT1 transcripts of approximately 1 A and 2:4 kb in all adult tissues 

examined, namely heart, brain, lung, liver, pancreas, kidney, skeletal muscle, and placenta. An 
approximately 1 .2-kb NIT1 transcript was found in skeletal muscle and heart. 

The loss of Fhit expression in several common human cancers is well documented. 
The disclosed NOV6 nucleic acid of the invention encoding a Nitrilase- 1 -like protein 
20 includes the nucleic acid whose sequence is provided in Table 6A or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 6A while still encoding a protein that maintains 
its Nitrilase-l-Iike activities and physiological functions, or a fragment of such a nucleic acid. 
The invention further includes nucleic acids whose sequences are complementary to those just 
described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1 percent of the 
bases may be so changed. 

The disclosed NOV6 protein of the invention includes the Nitrilase- Mike protein 
35 whose sequence is provided in Table 6B. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
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6B while still encoding a protein that maintains its Nitrilase-l-like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 1 8 
percent of the residues may be so changed. 

The protein homology information, expression pattern, and map location for the 
5 Nitrilase-l-like protein and nucleic acid (NOV6) disclosed herein suggest that NOV6 may 
have important structural and/or physiological functions characteristic of the Nitrilase-l-like 
family. Therefore, the NOV6 nucleic acids and proteins of the invention are useful in 
potential diagnostic and therapeutic applications. These include serving as a specific or 
selective nucleic acid or protein diagnostic and/or prognostic marker, wherein the presence or 

10 amount of the nucleic acid or the protein are to be assessed, as well as potential therapeutic 
applications such as the following: (i) a protein therapeutic, (ii) a small molecule drug target, 
(iii) an antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) a 
nucleic acid useful in gene therapy (gene delivery/gene ablation), and (v) a composition 
promoting tissue regeneration in vitro and in vivo. 

1 5 The NOV6 nucleic acids and proteins of the invention are useful in potential diagnostic 

and therapeutic applications implicated in various diseases and disorders described below 
and/or other pathologies. For example, the compositions of the present invention will have 
efficacy for treatment of patients suffering from cancer, muscle conditions, disorders and 
diseases, longevity, and/or other pathologies/disorders. The NOV6 nucleic acid, or fragments 

20 thereof may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

NOV6 nucleic acids and polypeptides are further useful in the generation of antibodies 
that bind immunospecifically to the novel substances of the invention for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 

25 art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
section below. For example the disclosed NOV6 protein have multiple hydrophilic regions, 
each of which can be used as an immunogen. This novel protein also has value in 
development of powerful assay system for functional analysis of various human disorders, 
which will help in understanding of pathology of the disease and development of new drug 

30 targets for various disorders. 
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NOV7 

NOV7 includes three novel cleavage signal- 1 protein-like proteins disclosed below. 
The disclosed sequences have been named NOV7a, NOV7b, NOV7c, and NOV7d. 

NOV7a 

5 A disclosed NOV7a nucleic acid of 1 822 nucleotides (also referred to as CG56613-0 1) 

encoding a novel cleavage signal-1 protein-like protein is shown in Table 7A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 98-100 
and ending with a TAA codon at nucleotides 839-841. A putative untranslated region 
upstream from the initiation codon is underlined in Table 7A. The start and stop codons are in 
10 bold letters. 



Table 7A. NOV7a nucleotide sequence (SEQ ID NO:35). 

GC^GCTGACGCAGCATTGCCAATTCTAAATCCATGATT^ 

CCAGGGTTTGAGAAATTCAGTCCGAATGGAACTTCAGGACCTGGAACTGC^GCTGGAr^AnrnrrT^r^r!^ 
CCTGGAGGAGCAGCTTCGTGCTGTGCGCATGCCTT 

CAGTAGAAGCACTGATAACTTGTCATGCCCTT C TCCAT TGAATGTAATGGAAC CAGT CA CTGAACTGATGCA 

GGAGCAGTCATACCTGAAGTCTGAATTGGGCCTGGGACTTGGAGAAATGGGATTTGAAATTCCTCCTGGAGA 

AAGCTCAGAATCTGTTTTTTCCAAGCAACGATCAGAATCATCTTCTATATGTTCTGGTCCCTCTCATGCTAA 

CAGAAGAACTGGAGTACCTTCTACTGCCTCAGTGGGCAAATCCAAAACCCCATTAGTGGC^ 

GTTCCGAGCATCGGTGGCTCTAACGCCAAC^GCTCCrTCrrAGAACAGGCTCTGTGCAGA 

GGAAAGTTC TGAGGAAGTTGATG CAG CTGAAGGAGCCC CAGAAGTT GTAGG ACC TAAAT CTGAAGTGGAAGA 

AGGGCAT GGAAAACT CC CAT CAATG C CAGCTG CT G AGGAAATGCAT AAAAATGTGGAGCAAGATG AGTTGCA 

GCAAGTCATACGGGAGATTAAAGAGTCTATTGTTGGGGAAATCAGACGGGAAATTGTAAGTGGACTTTTGGC 

AGCAGTATCTTCAAGTAAAGCGTCTAATTCTAAGCAAGATTATC^TTA AACAGAAATTATAGGTTGGCATGG 

ATCCTATTAGCTGTGTAATACTGGAATTATCAATGATATGCACTGGTGGAGGTGTTATTTGTGCTTTAGAj^ 

ATACTTGCTGTTGAGCTGGGCTACTGTATAC^^ 

ACGTACAT AGAAACTTAGGCACTTTGCTATTC 

TAATATTTATTTGTATGTCAATATTTTTCATTTGATTCCCTATTAGAATTAATTTTAAAACTTGAAGACTTC 

CAGACTTATCCAACTTATAAATAACATATTTCTTCAGACTAACATCTTAAAACACTGACCTCTATGAGG1M 

TTACTGTG CAATAACTGATTCATTTTTTTCAGAGCTTGAAGCATCCAATGATTTTTCCCT 

ATTAATGTCACTTCCAAGAAGAAAAACTGTTCTGTTGTAAAAAATATAATTGCTCTTAATTCTTGGGGAGGT 

TACTAATAGC&GTAGGATAGAATTTTATGAGGTCACCTACA 

TTG CTTTA CCCAAGACAAATGTAATTTTATCATTGCTTATGTAGTATTTTT CTTTTGGAAATGTG CCTTATG 

TTAAAGACT ATGTACTTTTACTTTTTGCATTGTCCAGACTTCTTTATTAGATGGAGATGTTTCCT 

CTT CTAGACTAAATAGAGTATCATCCAAATAATGGGG CCTATGACTTGAATGAATAGAAATGAATAAGCTGG 

TGTTTGTT TTTTCAAAATGGAAGTAATTTAGATTTGTTCTCCTCATACATAAAA 

AACCAGTGAAAACTTTGTTTTTATGAAAAAAAAGGAAAATGGTTTCCCATTTGGTTTTATATGTGTTAAATA 
AATGTGTAAAGTAACCACCCCC " ~ 



The disclosed NOV7a nucleic acid sequence, localized to chromosome 2, has 1 822 of 
1 828 bases (99%) identical to a gb:GENBANK-ID:HUMCSlPA|acc:M61 199.1 mRNA from 
15 Homo sapiens (Human cleavage signal 1 protein mRNA, complete cds) (E = 0.0). 

The disclosed NOV7a polypeptide (SEQ ID NO:36) encoded by SEQ ID NO:35 has 

247 amino acid residues and is presented in Table 7B using the one-letter amino acid code. 

Signal P, Psort and/or Hydropathy results predict that NOV7a has a signal peptide and is likely 

to be localized to the cytoplasm with a certainty of 0.6500. Alternatively, NOV7A may also 
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localize to the mitochondrial matrix space with a certainty of 0.1 000, or the lysosome (lumen) 
with a certainty of 0.1000. 



Table 7B. Encoded NOV7a protein sequence (SEQ ID NO:36). 

MELQDLELQLEERLLGLEEQLRAVRMPSPFRSSALMGM 

LGLGLGEMGFEIPPGESSESVFSKQRSESSSICSGPSHANWITGVPSTASVGKSKTPLVARKKVFRASVALT • 
P T AP SRTG S VQT P PDLE S S EE VD AAEGAP EWGP K.SEVEEGHGKLPSM P AAEEMHKNV EQDELQQVI RE I KE 
S I VGEIRREI VSGLLAAVSSSKASNSKQDYH 

5 A search of sequence databases reveals that the NOV7a amino acid sequence has 247 

of249 amino acid residues (99%) identical to, and 247 of 249 amino acid residues (99%) 
similar to, the 249 amino acid residue ptnr:SWISSPROT-ACC:P28290 protein from Homo 
sapiens (Human) (Sperm-Specific Antigen 2 (Cleavage Signal- 1 Protein) (CS-1)) (E = 6.1e* 

124 } 

10 NOV7a is predicted to be expressed in at least adrenal gland, bone marrow, brain - 

amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, uterus, Aorta, Ascending 

15 Colon, Bone, Cervix, Cochlea, Colon, Dermis, Gall Bladder, Hypothalamus, Islets of 
Langerhans, Liver, Lung, Lymphoid tissue, Ovary, Parathyroid Gland, Parotid Salivary 
glands, Pineal Gland, Retina, Right Cerebellum, Skin, Tonsils, Umbilical Vein, Vein, Whole 
Organism. . 

20 NOV7b 

In the present invention, the target sequence identified previously, NOV7a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 

25 examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, ot by translated homology of the predicted exons to closely 

30 related human sequences sequences from other species. These primers were then employed in 
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PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
5 gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 

10 another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated Accession Number NOV7b (6 aminoacid 
different from NOV7a) and NOV7c (2 aminoacid different from NOV7a). 

A disclosed NOV7b nucleic acid of 806 nucleotides (also referred to as CG56613-02) 

1 5 encoding a novel cleavage signal-1 protein-like protein is shown in Table 7C. An open 

reading frame was identified beginning with an ATG initiation codon at nucleotides 21-23 and 
ending with a TAA codon at nucleotides 762-764. A putative untranslated region upstream 
from the initiation codon is underlined in Table 7C. The start and stop codons are in bold 
letters, and the 5' and V untranslated regions, if any, are underlined. 



20 



Table 7C. NOV7b nucleotide sequence (SEQ ID NO:37). 



GTTTGAGAAATTCAGTCCGAATGGAACTTCAGGACCTGGAACTGCAGCTGGAGGAGCGCCTGCTGGGCCTGG 

AGGAGCAGCTTCGTGCTGTGCGCATGCCTTCACCCTTCCGCTCCTCCGCACTCATGGGAATGTGTGGCAGTA 

GAAGCGCTGATAACTTGTCATGCCCTTCTCCATTGAATGTAATGGAACCAGTCACTGAACTGATGCAGGAGC 

AGTCATACCTGAAGTCTGAATTGGGCCTGGGACTTGGAGAAATGGGATTTGAAATTCCTCCTGGAGAAAGCT 

CAGAATCTGTTTTTTCCCAAGCAACATCAGAATCATCTTCTGTATGTTCTGGTCCCTCTCATGCTAACAGAA 

GAACTGGAGTACCTTCTACTGTCTCAGTGGGCAAATCCAAAACCCCATTAGTGGCAAGGAAGAAAGTGTTCC 

^^ TCOTTG ^ TCTMraCC ^^ GCTCCWCTAGMmOTCTC ^TGCAGACACCTCCAGATTTGGAAA 

GCTCTGAGGAAGTTGATGCAGCTGAAGGAGCCC^GAAGTTGTAGGACCTAAATCTGAAGTGCSMGAAGGGC 

ATGGAAAACTCCCATCAATGCCAGCTGTTGAGGAAATGCATJiAAAATGTGGAGCAAGATGAGTO 

^^^ A ° ATO ^^ GTCTA ^ GTOG ^^TCAGACGGGAAATTGTAAGTGGACTTTTGGCAGCAG 

TATCTTCAAGTAAAGCGTCTAATTCTAAGCAAGATTATCATTAAACAGAAATTATAGGTTGGCATryiATr'rT 

ATTAGCTGTGTAAT " — " 



The disclosed NOV7b nucleic acid sequence, localized to chromosome 2, has 801 of 
812 bases (98%) identical to a gb:GENBANK-ID:HUMCSlPA|acc:M61 199.1 mRNA from 
Homo sapiens (Human cleavage signal 1 protein mRNA, complete cds) (E = 7.6e 171 ). 
25 The disclosed NOV7b polypeptide (SEQ ID NO:38) encoded by SEQ ID NO:37 has 

247 amino acid residues and is presented in Table 7D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV7b has no signal peptide and is 
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likely to be localized to the cytoplasm with a certainty of 0.6500. Alternatively, NOV7b may 
also localize to the mitochondrial matrix space with a certainty of 0.1000, or the lysosome 
(lumen) with a certainty of 0. 1 000. 



Table 7D. Encoded NOVTb protein sequence (SEQ ID NO:38). 

LGLGLGEMGFEIPPGESSESVFSQATSESSSVCSGPSHANRRTGVPSTVSVGKSKTPLVARKKVFRASVALT 
PTAPSRTGSVQTPPDLESSEEVDAAEGAPEWGPKSEVEEGHGKLPSMPAVEEMHKNVEQDELQQVIREIKE 
5 1 VGE I RRE I VSGLLAAVS S SKASNS KQD YH 

5 

A search of sequence databases reveals that the NOV7b amino acid sequence has 240 
of 249 amino acid residues (96%) identical to, and 242 of 249 amino acid residues (97%) 
similar to, the 249 amino acid residue ptnr:SWISSNEW-ACC:P28290 protein from Homo 
sapiens (Human) (Sperm-Specific Antigen 2 (Cleavage Signal-1 Protein) (CS-1)) (E = 9.7e" 
10 ™). 

NOV7b is predicted to be expressed in at least adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
15 small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, uterus, Aorta, Ascending 
Colon, Bone, Cervix, Cochlea, Colon, Dermis, Gall Bladder, Hypothalamus, Islets of 
Langerhans, Liver, Lung, Lymphoid tissue, Ovary, Parathyroid Gland, Parotid Salivary 
glands, Pineal Gland, Retina, Right Cerebellum, Skin, Tonsils, Umbilical Vein, Vein, Whole 
Organism. . 

20 

NOV7c 

A disclosed NOV7c nucleic acid of 806 nucleotides (also refen-ed to as CG566 13-03) 
encoding a novel cleavage signal-1 protein-like protein is shown in Table 7E. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 21-23 and 
25 ending with a TAA codon at nucleotides 762-764. A putative untranslated region upstream 
from the initiation codon is underlined in Table 7E. The start and stop codons are in bold 
letters, and the 5' and 3' untranslated regions, if any, are underlined. 
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Table 7E. NOV7c nucleotide sequence (SEQ ID NO:39). 



GTTTGRGAAATTCTVGTCCGAA TGGRACTTCAGnAPrTnna Rr^ararerTrena rtr; a ans^n^C, r TGCCCCTCC 

AGGAGCAGCTTCGTGCTGTGCGCATGCCTT^ 

G^GCGCTGATAACTTGTCATGCCCT^^ 

AGTCATACCTGAAGTCTGAATTGGGCCTGGGACTTGGAGAAATGGGATTTGAAATTCCTCCTGGAGAAAGCT 
CAGAATCTGTTTTTTCCCAAGCAACATC^GAATCATCTTCTGTATGTTCTGGTCCCTCTCATGCT 
GAACTGGAGTACCTTCTACTGCCTCAGTGGGCAAATCCAAAACCCCATTA 
GAGCATCGGTGGCTCTAACGCCAAGAGCTCCTTCTAGM 

GTTCTGAGGAAGTTGATGCAGCTGAAGGAGCCCCAGAAGTTGTAGGACCTAAATCTGAAGTGGAAGAAGGGC 
ATGGAAAACTCCCATCAATGCCAGCTGCTGAGGAAATGCATAAAAATGTGGAGC^ 
TCATACGGGAGATTAAAGAGTCTATTGTTGGGGAAATCAGACGG 
TATCTTCAAGTAAAGCGTCTAATTCTAAGCAA^ 

ATTAGCTGTGTAAT 



The disclosed NOV7c nucleic acid sequence, localized to chromosome 2, has 803 of 
812 bases (98%) identical to a gb:GENBANK-ID:HUMCSlPA|acc:M61 199.1 mRNA from 
Homo sapiens (Human cleavage signal 1 protein mRNA, complete cds) (E = 1 .2e~ 171 ). 

The disclosed NOV7c polypeptide (SEQ ID NO:40) encoded by SEQ ID NO:39 has 
247 amino acid residues and is presented in Table 7F using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV7c has no signal peptide and is 
likely to be localized to the cytoplasm with a certainty of 0.6500. Alternatively, NOV7f may 
also localize to the mitochondrial matrix space with a certainty of 0.1000, or the lysosome 
(lumen) with a certainty of 0.1000. 



Table 7R Encoded NOV7c protein sequence (SEQ ID NO:40) 



MELQDLELQLEERLLGLEEQL^ 

LGLGLGEMGFEIPPGESSESVFSQATSESSSVCSGPSHANRRTGVPST^ 

PTAPSRTGSVQTPPDLESSEEVDAAEGAPEWGPKSEVE^ 

SI VGE IRRE I VSGIiLAAVSSSKASNSKQDYH 



A search of sequence databases reveals that the NOV7c amino acid sequence has 242 
of 249 amino acid residues (97%) identical to, and 244 of 249 amino acid residues (97%) 
similar to, the 249 amino acid residue ptnr:SWESNEW-ACC J>28290 protein from Homo 
sapiens (Human) (Sperm-Specific Antigen 2 (Cleavage Signal-1 Protein) (CS-1)) (E = 1.4e" 

12I } 

NOV7c is predicted to be expressed in at least adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, uterus, Aorta, Ascending 
Colon, Bone, Cervix, Cochlea, Colon, Dermis, Gall Bladder, Hypothalamus, Islets of 



71 



WO 02/068649 PCT/US02/02785 

Langerhans, Liver, Lung, Lymphoid tissue, Ovary, Parathyroid Gland, Parotid Salivary 
glands, Pineal Gland, Retina, Right Cerebellum, Skin, Tonsils, Umbilical Vein, Vein, Whole 
Organism- . 
NOV7d 

5 A disclosed NOV7d nucleic acid of 705 nucleotides (also referred to as 174307820) 

encoding a novel cleavage signal-1 protein-like protein is shown in Table 7G. An open 
reading frame was identified beginning with an AGA initiation codon at nucleotides 1-3 and 
ending with nucleotides 703-705. The start codon is in bold letters, and the 5* and 3' 
untranslated regions, if any, are underlined. Because the start codon is not a traditional 

1 0 initiation codon, and there is no stop codon, NOV7d could be a partial open reading frame 
extending further in the 5 * and 3 ' directions. 



Table 7G. NOV7d nucleotide sequence (SEQ ID NO:41). 

AGATCTCCCACCATGGAACTTCAGGACCT 

CTTCGTGCTGTGCGCATGCCTTCACCCTTC^ 

GATAACTTGTCATGCCCTTCTCGATO 

CTGAAGTCTGAATTGGGCCTGGGACTTGGAGAAATGGG 

GTTTTTTCCCAAGCAACATCAGAATCATCT 

GTACCTTCTACTGCCTGAGTGGGCAAATCCAAAA 

GTGGCTCTAACGCCAACAGCTCCTTCTAGAACAGGCT 

GAAGTTGATGCAGCTGAAGGAGCCCCAGAAGTTGTAGGACCTAAATCTGAAGTGGAAG^ 
CTCCCATCAATGCCAGCTGCTGAGGAAATGCAT^ 

GAGATTAAAGAGTCTATTGTTGGGGAAATCAGACGGGAAATTGTAAGTGGACTCGAG 

The disclosed NOV7d polypeptide (SEQ ID NO:42) encoded by SEQ ID NO:41 has 
1 5 235 amino acid residues and is presented in Table 7H using the one-letter amino acid code. 

Table 7H. Encoded NOV7d protein sequence (SEQ ID NO:42). 

RSPTMEIiQDLELQLEERLLGLE 

LKSELGJ^LGEMGFEIPPGESSESVFSQATSESSSVCSGPSHAN^ 

VALTPTAPSRTGSVQTPPDLESSEEVDAAEGAPEWGPKSEVEEGHGKLPSMPAAEEMHKNVEQDELQQ . 
E I KES IVGEIRRE I VSGLE 

NOV7e 

A disclosed NOV7e nucleic acid of 759 nucleotides (also referred to as 174307820) 
encoding a novel cleavage signal-1 protein-like protein is shown in Table 71. An open reading 
20 frame was identified beginning with an AGA initiation codon at nucleotides 1-3 and ending 
with nucleotides 757-759. The start codon is in bold letters, and the 5' and 3' untranslated 
regions, if any, are underlined. Because the start codon is not a traditional initiation codon, and 
there is no stop codon, NOV7e could be a partial open reading frame extending further in the 
5* and 3* directions. 

25 
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Table 7L NOV7e nucleotide sequence (SEQ ID NO:323). 

CTTCGTGCTGTGCGCATGCCTTC^CCCTTCCGCTCCTCCG^ 
GATAACTTGTCATGCCCTTCTCCAT^ 

CTGAAGTCTGAATTGGGCCTGGGACTTGGAGAAATGGGATTTGAAATTCCTCCTGGAGAAAGCTCAGAATCT 

GTTTTTTCCCAAGCAACATCAGAATCATCTTCTGTATGTTCTGGTCCCTCTCATGCT 

GTACCTTCTACTGCCTraGTGGGCAAATC^ 

GTGGCTCTAACGCCAAC^GCTCCTTCTAGAACAGGCTCTGTGCAGA^ 

GAAGTTGATGCAGCTGAAGGAGCCCCAGAAGTTGTAGGACCTAAATCTGAAGTGGAAGAAGGGC^TGGAAM 
CTCCCATCAATGCCTIGCTGCTGAGGAAATGCATAAAAATGTGGAGCAAGAT^ 
GAGATTAAAGAGTCTATTGTTGGGGAAATCAGACGGGAAATTGTAAGTGGACT 
AGTAAAGCGTCTAATTCTAAGCAAGATTATCATCTCGAG 



The disclosed NOV7e polypeptide (SEQ ID NO:324) encoded by SEQ ID NO:323 has 
253 amino acid residues and is presented in Table 7J using the one-letter amino acid code. 



Table 7 J. Encoded NOV7e protein sequence (SEQ ID NO:324). 

RSPTMELQDLELQLEERIjLGLEEQLI^VR 

kKSELGLGLGEMGFEIPPGESSESVFSQATSESSSVCSGPSHANRRTGW^ 
VALTPTAPSRTGSVQTPPDLESSEEVDAAEGAPEWGPKSEVEEGHGKLPSMPAAEE>MKNVEQD 
EIKES IVGEIRREIVSGLLAAV5SSK7VSNSKQDYHLE 



NOV7a also has homology to the amino acid sequence shown in the BLASTP data 
listed in Table 7K. 



Table 7K. BLAST results for NOV7a 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 15620913 | db j | BAB6 
7820.1 | (AB067514) 


KIAA1927 protein 
[Homo sapiens] 


772 


242/247 
(97%) 


, 244/247 
(97%) 


e-109 


gi 1 16159686 |ref|XP 0' 
57458.1) (XMJ>57458) 


sperm specific 
antigen 2 [Homo 
sapiens] 


727 


242/247 
(97% 


244/247 
(97%) 


e-108 


gi| 15277922 jgb|AAH12 

947.1|AAH12947 

(BC012947) 


Unknown (protein 
for MGC: 21202) 
[Homo sapiens] 


267 


242/247 
(97%) 


244/247- 
(97%) 


e-102 


gi | 5803179 |ref|NP 00 
6742. 1| (NM_006751) 


sperm specific 
antigen 2; 
KIAA1927 protein 
[Homo sapiens] 


249 


247/249 
(99%) 


247/249 
(99%) 


e-102 


gi | 18017599 |ref|NP 5 
42125. 1| (NM_080558) 


sperm specific 
antigen 2 iMus 
musculus] 


264 


197/248 
(79%) 


212/248 
(85%) 


9e-81 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
10 in Table 7L. 

Table 7L. Information for the ClustalW proteins 

1) NOV7a (SEQ ID N0:36) 

2) NOV7b (SEQ ID N0:38) 

3) NOV7C (SEQ ID NO:40) 
15 4) NOV7d (SEQ ID NO: 42) 

5) NOV7e (SEQ ID NO: 324) 

5) gi|l5620913|dbj|BAB67820.l| (ABO 6 75 14) KIAA1927 protein [Homo sapiens] (SEQ ID 
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6) gi | 16159686 |ref|XP_057458.l| (XM_057458) sperm specific antigen 2 [Homo sapiens] 
(SEQ ID NO:348) 

7) gi|l5277922|gb|AAH12947.l|AAHl2947 (BC012947) Unknown (protein for MGC:21202) 
[Homo sapiens] (SEQ ID NO: 349) 

5 8) gi | 5803179 |ref|NP_006742.l| (NM_00675l) sperm specific antigen 2; KIAA1927 
protein [Homo sapiens] (SEQ ID NO: 350) 

9) gi | 18017599 | ref |NP_542125 . 1 | (NM_080558) sperm specific antigen 2 [Mus musculus] 
(SEQ ID NO:35l) 



10 



15 



20 



N0V7a 
NOV7b 
NOV7c 
NOV7d 
NOV7e 

gi | 15620913 
gi|l6159686 
gi (15277922 
gi 1 5803179 1 
gi | 18017599 



10 20 30 40 50 60 

....|....|....|....|....|....|....|....|.. ..)....).... )....) 

1 1 

1 - - 1 

1 - 1 

1 - " -- 1 

1 — - - 1 

1 YSHI FLFIDHLLRTASQHSDS SGFAEDSTDCLSLNHLQVQESLQAMGSSADSCDSETTVT 60 

1 MGSSADSCDSETTVT 15 

1 - 1 

1 - 1 

1 1 



25 



30 



35 



NOV7a 
HOV7b 
NOV7C 
NOV7d 
NOV7e 



gi 
g± 
gi 
gi 
gi 



15620913) 
16159686 | 
15277922) 
5803179 | 
18017599 | 



70 80 90 100 110 120 

. ... | .... | | .... | | .... | .... | .. . . J .... | .... | .... | .... | 

1 — " - 1 

1 1 

1 " - 1 

1 " 1 

1 - - 1 

61 SLGEDLATPTAQDQPYFNESEEESLVPLQKGLEKAAAVADKRKSGSQDFPQCNTIENTGT 120 

16 SIX3EDIATPTAQDQPYFNESEEESLVPLQKGLEKAAAVM)KRKSGSQDFPQCNTIENTGT 75 

1 -. 1 

1 " " 1 

1 1 



40 



45 



50 



55 



60 



65 



NOV7a 
NOV7b 
NOV7c 
NOV7d 
NOV7e 



gi 
gi 
gi 
gi 
gi 



15620913 
16159686 
15277922 
5803179 | 
18017599 



NOV7a 
NOV7b 
NOV7c 
NOV7d 
N0V7e 



gi 
gi 
gi 
gi 
gi 



15620913 
16159686 
15277922 
5803179) 
18017599 



NOV7a 
NOV7b 
NOV7c 
NOV7d 
NOV7e 



130 140 150 160 170 180 

| | | .... | | | | | | | | | 

1 i 

1 " 1 

1 1 

1 — - 1 

1 " 1 

121 KQSTCSPGDHIIEITEVEEDLFPAETVELLREASAESDVGKSSESEFTQYTTHHILKSLA 180 

76 KQSTCSPGDHIIEITEVEEDLFPi^TVELLI^ASAESDVGKSSESEFTQYTTHHILKSLA 135 

1 " 1 

1 " - 1 

1 - - — 1 

190 200 210 220 230 240 

| | | |....|....| |.. ..(.... | | | | 

1 - -- 1 

1 - - 1 

1 — " -- 1 

1 " "-- - -- 1 

1 " 1 

181 SIEAKCSDMSSENTTGPPSSMDRVNTALQI^QMKVCSLSNQRMGRSLLKSKDLIiKQRYLF 240 

136 SIEAKCSDMSSENTTGPPSSMDRVOTALQRAQMKVCSLSNQRMGRSLLKSKDLLKQRYLF 195 

1 " - - 1 

! _ 1 

! „ _ ! 

250 260 270 . 280 290 300 

1 - - i 

i i 

i - - i 

i - i 

i — — i 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



gi | 15620913 | 
gi (16159686 j 
gi I 15277922 | 
gi ( 5803179 I 
gi j 18017599 1 



NOV7a 
NOV7b 
NOV7c 
N0V7d 
NOV7e 

gi 1 15620913 
(16159686 
(15277922 
| 5803179 | 
(18017599 



0±| 
Si I 

si I 

Si I 



NOV7a 
N0V7b 
M0V7c 
NOV7d 
H0V7e 



gi 
gi 
gi 
gi 
gi 



15620913 
16159686 
15277922 
5803179 | 
18017599 | 



NOV7a 
NOV7b 
NOV7c 
NOV7d 
NOV7e 



gi 
gi 
gi 
gi 
gi 



15620913 
16159686 
15277922 
5803179] 
18017599 



NOV7a 
NOV7b 
NOV7c 
NOV7d 
NOV7e 

gi | 15620913 | 
gi (16159686 | 
gi | 15277922 j 
gi 1 5803179 | 
gi j 18017599 | 



NOV7a 
KOV7b 
NOV7c 
N0V7d 
NOV7e 



gi 
gi 
gi 
gi 
gi 



15620913 
16159686 
15277922 
5803179) 
18017599 



241 AKAGYPLRRSQSLPTTLLSPVRWSSVNVRLSPGKETRCSPPSFTYKYTPEEEQELEKRV 300 
196 AKAG YPLRRS QSLPTTLLS PVR WSS VNVRLS PGKETRCS P PS FT YKYTPEEEQELEKR V 255 

31° 320 330 340 3S0 3S0 

! ' j 

301 MEHTCQSLVKSTIFISPSSVKKEEAPQSEAPRVEECHHGRTPTCSRLAPPPMSQSTCSLH 360 
256 MEHDGQSLVKSTIFISPSSVKKEEAPQSEAPRVEECHHGRTPTCSRLAPPPMSQSTCSLH 315 

370 380 390 400 410 420 
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The cleavage signaM protein (CS-1), a doublet antigen comprised of approximately 
14-kDa and 1 8-kDa proteins has been shown to be present on the surface of sperm of various 
mammalian species including humans. Polyclonal antibodies to CS-1 inhibit the early 
cleavage of fertilized eggs without apparently affecting sperm penetration and pronuclear 
formation. The human CS-1 cDNA has been cloned and expressed in vitro to obtain the 
recombinant protein (reCS-1) molecule. The CS-1 cDNA clone has been isolated by 
immunological screening of a human testis lambda gtl 1 cDNA library with mono-specific 
polyclonal antibody against CS-1 . The cDNA is 1 828 bp long; the start codon assigned to the 
first ATG (bp 98-100) encodes a protein with 249 amino acid residues terminating at TAA (bp 
845-847). 

XCS-1 is a maternally expressed gene product that is the Xenopus homologue of the 
human cleavage signal protein (CS-1). XCS-1 may play an important role in regulating mitosis 
during early embryogenesis in Xenopus laevis. XCS-1 transcripts have been detected in 
oocytes. During development the XCS-1 protein has been detected on the membrane and in 
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the nucleus of blastomeres. It has also been detected on the mitotic spindle in mitotic cells and 
on the centrosomes in interphase cells. Overexpression of myc-XCS-1 in Xenopus embryos 
results in abnormal mitoses with increased numbers of centrosomes, multipolar spindles, and 
abnormal distribution of chromosomes. Incomplete cytokinesis resulting in multiple nuclei 
5 residing in the same cytoplasm with the daughter nuclei in different phases of the cell cycle 
has been observed. The phenotype depended on the presence of the N terminus of XCS-1 (aa 
1-73) and a consensus NIMA kinase phosphorylation site (aal59-167). Mutations in this site 
affect the ability of the overexpressed XCS-1 protein to produce the phenotype. 

The disclosed NOV7 nucleic acid of the invention encoding a Cleavage signal-1 
1 0 protein-like protein includes the nucleic acid whose sequence is provided in Table 7A, or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 7A while still encoding a 
protein that maintains its Cleavage signal-1 protein-like activities and physiological functions, 
or a fragment of such a nucleic acid. The invention further includes nucleic acids whose 

1 5 sequences are complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

20 derivatized. These modifications are carried out at least in part to enhance the chemical 

stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 1 percent of the bases may be so changed. 

The disclosed NOV7 protein of the invention includes the Cleavage signal-1 protein- 

25 like protein whose sequence is provided in Table 7B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 2 while still encoding a protein that maintains its Cleavage signal-1 protein-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 21 percent of the residues may be so changed. 

30 The invention further encompasses antibodies and antibody fragments, such as F ab or 

(Fab)2,that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Cleavage signal-1 protein-like 
protein (NOV7) is a member of a "Cleavage signal-1 protein family". Therefore, the NOV7 
nucleic acids and proteins identified here may be useful in potential therapeutic applications 
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implicated in (but not limited to) various pathologies and disorders as indicated below. The 
potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
5 delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV7 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in regulation of the cell cycle during early embryogenesis, 
and therefore may have potential application in the management of embryonic defects. 

10 Additionally, this antigen may also be involved in human immunoinfertility and therefore may 
have application in the treatment of infertility, and/or other diseases or pathologies. 

NOV7 nucleic acids and polypeptides are further useful in the generation of antibodies 
that bind immuno-specifically to the novel NOV7 substances for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 

1 5 art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
section below. The disclosed NOV7 protein has multiple hydrophilic regions, each of which 
can be used as an immunogen. These novel proteins can be used in assay systems for 
functional analysis of various human disorders, which will help in understanding of pathology 
of the disease and development of new drug targets for various disorders. 

20 

NOV8 

A disclosed NOV8 nucleic acid of 2838 nucleotides (also referred to as 1 5347245 1) 
encoding a novel Matriptase-like protein is shown in Table 8A. An open reading frame was 
identified beginning with an TAG initiation codon at nucleotides 8-10 and ending with a TGA 
25 codon at nucleotides 2279-2281 . The start and stop codons are in bold letters in Table 8 A, and 
the 5* and 3 5 untranslated regions, if any, are underlined. 



Table 8A. NOV8 nucleotide sequence (SEQ ID NO:43). 

GGGGACCA TGGGGAGCGATCGGGCCCGCAAGGGCGGAGGGGGCCCGAA^ACTTC^mCt^ACTPAAGTA 

CAACTCCCGGCACGAGAAAGTGAATGGCTTGGAGGAAGGCGTGGAGTTCCT 

GGTGGAAAAGCATGGCCCGGGGCGCTGGGTGGTGCTGGCAGCCGTGCTGAT 

GGCCGAGCGCGTCATGGCCGAGGAGCGOSTAGTCATG^ 

GGTCACCTCAGTGGTGGCTTTCCCGACGGACTCCAAAAC^ 

TGGCCTGCACCCCCGCGGTGTGGAGCTGATGCGCTTCACCACGCCCGGCTTCCCTGACAGCCCCTACCCCGC 
TCATGCCCGCTGCCAGTGGGCCCTGCGGGGGGACGCCGACTCAGTGCTGAGCCTCACCTTCCGCAGCTTT^ 

CGCCCTGGTGCAGTTGTGTGGCACCTACCCTCCCTCCTACAACCTGACCTTCCACTCCTCCCAGAACGTCCT 
GCTCATCACACTGATAACCAACACTGAGCGGCGGCATCCCGGCTT^ 
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GATGAGCAG CTGTGGAGGCCGCTTACGTAAAGCCCAGGGGACATTCAACAG CCCCTACTACCCAGGCCACTA 

CCCACCCAACATTGACTGCACATGGAACATTGAGGTGCCCAACAACCAGCATGTGAAGGTGAGCTTCAAATT 

CTTCTACCTGCTGGAGCCCGGCGTGCCTGCGGGCACCTGCCCCAAGGACTACGTGGAGATCAATGGGGAGAA 

ATACTGCGGAGAGAGGTCCCAGTTCGTCGTCACCAGCAACAGC^^CAAGATCACAGTTCGCTTCCACTCAGA 

TCAGTCCTACACCGACACCGGCTTCITAGCTGAATACCTCTCCTACGACTCCAGTGACCCATGCCCGGGGC^ 

GTTCACGTGCCGCACGGGG CGGTGTATCCGGAAGGAGCTGCG C TGTGATGG CTGGGCCGACTGCACCGAC CA 

CAGCGATGAGCTCAACTGCAGTTGCGACGCCGGCCACCAGTTCACGTGCAAGAACAAGTTCTGCAAGCCCCT 

CTTCTGGGTCTGCGACAGTGTGAACGACTGCGGAGACAACAGCGACGAGCAGGGGTGCAGTTGTCCGGCCCA 

GACCTT CAGGTGTT CCAATGGGAAGTGCCT CTCGAAAAGCCAG CAGTGCAATGGGAAGGACGACTGTGGGGA 

CGGGTCCGACGAGGCCTCCTGCCCQ^GGTGAACGTCGTCACTTGTACCAAAC^CACCTACCGCTGCCTC^ 

TGGGCTCTGCTTGAGCAAGGGCAACCCTGAGTGTGACGGGAAGGAGGACTGTAGCGACGGCTCAGATGAGAA 

GGACTGCGACTGTGGGCTGCGGTCATTCACGAGACAGGCTCGTGTTGTTGGGGGCACGGATGCGGATGAGGG 

CGAGTGGCCCTGGCAGGTAAGCCTGCATGCTCTGGGCCAGGGCCACATCTGCGGTGCTTCCCTCATCTCTCC 

CAACTGGCTGGTCT CTGCCG CACACTGCTACATCGATGACAGAGGATTC AGGTACTCAG ACCCCACG CAGTG 

GACGGCCTTCCTGGGCTTGCACGACCAGAGCCAGCGCAGCGCCCCTGGGGTGCAGGAGCGCAGGCTCAAGCG 

CATCATCTCCCACCCCTTCTTCAATGACTOCACCTTCGACTATGACATCGCGCTGCTGGAGCTGGAGAAACC 

GGCAGAGTACAGCTCCATGGTGCGGCCCATCTGCCTGCCGGACGCCTCCCATGTCTTCCCTGCCGGCAAGGC 

CATCTGGGTCACGGGCTGGGGACACACCCAGTATGGAGGCACTGGCGCGCTGATCCTGCAAAAGGGTGAGAT 

CCGCGTC^TC^CC^GACCACCrGCGAGAACCrCCTGCCGCAGCAGATCACGCroCGCA^ 

CTTCCTCAGCGGCGGCGTGGACTCCTGCCAGGGTGATTCCGGGGGACCCCTGTCCAGCGTGGAGGCGGATGG 

GCGGATCTTCCAGGCCGGTGTGGTGAGCTGGGGAGACGGCTGCGCTCAGAGGAACAAGCCAGGCGTGTACAC 

AAGGCTCCCTCTGTTTCGGGACTGGATCAAAGAGAACACTGGGGTATAGGGGCCGGGGCCACCCAAATGTGT 

ACACCTGCGGGGCCACCCATCGTCCACCCCAGTGTGCACGCCTGCAGGCTGGAGACTGGACCGCTGACTGCA 

CCAGCGCCCCCAGAACATACACTGTGAACTCAATCTCCAGGGCTCCAAATCTGCCTAGAAAACCTCTCGCTT 

CCTCAGCCTCCAAAGT GGAGCTGGGAGGTAGAAGGGGAGGACACTGGTGGTTCTACTGACCCAACTGGGGGC 

AAAGGTTT GAAGAC^CAGCCTCCCCCGCC^GCCCCAAGCTGGGCCGAGGCGCGTTTGTGTATATCTGCCTCC 

CCTGTCTGTAAGGAGCAGCGGGAACGGAGCTTCGGAGCCTCCTCAGTGAAGGTGGTGGGGCTGCCGGATCTG 

GGCTGTGGG GCCCTTGGGCCACGCTCTTGAGGAAGCCCAGGCTCGGAGGACCCTGGAAAACAGACGGGTCTG 

AGACTGAAATTGTTTTACC AGCTCCCAGGGTGGACTTCAGTGTGTGTATCT 

TATTTCTTTTTAAAAAAAAAAAAAAAAAAA ' ' " ~ 



The disclosed NOV8 nucleic acid sequence has 2644 of 2678 bases (98%) identical to 
a gb:GENBANK-ID:AFl 18224|acc:AFl 18224.2 mRNA from Homo sapiens (matriptase 
mRNA, complete cds) (E = 0.0). 
5 The disclosed NOV8 polypeptide (SEQ ID NO:44) encoded by SEQ ID NO:43 has 

757 amino acid residues is presented in Table 8B using the one-letter amino acid code. Signal 
P, Psort and/or Hydropathy results predict that NOV8 has a signal peptide and is likely to be 
localized in the plasma membrane with a certainty of 0.81 10. Alternatively, NOV8 is 
predicted to be localized to the Golgi body with a certainty of 0.3000, to the endoplasmic 
1 0 reticulum (membrane) with a certainty of 0.2000, or to the microbody (peroxisome) with a 
certainty of 0.1527. The most likely ceavage site for NOV8 is between positions 8 and 9, 
ARK-GG. 



Table 8B. Encoded NOV8 protein sequence (SEQ ID NO:44). 

MGSDRARKGGGGPKDFGAGLKYNSRHEKVNGLEEGVEF^ 

RVMAEERVVMLPPRARSLKSFWTSW^ 

RCQWALRGDADSVLSLTFRSFDLASCDERGSDLVT^ 

TLI TNTERRHPGFEATFFQLPRMS SCGGRLRKAQGTFNS P YYPGHYPPNI DCTWNIEVPNNQHVKVSFKFFY 
LLEPGVPAGTCPKDYVEINGEKYCGERSQFVVTSNSNKI 

CRTGRCIRKEIjRCDGWADCTDHSDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTF 
RCSNGKCLSKSQQC^GKDDCGTCSDEASCP 

DCGLRSFTRQARWGGTDADEGEWPWQVSLHALGQGHICGAStilSPNWLVSAAHCYIDDRGFRYSDPTQWTA 
FLGLHDQSQRS APGVQERRLKRI I SHPFFITOFTFDYDIALLELEKPAEYSSMVRP I CLPDASHVFPAGKAI W 
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VTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPFMMCVGFLSGGVDSCQGDSGGPLSSVEADGRI 
FQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 

A BLASTX of NOV8 shows that it has 699 of 729 amino acid residues (95%) identical 
to, and 702 of 729 amino acid residues (96%) similar to, the 855 amino acid residue 
ptnr:SPTREMBL-ACC:Q9Y5Y6 protein from Homo sapiens (Human) (Matriptase) (E = 0.0). 
5 NOV8 is predicted to be expressed in at least the following tissues: Adrenal 

Gland/Suprarenal gland, Aorta, Ascending Colon, Bone Marrow, Brain, Bronchus, Cartilage, 
Colon, Duodenum, Gall Bladder, Heart, Islets of Langerhans, Kidney, Kidney Cortex, Lung, 
Mammary gland/Breast, Ovary, Pancreas, Parathyroid Gland, Parotid Salivary glands, 
Peripheral Blood, Pituitary Gland, Placenta, Prostate, Small Intestine, Stomach, 
1 0 Thymus/Thyroid, Tonsils, Uterus, Vulva, Whole Organism. 

In addition, NOV8 is predicted to be expressed in breast cancer, according to NOV8 
nucleic acids, polypeptides, and antibodies Accordingly to the invention will have diagnostic 
and therapeutic applications for the detection of breast cancer. 

The disclosed NOV8 polypeptide has homology to the amino acid sequences shown in 
1 5 the BLASTP data listed in Table 8C. 



Table 8C BLAST results for NOV8 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1 10257390 |gb | AAG1 
5395.1|AF057145 1 
(AF057145) 

i 


serine protease 
TADG15 [Homo 
sapiens] 


855 


691/691 
(100%) 


691/691 
(100%) 


0.0 


gi | 11415040 |ref |NP 
068813. l| 
(NM_021978) 


suppression of 
tumorigenicity 14 
(colon carcinoma, 
matriptase, 
epithin) ,* 
suppression of 
tumorigenicity 14 
(colon 
carcinoma) ; 
matriptase [Homo 
sapiens] 


855 


690/691 
(99%) 


690/691 
(99%) 


0.0 


gi| 12249015 |dbj |BAB 
20376. l| (AB030036) 


prostamin [Homo 
sapiens] 


855 


689/691 
(99%) 


689/691 
(99%) 


0.0 


gi| 7363445 |ref |NP 0 

35306.2) 

(NM 011176) 


protease, serine, 
14 (epithin) [Mus 
musculus] 


855 


573/691 
(82%) 


633/691 
(90%) 


0.0 


gi | 16758444 |ref)NP 
446087. l| 
(NMJJ53635) 


suppression of 
tumorigenicity 14 
(colon carcinoma, 
matriptase, 
epithin) [Rattus 
norvegicus] | 


855 


571/691 
(82%) 


632/691 
(90%) 


0.0 
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The homology between these and other sequences is shown graphically in the 
CiustalW analysis shown in Table 8D. In the ClustalW alignment of the NOV8 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (/.e., regions that may be required to preserve structural or 
5 functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 8D. ClustalW Analysis of NOV8 



10 1) Novel NOV8 (SEQ ID N0:44) 

2) gi 1 10257390 | gb | AAG15395 . 1 1 AF057145_1 (AF057145) serine protease TADG15 [Homo 
sapiens] (SEQ ID NO: 352) 

3) gi 1 11415040 1 ref |NP_068813 . 1 1 (NM_021978) suppression of tumorigenicity 14 (colon 
carcinoma, matriptase, epithin) ; suppression of tumorigenicity 14 (colon carcinoma); 

15 matriptase [Homo sapiens] (SEQ ID NO:353) 

4) gi|l2249015ldbj|BAB20376.l| (AB030036) prostamin [Homo sapiens] (SEQ ID NO: 354) 

5) gi | 7363445 1 ref |NPJ}35306 . 2 j (NM_011176) protease, serine, 14 (epithin) [Mus 
muSCUlus] (SEQ ID NO: 355) 

6) gi 1 16758444 | ref |NP_446087 .1 1 (NM_053635) suppression of tumorigenicity 14 (colon 
20 carcinoma, matriptase, epithin) [Rattus norvegicus] (SEQ ID NO:356) 
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NOV8 
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gi 1 12249015 | 
gi I 7363445 | 
gi j 16758444 | 




NOV8 


66 
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gi 
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121 


gi 
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121 


NOV8 
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181 
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181 
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181 


gi 


16758444 


181 
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|....| 
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••I 



KgALKLLYSgVPgLGPYHKgSAVTAFSEGSVIAYYWSEFSIPg HLVEEAERVM AEERWM 
KSALKLLYSgV PgLGP YHKgS AVTAFS EGS VI AYYWS EFS I pS HLVEEAERVMAEE RWM 
Kj^KLLYS^PgLGPYHKjgSAVTAFS 

KgpaKLLYffly PfflliGPYHKBs AVTAFSEGS VI AYYWSEFS ipfi HL0EE^agMAj5ERVV' 
T 0»ALKL5YsBvpBLGPYHK§si?VTAFSEGSVIAYYWSEFSIpS HlSeeSSrBmaSeRW 
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NOV8 


143 


gi) 10257390) 


241 


gi 1 11415040 | 


241 


gi) 12249015) 


241 


gi|7363445| 


241 


gi (1575 8444 | 


241 


NOV8 


2 03 


gi)l0257390| 


301 


gi| 11415040) 


3 01 


gij 12249015) 


301 


gij 7363445) 


301 


gij 16758444) 


301 




N0V8 
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gi 


10257390 


361 


gi 


11415040 


361 


gi 


12249015 


361 


gi 


7363445) 


361 


gi 


16758444 


361 


Nove 


323 


gi 


10257390 


421 


gi 


11415040 


421 


gi 


12249015 


421 


gi|7363445 | 


421 


gij 167 58444 


421 
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380 



390 



400 



410 



420 



PPNIDCTWNIEVPNNQHVKVgFKFFYLLEPGVPAGTCPKDYVElNGEiCYCGERSQFWTS 
PPNIDCTWNIEVPNKQHVKVgFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTS 
PPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFVVTS 
PPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFVVTS 

PPtflScTWNIiWPlsnflS 



322 
420 
420 
420 
420 
420 



430 



440 



450 



460 



470 

..|.- 



480 



nsnkitvrfhsdqsytdtgflaeylsydssdpcpgqftcrtgrcirkelrcdgwadctdh 
wsnkitvrfhsdqsytdtgflaeylsydssdpcpgqftcrtgrcirkelrcdgwadctdh 
nsnkitvrfhsdqsytdtgflaeylsydssdpcpgqftcrtgrcirkelrcdgwadctd:! 
snkitvrfhsdqsytdtgflaeylsydssdpcpgqftcrtgrcirkelrcdgwadctdh, 

S|KITVgFHSDgSYTDTGFLAEYLSYDS^DPCPGjiF&gTGRCIRKELRCDGWADCraD 

'sSkitvRfhsdi«sytdtgfIiAEylsydsSdpcpgSfBcS^ 
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480 
480 
480 
480 
480 
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N0V8 

gi | 10257390) 
gi | 11415040 | 
gij 12249015 | 
gi (7363445 | 
gij 16758444 | 



N0V8 

gi | 10257390 | 
gi 1 11415040 j 
gij 12249015 j 
gij7363445| 
gij 16758444 | 
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gi 
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630 



640 



650 



I. 



660 



503 
601 
601 
601 
601 N 
601 jS 



DCDCGLRSFTRQARVVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYID 

dcdcglrsftrqarwggtdadegewpwqvslhalgqghicgaslispnwlvsaahcyid) 
dcdcglrsftrqarwggtdadegewpwqvslhalgqghicgaslispnwlvsaahcyidi 
dcdcglrsftrqarwggtdadegewpwqvslhalgqghicgaslispntvlvsaahcyidI 

CDCGJiRSFTgQARWGGO^ADEGEWPWQVSLHALGQGHfflCGASLIS 

cdcglrsftBqarwggtSadegewpwqvslhalgq 



562 
660 
660 
660 
660 
660 



N0V8 



gi 
gi 
gi 
gi 
gi 



10257390) 
11415040) 
12249015 | 
7363445) 
16758444) 



563 
661 
661 
661 
661 
661 



670 

■ .I.. 



680 



690 



700 



710 



I. 



720 
..I 



DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFKDFTFDYDIALLELEKP 
DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKP 
DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKP 
DRGFRYSDPTQWTjjFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLEIiEKP 
D^FjgYSDKTjWTAFLGLjpQSSRSASGVQETOLKRI IjfapEjFNDFTFDYDIALLELEKj 

fS^dSot^aflglSdqsBrsaBgvq 



622 
720 
720 
720 
720 
720 
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730 



10 



15 



20 



25 



NOV8 


623 


gi 


10257330 


| 721 


gi 


11415040 


| 721 


gi 


12249015 


| 721 


gi 


7363445| 


721 


gi 


16758444 | 721 


NOV8 


683 


gi 


10257390 


781 


gi 


11415040 


781 


gi 


12249015 


781 


gi 


7363445) 


781 


gi 


16758444) 781 


NOV8 


743 


gi 


10257390 


841 


gi 


11415040 


841 


gi 


12249015 


841 


gi 


7363445 | 


841 


gi 


16758444 


841 



740 



750 
■ •I.. 



760 



I 



770 
..I.. 



780 



aeyss1wrpiclpdashvfpagkaiwvtgwghtqyggtgalilqkgeirvinqttcenll! 
ae ys s mvr p i clp d as hvfp agkai wvtg wghtq yggtg al il qkge ir vinqttcenll 
aeyssmvrpiclpdashvfpagkaiwtgwghtqyggtgalilqkgeirvinqttcenll 
aeyssivltopiclpdashvfpagkaiwvtgwghtqyggtgalilqkgeirvinqttcenll 
seysgvrpiclpdayhvfpagkaiwvtgwghtgggtgalilqkgeirvinqttcegl" 
aeysSS to p i clpdS3hvfpagkaiwvtgwght™ggtgalilqkgeirvinqttce1ll! 



682 
780 
780 
780 
780 
780 



790 
..|.. 



800 



810 



820 



830 



840 
. - I 



pqqitprmmcvgflsggvdscqgdsggplssveadgrifqagvvswgdgcaqrnkpgvyt 
pqqitprmmcvgflsggvdscqgdsggplssveadgrifqagwswgdgcaqrnkpgvyt 
pqqitprr^lmcvgflsggvdscqgdsggplssveadgrifqagwswgdgcaqrnkpgvyt 
pqqitprmmcvgflsggvdscqgdsggplssveadgrifqagwswgdgcaqrnkpgvyt 
pqqitprmyicvgflsggtoscqgdsggplssgebdgrjjfqagvvswgjsgcaqrnkpgvyt 
pqqitprmmcvgflsggv dscqgdsggplssveBdgrifqagwswgSgcaqrnkpgvyt 



850 




Tables 8E-8R list the domain descriptions from DOMAIN analysis results against 
30 NOV8. This indicates that the NOV8 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 8E. Domain Analysis of NOV8 

gnl| Smart |smart00020, TrypjSPc, Trypsin-like serine protease; Many of 
these are synthesised as inactive precursor zymogens that are cleaved 
during limited proteolysis to generate their active forms. A few, 
however, are active as single chain molecules, and others are inactive 
due to substitutions of the catalytic triad residues. (SEQ ID 
NO; 804) 

CD-Length =230 residues, 100.0% aligned 
Score « 259 bits (662) , Expect = 4e-70 



35 



40 



45 



NOV 8: 


516 


Sbjct: 


1 


NOV 8: 


576 


Sbjct: 


55 


NOV 8: 


636 


Sbjct: 


113 


NOV 8: 


693 


Sbjct: 


173 



l+IIM* I +HIIM I I || Mill I+++MH' 



II IN I I +* ++| || +| |+| HUM + | | Hum 



III l+lll I | + || + + +++ || 



II 1*1 



i i M miiiiiiii . i i i mm in* iiimm ♦ m 
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Table 8F. Domain Analysis of NO V8 

gnl|Pfam|pfam00089, trypsin, Trypsin. Proteins recognized include all 
proteins in families Si, S2A, S2B, S2C, and S5 in the classification 
of peptidases. Also included are proteins that are clearly members, 
but that lack peptidase activity, such as haptoglobin and protein Z 
(PRTZ*). (SEQ ID NO:805) 
CD-Length = 217 residues, 100.0% aligned 
Score = 201 bits (510) , Expect = 2e-52 



10 



15 



NOV 8: 


517 


Sbjct: 


1 


NOV 8: 


577 


Sbjct: 


52 


NOV 8: 


637 


Sbjct: 


109 


NOV 8: 


697 


Sbjct: 


168 



♦III +1 I +IIIIII + II II Mil II+++IIII 

IVGGREAQAGSFPWQVSLQ-VSSGHFCGGSLISENWVLTAAHCV- 



- SGASSVRV 51 



ii i+ i+ ii +i i 1 1 1 1 i+i+ 1 mini 

VLGEHNLGTTEG - TEQKFDVKKI I VHPNYNPDT - -ND I ALLKLKS P VTLGDTVRP I CLPS 108 



II II' l+lll 1+ II + 11+ + ++++ II + 



l+llll III I IIIIII+ + III 
- VCSDGEL VGI VS WGYGCAVGNY PGVYTRVSRYLDWI 2 17 



Table 8G. Domain Analysis of NOV8 

gnl|Pfam|pfam00431, CUB, COB domain (SEQ ID NO: 806) 
CD-Length = 110 residues, 100.0% aligned 
Score =99.0 bits (245), Expect = 9e-22 



NOV 8: 242 CGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVSFKFFYLLEPGVPAGTCPK 301 
20 III I ++ 1+ +11 ii mi +1 I I I I+++I+ II I • 

Sb j ct : 1 CGGVLTESSGS I SSPNYPNDYPPNKECVWTIRAPPGYRVELTFQDFDL - - - -EDHTGCRY 56 



25 



NOV 8: 302 DYVEI 
Mill 

Sbjct: 57 



NGEKYCGERSQFWTSNSNKITVRFHSDQSYTDTGFLAEY 346 

Mill l + ll + I + II++I++I 111+ II I I 

DYVEIRDGDGSSSPLLGKFCGSGPPEDIVSSSNRMTIKFVSDASVSKRGFKATY 110 



Table 8H. Domain Analysis of NOV8 

gnl |Pfam|pfam00431, CUB, CUB domain (SEQ ID NO: 806) 
CD-Length = 110 residues, 90.9% aligned 
Score =62.4 bits (150), Expect = 9e-ll 



30 



NOV 8: 


129 


RFTTPGFPDSPYPAHARCQWALRGDiUOSVLSLTFRSFDIiASCDERGSDLVTVYNTLSPME 


188 






++I +1+ II + 1 1 +1 + 111+ III 1 1 + + 




Sbjct: 


11 


SISSPNYPN-DYPPNKECVWTIRAPPGYRVELTFQDFDLEDHTGCRYDYVEIRDGDGSSS 


69 


NOV 8: 


189 


PHALVQLCGTYPPS YNLTFHSSQNVLLI TLITNTERRHPGFEATF 233 








1 1 + 11+ II II 1 + 1 +++ 11+11+ 




Sbjct: 


70 . 


PL - LGKFCGSGPP - - - EDI VSSSNRMTIKFVSDASVSKRGFKATY 110 
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Table 81. Domain Analysis of NOV8 

gnl| Smart |smart00042, CUB, Domain first found in Clr, Cls, uEGF, and 

bone morphogenetic protein.; This domain is found mostly among 

developmentally-regulated proteins. Spermadhesins contain only this 

domain. (SEQ ID NO: 80 7) 

CD-Length = 114 residues, 99.1% aligned 

Score - 97.4 bits (241), Expect = 3e-21 



NOV 8: 242 CGGRLIOCAQGTFNSPyYPGHYPPNIDCTWNIEVPNNQHVKVSFKFFYLLEPGVPAGTCPK 301 

HI I + II II II II I++I III m| || + | 

Sb] ct : 1 CGGTLTASSGTITSPNYPNS YPNNLNCVWTISAPPGYRIELKFTDFDLE SSDNCTY 56 

NOV 8: 302 DYVEI-NGE KYCG-ERSQFWTSNSNKITTOFHSDQSYTDTGFLAEYLS 348 

HMI + l ++ll I +++MI +11 I II | II I | + 

Sb^Ct : 57 DYVEIYDGPSTSSPLLGRFCGSELPPPIISSSSNSMTVTFVSDSSVQKRGFSARYSA 113 



10 



15 



Table 8 J. Domain Analysis of NOV8 

gnl|Smart|smart00042, COB, Domain first found in Clr, Cls, uEGF, and 

bone morphogenetic protein.; This domain is found mostly among 

developmentally-regulated proteins. Spermadhesins contain only this 

domain. (SEQ ID NO: 807) 

CD-Length » 114 residues, 89.5% aligned 

Score = 58.5 bits (140), Expect = le-09 



NOV 8: 129 RFTTPGFPDSPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPME 188 

M + 1 + 1 11+11+ + I I III I i I I +1+ | 

Sb}Ct : 11 TITSPNYPNS - YPNNLNCVWTISAPPGYRIELKFTDFDLESSDNCTYDYVEI YDGPSTSS €9 

NOV 8: 189 PHALVQLCGTYPPSYNLTFHSSQNVLLITLITNTERRHPGFEATFF 234 

I I + 11+ I II I + +1 ++++ +111+ 

SbjCt: 70 PL- LGRFCGSELP- - PPI ISSSSNSMTVTFVSDSSVQKRGFSARYS 112 



Table 8K. Domain Analysis of NOV8 

gnl|Smart |smart00l92, LDLa , Low-density lipoprotein receptor domain 
class A; Cysteine- rich repeat in the low -density lipoprotein (LDL) 
receptor that plays a central role in mammalian cholesterol 
metabolism. The N-terminal type A repeats in LDL receptor bind the 
lipoproteins, other homologous domains occur in related receptors, 
including the very low-density lipoprotein receptor and the LDL 
receptor- related protein/alpha 2-macroglobulin receptor, and in 
proteins which are functionally unrelated, such as the C9 component of 
complement. Mutations in the LDL receptor gene cause familial 
hypercholesterolemia. (SEQ ID NO: 808) 
CD-Length = 38 residues, 94.7% aligned 
Score * 58.5 bits (140), Expect = le-09 



20 NOV 8: 427 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCP 

II M H+I+ I l + l IIMIIIII +11 
SX>J Ct : 2 CPPGEFQCKNGRCIPLSWVCDGVDDCGDGSDEENCP 



462 



37 
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Table 8L. Domain Analysis of NOV8 

gnl | Smart | smart 00192, LDLa, Low-density lipoprotein receptor domain 
class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) 
receptor that plays a central role in mammalian cholesterol 
metabolism. The N- terminal type A repeats in LDL receptor bind the 
lipoproteins. Other homologous domains occur in related receptors, 
including the very low- density lipoprotein receptor and the LDL 
receptor- related protein/ alpha 2-macroglobulin receptor, and in 
proteins which are functionally unrelated, such as the C9 component of . 
complement. Mutations in the LDL receptor gene cause familial 
hypercholesterolemia. (SEQ ID NO: 808) 
CD-Length = 38 residues, 92.1% aligned 
Score = 52.0 bits (123), Expect = le-07 



NOV 8: 356 PGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSC 390 

in i + mi in n i in ii 

Sbjct: 4 PGEFQCKNGRCIPLSWVCDGVDDCGDGSDEENCPS 38 

5 



Table 8M. Domain Analysis of NOV8 

gnl (Smart | smart 00 192, LDLa, Low-density lipoprotein receptor domain 
class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) 
receptor that plays a central role in mammalian cholesterol 
metabolism. The N- terminal type A repeats in LDL receptor bind the 
lipoproteins. Other homologous domains occur in related receptors, 
including the very low-density lipoprotein receptor and the LDL 
receptor- related protein/alpha 2-macroglobulin receptor, and in 
proteins which are functionally unrelated, such as the C9 component of 
complement. Mutations in the LDL receptor gene cause familial 
hypercholesterolemia. (SEQ ID NO: 808) 
CD- Length =38 residues, 89.5% aligned 
Score = 52.0 bits (123), Expect = le-07 



NOV 8: 3 94 HQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSC 427 

+ i iii i ii mi kiin mi- i 

Sbjct: 5 GEFQCKNGRC IPLSWVCDGVDDCGDGSDEENCPS 38 

10 



Table 8N. Domain Analysis of NO V8 

gnl | Smart | smart 00 192, LDLa, Low-density lipoprotein receptor domain 
class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) 
receptor that plays a central role in mammalian cholesterol 
metabolism. The N-terminal type A repeats in LDL receptor bind the 
lipoproteins. Other homologous domains occur in related receptors, 
including the very low-density lipoprotein receptor and the LDL 
receptor-related protein/alpha 2-macroglobulin receptor, and in 
proteins which are functionally unrelated, such as the C9 component of 
complement. Mutations in the LDL receptor gene cause familial 
hypercholesterolemia. (SEQ ID NO: 808) 
CD-Length = 38 residues, 94.7% aligned 
Score « 45.1 bits (105), Expect = le-05 



NOV 8: 468 TCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDC 504 

M ++I II 1+ III +11 IIIII++I 

Sb j Ct : 1 TCPPGEFQCKNGRC1PLSWV- CDGVDDCGDGS DEENC 3 6 

15 
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Table 80. Domain Analysis of NOV8 



gnl|Pfam|pfam00057, ldl_recept_a, Low-density lipoprotein receptor 
domain class A (SEQ ID NO: 809) 
CD-Length =39 residues, 92.3% aligned 
Score = 53.1 bits (126), Expect = 5e-08 



NOV 8: 427 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCP 462 

, i 1+1 +1*1+ i i+i ii inn +i 

SJtrjCt: 3 CGPNEFQCGSGEC IPMSWVCDGDPDCEDGSDEKNCA 38 



Table 8P. Domain Analysis of NOV8 



gnl|pfam|pfam00057, ldl_recept_a , Low-density lipoprotein receptor 
domain class A (SEQ ID NO: 809) 
CD-Length = 39 residues, 87.2% aligned 
Score = 47.4 bits (111), Expect = 3e-06 



10 



NOV 8: 356 PGQFTCRTGRC IRKELRCDGWADCTDHSDELNCS 

I +1 I +1 II Ml II I III 11 + 

Sb^ct: 5 PNEFQCGS GECI PMSWVCDGDPD CEDGSDEKNCA 



389 



38 



Table 8Q. Domain Analysis of NOV8 



gnl|Pfam|pfam00057, ldl_recept_a, Low-density lipoprotein receptor 
domain class A (SEQ ID NO: 809) 
CD-Length s 39 residues, 84.6% aligned 
Score = 44.3 bits (103), Expect « 3e-05 



15 



NOV 8: 394 HQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCS 426 

CK . „ e ++i i + i i+ mi ii i mi* i + 

Sb]Ct: 6 NEFQCGSGECIPMSWVCDGDPDCEDGSDEKNCA 38 



Table 8R. Domain Analysis of NOV8 



gnl|Pfam|pfam00057, ldl_recept_a, Low-density lipoprotein receptor 
domain class A (SEQ ID NO: 8 09) 
CD-Length = 39 residues, 92.3% aligned 
Score = 42.0 bits (97), Expect « le-04 



20 



25 



NOV 8: 468 TCTKHTYRCLNGLCLS KGNPECDGKEDCSDGSDEKDC 504 

II + ++ l + l l + + HI II llllll+l 
Sb} Ct ; 2 TCGPNEFQCGSGECIPM- SWVCDGDPDCEDGSDEKNC 3 7 

The predicted sequence described here belongs to the leucine-rich repeat protein 
family. It is homologous to insulin like growth factor binding protein (IGFBP) and RP105, a 
novel B cell surface molecule. It contains five leucine-rich repeat domains. Leucine-rich 
repeats (LRRs) are relatively short motifs (22-28 residues in length) found in a variety of 
cytoplasmic, membrane and extracellular proteins (1) . A common property of this protein 
family involves protein-protein interaction. Other functions of LRR-containing proteins 
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include, for example, binding to enzymes and vascular repair (1) . LRRs form elongated non- 
globular structures and are often flanked by cysteine rich domains. The circulating insulin-like 
growth factors (IGF-I and -II) occur largely as components of a 140kDa protein complex with 
IGF binding protein-3 and the acid-labile subunit (ALS). This ternary complex regulates the 
5 metabolic effects of the serum IGFs by limiting their access to tissue fluids. 

Because of the presence of the Leucine rich repeat domains and the homology to the 
IGFBP and RP105, we anticipate that the novel sequence described here will have useful 
properties and functions similar to these genes. 

The NOV8 nucleic acid and polypeptide contain structural motifs (i.e. leucine rich 

10 repeat domains) that are characteristics of proteins belonging to the leucine-rich repeat protein 
family. Accordingly, the various NOV8 nucleic acids and polypeptides of the invention are 
useful, inter alia, as novel members of this protein family. 

The disclosed NOV8 nucleic acid of the invention encoding a Insulin like growth 
factor binding protein-like protein includes the nucleic acid whose sequence is provided in 

1 5 Table 8A, or a fragment thereof. The invention also includes a mutant or variant nucleic acid 
any of whose bases may be changed from the corresponding base shown in Table 8 A while 
still encoding a protein that maintains its Insulin like growth factor binding protein-like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to those just described, 

20 including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 

25 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acid, up to about 2 percent of the bases may be so changed. 

The disclosed NOV8 protein of the invention includes the Insulin like growth factor 
binding protein-like protein whose sequence is provided in Table 8B. The invention also 

30 includes a mutant or variant protein any of whose residues may be changed from the 

corresponding residue shown in Table 8B while still encoding a protein that maintains its 
Insulin like growth factor binding protein-like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 18 percent of the 
residues may be so changed. 
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The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab>2,that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Insulin like growth factor binding 
protein-like protein (NOV8) is a member of a "Insulin like growth factor binding protein 
5 family". Therefore, the NOV8 nucleic acids and proteins identified here may be useful in 
potential therapeutic applications implicated in (but not limited to) various pathologies and 
disorders as indicated below. The potential therapeutic applications for this invention include, 
but are not limited to: protein therapeutic, small molecule drug target, antibody target 
(therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or prognostic 

1 0 marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo 
and in vitro of all tissues and cell types composing (but not limited to) those defined here. 

The NOV8 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in diabetes, obesity, Von Hippel-Lindau (VHL) syndrome, 
Alzheimer's disease, stroke, tuberous sclerosis, hypercalcemia, Parkinson's disease, 

1 5 Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, 
ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, anxiety, pain, 
neuroprotection, cirrhosis, transplantation, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, autoimmume disease, allergies, immunodeficiencies, graft versus 
host disease (GVHD), lymphaedema, and other diseases, disorders and conditions of the like. 

20 NOV8 nucleic acids and polypeptides are further useful in the generation of antibodies 

that bind immuno-specifically to the novel NOV8 substances for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 
art, using prediction from hydrophobic^ charts, as described in the "Anti-NOVX Antibodies" 
section below. The disclosed NOV8 protein has multiple hydrophilic regions, each of which 

25 can be used as an immunogen. These novel proteins can be used in assay systems for 

functional analysis of various human disorders, which will help in understanding of pathology 
of the disease and development of new drug targets for various disorders. 



NOV9 

NOV9 includes three novel Neuropeptide Y/Peptide YY receptor -like proteins 
disclosed below. The disclosed sequences have been named NOV9a, and NOV9b. 

NOV9a 
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A disclosed NOV9a nucleic acid of 2276 nucleotides (also referred to as CG56554-01) 
encoding a novel Neuropeptide Y/Peptide YY receptor -like protein is shown in Table 9A. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 370- 
372 and ending with a TAA codon at nucleotides 1549-1 55 1 . A putative untranslated region 
upstream from the initiation codon and downstream from the termination codon is underlined 
in Table 9A. The start and stop codons are in bold letters. 



Table 9A. NOV9a nucleotide sequence (SEQ ID NO:45). 



GGCCAGAAGGCGGGGAGCCAGAGGCGGCAGGACCCTAGCGTGGCGCTCCAGCACCCCAGACCGTGGCGGCGC 
CTCGCCTTAGGGAAGAGCAAGGGAAGAACTTTATTTGAACCGCGAACATTTTTTGGTCACTGAGATCGAGTC 
T CC CAGTGCTTTGG CTTCCCGCCTCTTTATCGTGGGTTTGATCCCTGAGCTGCT CTCCTTTCCCGAACCTCC 
CGGGGTGCAGCCTAGAGCCCTCCCGCGCGGCTGACTCCAGAGTAGAGGAAGGGAGGCGGCCTCCGGCTGGTC 
CCCCGAAGCCCTCGCTGCCCCGCAGATGCGGATGGCCAGCCAGTAGCGGGCGGTGGCCCCGCGTCCCGGGAG 
CGCACAGCAA TGCAGGCGCTTAACATTACCCCGGAGCAGTTCTCTCGGC?TGPTfl CGflnarra PR arrrria rv* 

CGGGAGCAGTTC&TCGCTCTGTACCGGCTGCGACCGCTCGTCT^ 

CTGGCCCTCGTGCTCACCGGCGTGCTCAT^ 

GTGACCCGCAGCJ^GGCC^TGCGCACCGTCACCAACATCTTO 

ATCACCTTCTTCTGCATTCCCGTC^CCATGATCCAGAACATTTCCGACAACTGGCTGGAGGGTGCTTTCATT 

TGCAAGATGGTGCCATTTGTCCAGTCTACCGCTGTTGTGACAGAAATCCTCACTATGACCTGCATTGCTGTC 

GAAAGGCACCAGGGACTTGTGCATCCTTTTAAAATGAAGTGGCAATAC^ 

CTAGGTGTGGTCTGGCTGGTGGCAGTCATCGTAGGATC^ 

TATGACTTCCTATATGAAAAGGAACACATCTGCTGCTTAGAA 

TACJVCCACCTTCOTCCTTGTCATCCTCrTCCTCCT^ 

GTGACAGTGGTGGCTCTCTTTGCTGTGTGCTGGGCACCATTCCATGTTGTCCATATGATGATTGAATACAG^ 
AAT TTTG AAAAGGAATATGATG ATGTCACAATCAAGATGATTTTTG C TATCGTGCAAATT ATTGGATTTT CC 
AACTCCATCTGTAATCCCATTGTCTATGCATTrATGAATGAAA^ 

TGT TAT TG CATAGT AAATAAAACCTTCTCT C CAGC ACAAAGG CATG GAAATTCAGGAATTACAATGATGCGG 

AAGAAAGCAAAGTTTTCCCTCAGAGAGAATCCAGTGGAGGAAACCA^G 

ATTGAAGTCAAATTGTGTGAACAGACAGAGGAGAAGAAAAAGCTCAAA 

GAACTGGCTGAGAATTCTCCTTTAGACAGTGGGCATTAATTATAACAATATCTTCATAATTAATGCCCITC^ 
GATTGTAACCCAAAGAGAAAATTATTTTGAGCAAAGGTCAAATACT 



AAGAAAACAAATCATGTTTCCATTAAAAAATCACACGAGGCTAGTCCJ^ 
TGATCACAATCATTTAAGAGATTTCTGTGTTCCTTCTCA^ 
GCAACATGGAAGGCCAGGCACGGTGGCTCATGCCTGTAATCCCAGCA 
CACGAGGTCAGGAGATCAAAACCATCCTGGCTAACACGGTGAAACCCCATCTCT^ 



AGCCGGGCGTGGTGGCGGGCACCTGTAGTCCC^GCTACTTGGGAGCCTCAGGCGGGAGAATGGTGTGAACCC 
GGGAGGCGGAGCTTGCAGTGATCCGAGATCATGCC^CrGCACTCCAGCCTGGGCGAAAGAGCGAGACTCCCC 
GTCTCAAAAAAAATTTTTTTGAAAAATTCGTAAACCATACTTTTAAGATTATTTCAGTGGATTTTTAA 
CTTGTACAGAAATCAGGGTTCTTAGCTj^ 

C T AG ATTGAAT AAG AAAAT AAAATAAT AT CTTCTT CCTTG AAAA 



10 



15 



In a search of public sequence databases, the NOV9a nucleic acid sequence, localized 
to chromosome 4, has 372 of 434 bases (85%) identical to a gb:GENBANK- 
ID:HSA400877|acc:AJ400877.1 mRNA from Homo sapiens (ASCL3 gene, CEGP1 gene, 
CI lorfl4 gene, CI lorfl5 gene, Cllorfl6 gene and CI lorfl7 gene) (E = 2.5e* 1 ). 

The disclosed NOV9a polypeptide (SEQ ID NO:46) encoded by SEQ ID NO:45 has 
393 amino acid residues and is presented in Table 9B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV9a has no signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV9a may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 
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reticulum (membrane) with a certainty of 0.3000, or in the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV9a is between positions 64 and 65: 
GNA-LV. 



Table 9B. Encoded NOV9a protein sequence (SEQ ID NO:46). 

MQALNITPEQFSRLLRDHlSnLTREQFIAL^ 

SKAMRTVTNIFICSIiALSDIJjITFFCIPVTMIQNISDNWLEGAFICKMVPFVQSTAW 
QGLVHPFKMKWQYTNRRAFTMLGWWLVAVI VGS PMWHVQQLEI KYDFLYEKEHICCLEEWTS PVHQKI YTT 
FILVILFLLPLMEKKRAVIMMVTWALFAV^^ 

COTIVYAFMNENFKKNVLSAVCYCIVNKTFSPAQRHGNSGITMMRKKA^ 
KLCEQTEEKKKLKRHIiALFRSELAENSPLDSGH 

5 ^ ~ — - — - 

A search of sequence databases reveals that the NOV9a amino acid sequence has 63 of 
184 amino acid residues (34%) identical to, and 107 of 184 amino acid residues (58%) similar 
to, the 377 amino acid residue ptnr:SPTREMBL-ACC:073733 protein from Brachydanio 
rerio (Zebrafish) (Zebra danio) (Neuropeptide Y/Peptide YY Receptor YA) (E = 0.0). 
10 NOV9a is predicted to be expressed in at least kidney. This information was derived by 

determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
sources. 

In addition, the sequence is predicted to be expressed in lower small intestine, colon, 
1 5 and pancreas, brain, hypothalamus because of SAGE tags identifed for AI308 1 24 and 

AI307658, ESTs which match to the sequence of the invention: pancreatic cancer, prostate, 
prostate cancer, brain, glioblastoma, astrocytoma, normal human luminar mammary epithelial 
cells, breast cancer, ovary, cystadenoma. The SAGE data is reproduced in Example 5. The 
sequence is also predicted to be expressed in the following tissues because of the expression 
20 pattern of related genes in the Neuropeptide Y/Peptide YY/ Orexin/ Galanin/ Cholecystokinin 
receptor family. 
NOV9b 

In the present invention, the target sequence identified previously, NOV9a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 

25 starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 

30 silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
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sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
5 brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 

10 other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV9b. This differs from the previously 

15 identified sequence (NOV9a) in having 38 less amino acids and 3 different ones. 

A disclosed NOV9b nucleic acid of 1472 nucleotides (also referred to as CG56554-02) 
encoding a novel Neuropeptide Y/Peptide YY receptor -like protein is shown in Table 9C. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 42- 
44 and ending with a TAA codon at nucleotides 1335-1337. A putative untranslated region 

20 upstream from the initiation codon and downstream from the termination codon is underlined 
in Table 9C. The start and stop codons are in bold letters. 



Table 9C. NOV9b nucleotide sequence (SEQ ID NO:47). 

CAGTAGCGGGCGGTGGCCCCGCGTCCCGGGAGCGCACAGCA ATGCAGGCGCTTAACATTACCCCGGAGCAGT 

TCTCTCGGCTGCTGCGGGACCACAACCTGACGCGGGAGCAGTTCATCGCTCTGTACCGGCTGCGACCGCTCG 

TCTACACCCCAGAGCTGCCGGGACGCGCCAAGCTGGCCCTCGTGCTCACCGGCGTGCTCATCTTCGCCCTGG 

CGCTCTTTGGCAATGCTCTGGTGTTCTACGTGGTGACCC^ 

TTATCTGCTCCTTGGCGCTCAGTGACCTGCTCATGACCTTCTT 

TTTCCGACAACTGGCTGGGGGGTGCTTTCATTTGCAAGATGGTGCCATTTGTCCAGTCTACCGCTGTTGTGA 

CAGAAATCCTCACTATGACCTGCATTGCTGTGGAAAGGCACCA 

GGCAATACACCAACCGAAGGGCTTTCACAATGCTAGGTGTGG 

CCATGTGGCACGTGCAACAACTTGAGATCAAATATGACT^ 

AAGAGTGGACCAGCCCTGTGCACCAGAAGATCTACACGACCTTCATCCTTGT 

TT ATGGTGATG CTT ATT CTGTACAGTAAAATTGGTT AT GAACTTTGGAT AAAGAAAAGAGTTGGGGATGGT T 
CAGTGCTT CGAACT ATT CATGGAAAAGAAATGTCC AAAATAG C CAGGAAGAAGAAAC GAG CTGTCATTATG A 
TGGTGACAGTGGTGGCTCTCTTTGCTGTGTGCTGGGCACCATT^ 

GTAATTTTGAAAAGGAATATGATGATGTCACAATCAAGATGATTTTTGCTATCGTGCAAATTATTGGATTT^ 

CCAACTCCATCTGTAATCCCATTGTCTATGCATTTATGAATGAAAACTTCAAAAAAAATC 

TTTGTTATTGCATAGTAAATAAAACCTTCTCT^ 

GGAAGAAAG CAAAG TTTT CCCTCAGAG AGAATCCAGTGG AGGAAAC CAAAGGAGAAG CAT TCAGTGATGG CA 
ACATTGAAG T CAAATTGTGTGAACAG ACAGAGG AGAAG AAAAAGCT CAAACGACATC TTGCT CTCTT TAGGT 
CTGAACTGG C TGAGAATT CTCCTTT AGACAGTGGG CAT T AATTAT AAC AAT ATCTTC AT AAT TAATGC CCT T 
CAGATTGT AACCCAAAGAG AAAATT ATTTTGAG CAAAGGTCAAATACTCTT TTT ATT CTTAAGATGATGAC A 
AG AAG AAAACAAAT ATGT TTCATTAAAAATGA 
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In a search of public sequence databases, the NOV9b nucleic acid sequence, localized 
to chromosome 4, has 403 of 656 bases (61%) identical to a gb:GENBANK- 
ID:AB040103|acc:AB040103.1 mRNA from Rattus norvegicus (Rattus norvegicus OT7T022 
5 mRNA for RFamide-related peptide receptor, complete cds) (E = 7.8e" 13 ). 

The disclosed NOV9b polypeptide (SEQ ID NO:48) encoded by SEQ ID NO:47 has 
393 amino acid residues and is presented in Table 9D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV9b has no signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0,6000. Alternatively, 
10 NOV9b may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or in the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV9b is between positions 64 and 65: 
GNA-LV. 



Table 9D. Encoded NOV9b protein sequence (SEQ ID NO:48). 

SKAMRTVTNIFICSIJU^SDLIiITFFC^ 

QGLVHPFKMKWQYTNR31AFTMLGVVWLVAVI VGSPMWHVQQLEI KYDFL YEKEHICCLEEWTS PVHQKI YTT 
FI LVI LFLLPLMEKKRAVIMMVTWALFAVCW I KMI FAI VQ 1 1 GFSNS I 
CNP IVYAFMNENFKKNVIiS AVCYCI VNKTFS PAQRHGNS GI TMMRKKAKFSLRENPVEETKGEAFSDGNIE V 
KLCEQTEEKKKLKRHLALFRSELAENSPLDSGH 

15 

A search of sequence databases reveals that the NOV9b amino acid sequence has 108 
of315 amino acid residues (34%) identical to, and 180 of 315 amino acid residues (57%) 
similar to, the 522 amino acid residue ptnr:SWISSNEW-ACC:Q9Y5X5 protein from Homo 
sapiens (Human) (Neuropeptide Ff Receptor 2 (Neuropeptide G Protein-Coupled Receptor) 

20 (G-Protein-Coupled Receptor HLWAR77)) (E « 5.2e^ 6 ). 

NOV9b is predicted to be expressed in at least the following tissues: lower small 
intestine, colon, and pancreas, brain, hypothalamus, kidney, pancreatic cancer, prostate, 
prostate cancer, glioblastoma, astrocytoma, normal human luminar mammary epithelial cells, 
breast cancer, ovary, cystadenoma. . 

25 The disclosed NOV9a polypeptide has homology to the amino acid .sequences shown 

in the BLASTP data listed in Table 9E. 
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Table 9E. BLAST results for NOV9a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
<aa) 


Identity 
(%) 


Positives 
(%> 


Expect 


gi| 16566347 |gb|AAL2 
6488.1|AF411117 1 
(AF411I17) 


G protein -coup led 
receptor [Homo 
sapiens] 


455 


382/393 
(97%) 


384/393 
(97%) 


0.0 


gi | xju^i /4 Jo (ret |NP 
076470. l| 
(NM_023980) 


neuropeptide FF 
receptor 2 
[Rattus 
nbrvegicus] 


417 


99/314 
(31%) 


157/314 
(49%) 


3e-37 


gi|4106397|gb|AAD02 
833. l| (AF073925) 


neuropeptide 
Y/peptide YY 
receptor Yb 

fGadus moT")iiial 


374 


90/320 
(28%) 


169/320 
(52%) 


4e-37 


gi|4758820|ref |NP 0 

04876.1) 

(NMJ)04885) 


neuropeptide G 
protein- coupled 
receptor; 
neuropeptide FF 2 
[Homo sapiens] 


522 


98/317 
(30%) 


159/317 
(49%) 


4e-37 


gi| 13878604 |sp|Q9Y5 
X5|NFF2_HDMAN 


NEUROPEPTIDE FF 
RECEPTOR 2 
(NEUROPEPTIDE G 
PROTEIN- COUPLED 
RECEPTOR) (G- 
PROTEIN- COUPLED 
RECEPTOR HLWAR77) 


522 


98/317 
(30%) 


159/317 
(49%) 


4e-37 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 9F. In the ClustalW alignment of the NOV9 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



1 0 Table 9F. ClustalW Analysis of NOV9 

1) Novel NOV9a (SEQ ID NO:46) 

1) Novel NOV9b (SEQ ID NO: 48) 

2) gi | 16566347 | gb | AAL26488 . 1 | AF411ll7_i (AF411117) G protein -coupled receptor [Homo 
sapiens] (SEQ ID N0:357) 

15 3) gi | 13027438 | ref |NP_076470.1 | (NMJ)239B0) neuropeptide FF receptor 2 [Rattus 
norvegicus] (SEQ ID NO: 358) 

4) gi|4i06397|gb|AAD02833.l| (AF073925) neuropeptide Y/peptide YY receptor Yb 
[Gadus morhua] (SEQ ID NO: 359) 

5) gi 1 4758820 1 ref |NP_004876. 1 | (NMJJ04885) neuropeptide G protein- coupled receptor; 
20 neuropeptide FF 2 [Homo sapiens] (SEQ ID NO: 360) 

6) gi ) 13878604 | sp | Q9Y5X5 1 NFF2_HUMAN NEUROPEPTIDE FF RECEPTOR 2 (NEUROPEPTIDE G 
PROTEIN- COUPLED RECEPTOR) (G- PROTEIN- COUPLED RECEPTOR HLWAR77) (SEQ ID NO.-361) 



25 



10 20 30 40 50 60 

NOV9b 1 x 

gi 1 16566347 | 1 _ MI CCS ALS PRI HLS FHRSL 19 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



gi 

Si 

gi 
gi 



13027438 | 1 1 

4X06397 | 1 . x 

4758820 j 1 MNSFFGTPAASWCIAESDVSSAPDKEAGRERRALSVQQRGGPAWSGSLEWSRQSAGDRRR 60 

13878604 | 1 MNSFFGTPAASWCLLESDVSSAPDKEAGRERRALSVQQRGGPAWSGSLEWSRQSAGDRRR 60 



NOV9a 


1 


NOV9b 


1 


gi 


16566347| 


20 


gi 


13027438 j 


1 


gi 


4106397 | 


1 


gi 


4758820) 


61 


gi 


13878604| 


61 



Novsa 

NOV9b 



gi 
gi 
gi 
gi 
gi 



16566347 | 
13027438] 
4106397| 
4758820 j 
138786041 



NOV9a 
NOV9b 



gi 
gi 
gi 
gi 
gi 



16566347 

13027438 

4106397 

4758820 

13878604 



NOV9a 
NOV9b 



gi 
gi 
gi 
gi 
gi 



16566347) 
13027438) 
4106397 | 
4758820 j 
13878604) 



NOV9a 
NOV9b 

gi ) 16566347 | 



70 80 90 100 110 120 

..| | | )....) | | | | J | 

_ - -MQALNITPEQFSRLLRD 17 

MQALNITPEQFSRLLRP 17 

TGI VLANS SLD I VLHDTYYWAHCGGNVRRLHCGGPAS RERTAMQALNI T PEQ FS RLLRp 7 9 

_ MGKRWBSNS SG S WDHjjwS 18 

--MEEAFDQFMME 11 

LGLS RQTAKS S WS RSRDRTCC CRRAWWI LVPAADRARRERFIMN^KWDTNS SENWHPHWN 12 0 
LGLSRQTAKSSWSRSRDRTCCCRRAWWILVPAADRARM^ 12 0 



130 



140 



NOV9a 


18 


NOV9b 


18 


gi 


16566347) 


80 


gi 


13027438| 


19 


gi 


4106397| 


12 


gi 


4758820 j 


121 


gi 


13878604 | 


121 



NOV9a 


138 C 


NOV9b 


138 C 


gi 


16566347) 


200 C 


gi 


13027438 j 


138 A 


gi 


4106397) 


130 L 


gi 


4758820) 


240 A 


gi 


13878604 | 


240 A 




310 
I 

195 E&EHIC 

195 EKEHIC 

257 EjfEHIC 

196 SHNKT 
188 FPGbDH--; 
298 SjQNjKTSP 1 
298 SdNKTSF 



350 

227 
227 
289 

JFT^TSAHSTG 255 
RRRK-DMVE 244 
F^IAAVPHTG 357 
(FgAAVPHTG 357 



370 

— I — | — l 

227 ME r 

227 ME 

289' WK-- 

256 gQRLEQWHVS- 

245 rardssrdnrakgsrr: 

358 j^KNQEQWHWSItK^kli: 
430 



380 



277 AIVQIXGgSiMIO] 
277 AIVQlfesHsffll 
33 9 AIVQliGHsSSl 




410 



420 



FEKEYDdS^TI KMHf 276 

^EKEYDD^IKMMF 276 

jFEKEYDDfclKM|Jl| 338 

'LSPNKSRVlSlYiyi 314 

-HEI^VSCQHNLRp 302 

LSPNEMIINIYHx 417 

SPNEgQilblYHY 417 

470 



Ir^gnsgi 1 

'RjHGNSGI' 

Ignsgp 
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gi | 13027438 | 
gi | 4106397 | 
gi j 4758820 j 
gi j 13878604 | 




NOV9a 


337 


KF 


NOV9b 


337 


KF 


gi 


16566347) 


399 


KF 


gi 


13027438) 


367 




gi 


4106397 | 


349 




gi 


4758820 j 


472 




gi 


13878604| 


472 





gFQF--CQ 
lATLSHC- 

3fqlq|!cq 

|EgFQL^CQ 




490 



500 



510 



520 



530 




ELAE|SPLDSGH 393 
ELAEgSPLDSGH 393 
ELAEgSPLDSGH 455 

ETGEATNSTET 417 

374 

TTNS|EI 522 

TTNsiEI 522 



Tables 9G-9H list the domain descriptions from DOMAIN analysis results against 
NOV9. This indicates that the N0V9 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 9G Domain Analysis of NOV9 

gnl|Pfam|pfam00001, 7tm_l, 7 transmembrane receptor (rhodopsin 

family). <SEQ ID NO: 8 10) 

CD-Length = 254 residues, 100.0% aligned 

Score = 146 bits (368) , Expect = 2e-36 



NOV 9: 62 GNALVFYVVTRSKAMRTVTNI FICSLALSDLLITFFCIPVTMIQNISDNWLEGAFICKMV 121 

II II 1+ l + l +11 Mil* +II++III I + + +1+ I + 11 + 1 

Sb j Ct : 1 GNLLVI LVI LRTKKLRTPTNI FLIJ!JLAVADLLFLLTLPPWAL YYLVGGDWVFGDALCKLV 6 0 

NOV 9: 122 PFVQSTAVVTEILTMTCIAVERHQGLVHPFKM^^ 181 

|| + | | +++ |+ + ||| + + 1111++ +II++I+++ I, 
Sbjct: 61 GALFWNGYAS I LLLTAI S IDRYLAI VHPLRYRRI RTPRRAKVLI LLVWVLALLLSLP PL 120 

NOV 9: 182 HVQQLE I KYDFLYEKEHI CCLEEWTS PVHQKI YTTFI LVT LFLL PL 227 

I ■ + I II ++ ++ I ++ l + l I I 

Sbjct: 121 LFSWLR TVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTRILRTL 176 

NOV 9: 228 MEKKRAV1MMVTWALFAVCWAPFHVVHMMIEYSNFE KEYDDVT I K 273 

+++I I++ II +1 +11 l+l+l ++ + + 
Sbjct: 177 RKRARSQRSLKRRSSSERKAAKMLLVVVWFVLCWLPYHI VLLL DSLCLLSI WRVLP 233 

NOV 9: 274 MIFAIVQIIGFSNSICNPIVY 294 

I + + II I I 1*1 
Sbjct: 234 TALLI TLWLAYVNS CLNPI I Y 254 
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Table 9H Domain Analysis of NOV9 

gnl|Pfam|pfam016O4, 7tm_5, 7TM chemoreceptor . This large family of 
proteins are related to pfamOOOOl. They are 7 transmembrane „ receptors . 
This family does not include all known members, as there are problems 
with overlapping specificity with pfamOOOOl. This family is greatly 
expanded in the nematode worm C. elegans. (SEQ ID NO: 811) 
CD-Length = 297 residues, 83.8% aligned 
Score - 38.1 bits (87), Expect = 0.001 



10 



15 



20 



NOV 9: 


55 


Sbjct: 


16 


NOV 9: 


110 


Sbjct: 


76 


NOV 9: 


166 


Sbjct: 


135 


NOV 9: 


226 


Sbjct: 


195 


NOV 9: 


285 


Sbjct: 


247 



I FALALFGNALVF YWTR - - SKAMRTVTN 1 FI CSLALSDLLI TFFCI PVTMI QNI SD 109 

r ++i + n + i i++i M n ++i n 

ITI ISIiPIHIFGFYCILFKTPKKMKSVKWSLLNLHFWSALLDLYIiS FLTIPYLFFPVLAG 75 



I + +1 II + + + II 11+ |++ 

YPLGLLSYLGVPTSIQIYIGVTILGWAVSIILLFENRHNSLVNINN-KFRIWKWIRILY 134 



| + +++| |+ |++ + + + | |++ | 



+ + 



11+ +1+ 
*KLQ KKFFIALCIQVSIP 246 



I I++I + I 



The NOV9 nucleic acids and polypeptides share structure similarity to members to the 
Neuropeptide Y/Peptide YY/ Orexin/ Galanin/ Cholecystokinin/pancreatic polypeptide 
receptor family Neuropeptide Y (NPY) is one of the most abundant neuropeptides in the 

25 mammalian nervous system and exhibits a diverse range of important physiologic activities, 
including effects on psychomotor activity, food intake, regulation of central endocrine 
secretion, and potent vasoactive effects on the cardiovascular system. It shows sequence 
homology to peptide YY and over 50% homology in amino acid and nucleotide sequence to 
pancreatic polypeptide. Neuropeptide Y (NPY) signals through a family of G protein-coupled 

30 receptors present in the brain and sympathetic neurons. At least 3 types of neuropeptide Y 
receptor have been defined on the basis of pharmacologic criteria, tissue distribution, and 
structure of the encoding gene. The NPY Yl receptors have been identified in a variety of. 
tissues, including brain, spleen, small intestine, kidney, testis, placenta, and aortic smooth 
muscle. The Y2 receptor is found mainly in the central nervous system. 

35 Orexin A and Orexin B, are derived from the same precursor, orexin, or hypocretin 

(HCRT), by proteolytic processing. One receptor, designated OX2R, binds both orexin A and 

orexin B. The predicted amino acid sequences of human and rat OX2R are 95% identical and 

contain 7 putative transmembrane domains. The other receptor, designated OX1R (HCRTR1), 

binds orexin A only and has 64% identity to OX2R. Northern blot analysis revealed that in the 
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rat a 3.5-kb OX2R mRNA is expressed exclusively in the brain. When administered 
intracerebroventricularly to rats, orexin A and orexin B stimulated food consumption. In 
addition, preproorexin mRNA levels are upregulated upon fasting, thust these peptides are 
mediators in the central feedback mechanism that regulates feeding behavior. 
5 PYY is secreted from endocrine cells in the lower small intestine, colon, and pancreas. 

It acts through the pancreatic polypeptide receptors in the gastrointestinal tract as an inhibitor 
of gastric acid secretion, gastric emptying, digestive enzyme secretion by the pancreas, and gut 
motility. 

The disclosed NOV9 nucleic acid of the invention encoding a Neuropeptide Y/Peptide 
10 YY receptor -like protein includes the nucleic acid whose sequence is provided in Table 9A or 
a fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 9A while still encoding a 
protein that maintains its Neuropeptide Y/Peptide YY receptor -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 15 percent of the bases may be so 
changed. 

The disclosed NOV9 protein of the invention includes the Neuropeptide Y/Peptide YY 
receptor -like protein whose sequence is provided in Table 9B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 9B while still encoding a protein that maintains its Neuropeptide 
Y/Peptide YY receptor -like activities and physiological functions, or a functional fragment 
thereof. In the mutant or variant protein, up to about 70 percent of the residues may be so 
changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fabk that bind immunospecifically to any of the proteins of the invention. 
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The above disclosed information suggests that this Neuropeptide Y/Peptide YY 
receptor -like protein (NOV9) is a member of a '"Neuropeptide Y/Peptide YY receptor 
family". Therefore, the NOV9 nucleic acids and proteins identified here may be useful in 
potential therapeutic applications implicated in (but not limited to) various pathologies and 
5 disorders as indicated below. The potential therapeutic applications for this invention include, 
but are not limited to: protein therapeutic, small molecule drug target, antibody target 
(therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or prognostic 
marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo 
and in vitro of all tissues and cell types composing (but not limited to) those defined here. 
10 The NOV9 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in obesity, diabetes, kidney disorders, cardiovascular 
disorders, anorexia, eating disorders, gastrointestinal and digestive diseases, metabolic 
diseases,CNS disorders, cancer, autoimmune disease, inflammation, and/or other pathologies 
and disorders. 

15 NOV9 nucleic acids and polypeptides are further useful in the generation of antibodies 

that bind immuno-specifically to the novel NOV9 substances for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 
art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
section below. The disclosed NOV9 protein has multiple hydrophilic regions, each of which 

20 can be used as an immunogen. These novel proteins can be used in assay systems for 

functional analysis of various human disorders, which will help in understanding of pathology 
of the disease and development of new drug targets for various disorders. 

NOV10 

25 A disclosed NOV 1 0 nucleic acid of 985 nucleotides (also referred to as CG55964-01 ) 

encoding a novel G-Protein Coupled Receptor-like protein is shown in Table 10A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 33-35 and 
ending with a TGA codon at nucleotides 981 -983. • A putative untranslated region upstream 
from the initiation codon is underlined in Table 10A. The start and stop codons are in bold 

30 letters. 
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Table 10A. NOV10 nucleotide sequence (SEQ ID NO:49). 



CA^TCTACCACTTGATTCTGATGAACAAATCATGCCGACATTC^TGGCTC^GTCTTCATGCCCTCTGCGT 
TTATACTAATTGGGATTCCTGGTCTGGAGTCAGTGCAGTGTTGGATTGGGATTCCTTTCTCTGCCATGTATC 
TTATTGGTGTGATTGGAAATT CCCTAATTTTAGTTATAATCAAATATGAAAACAG CCTCCATATACCCATGT 
ACATTTTTTTGGCCATGTTGGCAGCCACAGACATTGCACTTAACACCTGCATTCTTCCCAAAATGTTAGGCA 
TCTTCTGGTTTCATTTGCCAGAGATTTCTTTTGATGCCTGTCTTTTTCAAATGTGGCTTATTCACTCATTCC 
AGGCAATTGAATCGGGTATCCTTCTGGCAATGGCCCTGGATCGCTATGTGGCCATCTGTATCCCCTTGAGAC 
ATGCC^CCATCTTTTCCCAGCAGTTCTTAACTC^ 

TAATACCTTCCTTAGGGCTCATCAAATGCTGTCTGAAACACTATCGAACTACAGTCATCTCTCACTCTTACT 
GTGAGCACATGGCCATCGTGAAGCTGGCTACTGAAGATATCCGAGTCAAC7\AGATATATGGCCTATTCGTTG 
CCTTTG C^TCCTAGGGTTTGACAT AATATTTATAACCTTGTCCTATGT CCAAATTTTTATCACTGTCTTT C 
AGCTGCCCCAGAAGGAGGCACGATTCAAGGCCTTTAATACATGCATTGCCCAC^TTTGTGTCTTCCTACAGT 
TCTACCTTCTTGCCTTCTTCTCTTTCTTCAGA^ 

TCTTGTCAAATCTTTACCTGTTAGTCCCACCTTTTCTCAACCCTATTGTCTATGGAGTGAAGACCAAGCAAA 
TTCGTGACCATATTGTGAAAGTGTTTTTCTTCAAAAAAGTAACTTGATC 



10 
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In a search of public sequence databases, the NOV 10 nucleic acid sequence has 789 of 
974 bases (81%) identical to a gb:GENBANK-ID:AF133300|acc:AF133300.2 mRNA from 
Mus musculus (MOR 3'Betal, MOR 3'Beta2, MOR 3'Beta3, and MOR 3 f Beta4 genes, 
complete cds; Cbx3 pseudogene, complete sequence; and MOR 3 f Beta5 and MOR 3'Beta6 
genes, complete cds) (E = 4.3e~ 136 ). 

The disclosed NOV10 polypeptide (SEQ ID NO:50) encoded by SEQ ID NO:49 has 
3 1 6 amino acid residues and is presented in Table 1 OB using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV 10b has a signal peptide and is 
likely to be localized to the endoplasmic reticulum (membrane) with a certainty of 0.6850. 
Alternatively, NOV10 may also localize to the plasma membrane with a certainty of 0.6400, 
the Golgi body with a certainty of 0.4600, or in the endoplasmic reticulum (lumen) with a 
certainty of 0.1000. The most likely cleavage site for NOV10 is between positions 24 and 25: 
LES-VQ. 



Table 10B. Encoded NOV10 protein sequence (SEQ ID NO:50). 



MPTPNGSVFMPSAFILIGIPGLESVQCWIGIPFSAMYLIGVIGNSLILVIIKYENSIiHIPMYIP 
LAMIJU^TDIAIJSrTCILPKMIiGIFWFHLPEISPDACLFQMWLIHSFQAIESGILZjAMALDRYVAI 
CI PIrflHATI FSQQFLTHIGLGVTLIUVAI LI ^ ^ 

EDIRVNKI YGIiFVAFAILGFDII FITLSYVQI FITVFQLPQKEARFKAFNTCIAHICVFLQFYL 
LAFFSFFTHRFGSHIPPYIHIIJiSNLYLLVPPFLNPIVYGVKTKQIRDHIVKVFFFKKVT 



20 



A search of sequence databases reveals that the NOV10 amino acid sequence has 316 
of 316 amino acid residues (100%) identical to, and 316 of 316 amino acid residues (100%) 
similar to, the 316 amino acid residue ptnr:TREMBLNEW-ACC:AAG42368 protein from 
Homo sapiens (Human) (Odorant Receptor HOR3'BETA5) (E = 5.7e" 169 ). 
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NOV10 is predicted to be expressed in at least Apical microvilli of the retinal pigment 
epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma cell lines, corpus 
callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 
cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 

5 umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 
cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 
cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express MHC II and m 
nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 

10 spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus, and thymus 
tissue. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 

The disclosed NOV10 polypeptide has homology to the amino acid sequences shown 

15 in the BLASTP data listed in Table 10C. 



Table 10C. BLAST results for NOV10 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Po 
sitives 
(%) 


Expect 


gi 1 11991867 | gb | AAG4 
2368. l| (AF289204) 


odorant receptor 
HOR3' betas [Homo 
sapiens] 


316 


316/316 
(100%) 


316/316 
(100%) 


e-148 


gi | 7305351 |ref |NP_0 

38648.1) 

(NMJ>13620) 


olfactory- 
receptor 68 [Mus 
musculus] 


315 


258/314 
(82%) 


281/314 
(89%) 


e-122 


gi | 7305353 |ref |NP_0 

38649.1) 

(NM 013621) 


olfactory 
receptor 69 [Mus 
musculus] 


316 


255/314 
(81%) 


279/314 
(88%) 


e-120 


gi 1 11908221 1 gb | AAG4 
1685.1] (AP133300) 


MOR 3»Beta6 [Mus 
musculus] 


316 


238/311 
(76%) 


268/311 
(85%) 


e-115 


gi 1 6912560 1 ref |NPJ) 

36507.1) 

(NM 012375) 


olfactory 
receptor , family 
52, subfamily A, 
member 1 [Homo 
sapiens] 


312 


233/310 
(75%) 


263/310 
(84%) 


e-110 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 10D. In the ClustalW alignment of the NOV10 protein, as 
20 well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-higjblighted amino acid residues are less conserved and 
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can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 10D. ClustalW Analysis of NOV10 

1) Novel NOV10 (SEQ ID NO: 50) 

2) gi 1 11991867 |gb| AAG42368 . 1 1 (AF289204) odorant receptor H0R3'beta5 [Homo sapiens] 
(SEQ ID N0:362) 

3) gi| 7305351 |ref|NP_038648.l| (NM_013620) olfactory receptor 68 [Mus musculus] 
(SEQ ID NO: 363) 

4) gi|7305353|ref|NP_03 8649.1) (NM_013621) olfactory receptor 69 [Mus musculus] 
(SEQ ID NO: 364) 



5) gi|H90822l|gb 

6) gij 6912560 |ref 



AAG41685.l| (AF133300) MOR 3'Beta6 [Mus musculus] (SEQ ID N0:365) 
NP_036507.l| (NMJ)12375) olfactory receptor, family 52, subfamily 



A, member 1 [Homo sapiens] (SEQ ID NO: 366) 



NOVU0 



9i 
9i 

gi 
gi 
gi 



11991867) 
7305351) 
7305353) 
11908221) 
6912560 | 



NOV10 


61 


gi 


11991867) 


61 


gi 


7305351) 


61 


gi 


7305353) 


61 


gi 


11908221 | 


61 


gi 


6912560) 


61 




20 



30 



l0gi pgles vqcwigipfj 
l|gipglesvqcvjigipf| 

l|g I PGLES VQCW IGIPFI 
LSGIPGLESVQCWIGIPP 
LHGI PGLES VQgWI GI P F 
■L^GIPGLESVQCWIGIPP 




I 



70 
•I-- 



80 



I 



I. 



90 



100 

..I.. 



L 



110 



120 



MY I FLAMLAATD I AL|TC I LP KMLG I FW FHgP| I S FDACLgQMWLI HS FQAI ESGI LLAM 
MYIFLAMLAATDIALjjflTCILPKML 

myiflaHlaBtdialBtcilpkm^^ 

MYI FLa|laJjTD I AL|tC I LPKMLGI FWFHjjjbgl S FDACLgQMWLI HSFQAjUES GILL AM 
M Y§FIAMLgATD I @l|t@I LP KMLG I FWFH|gjJl gFDACLfflQMWL I Hg)FQglESGIL[|AM 
gYIFLEML§ATDIAL§g§I^ 



120 
120 
120 
120 
120 
120 



NOV10 


121 


gi 


11991867 | 


121 


gi 


7305351) 


121 


gi 


7305353) 


121 


gi 


11908221) 


121 


gi 


6912560 | 


121 



NOV10 


181 


gi 


11991867) 


181 


gi 


7305351| 


181 


gi 


7305353 | 


181 


gi 


11908221) 


181 


gi 


6912560| 


181 



NOV10 


241 


gi 


11991867 | 


241 


gi 


7305351) 


241 


gi 


7305353) 


241 


gi 


11908221) 


241 


gi 


6912560 | 


241 



NOV10 



301 




190 
..I.. 



200 



210 



220 



I 



230 



I 



240 
..I 



YCEHMAIVKLi^|DIRVNKIYGLFVAFAILGFDiaFITLSYV:«lIFITVFQLPQKEARFK^. 

ycekmaivkl^IdirwkiyglfvafailgfdihfitlsyvS 
ycehmaivkla&|dir||n^^ 

YCEHMAIVKLA^DIRyNKIgGLgVAFAI LGFDI|jFITgSw|jlFITVFQLPQKEARFKA 

YCEHMAivKLApb£^^ 

YCEHMAIVKLAfefej^ 



GFDImFITLSYVSIFITVFQIjPQKEARFKA 



240 
240 
240 
240 
240 
240 



250 



260 
.J.. 



270 



280 



290 



300 
••I 



fntciahicvflqfyllaffsffthrfg|hippyihills|lyllvppflnpivygvktk 
fntciahicvflqfyllaffsffthrfg|kippyihills|lyllvppflnpivygvktk 
fntciahicvflqfyllaffsffthrfgSkippyShills|lyllvppflnpivygvktk 
fntciahicvflqfyllaffsffthrfg|hippy|hills|lyllvppflnpivyg|ktk 
fntc|ahicvfl|fyllaffsffthrfg3h§p@yihills|lyllvppjjjjnpivygvktf 
fntciahicvflqfyllaffsffthrfg^hippyihil^spljjyllvppflnp^vyg^kt 



300 
300 
300 
300 
300 
300 



310 



316 
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11991867 | 


301 


gi 


7305351) 


301 


gi 


7305353 | 


301 


gi 


11908221 | 


301 


gi 


6912560| 


301 




10 



Table 10E lists the domain description from DOMAIN analysis results against 
NOV1 0. This indicates that the NOV 1 0 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 10E Domain Analysis of NOV10 

gnl|Pfam|pfamO00Oi, 7tm_l, 7 transmembrane receptor (rhodopsin 

family). (SEQ ID NO: 810) 

CD- Length = 254 residues, 100.0% aligned 

Score = 67.8 bits (164), Expect =' 9e-13 



15 



20 



25 



30 



35 



40 



NOV10: 


43 


Sbjct: 


1 


NOV10: 


103 


Sbjct: 


61 


NOV10 : 


163 


Sbjct: 


121 


NOV10 : 


220 


Sbjct: 


177 


NOV10 : 


278 


Sbjct: 


237 



GNSLILVIIKYENSIjHIPMyiFIJ^LAATDIAIjOTCILPKMLGIFWFHLPEISFDACLFQ 102 

II I++++I I I III II h +11 I 

GNLLVILVIIiRTKKLRTPTNI FLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 6 0 



I +1 I+++III+II 111+ I + + + I I + I +1 +1 I 

GALFWNGYAS ILLLTAI S IDRYLAI VHPLRYRR IRTPRRAKVLI IiLVWVLALLLS LPPL 120 



II 



+ I 



I I 



I + I + 



I llll+l 

:tlwlayvnsclnpxiy 254 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
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Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

5 The disclosed NOV 10 nucleic acid of the invention encoding a G-Protein Coupled 

Receptor -like protein includes the nucleic acid whose sequence is provided in Table 1 OA or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 10A while still encoding a 
protein that maintains its G-Protein Coupled Receptor-like activities and physiological 

10 functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 

1 5 modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 19 percent of the bases may be so changed. 

20 The disclosed NOV1 0 protein of the invention includes the G-Protein Coupled 

Receptor-like protein whose sequence is provided in Table 10B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 10B while still encoding a protein that maintains its G-Protein 
Coupled Receptor-like activities and physiological functions, or a functional fragment thereof. 

25 In the mutant or variant protein, up to about 25 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor-like 
protein (NOV 10) is a member of a "G-Protein Coupled Receptor family". Therefore, the 

30 NOV10 nucleic acids and proteins identified here may be useful in potential therapeutic 

applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
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delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV1 0 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in developmental diseases, MHCII and III diseases 
5 (immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 

Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 

1 0 fungal, protozoal and viral infections (particularly infections caused by HIV- 1 or HI V-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 
Treatment of Albright Hereditary Osteodystrophy, angina pectoris, myocardial infarction, 

1 5 ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
autosomal dominant (2) AcrocaUosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de ia Tourette syndrome, and/or other diseases and pathologies. 

20 NOV 1 0 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immuno-specifically to the novel NOV1 0 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV 10 protein has multiple hydrophilic 

>5 regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 



30 NOV11 



A disclosed NOV1 1 nucleic acid of 1014 nucleotides (also referred to as Curagen 
Accession No. CG55966-01) encoding a novel G-Protein Coupled Receptor -like protein i 
shown in Table 1 1 A. An open reading frame was identified beginning with an ATG initiation 



is 
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codon at nucleotides 2-4 and ending with a TGA codon at nucleotides 947-949. Putative 
untranslated regions upstream from the initiation codon and downstream of the termination 
codon are underlined in Table 1 1 A. The start and stop codons are in bold letters. 



Table 1 1 A. NOV11 nucleotide sequence (SEQ ID NO:51). 

^TGATTACTTCAGTAAGCCCTAGCAC(^GCACGAATTCTTCCTTTCTTCTCACTGGATTTTCTG 

GCATGGAGCAGCAATACCCCTGGTTTTCCATCCCCTTCTCCTCAATCTATGCCATGGTGCTTTTG 

GGCAATTGCATGGTTCTCCATGTGATATGGACTGAGCCAAGCCTGCACCAGCCTATGTTTTACTT 

CCTGTCCATGCTGGCCCTCACTGACCTGTGCATGGGGCTGTCCACTGTGTACACAGTGCTGGGGA 

TCCTGTGGCGGATCATTCGAGAGATCAGCTTGGATTCCTGC^TTGCCCAGTCCTATTTCATCCAT 

GGTCTGTCCTTCATGGAGTCCTCTGTCCTCCTCACTATGGCCTTTGACCGGTACATTGCAATTTG 

CAATCCACTACGTTATTCCTCCATCCTGACTAATTCCAGAATTATCAAAATTGGGCTCACTATAA 

TAGGTAGGAGTTTTTTCTTTATTACACCCCCCATCATCTGTCTGAAATTTTTTAATTACTGTCAT 

TTCCACATCCTTTCTC^CrCTTTCTGCCTGC^CCAGGATCTTCTCCGCTTAGCCTGTTCAGACAT 

CCGATTCAATAGTTACTATGCCCTGATGCTGGTTATTTGCATACTGTTGTTGGATGCTATACTCA 

TCCTTTTCTCCTACATCCTGATTCrrAAGTCAGTCCTGGCAGTTGCCTCTCAGGAAGAGAGGCAT 

AAATTATTTCAGACCTGCATCTCCCACATCTGT 

CCTCACAATGGTGCACCGTTTTGGCAAGCACC 

TCTACATCCTTTTCCCACCTTTAATC 

ACCAGAATGCTTAGACTCTTTTCTCTGAAAAGATATTG AGAGATATTGAGATGTATTGCCTAAAA 
AAAAGAAAGAAAAGCAGCAACAATAATAAACAAAAATCA 

5 

The disclosed NOV1 1 polypeptide (SEQ ID NO:52) encoded by SEQ ID NO:51 has 
315 amino acid residues and is presented in Table 1 IB using the one-letter amino acid code. 



Table 11B. Encoded NOV11 protein sequence (SEQ ID NO:52). 

MITSVSPSTSTNSSFLLTGFSGMEQQYPWPSIPFSSIYAMVLLGNCMVLHVIWTEPSLHQPMFY 
FLSMLALTDLCMGLS1VYWLGILWRIIREISLDSCIAQSYFIHGLSFMESSVLLTMAFDRYIA 
ICNPLRYSSILTNSRIIKIGLTIIGRSFFFITPPIICLKFFNYCHFHILSHSFCLHQDLLRLAC 
SDIRFNSYYALMLVICILLLDAILILFSYILILKSVLAVASQEERHKLFQTCISHICAVLVFYI 
PI ISLTMVHRFGKHLS PVAHVLIGNIYILFPPLMNPIIYSVKTQQIHTRMLRLFSLKRY 

10 A search of sequence databases reveals that the NOVl 1 amino acid sequence has 1 65 

of 302 amino acid residues (54%) identical to, and 222 of 302 amino acid residues (73%) 
similar to, the 31 1 amino acid residue ptnr: SPTREMBL-ACC:Q9WVN4 protein from Mus 
musculus (Mouse) MORS'BETAl (E = 7.0e"* 8 ). 

The disclosed NOVl 1 polypeptide has homology to the amino acid sequences shown 

1 5 in the BLASTP data listed in Table 1 IC. 



Table 11C. BLAST results for NOV11 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) ■ 


Expect 


gi (11991863 | gb | AAG4 
2364. l| (AF289204) 


odorant receptor 
HOR3»betal [Homo 
sapiens] 


321 


315/315 
(100%) 


315/315 
(100%) 


e-139 
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gi | 11908218 | gb | AAG4 
1683.1) (AF137396) 


HOR5 1 BetaS [Homo 
sapiens] 


312 


165/307 
(53%) 


231/307 
(74%) 


4e-78 


gi 1 17456753 |ref |XP_ 
061614. 1| 
(XM_061614) 


similar to MOR 
3Beta4 (H. 
sapiens) [Homo 
sapiens] 


315 


163/307 
(53%) 


223/307 
(72%) 


le-77 


gi | 7305345 | ref |NP_0 
38645. l| 
(NM_013617) 


olfactory 
receptor 65 [Mus 
muscul us] 


307 


164/305 
(53%) . 


223/305 
(72%) 


5e-77 


gi| 17456767 | ref |XP_ 
061618. l| 
(XM 061618) 


similar to 
prostate specific 
G-protein coupled 
receptor (H. 
sapiens) [Homo 
sapiens] 


879 


162/303 
(53%) 


226/303 
(74%) 


2e-76 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 1 ID. In the ClustalW alignment of the NOV1 1 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 11D. ClustalW Analysis of NOV11 

10 1) Novel NOV11 (SEQ ID NO: 52) 

2) gi|ll991863 [ gb | AAG42364 . 1 | (AF289204) odorant receptor HOR3 1 betal [Homo sapiens] 
(SEQ ID NO:367) 

3) gi|ll908218]gb|AAG41683.l] (AF137396) HOR5'Beta5 [Homo sapiens] (SEQ ID N0:368) 

4) gi j 17456753 j ref |XP_061614 . 1 | (XM_061614) similar to MOR 3Beta4 (H. sapiens) 
15 [Homo sapiens] (SEQ ID NO: 369) 

5) gi | 7305345 | ref |NP_038645 . 1 | (NM_013617) olfactory receptor 65 [Mus mus cuius] 
(SEQ ID NO:370) 

6) gi | 17456767 | ref |XP_061618 . 1 | (XMJ)61618) similar to prostate specific G-protein 
coupled receptor (H. sapiens) [Homo sapiens] (SEQ ID NO: 371) 



25 



30 



35 



40 



NOVll 



gi 
gi 
gi 
gi 
gi 



11991863 
11908218 
17456753 
7305345 | 
17456767 



NOVll 



gi 
gi 
gi 
gi 
gi 



11991863 | 
11908218 j 
17456753 | 
7305345) 
17456767 1 



10 20 30 40 50 60 

....|....|....|....|....|....|....|....|....|....|....|....| 

1 1 

1 1 

1 - 1 

1 - - 1 

1 - - - 1 

1 MSLALDLCPLSQRLEAFPSSIVLFFQTAPAVRHPKGLLEIiHKTVPTSIKEELKGFFPTSD 60 

70 80 90 100 110 120 

....|....|....|....|....|....|....|....|....|....|....|....| 

1 1 

1 -- - 1 

1 - 1 

1 - 1 

1 ~ 1 

61 HFIITDFIAKYOTDLKWAVIjGIATPRQQFKALN^ 120 

130 140 150 160 170 180 

....|....|....|....|....|....|....|..;.|....|....|....|....| 
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NOVll 

gi | 11991853 | 
gi | 11908218 | 
gij 17456753) 
gi 1 7305345 | 
gi 1 17456767 | 



1 
1 
1 
1* 
1 



121 DVSPMIHVLMADIFLIiVPPLLOTIVYCVKTHQIREKWGKLCPKN 180 



10 



15 



NOVll 


1 


gi 


11991863) 


1 


gi 


11908218 j 


1 


gi 


17456753 | 


1- 


gi 


7305345 | 


1 


gi 


17456767) 


181 



190 



200 



210 



220 



230 



|....|. 



240 



1 

- 1 

x 

1 

x 

181 GFRLAYyyLPPHPSKVSFLDPVEKANRSAPTQFSPMPSADASLLADLGTFSSLQRATFFL 240 



250 



260 



270 



280 



20 



25 



30 



35 



40 



NOVll 



gi 
gi 
gi 
gi 
gi 



11991863) 

11908218) 

17456753) 

7305345) 

17456767) 



290 



300 



.|....|....|. 



241 TGFQGLEGIJIGWISIPFCFIYLTVIIjGNLTILHVICTDATLHGPMyYFIjGMlAAVTDLGLC 300 



310 



320 



330 



I I I 



340 
..).... | 



350 



360 



NOVll 

gi 1 11991863 | 
gi | 11908218 j 
gi|l7456753| 
gi 1 7305345 j 

gi j 17456767 | 301 LSTLPTVLGIFWFDTREIGIPACFTQLFFIHTLSSMESSVLLSMSIDRYVAVCNPLHDST 360 



1 
1 
1 
1 
1 



370, 



380 



390 



NOVll 



gi 
gi 
gi 
gi 



11991863) 
11908218 j 
17456753) 
7305345) 



400 
..).. 



410 



420 



1 
1 
1 
1 
1 



gi 17456767) 361 VLTPACIVKMGLSSVLRSALLILPLPFIiKRFQYCHSHVIjAHAyCLHLEIMKLACSSII^ 420 



45 



50 



430 



NOVll 

gi | 11991863 | 
gi | 11908218 | 
gi) 17456753 | 
gi j 7305345 | 
gi) 17456767 | 



440 



450 



460 



470 



480 
)....) 



1 
1 
1 
1 



1 - - 1 

421 NHIYGLFWACTVGVDSLLIFLSYALILRTVLSIASHQERLRALNTCVSHICAVLLFYIP 480 



55 



60 



NOVll 



gi 
gi 
gi 
gi 
gi 



11991863 
11908218 
17456753 
7305345) 
17456767 



490 



500 



510 

■|....|.. 



520 



530 



540 



1 
1 
1 
1 



1 1 

481 MlGLSIiVHRFGEHLPRVVHLFMSYVYLLVPPLMNPIIYSIKTKQIRQRIIKKFQFIKSLR 540 



65 



70 



NOVll 



gi 
gi 
gi 
gi 
gi 



11991863 
11908218 
17456753 
7305345) 
17456767 



550 560 570 580 

. | | | | .... | . - | 

_ -jWITSVSPgTSTNSS 

• - MFLSSRMITSVSPOT^TksS 

• - ---MSSSSg- 

_ - -MGDWNNSD-SVEPI 

MWSSNj: 



590 



600 



541 CNHQYCLNLLQDFGGHPPSPLSPHTMTfiGSL^ 
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10 



15 



20 



25 



40 



45 



NOVll 



gi 
gi 
gi 
gi 



1199X863) 

11908218) 

17456753| 

7305345] 

17456767! 



NOVll 


94 




sldS 


gi 


11991863) 


100 




si»r£ 


gi 


11908218) 


89 




gsa] 


gi 


17456753) 


93 




QAS 


gi 


7305345) 


89 






gi 


17456767) 


661 







NOVll 



gi 
gi 
gi 
gi 



11991863 
11908218 
17456753 
7305345) 
17456767 



30 


NOVll 


214 £ES 




gi 


11991863 


220 i&xij 




gi 


11908218 


209 5tgj 




gi 


17456753 


213 JsiSF, 


35 


gi 


7305345) 


209 'Hit 


gi 


17456767 


781 Sht 



NOVll 



gi 
gi 
gi 
gi 
gi 




154 
160 
149 
153 
149 
721 



SP 273 
iSP 279 
>VPH 268 
' P 272 
'H 268 
IAPH 840 



850 



860 



870 



880 



11991863) 
11908218) 
17456753) 
7305345 | 
17456767) 




50 



Table 1 IE lists the domain description from DOMAIN analysis results against 
NOV1 1 . This indicates that the NOV1 1 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table HE Domain Analysis of NOVll 

gnl |pfam|pfam00001, 7tm_i, 7 transmembrane receptor {rhodopsin 

family). (SEQ ID NO:810) 

CD-Length « 254 residues, 100.0% aligned 

Score « 71.2 bits (173), Expect = 8e-l4 
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NOVll: 44 GNCMVLHVIWTEPSLHQPMFYFLSMLALTDLCMGLSTVyTVL IREI SLDSCI AQ 103 

II +1+ II I I II 11+ II 1+ I I I 

SbjCt: 1 GNLLVI LVILRTKKLRTPTNI FLL^TLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKIiV 60 

NOVll : 104 SYFIHGLS FMES S VLLTMAFDRYI AI CNPLRY S S ILTNSRI I KIGLTI IGRS FFFI TPPI 1G3 
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10 



15 



Sbjct: 
NOV11 : 
Sbjct: 
NOV11 : 
Sbjct: 
N0V11 : 
Sbjct: 



61 



+ +1 I I 1*1 I + 1111 III + | + + || + 

GALFWNGYAS ILLLTAI SIDRYLAI VHPLRYRRIRTPRRAKVLI LLVWVLALLLSLPPL 120 



1S4 I CLKFFNYCHFHI LSHSFCLHQDLLRIACSDIRFNSYYAliML VJC JLLLDA JI»ILFSY IL 223 
+ I ♦ + II ♦ I 1+ + +| ++|| I 



121 L FSWLRTVEEGNTTVCLIDF- 



• PEE S VKRSYVLLS TLVGFVIiPLLVI LVCYTR 171 



224 ILKSVLAVA SQEERHKLFQTCISHI CAVLVF - - YIPIISLTMVHRFGKHLS 272 

I III ♦ + II + I I + 

172 ILRTLRKRARSQRSLKRRSSSERKAAKMLLV\AAA/FVIjCWLPYHIVLLLI>SLCLLSIWRV 231 

273 PVAHVLIGNI YILFPPLMNPI IY 295 

♦II ♦UNI 
232 LPTALLITLWLAYVNS CLNPI IY 254 



G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
20 and are likely to underlie the recognition and G-protein-mediated transduction of various 

signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

25 Olfactory receptors (ORs) have been identified as extremely large subfamily of G 

protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
Previously, OR genes cloned in different species were from random locations in the respective 

30 genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV1 1 nucleic acid of the invention encoding a G-Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 1 1 A or a 

35 fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 1 1 A while still encoding a 
protein that maintains its G-Protein Coupled Receptor-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 

40 that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 
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modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. 
5 The disclosed NOV1 1 protein of the invention includes the G-Protein Coupled 

Receptor-like protein whose sequence is provided in Table 1 IB. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 1 IB while still encoding a protein that maintains its G-Protein 
Coupled Receptor -like activities and physiological functions, or a functional fragment thereof. 

1 0 In the mutant or variant protein, up to about 47 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor-like 
protein (NOV1 1) is a member of a "G-Protein Coupled Receptor family". Therefore, the 

15 NOV1 1 nucleic acids and proteins identified here may be useful in potential therapeutic 

applications implicated in (but not limited to) various pathologies and disorders. The potential 
therapeutic applications for this invention include,, but are not limited to: protein therapeutic, 
small molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 

20 research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
(but not limited to) those defined here. 

The NOV1 1 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 

25 Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 

30 cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 
Treatment of Albright Hereditary Osteodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 



111 



WO 02/068649 PCT/US02/02785 



disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other pathologies and disorders. 
5 NOV1 1 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immuno-specifically to the novel NOV1 1 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. These novel proteins can be used in assay systems for 
10 functional analysis of various human disorders, which will help in understanding of pathology 
of the disease and development of new drug targets for various disorders. 



NOV12 

A disclosed NOV 12 nucleic acid of 1067 nucleotides (also referred to as Curagen 
Accession No. CG56003-01) encoding a novel G-Protein Coupled Receptor -like protein is 
15 shown in Table 12A. An open reading frame was identified beginning with an ATG initiation 
codon at nucleotides 15-17 and ending with a TGA codon at nucleotides 1023-1025. The 
untranslated regions are underlined and the start and stop codons are in bold letters in Table 
12A. 



Table 12A. NOV12 nucleotide sequence (SEQ ID NO:53). 

AAAACCTGACATAAATGAACAACAATACAACATGTATTCAACCATCT^ 

T CATTT ACATC C TCCTTTGTATTGTTGGTGTTT TTGGAAACACTCT CT CTC 

TAGGTAAAAAAACATCAACGCACATCTACCTGTCACACC 

TGCCTTTCATGAGTATCTATTTCCTGAAAGGTTTCCAATGGGAATATCAATCTGCTCAATGCAGAGTGGTC^ 
ATTTTCTGGGAACTCTATCCATGCATG CAAGTATGTTTGT CAGTCTCTTAATTTTAAGTTGG ATTGCCATAA 
G CCGCTATGCTACCTTAATGCAAAAGGATT CCT CGCAAGAGACTACTTCATGCTATGAGAAAATATTTTATG 
G C C ATTTACTGAAAAAATT TCG CCAGC CCAACT TTG CT AGAAAACTATGCATTT ACATATGGGGAGTTGT AC 
TGGGCATAATCATTCCAGTTACCGTATACTACTCAGTCATAGAGGCTACAGAAGGAGAAGAGAGCCTATGCT 
ACAATCGG CAG ATGGAACTAGGAGC CATGATCT CTCAGATTG C 

TTT C CT TTTT AGTAGTACTAACATCATACT ACT CTTTTGTAAG CCATCTGAGAAAAATAAGAACCTGTACGT 

CCATTATGGAGAAAGATTTGACTTACAGTTCTGTGAAAAGACATCTTTTGGTCA 

TrTGCTTCCTTCCTTATAGTATTTTTAAACCCATTTTTTAT^ 

TGAATTATTTAATAGAAACAAAAAACATTCTCACCTGTCTTGCTTCGGCCAGAACT 

TATTTCTTTTATTAGATAAAACATTCAAGAAGACACTATATAATCrCTTTACAAAGTCTAATTCAG 

TGCAATCATATGGTTG ACTTTTGAATGGAAAACCCCACAATATTAAGAAAAGCATTCAT 

20 " " 

The disclosed NOV12 polypeptide (SEQ ID NO 54) encoded by SEQ ID NO:53 has 
336 amino acid residues and is presented in Table 12B using the one-letter amino acid code. 
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Table 12B. Encoded NOV12 protein sequence (SEQ ID NO:54). 

MNNNTTCIQPSMISSMALPIIYILLCIVGVFGOTLSQWIFLTKIGKKTSTHIYLSHLVTi^LLV 
CSAMPFMSIYFLKGFQWEYQSAQCRWNFLGTLSMHASMFVSLLILSWIAISRYATLMQKDSSQ 
ETTSCYEKIFYGHIiLKKFRQPNFARKLCIYIWGVVLGIIIPVTVYYSVIEATEGEESLCYNRQM 
ELGAMISQIAGLIGTTFIGFSFLWLTSYYSFVSHLRKIRTCTSIMEKDLTYSSVKRHLLVIQI 
LLI VCFLP YS I FKP I FYVLHQRDNCQQLNYLIETKNILTCLASARSSTDPI I FLLLDKTFKKTL YNLFT 
KSNSAHMQSYG 



A search of sequence databases reveals that the NOV 12 amino acid sequence has 52 of 
179 amino acid residues (29%) identical to, and 86 of 179 amino acid residues (48%) similar 
to, the 339 amino acid residue ptnr: SWISSPROT-ACC:Q13304 protein from Homo sapiens 
5 Putative G Protein-Coupled Receptor GPR17 (R12) (E = 1 .6c" 22 ). 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
10 signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

The disclosed NOV12 polypeptide has homology to the amino acid sequences shown 
15 in the BLASTP data listed in Table 1 2C. 



Table 12C. BLAST results for NOV12 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1 18201870 |ref|NP 
543007.1 | 
(NMJ)80817) 


G protein- coupled 
receptor 82 [Homo 
sapiens] 


336 


336/336 
(100%) 


336/336 
(100%) 


e-170 


gi | 48853 01 |ref|NP_0 
05282. l| 
(NM_005291) 


G protein- coupled 
receptor 17 [Homo 
sapiens] 


367 


85/322 
(26%) 


144/322 
(44%) 


6e-2l 


gi 1 17462169 |ref |XP_ 
002705.4 | 
(XM_002705) 


G protein- coupled 
receptor 17 [Homo 
sapiens] 


339 


85/322 
(26%) 


144/322 
(44%) 


2e-20 


gi| 2695876 |erab|CAB0 
8108. 1| (Z94155) 


P2Y-like G- 
protein coupled 
receptor [Homo 
sapiens] 


298 


80/302 
(26%) 


135/302 
(44%) 


3e-18 


gi | 5757634 | gb | AAD50 

531.l|AF039686_l 

(AF039686) 


G-protein coupled 
receptor GPR34 
[Homo sapiens] 


381 


77/323 
(23%) 


152/323 
(46%) 


4e-i8 
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The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 12D. In the ClustalW alignment of the NOV12 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 



15 



20 



Table 12D. ClustalW Analysis of NO V12 

1) Novel N0V12 (SEQ ID NO: 54) 

9) gi 1 18201870 | ref |NP_543007 . 1 1 (NM_080817) G protein- coupled receptor 82 [Homo 
sapiens] (SEQ ID NO:372) 

9) gi | 4885301 | ref |NP_005282.l| (NMJJ05291) G protein- coupled receptor 17 [Homo 
sapiens) (SEQ ID NO: 373) 

9) gi 1 17462169 1 ref |XP_002705 .4 | (XM_002705) G protein- coupled receptor 17 [Homo 
sapiens] (SEQ ID NO: 3 74) 

9) gi 12695876 1 erab I CAB08108.1) (Z94155) P2Y-like G-protein coupled receptor [Homo 
sapiens] (SEQ ID NO;375) 

9) gi | 5757634 |gb|AAD50531.l|AF039686_l (AF039686) G-protein coupled receptor GPR34 
[Homo sapiens] (SEQ ID NO: 376) 
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55 



60 



NO VI 2 

gi 
gi 
gi 
gi 
gi 



)....). 



18201870) 
4885301) 
17462169) 
2695876) 
5757634 | 



MSKI^WWAGSRKPPREmKLSGSDSSQSMNGLEVAPPGLITNFSiiATAEQ 
MNGLEVAPPGLITNFS2jATAE< 

MRSHTITMTTTSVSSWPYSSHRMRFITNHSDQPPQNFS - - ATPNVTTQpMDEKBl^T 55 




NOV! 2 


17 


ALPII 


gi 


18201870) 


17 


ALPII 


gi 


4885301) 


61 


LFASF 


gi 


17462169| 


33 


LFASF 


gi 


2695876) 


1 




gi 


5757634) 


56 


VLTTS 



NOV12 



gi 
gi 
gi 
gi 
gi 



18201870 | 

4885301) 

17462169) 

2695876) 

57576341 



NOV12 

gi 1 18201870 | 
gi j 4885301 1 
gi j 17462169 1 
gi 1 2695876 | 
gi j 5757634 j 



NOV12 




JyNRQMHLGS 196 
IQmBlG^ 196 
QLYRgKAS 218. 
LQLYRgKAB 190 
(LYRgKAS 149 
»gHN| 213 



250 



260 



270 



|....|. 



260 



197 Mll^gAGI^GTTgiC^S^ 



(SVVtjJs^Y; 



SBfSHlJ^IRTCTSlj 

114 



290 



300 

!QI 256 
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15 



20 



25 



gi 
gi 
gi 
gi 



18201870 | 
4885301) 
1746216$| 
2695876 | 
5757634| 



NOV12 

gi 
gi 
gi 
gi 
gi 



18201870 | 
4885301) 
17462169) 
2695876 | 
5757634 



197 MI^QXAGLJEGT 

219 m&jjysr-i m 

191 HH^vf--I 

150 hhaiAts - -i 

214 KGEA$F 



OVIQI 256 
i-MJAgjgA 270 
-M^EA^gA 242 
*-M2A^gA 201 
bTATTAKNSFiffiSl 273 



NOV12 


257 


gi 


18201870) 


257 


gi 


4885301) 


271 


gi 


17462169) 


243 


gi 


2695876 | 


202 


gi 


5757634) 


274 




Table 12E lists the domain description from DOMAIN analysis results against 
NOV12. This indicates that the NOV12 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 12E Domain Analysis of NOV12 

gnl ) Pf am | pf amOO 0 0 1 , 7tra_l , 7 transmembrane receptor ( rhodopsin 

family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 99.6% aligned' 

Score =82.0 bits (201), Expect = 5e-17 



NOV12 : 


32 


Sbj ct : 


1 


NOV12 : 


92 


Sbjct: 


61 


NOV12 : 


152 


Sbjct: 


101 


NOV12 : 


212 


Sbjct: 


160 


NOV12 : 


271 


Sbjct: 


220 



GNTLSQWIFLTKIGKKTSTHIYLSHLVTANLLVCSAMPFMSIYFLKGFQWEYQSAQCRVV 

II I * I +1 |+|*| +1 l+ll +1 **l*| I I + I |**| 

GNLLVTLVILRTKICLRTPTNI FLLNLAVADLL FLLTLP PWAL YYL VGGDWVFGDALCKLV 

NFLGTLSMHASMFVSLIilLSWIAISRYATLMQKDSSQETTSCYEKIFYGHLLKKFRQPNF 

I *+ +11+ +1+ IH II | + ++ | | 

GALFWNGYASIL LLTAI S I DRYLA IVHPLRYRRIRTPRR 



1+ I + *l * I + +1 



++ 



II ++I 



91 

60 

151 

100 V 

211 

159 



11*1 I + III |* |* + * 



* III* +* **l*lll I 



- ALLI TLWLAYVNS CLNP 1 1 253 



Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 
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transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
5 displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 12 nucleic acid of the invention encoding a G-Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 12A or a 
fragment thereof The invention also includes a mutant or variant nucleic acid any of whose 

10 bases may be changed from the corresponding base shown in Table 12A while still encoding a 
protein that maintains its G-Protein Coupled Receptor-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 

15 includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 

20 binding nucleic acids in therapeutic applications in a subject. 

The disclosed NOV 12 protein of the invention includes the G-Protein Coupled 
Receptor-like protein whose sequence is provided in Table 12B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 12B while still encoding a protein that maintains its G-Protein 

25 Coupled Receptor -like activities and physiological functions, or a functional fragment thereof. 
In the mutant or variant protein, up to about 77 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab>2. that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor-like 

30 protein (NOV1 2) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV 1 2 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
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targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV12 nucleic acids and proteins of the invention are useful in potential 
5 therapeutic applications implicated in developmental diseases, MHC1I and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 

1 0 starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1 ), bacterial, 

fungal, protozoal and viral infections (particularly infections caused by HIV-l or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

1 5 Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington f s disease 

20 or Gilles de la Tourette syndrome, and/or other pathologies and disorders. 

NOV 12 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV 12 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

25 NOVX Antibodies" section below. The disclosed NOV 12 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

30 NOV13 

NOV 13 includes three novel G-Protein Coupled Receptor -like proteins disclosed 
below. The disclosed sequences have been named NOV 13a and NOV 13b. 
NOV13a 
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A disclosed NOV13a nucleic acid of 961 nucleotides (also referred to as CG56075-01) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 13 A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 12-14 and 
ending with a TGA codon at nucleotides 936,-938. The start and stop codons are shown in 
5 bold in Table 13 A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 13A. NOV13a nucleotide sequence (SEQ ID NO:55). 

GACAACAAACTATGAGACAGATAAATCAGACACAAGTGACAGAATTCCTCCTTCTGGGACTCTCTGATGGGC 
CACACACCGAGCAGCTGCTATTTATCGTATTATTGGGTGTCTACCTGGTCACTGTGCTTGGAAATCTGCTTC 
TAATCTCCCTTGTTCATGTTGACTCCCAACTTCACACACCCATGTATTTTTTTCTCTGCAACT 

ctgacctctgtttctctaccaacatagttcctcaggcactagtccaCctgctttccagaaagaaggtcattg 
cattcacactttgcgcagctrcacttct 

TGATGTCCTATGATCGCTATGTTGC^TCTGCAATCCnCT 

GTGTCCAGCTGGCAACAGGATCATGGACCAGTGGCATTCTGGTGTOTGTGGTAGA 
GGCTACCCTACCGAGGCAGTAACAGCATTGCT 

CCACAGACACCCATGCATCAGAGATGGCCATTTTTCTTACGGGGGTTC 

TGATTCTGGTATCCTATGGCCGTATCATAGTAACTGTGGTCAAGATGAAGTCAACTGTGGGGAGTCTCAAGG 
C ATTTT CTAC CTGTGG C T C C CAC CT CATGGTGGTCATACTTTTTTATGGAT CAG CAATTATCACTT ACATGA 
CACCCAAGTCTTCCAAACAGCAGGAAAAATCGGTGTCTGTTT^^ 

C CCTCATCTATAGC CTGAGAAACAAGGATGTGAAGG C AGCTCTGAGG AAAGTAGCCACAAGG AAT TT CC CAT 
GAAGGCTTGGAATCTCACACTGACA 



The disclosed NOV13a polypeptide (SEQ ID NO:56) encoded by SEQ ID NO:55 has 
308 amino acid residues and is presented in Table 13B using the one-letter amino acid code. 

10 



Table 13B. Encoded NOV13a protein sequence (SEQ ID NO:56). 

MRQINQTQVTEFLLLGLSDGPHTEQLLFIVLLGVYLVTVLGNIiLLISLVHVDSQLHTPMYFFLC 
NLSLADLCFSTNIVPQALVHLLSRKKVIAFTLCAARLLFFLI^ 

PLRYPNIMTWKVCVQIiATGSWTSGILVSVVDTTFTLRLPYRGSNSIAHFFCEAPALLILASTDT 
HASEMAIFLTG^ILLIPVFLILVSYGRIIVTVVKMKSTVGSLKAFSTCGSHLMVVILFYGSAI 
ITYMTPKSSKQQEKSVSVFYAIVTPMLNPLIYSLRNKDVKAALRKVATRNFP 



A search of sequence databases reveals that the NOV 13a amino acid sequence has 216 
of 217 amino acid residues (99%) identical to, and 217 of 217 amino acid residues (100%) 
similar to, the 217 amino acid residue ptnr: SPTREMBL-ACC:095224 protein from Homo 
1 5 sapiens (Human) (Olfactory Receptor) (E = 2.2e* 109 ). 

NOV13b 

A disclosed NOV 13b nucleic acid of 961 nucleotides (also referred to as CG56021-02) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 13C. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 12-14 and 
20 ending with a TGA codon at nucleotides 936-938. A putative untranslated region upstream 
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from the initiation codon is underlined in Table 13C. The start and stop codons are in bold 
letters. 



Table 13C NOV13b nucleotide sequence (SEQ ID NO:57). 

GACAACAAACTA TGAGACAGATAAATCAGACAC^GTGACAGAATTCCTCCTTCTGGGACTCTGTGATGGGC 
CACACACCGAGCAGCTGCTATTTATCGTATTATTGGGTGTCTACCTGGTCACTGTGCTTGGAAATCTGCTTC 
TAATCTCCCTTGTTCATGTTGACTCCCAACrTCACACACCCATGTATTTTTTTCTCTGCAACTT^ 
CTGACCTCTGTTTCTCTACCAACATAGTTCCTCAGGC^CTAATC 

CATTCACACTTTGCGCAGCTCGACTTCTCTTTTTCCTCATTTTTGGGTGTACCCAGTGCGCCCTTCr 
TGATGT C CT ATGAT CGCTATGT TG C AAT CTGCAAT C CT CTG CGTTACCCT AACATCATG ACCTGGAAAGTGT 
GTGTCCAGCTGGCAACAGGATC^TGGACCAGTGGCATOCTG 

GGCTACCCTACCGAGGCAGTAACAGCATTGCTCATTTCTTTTGTGAGGCCCCTGCACTATTGATCTTAGCAT 
CCACAGACACCCATGCATGAGAGATGGCCATTTTTCTTATGGGGGTTGTGATTCTCCTCATAC 
TGATTCTGGTAT C CTAT GG CCGT AT C ATAGTAACTGTG GTC AAGATGAAGT CAACTGTG GGGAGTCT CAAGG 
CATTTTCTACCTGTGGCTCCCACCTCATGGTG^ 

CACCCAAGTCTTCCAAACAGCAGGAAAAATCGGTGTCTGTTTTCTATGCAATAGTGACTCCCATGCTTAATC 
C CCT CATCT AT AGCCTGAGAAACAAGGATGTGAAGGCAG CT CT GAGGAAAGTAGCCACAAGG AAT TTC CCAT 
G AAGG CTTGGAATCTCACACTGACA 



In a search of public sequence databases, the NOV 13b nucleic acid sequence has 648 
5 of 653 bases (99%) identical to a gb:GENBANK-ID:AF065876|acc:AF065876.1 mRNA from 
Homo sapiens (olfactory receptor (OR2D2) gene, partial cds) (E = 2.8e' 139 ). 

The disclosed NOV13b polypeptide (SEQ ID NO:58) encoded by SEQ ID NO:57 has 
308 amino acid residues and is presented in Table 13D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV 1 3b has a signal peptide and is 
1 0 likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV 13b may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or in the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV 13b is between positions 53 and 54: 
VDS-QL. 

15 

Table 13D. Encoded NOV13b protein sequence (SEQ ID NO:58). 

MRQINQTQVTEFLLLGLCDGPHTEQLLFIVL^ 

FSTNIVPQALIHLLSRKKVIAFTLCAARLLFFLI FGCTQCALLAVMS YDRYVAI CNPLRYPNTMTWKVCVQL 
ATGSWTSGILVSVVDTTFTLRLPYRGSNSIAHFFCEAPAIXILASTDTHASEMAIFLMGVVILLIPVFLILV 
SYGRIIVTVVKMKSTOGSLKAFSTCGSHLMWILFYG 

SLRNKDVKAALRKVATRNFP 

A search of sequence databases reveals that the NOV 13 amino acid sequence has 52 of 
179 amino acid residues (29%) identical to, and 86 of 179 amino acid residues (48%) similar 
to, the 339 amino acid residue ptnr: SWISSPROT-ACC:Q13304 protein from Homo sapiens 
20 Putative G Protein-Coupled Receptor GPR17 (R12) (E = 3.3e" 157 ). 

NOV 13b is predicted to be expressed in at least Apical microvilli of the retinal 
pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma cell lines, 
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corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 

cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 

umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 

cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 

5 cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express MHC II and III 

> 

nervous, medulla,, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 
spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus, and thymus 
tissue. This information was derived by determining the tissue sources of the sequences that 
10 were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 

The disclosed NOV 13a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 1 3E. 



Table 13E. BLAST results for NOV13a 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%> 


Positives 
(%) 


Expect 


gi| 14423807 |sp|Q9H2 
10|O2D2_HUMAN 


OLFACTORY 
RECEPTOR 2D2 

(OLFACTORY 
RECEPTOR 11-610) 

(OR11-610) (HB2) 


308 


307/308 
(99%) 


308/308 
(99%) 


e-148 


gi | 17461460 |ref|XP 

062286.1) 

(XMJ562286) 


similar to hB2 
olfactory 
receptor (H. 
sapiens) [Homo 
sapiens] 


308 


308/308 
(100%) 


308/308 
(100%) 


e-148 


gi| 12007409 |gb|AAG4 
5183. l| (AF321233) 


B2 olfactory 
receptor [Mns 
musculus] 


314 


261/305 
(85%) 


278/305 
(90%) 


e-127 


gi'| 3831619 |gb|AAC70 
020.1 | (AF065876) 


olfactory 
receptor [Homo 
sapiens] 


217 


216/217 
(99%) 


217/217 
(99%) 


e-100 


gi|l5293767|gb|AAK9 
5076. l| (AF399591) 


olfactory 
receptor [Homo 
sapiens] 


214 


213/214 
(99%) 


214/214 
(99%) 


e-100 



15 

The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 13R In the ClustalW alignment of the NOV13 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (jLe. 9 regions that may be required to preserve structural or 

20 functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 13F. ClustalW Analysis of NOV13 

1) Novel NOVl3a (SEQ ID NO: 56) 

2) Novel NOVl3b (SEQ ID NO: 58) 

3) gi) 14423807 |sp|Q9H210|O2D2_HUMAN OLFACTORY RECEPTOR 2D2 (OLFACTORY RECEPTOR 11- 
610) (OR11-610) (HB2) (SEQ ID NO:377) 

4) gi| 17461460 |ref|XP_062286 .1 | (XM_062286) similar to hB2 olfactory receptor (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 378) 

5) gi) 12007409|gb)AAG45183.l| (AF321233) B2 olfactory receptor [Mus musculus] (SEQ 
ID NO:379) 

6) gi|3831619 |gb | AAC70020 . 1 | (AF065876) olfactory receptor [Homo sapiens] (SEQ ID 
NO:380) 

7) gi|l5293767«|gb|AAK95076.l| (AF399591) olfactory receptor [Homo sapiens] (SEQ ID 
NO:381) 



NOV13a 
NOV13b 



gi 
gi 
gi 
gi 



144238071 
17461460 I 
12007409 j 
3831619] 
152937671 




NOV13a 


61 


NOV13b 


61 


gi 


14423807 


61 


gi 


17461460 


61 


gi 


12007409 


60 


gi 


3831619) 


1 


gi 


15293767 


1 



I. 



70 
.1.. 



80 



I 



90 
. I . . 



100 



I 



no 



I. 
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FFLCNLSLADLCFSTNIVPQALVHLLSRKKVIAFTLCAARLLFFLIFGCTQCALLAVMSY; 
FFLCNLSLADLCFSTNIVPQAL0HLLSRKKVIAFTLCAARLLFFLIFGCTQCALLAVMSY 
FFLCNLSLADLCFSTNIVPQALVHLLSRKKVIAFTLCAARLLFFLIFGCTQCALLAVMSY 
FFLCNLSLADLCFSTNIVPQALVHLLSRKKVIAFTLCAARLLFFLIFGCTQCALLAVMSY 
FJvLCI^SLADLCFSTNIVPQALQhLLSRKkQ^ 

SLADLCFSTNIVPQALVHLLSRKKVIAFTLCAARLLFFLIFGCTQCALLAVMSY 
IlADLCFSTNIVPQALVHLLSRKKVIAFTLCAARLLFFLIFGCTQCALLAVMSY 



130 



NOV13a 


121 


NOV13b 


121 


gi 


14423807 


121 


gi 


17461460 


121 


gi 


12007409 


120 


gi 


38316191 


55 


gi 


15293767 | 54 



140 



150 



160 



170 



180 



DRYVAICNPLRYPglMTWKVCVQLATGSWTSGILVSVVDTTFTLRLPYRGSNSIAHFFCE, 

dryvaicnplryp|imtwkvcvqlatgswtsgilvswdttftlrlpyrgsnsiahffce [ 
dryvaicnplrypBimtwkvcvqlatgswtsgilvswdttftlrlpyrgsnsiahffce 
dryvaicnplrypSimtwkvcvqlatgswtsgilvswdttftlrlpyrgsnsiahffcE: 
dryvaicnplUjy^imtwSvcSqlatEswtsgilvswdttftlrlpyrgsnsiahffce 
dry-vaicnplrypglmtwkvcvqlatgswtsgilvswdttftlrlpyrgsnsiahffce 
dryvaicnplrypBimtwkvcvqlatgswtsgilvswdttftlrlpyrgsnsiahffce 



180 
180 
180 
180 
179 
114 
113 



N0V13a 


181 


NOV13b 


181 


gi 


14423807 


181 


gi 


17461460 


181 


gi 


12007409 


180 


gi 


38316191 


115 


gi 


15293767) 114 



I 



190 



200 
..I .. 



210 



I 



220 
..I.. 



230 



240 



1 



I. 



apallilastdthasemaifltgwillipvflilvsygriivtvvkmkstvgslkafst! 
apallilastdthaser^aifl^gwillipvflilvsygriivtwkmkstvgslkafst 
apall i las td thas emai fl tg wi l l i p vfl i l vs ygr 1 1 vt wkm ks t vg s lkafs ti 
apallilastdthasejviaifltgvvillipvflilvsygriivtwkmkstvgslkafst! 

APALLgLASTDTg^SEMAI FL^GWILLI PV^LILVS YG{|jI IVTWKMKS^G^JjKAFSt' 
APALLILASTDTHASEI4AIFLTGWILLIPVFLILVSYGRIIVTWKMKSTVGSLKAFST 
APALLILASTDTHASEMAIFLTG WILLI PVFLILVS YGRI IVT WKMKSTVGSLKAFST 



240 
240 
240 
240 
23 9 
174 
173 



H0V13a 


241 


NOV13b 


241 


gi 


14423807 


241 


gi 


17461460 


241 


gi 


12007409 


240 


gi 


3831619) 


175 


gi 


15293767 


174 



250 
..I.. 



260 



270 



280 



290 



300 



cgshlmwilfygsaiitymtpksskqqeksvsvfyaivtpmlnpliyslrnkdvkgalg 
cgshlmwilfygsaiit^tpksskqqeksvsvfyaivtpmlnpliyslrnkdvksalB 
cgshlmwilfygsaiitymtpksskqqeksvsvtyaivtpmlnpliyslrnkdvkBalB 



cgshlmwilfygsaiitymtpksskqqeksvsvfyaivtpmlnpliyslrnkdvkg;^|v 
cgshlmwilfygsaiitymtpksskqqeksvsvfyaivt.pmlnpliyslrnkdvk|alS 
cgshlmwilf ygsai itymtpks sk^qek^vsvf ya^tpml npliyslrnkdvk 
cgshlmwilfygsaiitymtpksskqqeksvsvfyaivtpmlS ~— 
cgshlmwtlfygs ai itymtpk5 skqqeksvsvfyaivt-pj 



rPMLNPLIYSLKNKDVKWALW 
rPMLNPLIYSLRHKDVKffiAL* 
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10 



NOV13a 
NOV13b 



gi 
gi 
gi 
gi 



14423807 
17461460 
12007409 
3831619| 
15293767 




Table 13G lists the domain description from DOMAIN analysis results against 
NO VI 3. This indicates that the NOV 13 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 13G Domain Analysis of NOV13 

gnl |Pfam|pfam00001, 7tm_l, 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO: 810) 
CD-Length = 254 residues, 94.9% aligned 
Score = 93.2 bits (230), Expect = 2e-20 

15 

NOV13 : 54 
Sb j Ct : 14 
20 NOV13 : 114 
Sbjct: 74 

25 NOV13 : 174 
SbjCt: 134 

30 NOV13: 229 
Sbjct: 180 
NOV13 : 279 

35 

SbjCt: 240 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 

40 seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 

45 olfactory epithelium. 



QLHTPMYFFLC^SLADLCFSTNIVPQALVHLLSRKKVIAFTLCAARLLFFLIFGCTQCA 113 

+1 II II II++III I + I II +1+ I || |** I 

KLRTPTNI FliIjNLAVADLLFljLTLPPWALYYLVGGDWVFGDALCKLVGALFVyNGYAS IL 73 

LLAVM S YDRYVAI CNPLRY PNIMTWKVCVQLATG S WT SG I LVS WOT T FTLRLPYRGSNS 173 

II -I lll+ll +1 1 1 1 11+ I I +1 + 1+ 1+ | + 

LLTAISIDRYIJUiVHPLRYRRIRTPRRAKVLILLVWVIJUaliLSLPPLLFSWIiRTVEEGNT 13 3 



IAHFFC EAPALLIIiASTDTHASEMAIFLTGWJLiJPVFLJLVSYGRIIVTVVKM 228 

+ ++I++ + + I I 11+ 1+ I 

TVCLIDFPEESVKRS YVLLSTLVGFVLPLLVI LV CYTRILRTLRKR 179 



KSTVGSLK AFSTCGSHLMWILFYGSAIITYMTPK55i?00£JCSV5VFYAI - 278 

+111 I ++ 1+ + 1+ + + + I 

ARSQRSLKRRSSSERKAAKMLLVVWVFVLCWLPYHIVLLLDSLCLLSITOVLPTALLIT 239 

VTPMLNPLIY 288 

I lll+ll 
L WLA YVNS CLNP 1 1 Y 254 
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Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
5 Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 13 nucleic acid of the invention encoding a G-Protein Coupled 
10 Receptor -like protein includes the nucleic acid whose sequence is provided in Table 13A, 

14C or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 13A, or 14C while 
still encoding a protein that maintains its G-Protein Coupled Receptor -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
1 5 nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
20 are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 1 percent of the bases may be so 
changed. 

25 The disclosed NOV13 protein of the invention includes the G-Protein Coupled 

Receptor -like protein whose sequence is provided in Table 13B, or 14D. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 13B, or 14D while still encoding a protein that 
maintains its G-Protein Coupled Receptor -like activities and physiological functions, or a 

30 functional fragment thereof. In the mutant or variant protein, up to about 15 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fabb, that bind immunospecifically to any of the proteins of the invention. 
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The above disclosed information suggests that this G-Protein Coupled Receptor -like 
protein (NOV 13) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV13 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
5 below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

1 0 The NOV 1 3 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt f s lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 

15 disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by fflV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 

20 hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 

25 autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

NOV 13 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV 13 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 

30 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV13 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
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understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV14 

A disclosed NOV14 nucleic acid of 986 nucleotides (also referred to as CG56023-01) 
5 encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 14A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 23-25 and 
ending with a TGA codon at nucleotides 974-976. The start and stop codons are shown in 
bold in Table 14A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 14A. NOV14 nucleotide sequence (SEQ ID NO:S9). 

CTGGGGATTTATGCCCATACTTATGGCTATA 
CTCTTCCCTACCTACTGAAATACAGTCATTGCTCTTCCTGA 

GGGAAACAGCCTCATCATT CTGGTTAC CCTAGCTGACCCCATG CTACACAGCCCCATGTACTTCTTCCTCAG 

AAAOTTATCTTTCCTGGAGATTGGCTTCAACCTAGTCATTGTGCCCAAAATGCTGGGGACCerGCTTGCCCA 

GGACACAAC CAT CT CCTTC CTTGGCTGTGC CACTCAGATGTATTTCTTCTTCTTCTTTGGGGTAGCTGAATG 

CTTCCTCCTGGCTACCATGGCATATGACCGCTATGTGGCCATCTGCAGTCCCTTGCAOT 

GAACCAAAGGACACGGGCCAAACTGGCTGCTGCTTCCTGGTTCCCAGGCTTTCCTGTAGCTACTGTGCAGAC 

CACATGGCTCTTCAGTTTTCCATTCTC 

GCTGAAGCTGGTCTGTGCAGACACAGCACTGTTTGAGATCTACGCCATCGTCGGAACCATTCTGGTGGTCAT 
GAT CCCCTGCTTGCTGATCTTGTGTTC CTATACTCGCATTGCTGCTGCTATCCTCAAGATCC CATCAGCTAA 
AGGGAAGCATAAAGCCTTCTCTACGTGCTCCTCACACCTCCTTGTTGTC 

CCTCACCTACTTCTGGCCTAAATCAAATAATTCTCCTGAGAGCAAGAAGTTGTTATCATTATCCTAC^ 
TGTGACTCCCATGTTGAACCCCATTATCTACAGCTTGAGAAATAGCGAGGTGAAGAATGCCCTCAGCAGGAC 
CTT CCACAAGGTCCTAGC CCTCAGAAACTGTATCCCATAGACCTTAGGAA 



10 

The disclosed NOV14 polypeptide (SEQ ID NO:60) encoded by SEQ ED NO:59 has 
321 amino acid residues and is presented in Table 14B using the one-letter amino acid code. 



Table 14B. Encoded NOV14 protein sequence (SEQ ID NO:60). 

MPILMAIGNWTEISEFILMSFSSLPTEIQSLLFLT^ 

YFFLRNLSFLEIGFNLVIVPKMLGTLLAQDTTISFLGCATQMYFFFFFGVAECFLLATMAYDRY 
VAI CS PLHYPVIMNQRTRAKLAAASWFPGFPVATVQTTWLFSFPFCGTNKVNHFFCDS PPVLKL 
VCADTALFEIYAIVGTILVVMIPCLLILCSYTRIAAAILKIPSAKGKHKAFSTCSSHLLWSLF 
YISSSLTYFWPKSNNSPESKKLLSLSYTVVTPMLNPIIYSLI^SEVKNALSRTFHKVLALRNCIP 



1 5 A search of sequence databases reveals that the NOV14 amino acid sequence has 234 

of 310 amino acid residues (75%) identical to, and 264 of 310 amino acid residues (85%) 
similar to, the 315 amino acid residue ptnr: SPTREMBL-ACC:Q9JKA6 protein from Mus 
musculus (Mouse) (OLFACTORY RECEPTOR P2)(E = 4.0e~ 124 ). 

The disclosed NOV] 4 polypeptide has homology to the amino acid sequences shown 

20 in the BLASTP data listed in Table 14C 
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Table 14C. BLAST results for NOV14 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi| 14423805 |sp|Q9H2 
07 |OAA5_HUMAN 


OLFACTORY 
RECEPTOR 10A5 
(HP3) 


317 


317/317 
(100%) 


317/317 
(100%) 


e-154 


gi | 12007437 | gb | AAG4 
5207.l|AF321237 4 
(AF321237) 


hP4 olfactory 
receptor [Homo 
sapiens] 


317 


300/317 
(94%) 


305/317 
(95%) 


e-145 


gi | 12007412 | gb| AAG4 
5186. l| (AF321233) 


P3 olfactory 
receptor [Mus 
musculus] 


317 


292/316 
(92%) 


302/316 
(95%) 


e-140 


gi | 15419583 |gb | AAK9 
7076.l|AF293080 1 
(AF293080) 


olfactory 
receptor P3 [Mus 
musculus] 


324 


294/320 
(91%) 


304/320 
(94%) 


e-140 


gi | 12007411 | gb | AAG4 
5185. l| (AF321233) 


P4 olfactory 
receptor [Mus 
musculus] 


317 


281/316 
(88%) 


296/316 
(92%) 


e-136 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 14F. In the ClustalW alignment of the NOV14 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (z.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



1 0 Table 14D. ClustalW Analysis of NOV14 

1) Novel NOV14 (SEQ ID NO: 60) 

2) gi|l4423805|sp|Q9H207|OAA5_HUMAN OLFACTORY RECEPTOR 10A5 (HP3) (SEQ ID NO:382) 

3) gi|l2007437|gb|AAG45207.l|AF321237 4 (AF321237) hP4 olfactory receptor [Homo 
sapiens] (SEQ ID NO:383) 

15 4) gi|l2007412|gb|AAG45186.l| (AF321233) P3 olfactory receptor [Mus jnusculus] (SEQ 
ID NO:384) 

5) gi|l5419583 | gb|AAK97 076. 1 |AF2 93 08 0_1 (AF293080) olfactory receptor P3 [Mus 
musculus] (SEQ ID NO: 385) 

6) gi|l20074ii|gb|AAG45l85.i| (AF321233) P4 olfactory receptor [Mus musculus] (SEQ 
20 ID NO:386) 
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30 



35 



NOV14 


1 


gi 


14423805 


1 


gi 


12007437 


1 


gi 


12007412 


1 


gi 


15419583 


1 


gi 


12007411 


1 



NOV14 

gi 1 14423805 
gi j 12007437 
gi 1 12007*12 



58 
54 
54 
54 
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IPMLHSPMYFFLRNLSFLEIGFNLVIVPKMLGTL^QDTglSFIiGCATQMYFFFFFGVAEC 
PML jjs PMY FFLRNLSF^E I G FNLVI ^PKML^TlB AQDT| I S FLGCATQMYFFFFFGVAE C 
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15 



20 



25 



30 



35 



gi 115419583 


61 


gi 


12007411 


54 


NO VI 4 


118 


gi 


14423805 


114 


gi 


12007437 


114 


gi 


12007412 


114 


gi 


15419583 


121 


gi 


12007411 


114 



NOV14 

gi 1 14423805 | 
gi j 12007437 j 
gi | 12007412 j 
gi 115419583) 
gi |12007411 j 



NOV14 


238 


gi 


14423805 


234 


gi 


12007437 


234 


gi 


12007412 


234 


gi 


15419583 


241 


gi 


12007411 


234 


NOVU4 


298 


gi 


14423805 


294 


gi 


12007437 


294 


gi 


12007412 


294 


gi 


15419583 


301 


gi 


12007411 


294 



I 



130 



140 



150 



160 



170 



I 



1 



180 

.4 



fllatmaydryvaicsplhypvimnqgtraklaaaswfpgfpvatvqttwlfsfpfc^ 
fllati^ydry^aicsplhypvir^qgtrakiiaaaswfpgfpvatvqttwlfsfpfc^n 
fllatmaydryvaicsplhypvimnqStraklaaQswfpgfpvatvqttwlfsfpfcBtn 

FLLATMAYDRYVAI CSPLHYPVIMNqBtrKkLAAAS WFPGFPVATVQTTWLFS FPFCgTN 

fllatmaydryvaicsplhypvii4nqStr"klaaaswfpgfpvatvqttv7lfsfpfc3tn 
fllatmaydryvaicsplhyp5lmnq§tja)j^laaaswfpgfpvatvqttv?lfsfpfc 



177 
173 
173 
173 
180 
173 



200 



210 



22 0 



230 



240 




250 



260 



I. 



270 
. -J. . 



I 



28*0 
..I.. 



290 



300 



gkhkafstcsshllwslfyrsssltyfgpksnnspeskkllslsytwtpmlnpiiysl 
gkhkafstcsshllwslfyhsssltyfSpksnnspeskkllslsytvvtpmlnpiiysl 
gkjskafstcsshllwslfyqsjjsl^^ 

gkhkafstcsshllwslfySsssltyfrpksnnspeskkllslsytwtpmlnpiiysl 
gkhkafstcsshllwslfyRsssltyfrpksnnspeskkllslsytvvtpmlnpiiysl 
gkhkafstcsshlSwslfyws?IsltyfrpksISnspeskkllslsytvvtp2lnpiiysl 
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320 



E V KN AL S RTFH KE L ALRNS I 
RNgE VKNALSRT FHk3lALRnSi 

rnnevkkalsrtESkalalrnSi 

RNNEVKgALSRTFHKALALRNgI 
RHNE VK§AL S RT FHKALALRJwBl 
RNNEVKNALSRTFHKALALRr^ " 





321 


jcnp 


317 


cHp 


317 




317 


324 


CFHL 


317 



40 



Table HE lists the domain descriptions from DOMAIN analysis results against 
NOV14. This indicates that the NOV14 sequence has properties similar to those of other 
proteins known to contain this domain. 
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55 



Table 14E Domain Analysis of NOV14 

gnl|Pf am|pf amOOOOl, 7tm_l, 7 transmembrane receptor (rhodopsin 

family). {SEQ ID NO:810) 

CD-Length = 254 residues, 100.0% aligned 

Score = 103 bits (256) , Expect = 2e-23 



NOV14 : 


46 


Sb j Ct : 


1 


NOV14 : 


106 


Sbjct: 


61 


NOV14 : 


166 


Sbjct: 


114 



GNSLI ILVTLADPMLHSPMYFFLRNLSFLE IGFNLVI VPKMLGTLLAQDTTISFLGCATQ 105 

li l+lll I I +1 II 11+ ++ I I + I I 1+ I I 

GNLLVILVILRTKKLRTPTNIFLLNIiAVADLLFLIjTLPPWALYYLVGGDWVFGDALCKIiV 60 



I II II ++ 1 11+11 II I I I I I 

GALFWNGYASILLLTA1SIDRYLAIVHPLRYRRIRTPREAKVLILLVWVLAL- 



I I I 



I ++ I++ 



l+ll III 



113 



127 
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N0V14 : 226 IA AAILKIPSAKGKHKAFSTCSSHLLWSLFY ISSSLTYFWPKSNNS 272 

I II 1+ + I I + 

Sbjct : 172 I LRTLRKRAR SQRSLKRRS S S ERKAAKMLL WVWFVLCWL P YHI VLLLDSLCLLS I WRV 231 



NOV14: 273 PESKKLLSLSYTVVTPMLNPIIY 295 

+ I++I I nun 



Sbjct: 232 LPTALLITLWLAYVNSCLNPIIY 254 

10 G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 

subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 

15 respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 

20 transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 

25 epithelium. 

The disclosed NOV 14 nucleic acid of the invention encoding a G-Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 14A or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 14A while still encoding a 

30 protein that maintains its G-Protein Coupled Receptor -like activities and physiological 

functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 

35 include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. 
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The disclosed NOV14 protein of the invention includes the G-Protein Coupled 
Receptor -like protein whose sequence is provided in Table 14B. The invention also includes 
a mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 14B while still encoding a protein that maintains its G-Protein 
5 Coupled Receptor -like activities and physiological functions, or a functional fragment thereof. 
In the mutant or variant protein, up to about 12 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fabkthat bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor -like 
10 protein (NOV14) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV14 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
1 5 targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV14 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in developmental diseases, MHCI1 and III diseases 
20 (immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 

Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsul in-dependent diabetes mellitus (NIDDM1), bacterial, 
25 fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HTV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 
Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
30 ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 
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NOV 14 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV 14 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
5 NOVX Antibodies" section below. The disclosed NOV 14 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 
10 NOV15 

NOV15 includes three novel G-Protein Coupled Receptor -like proteins disclosed 
below. The disclosed sequences have been named NOV 15a and NOV 15b. 

NOV15a 

A disclosed NOV15a nucleic acid of 943 nucleotides (also referred to as CG56065-01) 
1 5 encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 1 5A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 2-4 and 
ending with a TGA codon at nucleotides 935-937. The start and stop codons are shown in 
bold in Table 15 A, and the 5 5 and 3' untranslated regions, if any, are underlined. 



Table 15A. NOVlSa nucleotide sequence (SEQ ID NO:61). 

"aatggcagcagaaaaccattc 

agctacagctgcccctcttcctcgtcttcctgggaatctatgtagtcacagtgctggggaacctg 
ggcatgatcacactgattgggctcagttctcac^ 

tctgtccttcattgacttctgccattccactgtcattacccctaagatgctggtgaactttgtga 
cagagaagaacatcatctcctacccrgaatgcatgactcagctctacttcttcctcgtttttgct 
attgcagagtgtcacatgttggctgcaatggcatatgacggctacgtggccatctgtagcccctt 
gctgtacagcatcatcatatccaataaggcttgcttttctctgattttagtggtgtatgtaatag 
gcctgatttgtgcgtc^gctcatataggctgtatgtttagggttcaattctgcaaatttgatgtg 
atcaaccattatttctgtgatcttatttct 

tgagttactgattttaatctttagtggaattaacatccttgtccccagcctgaccatcctcagct 
cttacatcttcatcattgccagcatcct^ 

agcacttgcagctcccacatctcggctgtttctgttttctttgggtctgcagcattcatgtacct 
gcagccatcatctgtcagctccatggaccaggggaaagtgtcctctgtgttttatactattgttg ■ 
tgcccatgctgaaccccctgatctacagcctgaggaataaagatgtccacgttgccctgaagaaa 
acgctagggaaaagaacattcttatg aacagaa 

20 " — — — — _l 

The disclosed NOVlSa polypeptide (SEQ ID NO:62) encoded by SEQ ID NO:61 has 
311 amino acid residues and is presented in Table 15B using the one-letter amino acid code. 
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Table 15B. Encoded NOV15a protein sequence (SEQ ID NO:62). 

MAAENHSFVTKFILVGLTEKSELQLPLFLVFLGIYVVTVLGNLGMITLIGLSSHLHTPMYCFLS 
SLSFIDFCHSTVITPKMLVNFVTEKNIISYPECMTQLYFFLVFAIAECHMLAAMAYDGYVAICS 
PLLYSIIISNKACFSLILWYVIGLICASAHIGCMFRVQFCKFDVINHYFCDLISILKLSCSST 
YINEIjLILIFSGINILVPSLTILSSYIFIIASILRIRYTEGRSKAFSTCSSHISAVSVFFGSAA 
FMYLQPSSVSSMDQGKVSSVFYTlVVPMIiNPLIYSLiaJKPVHVALKKTLGKRTFL 



A search of sequence databases reveals that the NOV 15a amino acid sequence has 235 
of 31 1 amino acid residues (75%) identical to, and 270 of 31 1 amino acid residues (86%) 
similar to, the 311 amino acid residue ptnr: SPTREMBL-ACC:035184 protein from Rattus 
5 norvegicus (Rat) (Olfactory Receptor) (E = 9.9e* 121 ). 
NOVlSb 

A disclosed NOVlSb nucleic acid of 943 nucleotides (also referred to as CG56065-02) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 1 5C. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 2-4 and 
10 ending with a TGA codon at nucleotides 935-937. The start and stop codons are shown in 
bold in Table 15C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 15C. NOV15b nucleotide sequence (SEQ ID NO:63). 

^TGGCAGCAGAAAACCATTCTTTTGTGACTAAGTTTATTCTGGTTGGGCTAACAGAGAAGTCAGAGCTACA 
GCTGCCCCTCTTCCTCGTCTTCCTGGGAATCTATGTAGTC^CAGTGCTGGGGAACCTGGGCATGATCACACT 
GATTGGGCTCAGTTCTCACCTGCACACACCTATGTACHGTTTCCT 

CCATTCCACTGTCATTACCCCTAAGATGCTGGTGAACTTTGTGACAGAGAAGAACATCATCTCCTACCCTGA 
ATGCATGAOTCAGCTCTACTTCTTCCTCGTTTTTGCTATTGCAGAGTGTCACATGTTGGCT 
TGACGGCTACGTGGCCATCTGTAGCCCCGTGCTGTACAGCATCATCATATCCAATAAGGCTTGCTTTTCTCT 
GATTTTAGTGGTGTATGTAATAGGCCTGATTTGTGCGT 

CTGCAAATTTGATGTGATCAACCATTATTTCTGTGATCTTAT1TCTATCTTGAAGCTCTCCTGTTCTAGTAC 
TTAC^TTAATGAGTTACTGATTTTAATCTTTAGTGGAATTAACATCCTTGT CCCCAGCCTGACCATC CTCAG 
CTCTTACATCTTCATCATTGCCAGCATCCTCCGCATTCGCT ACACTGAGGG CAGGTC CAAAGCCTTCAGCAC 
TTGCAGCTCCCACATCTCGGCTGTTTCTGTTTTCTTTGGGTCTGCAGCATTCATGTACCTGCAGCCATCATC 
TGTCAGCTCCATGGACCAGGGGAAAGTGTCCTCrGTGTTTTATACTATTGTTGTGCCCGTGCTGAACCCC 
G ATCTAC AGC C TGAGG AATAAAGATGT C CACGTTGC CCTGAAGAAAACG CTAGGGAAAAGAAC ATT C TTATG 
AACAGAA 



In a search of public sequence databases, the NOV 15b nucleic acid sequence, localized 

1 5 to chromosome 4, has 770 of 937 bases (82%) identical to a gb:GENBANK- 

ID:AF282271 |acc:AF282271.1 mRNA from Mus musculus (odorant receptor Kl 1 gene, 

complete cds) (E = 5.2e" 135 ). 

The disclosed NOVlSb polypeptide (SEQ ID NO:64) encoded by SEQ ID NO:63 has 

311 amino acid residues and is presented in Table 15D using the one-letter amino acid code. 

20 Signal P, Psort and/or Hydropathy results predict that NOV 15b has no signal peptide and is 

likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 

NOV 1 5b may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 
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reticulum (membrane) with a certainty of 0.3000, or in the microbody (peroxisome) with a 
certainty of 0.3000, The most likely cleavage site for NOV15b is between positions 41 and 42: 
VLG-NL. 



Table 15D. Encoded NOVlSb protein sequence (SEQ ID NO:64). 

MAAENHSFVTKFILVGLTEKSELQIiPLFLVFIX^ 
HSTVITPKMLVNFVTEKNIISYPECMTQLYFF^ 

ILWYVIGLICASAHIGCMFRVQFCKFDVINHYFCDLISILKLSCSSTYINELLILIFSGINILVPSLTILS 
SYIFIIASILRIRYTEGRSKAFSTCSSHISAVSVFFGSAAFMYLQPSSVSSMDQGKVSSVFYTIVVPVIiNPL 
IYSLRNKDVHVALKKTLGKRTFL 

5 ^ " ^ ~'~ ' ~~ ^ ~ — . - ^ - 

A search of sequence databases reveals that the NOVlSb amino acid sequence has 237 
of 3 1 1 amino acid residues (76%) identical to, and 273 of 3 1 1 amino acid residues (87%) 
similar to, the 314 amino acid residue ptnr:TREMBLNEW-ACC:AAG39856 protein from 
Mus musculus (Mouse) (Odorant Receptor Kl 1) (E = 2.6e" 125 ). 

10 NOV1 5b is predicted to be expressed in at least the following tissues: Apical microvilli 

of the retinal pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma 
cell lines, corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral 
tissue, cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary 
artery and umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal 

15 hematopoietic cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, 
lung lymphoma cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express 
MHC II and III nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, 
pons, prostate, putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary 
artery in aortic) spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, 

20 thalamus, and thymus tissue. This information was derived by determining the tissue sources 
of the sequences that were included in the invention including but not limited to SeqCalling 
sources, Public EST sources, Literature sources, and/or RACE sources. 

The disclosed NOV 15a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 1 5E. 

25 



Table 15E. BLAST results for NOV15a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 17472672 |ref|XP 
061794. 1| 
(XM 061794) 


similar to 
odorant receptor 
Kli (H. sapiens) 
[Homo sapiens] 


311 


311/311 
(100%) 


311/311 
(100%) 


e-140 
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gi | 11692519 | gb | AAG3 
9856 . 1 | AF282271 1 
(AF282271) 


odorant receptor 
Kll [Mus 
musculus] 


314 


239/311 
(76%) 


273/311 
(86%) 


e-110 


gi j 11692527 | gb j AAG3 
9860.1 |AF282275_1 
(AF282275) 


odorant receptor 
K15 [Mus 
muscul us] 


311 


236/311 
(75%) 


271/311 
(86%) 


e-108 


gi| 17472662 |ref|XP 
061790. l| 
{XMJ361790) 


similar to 
odorant receptor 
K4hll (H. 
sapiens) [Homo 
sapiens]. 


593 


233/301 
(77%) 


261/301 
(86%) 


e-105 


gi 1 2317704 | gb | AAB66 
333. 1| (AF010293) 


olfactory 
receptor [Rattus 
norvegicus] 


311 


235/311 
(75%) 


270/311 
(86%) 


e-105 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 15F. In the ClustalW alignment of the NOV 15 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (Le. 9 regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 15F. ClustalW Analysis of NOV15 



1) Novel NOVl5a {SEQ ID NO: 62) 

2) Novel NOVlSb (SEQ ID NO: 64) * 

3) gi| 17472672 | ref |XP_061794 . 1 1 (XM_061794) similar to odorant receptor Kll (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 3 87) 

4) gi 1 11692519 |gb | AAG398S6 . 1 1 AF282271_1 (AF282271) odorant receptor Kll [Mus 
musculus] (SEQ ID N0:388) 

5) gi 1 11692527 |gb | AAG39860 . 1 1 AF282275_1 (AF282275) odorant receptor K15 [MUS 
musculus] (SEQ ID NO: 389) 

6) gi | 17472662 | ref | XPJ)61790 . 1 1 (XM_061790) similar to odorant receptor K4hll (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 3 90) 

7) gi | 2317704 | gb | AAB66333.1) (AF010293) olfactory receptor [Rattus norvegicus] (SEQ 
ID N0:391) 
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30 



10 



NOV15a 


1 


NOVlSb 


1 


gi 


17472672 | 


1 


gi 


11692519] 


1 


gi 


11692527 | 


1 


gi 


17472662| 


1 


gi 


2317704) 


1 
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|....|. 



60 



1 

_ _ ! 

± 

1 

_ - 1 

MVKGNHST VTEFNLAGLTDKPELQLPLFLLFLGI YWTWGNLSMITLIGFSSHLHTPMY 6 0 
~ " " - 1 
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NOVlSa 


1 


NOVlSb 


1 


gi 


17472672 


1 


gi 


11692519 


1 


gi 


11692527 


1 


gi 


17472662 


61 


gi 


2317704| 


1 
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HFLSSLSFIDLCQSSVITPKMLVNFVSERNI I SYPACMTQLYFFL VXVISECHMLAAMAY 
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NOV15a 

NOV15b 

gi | 17472672 

gi | 11692519 

gij 11692527 

gi j 17472662 

gij 2317704 | 



NOVlSa 
NOVlSb 



gi 
gi 
gi 
gi 
gi 



17472672 
11692519 
11692527 
17472662 
2317704) 



N0V15a 

N0V15b 

gi 1 17472672 | 

gij 11692519 | 

gij 11692527 j 

gij 17472662 | 

gij 2317704 | 



NOVlSa 
N0V15b 



NOV15a 
NOVlSb 
gi | 17472672 
gij 11692519 
gij 11692527 
gij 17472662 
gij 2317704 | 
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..|.. 



140 
..|.. 



150 



160 



170 



|....|....| 



180 
-•I 



121 DHYIAICNPLLYHVAMSYQVCSWMWEVYFMGFIGATCSHSLHAKSAFLLTILSSYIFIV 180 
1 - i 



190 



200 



210 



220 
..|....| 



230 



240 



181 ASILCIRSTEGRSKTFSTCSSHISAVSVFFGGTSRSRFQVLGLBVRSVRLGGCPDAGQTP 240 
1 " " - 1 



250 



260 



270 



280 



|....|. 



1 

1 

1 - 

1 

1 

241 ETQPPVQSLFSGHRNLAPSAR^EKKNVQPWTLAERMETVDKI 

1 




17 

17 

17 

20 

17 

300 
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310 



320 
. . I . . 



330 
..I 



340 
. . I . . 



350 



360 



18 
18 



gi 


17472672 


18 


gi 


11692519 


21 


gi 


11692527 


18 


gi 


17472662 


301 


gi 


2317704| 


18 


N0V15a 


78 


N0V15b 


78 


gi 


17472672 


78 


gi 


11692519 


81 


gi 


11692527 


78 


gi 


17472662 


361 


gi 


2317704 | 


78 



ekSelqlplflwflgiyvvtvSgnm^ 

ekgelqlplflqflgiyvvtvggnlgmitliglsshijn'pmybflsslsfidfchstvi 
ek^lqlplf^fqgiy^tvggnlgmlqliglsshlhtpmyyflsslsfidfc^stvj 

e^elql3lfl§flgiBwtv|gnlgmitliglsshlhtpmySflsslsfidfchstvi 
2kSelq;EplfS 



77 

77 

77 

80 

77 

360 

77 



NOVlSa 


198 


NOVlSb 


198 


gi 


17472672 


198 


gi 


11692519 


201 


gi 


11692527 


198 


gi 


17472662 


481 


gi 


2317704 | 


198 




138 NgACF^ 

138 iMhc: 

138 NKA< 

141 YpiYI 

138 YgiYTg 

421 YHH< 

138 Y§SYI| 
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NOV15a 


258 


NOVlSb 


258 


gi| 174726721 


258 


gi (11692519 j 


261 


gij 11692527 | 


258 


gij 17472662 | 


541 


gij 2317704 | 


258 




15 



Table 15G lists the domain description from DOMAIN analysis results against 
NOV15. This indicates that the N0V15 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 15G Domain Analysis of NOV15 

gnl|pfam|pfam0000l, 7tm_l, 7 transmembrane receptor (rhodopsin 

family)., (SEQ ID NO: 810} 

CD-Length = 254 residues, 100.0% aligned 

Score = 86.7 bits (213), Expect = 2e-18 





NOV15 : 


41 


GNLGMI TL I GLS SHLHTPMYCFLS SLS FIDFCHS TVI TPKMLVNFVTEKNI I S YPECMTQ 

lil + 1 +1 + 1 II II +|.+ 1 +111 + | 

GNLLV I LV I IiRT KKLRT PTNI FLLNLAVADLLFLLTLPP W ALYYL VGGDWVFGDAL CKLV 


100 


20 


Sbjct: 


1 


60 




NO VI 5 : 


101 


L YFFLVFAIAECHMLAAMAYDGYVAI CSPLLYS III SNKACFSLI L WYVI GLI CAS AHI 


160 


25 


Sbjct: 


61 


l + l 1 + 1 I++ 1 Ml II 1 1 + + III + M+ 1+ + + 

GAL FWNG Y AS I LLLTAI S ID R YLAI VHPLRYRR I RT PRRAK VL I LL VWVLAL LL S L P PL 


120 




N0V15 : 


161 


GCMFRVQFCKFDVINHYFCD- --- -LISILKLSCSSTYINELLILIFSGINILVPSLTIL 


215 




Sb j ct : 


121 


+ + + 1 + II ++ ||+++ || 
LFS WLRTVEEGNTT VCL IDFPEES VKRS Y"VLLSTLVGFVLPLL VI LVCYTRILRTLRKRA 


180 


30 


NO VI 5: 


216 


SSYIFIIASILRIRYTEGRSKAFSTCSSHISAVSVFFGSAAFMYL QPSSVSSMDQG 


271 




Sb j Ct : 


181 


1 l+l+ll +1+ +| | + 
RSQ RSLPCRRSSSERKAAKMLLVWWFVLCWLPYHIVLLLDSLCLLSIWRVLPTA 


235 


35 


NOV15 : 


272 


KVSSVFYTIWPMLNPLIY 290 




Sbjct: 


236 


+ +++ 1 II Ml 
LL ITLWLAYVNS CLNP 1 1 Y 254 
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G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 
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Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
5 Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV1 5 nucleic acid of the invention encoding a G-Protein Coupled 
10 Receptor -like protein includes the nucleic acid whose sequence is provided in Table 15 A, 

15C or a fragment thereof. Hie invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 15A or 15C while 
still encoding a protein that maintains its G-Protein Coupled Receptor -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
1 5 nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
20 are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 1 8 percent of the bases may be so 
changed. 

25 The disclosed NOV 1 5 protein of the invention includes the G-Protein Coupled 

Receptor -like protein whose sequence is provided in Table 15B, or 15D. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 1 5B, or 1 5D while still encoding a protein that 
maintains its G-Protein Coupled Receptor -like activities and physiological functions, or a 

30 functional fragment thereof. In the mutant or variant protein, up to about 23 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 
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The above disclosed information suggests that this G-Protein Coupled Receptor -like 
protein (NOV15) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV 15 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 

5 below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

1 0 The NOV 1 5 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 

1 5 disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), npninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 

20 hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

Treatment of Albright Hereditary Osteodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 

25 autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

NOV 15 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV 15 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 

30 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV 15 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
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understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV16a 

A disclosed NOV 16a nucleic acid of 891 nucleotides (also referred to as CG56067-01) 
5 encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 16A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 5-7 and 
ending with a TAA codon at nucleotides 878-880. The start and stop codons are shown in 
bold in Table 16a, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 16A. NOV16a nucleotide sequence (SEQ ED NO:65). 

GAAAATGTCAGCAGGAAAC CATTCCT CAGTGACTGAGTTCATTCTGGCTGGGCTCTCAGAA.CAGCCAGAGCT 
CCAGCTGCGCCTCTTCCTCCTGTTCTTAGGAATCT 

ACTGATTGGGCTCAGTTCTCACCTGCATACCCCCATGTACTATTTCCTCAGTGGTCTGTCCTTCATTGATAT 
CTGCCATTCCACTATCATTACCCCGAAAA 

TGAATG CATGACTCAGCTTTAC TTCTTCCT CATTTTTG CT ATTG CAGAGTGTC ACATGTTGG CTGTAACG G C 
ATATGACCGCTATGTTGCCATCTGCAGCCCCTTGCTGT^ 

G CT CACAGTGGGAGTTTACATTTTAGGCAT C CTTGG AT CT ACAATTCACAC CGG CTTTATGTT GAGACT CTT 

TTTGTGCAAGACTAATGTGATTAACCATTATTTTTGTGAT^ 

CACCTACATCAATGAATTACTGGTTCTGGTCTTGAGTGCATTO 

TGCTTCTTACATCTTTATC^TTGCCAGCATCCTCCGCACT 

CACTTGCAGCTCCCACATCTTGGCTGTTGCTGTTTTCTTTC 

ATCTGTCAGCTCCATGGACCAGGGGAAAGTGTCCTCTGTGTTTTATACTATTGTTC 
. C CAAT CT AT AG CC T AAGAAATAAGGAT 

10 

In a search of public sequence databases, the NOV 16a nucleic acid sequence, localized 
to chromosome 4, has729 of 888 bases (82%) identical to a gb:GENBANK- 
ID:AF282293|acc:AF282293.1 mRNA from Mus musculus (odorant receptor K4hl 1 gene, 
1 5 complete cds) (E = 9.8e" 127 ). 

The disclosed NOV16a polypeptide (SEQ ID NO:66) encoded by SEQ ID NO:65 has 
311 amino acid residues and is presented in Table 16B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV 16a has no signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
20 NOV16A may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or in the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV16A is between positions 41 and 
42: VVG-NL. 



Table 16B. Encoded NOV16a protein sequence (SEQ ID NO:66). 

MSAGNHSSVTEFII^GLSEQPELQLRLI^FLGIYVVTVVGNLSMITLIGLSSHLHTPMYYFLS • 
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GLS FID I CHSTI ITPKMLVNFVTEKNI I S YPECMTQLYFFLI FAI AECHMLAVTAYDRYVAICS 
PLLYNVIMSYHHCFWLTVGVYILGILGSTIHTGFMLRLFLCKTNVINHYFCDLFPLLGLSCSST 
YINELLVLVLSAFNILTPALTILASYIFIIASILRIRSTEGRSKAFSTCSSHILAVAVFFGSAA 
FMYLQPSSVSSMDQGKVSSVFYTIWPMUJPQSIA • 

A search of sequence databases reveals that the NOV 16a amino acid sequence has 232 
of 287 amino acid residues (80%) identical to, and 253 of 287 amino acid residues (88%) 
similar to, the 307 amino acid residue ptnr:TREMBLNEW-ACC:AAG39878 protein from 

5 Mas musculus (Mouse) (Odorant Receptor K4H1 1) (E = 5.1e~ 122 ). 

NOV 16a is predicted to be expressed in at least Apical microvilli of the retinal pigment 
epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma cell lines, corpus 
callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 
cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 

10 umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 
cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 
cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express MHC II and III 
nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 

15 spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus, and thym us 
tissue. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 
NOV16b 

20 A disclosed NOV16b nucleic acid of 939 nucleotides (also referred to as CG56753-02) 

encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 16C. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 1-3 and 
ending with a TAG codon at nucleotides 934-936. The start and stop codons are shown in 
bold in Table 16C, and the 5' and 3' untranslated regions, if any, are underlined. 
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Table 16C. NOV16b nucleotide sequence (SEQ ED NO:67). 

ATGTCAGGAGAAAAT AATT C CT CAGTG ACT GAGTT CATT CT GGCTGGG CTCT CAGAACAG C CAGAG CTCCAG 

CTGCCCCTCTTCCTCCTGTTCTTAGGAATCTATGTGGTCAGAGTGGTGGGC^ 

ATTTGGCTCAGTTCTCACCTGCACACCCCTATGTACTATTTCCTCM 

CATTCCACTGTCATTACCCCTAAGATGCTGGTGAACTTTGTGACAGAGAAGAACATC^TCTCCTACCCTGAA 

TGCATGACTCAGCTCTACTTCTTCCTCGTTTTTGCTATTGCAGAGTGTCACATGTTGGCTGCAATGGCGTAT 

GACCGTTACATGGCCATCTGTAGCCCCTTGCTGTACAGTGTCATCATATCCAATAAGGCTTGCTTTTCTCTG 

ATTTTAGGGGTGTATATAATAGGCCTGGTTTGTGCATCAGTTCATACAGACAGTATGTTTAGGGTTCA^ 

TGCAAATTTGATTTGATTAACCATTATTTCTGTGATCTTCTTCCCCTCCTAAAGCTCTCTTGCTCTAGTATC- 

TATGTCAACAAACTACTTATTCTATGTGTTGGTGCATTTAACATCCTTGTCCCCAGCCTGACCA 

TCTTACATCTTTATTATTGCCAGC^TCCrrCCACATTCGCTCCACTGAGGGCAGGTCCAAAGCCTTCA 

TGT AGCTCC CACATGTTGGCGGTTGTAATCTTTTT TGG ATCTGCAGCAT TCATGT ACTTG CAGC CAT CTT CA 

ATC^GCTCC^TGGACCAGGGGAAAGTATCCTCTGTGTTTTATACTATTATTGTGCCCATGTTGAACCCTCTG 

ATTT ATAGC CT GAG GAATAAAGATGTC CATGTTTC C CTG AAGAAAATG C TACAGAGAAGAAC ATTATTGTAA 

ACA 



In a search of public sequence databases, the NOV 16b nucleic acid sequence has 770 
of 935 bases (82%) identical to a gb:GENBANK-ID:AF282271|accAF28227Ll mRNA from 
Mus musculus (odorant receptor Kl 1 gene, complete cds) (E = 1.3e" 136 ). 
5 The disclosed NOV16b polypeptide (SEQ ID NO:68) encoded by SEQ ID NO:67 has 

311 amino acid residues and is presented in Table 16D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV 1 6b has A signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV16b may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 
1 0 reticulum (membrane) with a certainty of 0.3000, or in the endoplasmic reticulum (lumen) 
with a certainty of 0.3000. The most likely cleavage site for NOV16b is between positions 41 
and 42: VVG-NL. 



Table 16D. Encoded NOV16b protein sequence (SEQ ID NO:68). 

MSGEmiSSVTEFIIAGIiSEQPELQLPLFLLFLGIYVVTWGNLGMTTLIWLSSHLHTPMYYFLSSLSFIDFC 
HS WITPKMIiVNFOTEKNI IS YPECMTQLYFFLVFAI AECH I SNKACFSL 

ILGVYIIGLVC^VHTDSMFRVQFCKFDLINHYFO^ 
SYIFIIASILHIRSTEGRSKAFSTCSSHMLAWIFFGSAAFMYLQPS 
IYSLRNKDVHVSLKKMLQRRTLL 



15 A search of sequence databases reveals that the NOV16b amino acid sequence has 238 

of 31 1 amino acid residues (76%) identical to, and 274 of 31 1 amino acid residues (88%) 
similar to, the 3 14 amino acid residue ptnr:SPTREMBL-ACC:Q9EQB8 protein from Mus 
musculus (Mouse) (Odorant Receptor Kl 1) (E = 1 .0e~ 127 ). 

NOV16b is predicted to be expressed in at least the following tissues: Apical microvilli 
20 of the retinal pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma 
cell lines, corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral 
tissue, cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary 
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artery and umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal 
hematopoietic cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, 
lung lymphoma cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express 
MHC II and EI nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, 

5 pons, prostate, putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary 
artery in aortic) spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, 
thalamus, and thymus tissue. This information was derived by determining the tissue sources 
of the sequences that were included in the invention including but not limited to SeqCalling 
sources, Public EST sources, Literature sources, and/or RACE sources. 

10 The disclosed NOV16a polypeptide has homology to the amino acid sequences shown 

in the BLASTP data listed.in Table 16E. 



Table 16E. BLAST results for NOV16a 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 17472662 |ref |XP_ 

061790.1) 

(XM_061790) 


similar to 
odorant receptor 
K4hll (H. 
sapiens) [Homo 
sapiens] 


593 


265/284 
(93%) 


267/284 
(93%) 


e-121 


gi| 11692519 |gb|AAG3 
9856.1 |AF282271_1 
(AF282271) 


odorant receptor 
Kll [Mus 
musculus] 


314 . 


223/287 
(77%) 


250/287 
(86%) 


e-104 


gi| 11692563 |gb|AAG3 
9878.1 |AF282293_1 
(AF282293) 


odorant receptor 
K4hll [Mus 
musculus] 


307 


232/287 
(80%) 


253/287 
(87%) 


e-102 


gi | 17472672 |ref |XP_ 
061794. 1| 
<XMJ>61794) 


similar to 
odorant receptor 
Kll (H. sapiens) 
[Homo sapiens} 


311 


226/287 
(78%) 


252/287 
(87%) 


e-102 


gi 1 11692527 | gb | AAG3 

9860.1|AF282275_1 

(AF282275) 


odorant receptor 
K15 [Mus 
musculus] 


311 


224/287 
(78%) 


246/287 
(85%) 


e-102 



The homology between these and other sequences, is shown graphically in the 
15 ClustalW analysis shown in Table 16R In the ClustalW alignment of the NOV16 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (Le. 9 regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
20 function. 
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Table 16F. ClustalW Analysis of NOV16 

1) Novel N0V16a (SEQ ID NO: 66) 

2) Novel NOVl6b (SEQ ID NO: 68) 

3) gi 1 17472662 | ref |XP_061790 . 1 | (XM_061790) similar to odorant receptor K4hll (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 390) 

4) gi|ll692519|gb|AAG39856.l|AF282271_l (AF282271) odorant receptor Kll [Mus 
muSCUlus] (SEQ ID NO:388) 

5) gi|ll692563|gb|AAG39878.l|AF282293_l (AF282293) odorant receptor K4 hi 1 [Mus 
musculus] (SEQ ID NO: 392) 

6) gi 1 17472672 | ref |XP_061794 . 1 | (XM_061794) similar to odorant receptor Kll (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 3 93) 

7) gi|ll692527|gb|AAG39860.l|AF282275JL (AF282275) odorant receptor K15 [Mus 
musculus] (SEQ ID NO: 389) 



N0V16a 


1 


N0V16b 


1 


gi 


17472662 


1 


gi 


11692519 


1 


gi 


11692563 


1 


gi 


17472672 


1 


gi 


11692527 


1 



10 



20 



30 
■|.- 



40 



50 



60 



MVTCGNHSTVTEFNLAGLTDKPELQLPLPLIiPLGIYvVTW 



1 
1 

60 

1 

1 

1 

1 



NOV16a 
NO VI 6b 
gi | 17472662 
gi | 11692519 
gi | 11692563 
gi | 17472672 
gi | 11692527 



1 
1 

61 

1 

1 

1 

1 



70 



80 



90 



100 



|....|....|....| 



110 



120 
|....| 



1 

x 

HFLSSLSFIDLCQSSVT TPKMLVNFVSERNI I SYPACMTQLYFFLVLVI SECHMLAAMAY 120 

-- - 1 

- — 1 

1 

- - - - 1 



130 



140 



150 



160 



170 



N0V16a 


1 


NOV16b 


1 


gi 


17472662 


121 


gi 


11692519 


1 


gi 


11692563 


1 


gi 


17472672 


1 


gi 


11692527 


1 



180 



1 

1 

18 

1 

1 

1 

1 



N0V16a 

N0V16b 

gi 1 17472662 | 

gi 1 11692519 J 

gi | 11692563 | 

gij 17472672| 

gij 11692527 j 



190 



200 



210 



220 



230 



240 
|....| 



1 - 1 

181 ASILCIRSTEGRSKTFSTCSSHISAVSVTFGGTSRSRFQVI^^ 240 

1 - r 



1 
1 
1 
1 



N0V16a 


1 


NOV16b 


1 


NOV16b 


1 


gi 


17472662 


241 


gi 


11692519 


1 


gi 


11692563 


1 


gi 


17472672 


1 


gi 


11692527 


1 



250 260 270 

..|....|....|....|....|.. 



280 



290 



241 ETQPPVQSLFSGHRNIiAPSARAMEKKNVQPWTLAERMETVDKI 



300 
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310 



320 



330 



340 



350 



360 



10 



15 



20 



25 



30 



35 



40 



45 



50 



NOVl6a 
NOV16b 



NOVl6a 
NOVl6b 



gi 
gi 
gi 
gi 
gi 



17472662 
11692519 
11692563 
17472672 
11692527 



NOVl6a 
NOVl6b 



I 



IS 
18 



gi 


17472662 


301 


gi 


11692519 


21 


gi 


11692563 


18 


gi 


17472672 


18 


gi 


11692527 


18 


NOVl6a 


78 


NOVl6b 


78 


gi 


17472662 


361 


gi 


11692519 


81 


gi 


11692563 


78 


gi 


17472672 


78 


gi 


11692527 


78 



S EffiP E LQLJ^IjFLL FLG I YWTWGNLgM I TL I GhS S HLHT PM Y YFLSfiLS F IDHCH S T Jjl I 
SEfepELQLPLFLLFLGIYVVTVVGNLGMaTLliJiLSSHLHTPMYYFLSSLSFID|CHSTVI 
SEVpELQL[3LFLLFLGlffvVTWGNLGMITLIGLSSHLHTPMYYFLSSLSFIDBcHSTVI 

se^pelqlplf^f^gi^Stv^gnlgmiRliglsshlhtpmyyflsslsfidJSc^stvP 

I^PELQLPLFLLFLGIYj^vgG^ 

i^elqlplflEflgiywtvBgnlgmi^ 

St^^^LQj^PLFLLFLGI YVVTVVGISrLGMITLlSliS SHLHTPMYYFL5 SLSFIPSCHSTVI 



77 

77 

360 

80 

77 

77 

77 



370 



380 



390 



400 
. . I . . 



410 



420 
-J. 



TPKMLVNFVTEKNIISYPECMTQLYFFLIFAIAECHMLA^YDRYVAICSPLLYNVIMS 
TPKMLTOFVTEKNIISYPECMTQLYFFLjjjj^ 

TPKMLWF2TEKNIISYPECMSQLY2FglFAIAECHMLAAMAYDgYVAICSPLLYNVIMS 
TPKI^WFVTEKNIISYPgcMTQLYFFLIFAIAECSLAAMAYDRYVAICgPLLY^ 
TPKML§^IFjj2E§NI I S YPE CMTQLYFF^FAI AE CHMLAjjjMAYDRYVAI O^PLLY^VuMS 
TPKMLWFVTEKNIISYPECMTQLYFFI^FAIAECHMLAAMAYDSYVAICSPLLYaJlBs 
tpthvit .^r\Tr\^T? TfMT T £v7?cjCMj?QL YFFLI FAI AJSCHMLA AMAYDRYVAIQ^PLLYlTVffilS 



137 
137 
420 
140 
137 
137 
137 



450 



460 



470 



480 



gi 


17472662 


481 


gi 


11692519 


201 


gi 


11692563 


198 


gi 


17472672 


198 


gi 


11692527 


198 


NOV16a 


258 


NOVlSb 


258 


gi 


17472662 


541 


gi 


11692519 


261 


gi 


11692563 


258 


gi 


17472672 


258 


gi 


11692527 


258 




550 



560 



570 



I. 



580 
..|.. 



590 



MYLQPSSVSSMDQGKVSSVFYTIWPMLNP 
MYLQPSSSsSMDQGKVSSVFYTlRvPMLNPLIYSLRNKDVjj 1 

mylqpssvssmdqSkvssvfyt(Jjvpmlnpliyslrnkd\ 
mylqps svs smdqgkvs svfyti wpmlnpli yslrnkd 
]jylqpssvssmjsqgkvssvfytj5wpmlnpliyslrnkd 
mylqps svs smdqgkvs svfyti wpmlnpl i yslrnkdvj 
mylqpssvssmdqgkvssvfytiwpmlnpliyslrnxdvi 




291 

iSRTLIi 311 
AC- 593 
iE&KTFM 314 
307 
ITFL 311 
iFM 311 



Table 16G lists the domain description from DOMAIN analysis results against 
NOV16. This indicates that the NOV16 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 16G Domain Analysis of NOV16 

gnl |Pf am|pfam00001, 7tm_l, 7 transmembrane receptor (rhodopsin 

family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 98.8% aligned 

Score = 85.9 bits (211), Expect = 3e-l8 





NOV18 : 


41 


5 


Sbjct: 


1 




NOV18: 


101 




Sbjct: 


61 


10 


NO VI 8: 


161 




Sbjct: 


121 


15 


NOV18 : 


216 




Sbjct: 


176 




NOV18 : 


272 


20 


Sbjct: 


236 



GNLSMITLIGLSSHLHTPrTlTXTLSGLSFIDICHSTIITPKMLVNFVTEKNIISyPECMTQ 100 

III +1 +1 + I II II 1+ |+ +11 I + I 

GNLLVILVILRTKKLRTPTNIFLLNIAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 6 0 

LYFFLIFAIAECHMLAWAYDRWAICSPLLYNVIMSYHHC^mTVGVYILGILGSTIHT 160 

I++ I +1 + 111*1-1 II I I + I + |+*| +| I 

GALFWNGYAS ILLLTAI S IDRYLAIVHPLRYRRIRTPRRAKVLILLVWVIiALLLSLPPL 12 0 

GFMLRIiFLCKTNVINHYFCDLFPLLG LSCSSTYINELLVLVLSAFNILTPALTIL 215 

I + + I + II ++ III+++ II 
LFSWLRTVEEGNTTVCLIDFPEES VKRSYVLLSTLVGFVLPLLVI LVCYTRIL RT 175 

ASYIFIIASILRIRSTEGRSKAFSTCSSHILAVAVFFGSAAFMYL QPSSVSSMDQG 271 

1+ 11+ I I ++ | + +| | * 

LRKRARSQRSLKRRSS SERKAAKMLLVVVVVFVLCWLPYHIVLLLDSLCLLS IWRVLPTA 235 

KVSSVFYTIWPMLNP 287 

+ +++ I III 
LLITLWLAYVNSCLNP 251 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
25 and are likely to underlie the recognition and G-protein-mediated transduction of various 

signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

30 Olfactory receptors (ORs) have been identified as extremely large subfamily of G 

protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
Previously, OR genes cloned in different species were from random locations in the respective 

35 genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 16 nucleic acid of the invention encoding a G-Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 16A or a 
40 fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 16A while still encoding a 

144 



WO 02/068649 



PCT/US02/02785 



protein that maintains its G-Protein Coupled Receptor -like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
5 includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 

1 0 binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 1 8 percent of the bases may be so changed. 

The disclosed NOV16 protein of the invention includes the G-Protein Coupled 
Receptor -like protein whose sequence is provided in Table 16B. The invention also includes 
a mutant or variant protein any of whose residues may be changed from the corresponding 

1 5 residue shown in Table 16B while still encoding a protein that maintains its G-Protein 

Coupled Receptor -like activities and physiological functions, or a functional fragment thereof. 
In the mutant or variant protein, up to about 23 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

20 The above disclosed information suggests that this G-Protein Coupled Receptor -like 

protein (NOV16) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV 16 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 

25 protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV16 nucleic acids and proteins of the invention are useful in potential 

30 therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
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starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
5 hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

Treatment of Albright Hereditary Osteodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 

10 autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

NOV 16 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV16 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 

15 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV 16 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 

20 disorders. 

NOV17 

NOV 1 7 includes three novel G-Protein Coupled Receptor -like proteins disclosed 
below. The disclosed sequences have been named NOV17a, NOV17b, NOV17c, and 
NOV17d. 
25 NOV17a 

A disclosed NOV17a nucleic acid of 962 nucleotides (also referred to as CG56657-01) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 17A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 18-20 and 
ending with a TAG codon at nucleotides 954-956. The start and stop codons are shown in 
30 bold in Table 1 7A, and the 5' and 3' untranslated regions, if any, are underlined. 
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Table 17A. NO VI 7a nucleotide sequence (SEQ ID NO:69). 

GATCGTATGAATGCCCC ATGGAAAATTACAATCAAACGTCAACTGATTTCATCTTATTGGGGCTGTTCCCAP 
CATCAAAAATTGGCCTTTTCCTCTTCATTCTCT 

CCATGATTCTTCTCAT CTT CTTGGACACCCATCTCCACACACCCATGTATTTCCTGCTTAGTCAGCTCTCCC 
TCATTGACCTAAATTACAT CTCTACGATTGTTC CTAAGATGGCTTCTGATTTTCTGTATGGAAACAAGTCTA 
TCTCCTTCATTGGGTGTGGGATTCAGAGTTTCTTCTTCATGACTTTTGCAGGTGCAGAAGCGCTGCTCCTGA 
CATCAATGGCCTATGATCGTTATGTGGCC^TTTGCTTTCCTCTCCACTATCCCATCCGTATGAGCAAAAGAA 
TGTATGTGCTGATGATAACAGGATCTTGGATGATAGGCTCCATCAACTCT 

TCCGTATCCCATATTGCAAGTCCAGAGCCATCAATCATTTTTTCTGTGATGTTCC7VGCTATGTTGACATTAG 
CCTGTACAGACACCTGGGTCTATGAGTACACAGTGTTTTTGAGCAGCACCATCTTTCTTGTGTTTCCCTTCA 
CTGGCATTGCGTGTTCCTATGGCTGGGTTCTCCTTGCTGTCTACCGCATGCACTCTGCAGAAGGGAGGAAAA 
AGGCCTATTCGACCTGCAGCACCCACCTCACTGTAGTAACTTTCTACTATGCACCCTTTGCXTATACCTATC 
TATGTCCAAGATCCCTGCGATCTCTGACAGAGGACAAGGTTCTGGCTGTTTTCTACACCATCCTCACCCCAA 
TGCTCAACCCCATCATCTACAGCCTGAGAAAO^GGAGGTGATGGGGGCCCTGACACGAGTGATTCAGAATA 
TCTTCTCGGTGAAAATGTAGACATAC 



The disclosed NOV 17a polypeptide (SEQ ID NO:70) encoded by SEQ ID NO:69 has 
3 12 amino acid residues and is presented in Table 17B using the one-letter amino acid code. 



Table 17B. Encoded NO VI 7a protein sequence (SEQ ID NO:70). 

MENYNQTSTDFILLGLFPPSKIGLFLFILFVLIFLMALIGNLSMILLIFLDTHLHTPMYFLLSQ 

LS L I DLNY 1ST I VPKMASDFL YGNKS ISFIGCGIQS FFFMT FAGAE ALLLTSMAYDR YVAI CFP 

LHYPIRMSKRMYVLMITGSWMIGSINSCAHTVYAFRIPYCKSRAINHFFOT 

VYEYTVFLS STI FLVFP FTGI ACS YGWVLLAVYRMHS AEGRKKAYSTCS THLTWT FYYAP FAY 

TYLCPRSXiRS LTEDKVLAVF YTI LTPMLNPI I YSLRNKEVMGALTRVIQNIFS VKM 



A search of sequence databases reveals that the NOV 17a amino acid sequence has 148 
of 305 amino acid residues (48%) identical to, and 192 of 305 amino acid residues (62%) 
similar to, the 316 amino acid residue ptnr: TREMBLNEW-ACC: AAG45 1 96 protein from 
Mus musculus (Mouse) (T2 OLFACTORY RECEPTOR) (E = 8.0e" 73 ). 

10 NOV17b 

A disclosed NOV17b nucleic acid of 962 nucleotides (also referred to as CG56657-01) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 17C. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 18-20 and 
ending with a TAG codon at nucleotides 954-956. The start and stop codons are shown in 

1 5 bold in Table 17C, and the 5' and 3' untranslated regions, if any, are underlined. 
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Table 17C. NOV17b nucleotide sequence (SEQ ID NO:71). 

GATCGT ATGAATGCCCCA TGGAAAATTACAATCAAACGTCAACTGATTTCATCT TATTGGGG CTGTTCCCAC 
CAT CAAAAATTG GC C TTT T CCT CTT CAT T CTCTTTGTT CTC ATTT TC CT AATGG CT CTAATTGGAAACCT AT 
CCATGATTCTTCTC^TCTTCTTGGACACCCATCTCCACIACACCCATGTATTTCCTGCTTAGTCAGCTCTCCC 
TCATTGACCTAAATTACATCTCTACGATTGTTCCTAAGATGGCTTCTGATTTTCTGTATGGAAACAAGTCTA 
TCTCCTTCATTGGGTGTGGGATTCAGAGTTTCTTCTTCATGACTTTTGCAGGTGCAGAAGCGCTGCTCCTGA 
CAT CAATGGCCTATGATCGTTATGTGGCCATTTGC TTTCCTCTCCGCTATCCCATCCGTATGAGCAAAAGAA 
TGTATGTGCTGATGATAACJ^ATCTTGGATGATAGGCTCCATC^CrCTTGTGCTCACACAGTATATGCAT 
TCCGTATCCCATATTGCAAGTCCAGAG CCATCAATCATTTTTT CTGTGATGTTCCAG CTATGTTGACATTAG 

CTGGCATTGCGTGTTCCTATGGCTGGGTTCTCCTTGCTGTCTACCG CATGCACTCTG CAGAAGGGAGGAAAA 
AGGCCTATTCGACCTGCAGCACCCACCTCACTGTAGTAACTTTCTACTATGCACCCT 

TATGTCCAAGATCCCTGCGATCTCTGACAGAGGACAAGGTTCTGGCTGTTTTCTACACCATCCTCACCCC^ 
TGCTCAACCCCATCATCTACAGCCTGAGAAACAAGGAGGTGATGGGGGCCCTGACACGAGTGATTCAGAATA 
TCTTCTCGGTGAAAATGTAGACATAC 



In a search of public sequence databases, the NOV 17b nucleic acid sequence, localized 
to chromosome 4, has321 of 342 bases (93%) identical to a gb:GENBANK- 
ID:HSHTPRH07|acc:X64978.1 mRNA from Homo sapiens (Hsapiens mRNA HTPCRH07 
5 for olfactory receptor) (E = 2.9c -62 ). 

The disclosed NOV17b polypeptide (SEQ ID NO:72) encoded by SEQ ID NO:71 has 
311 amino acid residues and is presented in Table 1 7D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV 17b has no signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.4600. Alternatively, 
1 0 NOV 1 7b may also localize to the microbody (peroxisome) with a certainty of 0.23 1 1 , the 
endoplasmic reticulum (membrane) with a certainty of 0.1 000, or in the endoplasmic 
reticulum (lumen) with a certainty of 0.1000. The most likely cleavage site for NOV17B is 
between positions 43 and 44: NLS-MI. 



Table 17D. Encoded NOV17b protein sequence (SEQ ID NO:72). 

MBNYNQTSTDFILLGIJPPSKIGLFLF 
ISTIVPKMASDFLYGITCCSISFIGCXJIQSFFFMTFAGAEA 
TGSWMIGSINSCAHTVYAFRIPYCKSRAINHFFCDV^ 
YGWVLIAVYRMHSAEGRKKAYSTCSTHLTWTFYYAP 
YSLRNKEVMGAIiTRVIQNI FSVKM 



A search of sequence databases reveals that the NOV17b amino acid sequence has 148 
of 305 amino acid residues (48%) identical to, and 191 of 305 amino acid residues (62%) 
similar to, the 316 amino acid residue ptnnTREMBLNE W-ACC : AAG45 1 96 protein from 
Mus musculus (Mouse) (T2 Olfactory Receptor) (E = 8.0e" 73 ). 
20 NOV 17b is predicted to be expressed in at least the following tissues: Apical microvilli 

of the retinal pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma 
cell lines, corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral 
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tissue, cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary 
artery and umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal 
hematopoietic cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, 
lung lymphoma cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express 

5 MHC II and III nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, 
pons, prostate, putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary 
artery in aortic) spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, 
thalamus, and thymus tissue. This information was derived by determining the tissue sources 
of the sequences that were included in the invention including but not limited to SeqCalling 

1 0 sources, Public EST sources, Literature sources, and/or RACE sources. 
NOV17c 

A disclosed NOV17c nucleic acid of 883 nucleotides (also referred to as CG56659-01) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 17E. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 44-46 and 
15 ending with a TAG codon at nucleotides 875-877. The start and stop codons are shown in 
bold in Table 17E, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 17E. NOV17c nucleotide sequence (SEQ ID NO:73). 

AATTCGCCTTO 

TCTTCTCATCTTTTTGGACATCCATCTC^ 

CCTAAATTACATCTCCACCATTGTT CCAAAGATGGTTTATGATTTT CTGTATGGAAACAAGTCTATCTCCTT 

CACTGGATGTGGGATTCAGAGTTTCTTCTTCTTG 

GGCCTATGATCGTTATGTGGCCATTTGCT^ 

G ATGAT GAT AACAGG AT CTTGG ATG AT AAGCT CT AT CAACT CTTGTGCTCACACAGT AT ATGCACT CTGT AT 
CCC ATATTGCAAGT CCAGAGCCATCAATCATTTTTTCTGTGA CCTGCAC 
AGACACTTGGGTCTATGAGAGCACAGTGTTTTTGA 

TGCATGTTCCTATGGCCGGGTTCTCCTTGCTGTCTACCGCATGCACTCTGCAGAAGGGAGGAAGAAGGCCTA 
TTCAACCTGTAGCACCCACCTCACTGTAGT^^ 

AAGATCCCTGCGATCTCCAACAGAGGACAAGATTCTGGCTGTTTTCTACACCATCCTCACCCCAATGCTCAA 

CCCCATCATCTACAGCCTGASAAAC^ 

AGTGAAAATGTAGACATAC 



The disclosed NOV17c polypeptide (SEQ ID NO:74) encoded by SEQ ID NO:73 has 
20 277 amino acid residues and is presented in Table 17F using the one-letter amino acid code. 



Table 17F. Encoded NOV17c protein sequence (SEQ ID NO:74). 

MALIGNLSMILLIFLDIHLHTPMYFLLSQLSLIDLNyiSTIVPKMVYDPLYGNKSISFTGCGIQ 
SFFFLTLAVAfiGLLLTSMAYDRYVAICFPLHYPIRI^ 

CI P YCKS RAINHFFCDVPAMLTLACTDTWVYESTVFLS STI FLVLPFTGIACSYGRVLLAVYRM 
HSAEGRKKAYSTCSTHLTWSFYYAPFAYTYVRPRSLRSPTED^^ 

GALTQVI QKI FS VKM • ' , 
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A search of sequence databases reveals that the NOV17c amino acid sequence has 139 
of 272 amino acid residues (51%) identical to, and 181 of 272 amino acid residues (66%) 
similar to, the 3 16 amino acid residue ptnr: TREMBLNEW-ACC: AAG45 1 96 protein from 
Mus musculus (Mouse) (T2 OLFACTORY RECEPTOR) (E « 4.0e~ 71 ). 
5 NOV17d 

A disclosed NOV1 7d nucleic acid of 926 nucleotides (also referred to as CG56659 _02) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 17G. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 87-89 and 
ending with a TAG codon at nucleotides 918-920, The start and stop codons are shown in 
10 bold in Table 17G, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 17G. NOV17d nucleotide sequence (SEQ ID NO:75). 

CATCAACTGATTTCATCTTATTGGGGCTGTTCCCACAATCA^GAATTGGCCTTTTCGTATTCACCCTCATTT 
TTCTCATTTTCCTAATGGCTCTAATTGGAAATCTATC 

ACACACCTATGTATTTCCTACTTAGTCAGCTCTCCCTCATTGACCTAAATTACATCTCCACCAT^ 

AGATGGTTTATGATTTTCTGTATGGAAACAAGTCTATCT 

TCTTGACTTTAGCAGTTGCAGAAGGGCTGCTCCTGACA 

TTCCTCTCCACTATCCCATCCGTATAAGCAAAAGAGTGTGTGTGATGATGATAACAGGATCTTGGATGATAA 
G CT CT ATCAACT CTTGTGCTCACACAGTATATGCACTCTGTATCCCATATTGCAAGT CC AGAG CCAT CAATC 
ATTTTTTCTGTGATGTTCCAGCTATGTTGACGCTAGCCTGCACAGAC^CrTGGGTCTATGAGAGCACAGT^ 
T TTTGAGC AGCAC CAT CTTT CT TGTGCTTC CTTT C ACT GGT ATTGCATGTT CCT ATGG C CGGGTT CT C CTTG 
CTGTCTACCGCATGCACTCTGCAGAAGGGAGGA^ 
TGTCCTTCTACTATGCACCCTTTGCTTATACCTATCT^ 
AGATTCTGGCTGTTTTCTACACCATCCTCACCCCAATGCTCAACCCC^ 
AGGTGATGGGGGTCCTGACACAAGTGATTCAGAAAATCTTCTCAGTGAAAATGTAGACATAC 



In a search of public sequence databases, the NOV17d nucleic acid sequence has343 of 
343 bases (100%) identical to a gb:GENBANK-ID:HSHTPRH07|acc:X64978.1 mRNA from 
15 Homo sapiens (H.sapiens mRNA HTPCRH07 for olfactory receptor) (E = 5.4e" 71 ). 

The disclosed NOV17D polypeptide (SEQ ID NO:76) encoded by SEQ ID NO:75 has 
277 amino acid residues and is presented in Table 17H using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV17d has no signal peptide and is 
likely to be localized to the endoplasmic reticulum (membrane) with a certainty of 0.6850. 
20 Alternatively, NOV 1 7d may also localize to the plasma membrane with a certainty of 0.6400, 
the Golgi body with a certainty of 0.4600, or in the endoplasmic reticulum (lumen) with a 
certainty of 0. 1 000. The most likely cleavage site for NO VI 7D is between positions 22 and 
23: HTP-MY. 



Table 17H. Encoded NOV17d protein sequence (SEQ ID NO:76). 

MALIGNLSMIIXIFLDIHLOTPMYFLLSQLS 
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VAEGLLLTSMAYDRYVAICFPLHYPI RI S KRVCVMMITGSWMI SS INSCAHTVYALCI PYCKSRAINHFFCD 
VPAMLTLACTDTWVYESTVFLSSTIFLVLPFTGIACSYGRVLI^VyRMHSAEGRKKAYSTCSTHLTW 
APFAYTYVRPRSLRSPTEDKILAVFYTILTPMLNPIIYSIiRNKEVMGVLTQVIQKIFSVKM 



A search of sequence databases reveals that the NOV17d amino acid sequence has 138 
of 269 amino acid residues (51%) identical to, and 183 of 269 amino acid residues (68%) 
similar to, the 316 amino acid residue ptnr:SPTREMBL-ACC:Q9D3U9 protein from Mus 
5 musculus (Mouse) (4933433E02rik Protein) (E = 3.9e" 71 ). 

NOV17d is predicted to be expressed in at least the following tissues: Apical microvilli 
of the retinal pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma 
cell lines, corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral 
tissue, cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary 
10 artery and umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal 

hematopoietic cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, 
lung lymphoma cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express 
MHC II and HI nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, 
pons, prostate, putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary 
15 artery in aortic) spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, 

thalamus, and thymus tissue. This information was derived by determining the tissue sources 
of the sequences that were included in the invention including but not limited to SeqCalling 
sources, Public EST sources, Literature sources, and/or RACE sources. 

The disclosed NOV 17a polypeptide has homology to the amino acid sequences shown 
20 in the BLASTP data listed in Table 1 71. 



Table 171. BLAST results for NOV17a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


' Identity 
(%) 


Positives 
(%) 


Expect 


>gi | 17445356 |ref|XP 
_060561.ll 
(XMJ>60561) 


similar to 
OLFACTORY 
RECEPTOR 2T1 
(OLFACTORY 
RECEPTOR 1-25) 
(0R1-25) (H. 
sapiens) [Homo 
sapiens] 


312 


312/312 
(100%) 


312/312 
(100%) 


e-149 


gi | 17445348 |ref|XP 
060559. 1| 
(XM 060559) 


similar to 
OLFACTORY 
RECEPTOR 2T1 
(OLFACTORY 
RECEPTOR 1-25) 
(0R1-25) (H. 
sapiens) [Homo 
sapiens] 


533 


199/233 
(85%) 


206/233 
(88%) 


le-95 
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gi | 17437047 |ref |XP_ 

0G0312.1) 

(XM_060312) 


similar to 
OLFACTORY 
RECEPTOR 2T1 
(OLFACTORY 
RECEPTOR 1-25) 
(OR1-25) (H. 
sapiens) [Homo 
sapiens] 


472 


149/299 
(49%) 


211/299 
(69%) 


5e-78 


gi | 17437056 |ref |XP_ 
060314. l| 
(XM_060314) 


similar to 
OLFACTORY 
RECEPTOR 2T1 
(OLFACTORY 
RECEPTOR 1-25) 
(OR1-25) (H. 
sapiens ) {Homo 
sapiens] 


695 


155/295 
(52%) 


209/295 
(70%) 


le-74 


gi| 17456595 |ref |XP_ 
065 073. l| 
(XM_065073) 


similar to 
olfactory 
receptor (H. 
sapiens) [Homo 
sapiens] 


638 


138/296 
(46%) 


193/296 
(64%) 


le-73 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 17J. In the ClustalW alignment of the NOV 17 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
.5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 17 J. ClustalW Analysis of NOV17 



1) Novel NOV17a (SEQ ID NO: 70) 

2) Novel NOV17b (SEQ ID NO: 72) 
2) Novel N0V17c (SEQ ID NO: 74) 

2) Novel NOV17d (SEQ ID NO: 76) 

3) gi | 17445356 | ref |XP_06056l . 1 | (XM_060561) similar to OLFACTORY RECEPTOR 2T1 
(OLFACTORY RECEPTOR 1-25) (OR1-25) (H. sapiens) [Homo sapiens) (SEQ ID NO:394) 

4) gi 1 17445348 | ref |XP_060559 . 1 1 (XM_060559) similar to OLFACTORY RECEPTOR 2T1 
(OLFACTORY RECEPTOR 1-25) (OR1-25) . (H. sapiens) [Homo sapiens] (SEQ ID NO:395) 

5) gi 1 17437047 | ref |XP_060312.l| (XM_060312) similar to OLFACTORY RECEPTOR 2T1 
(OLFACTORY RECEPTOR 1-25) (OR1-25) (H. sapiens) [Homo sapiens] (SEQ ID NO:396) 

6) gi 1 17437056 | ref |XP_060314 . 1 1 (XM_060314) similar to OLFACTORY RECEPTOR 2T1 
(OLFACTORY RECEPTOR 1-25) (OR1-25) (H. sapiens) [Homo sapiens] (SEQ ID NO:397) 

7) gi | 17456595 | ref |XP_065073 . 1 | (XM_065073) similar to olfactory receptor (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 3 98) 
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NOV17a 


NOV17b 


NOV17C 


NOV17d 


gi 


17445356 


gi 


17445348 


gi 


17437047 


gi 


17437056 


gi 


17456595 
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NOVX7a 


1 


NOV17b 


1 


NOV17c 


1 


NOV17d 


1 


gi 


17445356 


1 


gi 


17445348 


1 


gi 


17437047 


43 


gi 


17437056 


61 


gi 


17456595 


1 
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.|....| 



100 

..I.. 



110 



120 



- - QTDWSRREI ISEDKMFRTTTAGFQAESGVAG CTGPDVTLMWLRLD - LEGFMR 95 

HTPMYFFI SQLALMDLMYLCVTVPKMLVGQVTGDDT I S PSGCGI QMFFYLTLAGAEVFLL 120 
1 
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NOV17a 
NOV17b 
NOV17C 
NOV17d 



gi 
gi 
gi 
gi 
gi 



17445356 
17445348 
17437047 
17437056 
17456595 



NOV17a 
NOV17b 
NOV17C 
NOV17d 



gi 


17445356 


1 


gi 


17445348 


1 


gi 


17437047 


146 


gi 


17437056 


181 


gi 


17456595 


1 


NOV17a 


1 


NOV17b 


1 


NOV17C 


1 


N0V17d 


1 


gi 


17445356 


1 


gi 


17445348 


1 


gi 


17437047 


147 


gi 


17437056 


241 


gi 


17456595 


1 



140 
..|.. 



150 
..|.. 



160 



170 
..|.. 



180 
■•I 



1 _ 

1 r 1 

96 QQGDRGKVRGTTRPLAWKLHPDG- -TLRSVTSTADLSHLDRVLLPP SWSLCLP 146 

121 AAMAYDRYAAVCRPLHYPLLMNQRVCQLLVSACWVLGMVDGLLLTPITMSFPFCQSRKIL 180 

1 — - 1 



190 
|....|.. 



200 
..|....|. 



210 



220 



23 0 



I I 



240 



_ - l 

" 1 

± 

" 1 

x 

" 1 

" — 146 

181 SFFCETPALLKLS CSDVSLYKTXMYLCCI LMLLAP IMVI SS S YTLILHLI HRMNS AAGHR 24 0 
1 



250 



260 



270 
..|.. 



280 



290 



300 



1 
1 
1 
1 
1 
1 

150 



310 



N0V17a 

N0V17b 

N0V17C 

NOV17d 

gi | 1744535 6 | 

gi 1 17445348 I 

gi 1 17437047 | 

gi j 17437056 j 

gi 1 17456595 j 




NOV17a 


48 


NOV17b 


48 


N0V17C 


13 


N0V17d 


13 


gi 


17445356| 


48 


gi 


17445348] 


49 


gi 


17437047 | 


- 202 



370 
I 1 



380 

■ J.. 



390 
. . I . . 



400 



410 



IyLDjgJLHTPiMYFLLSQLSLM DL|YI 

iBldB'LHTPmyfllsqlslB dlSyi 
iSldBSlhtpmyfllsqlslS dlSyi 
iSldWilhtpmyfllsqlslS dlSyi 
iBld^ilhtpmyfllsqlslS dlSyi . 

ISLSSLHTPMYFLLSnT.snffinT .iffiYT 



I^jSgLHTPMYFLLSQLS 

i^dSSlktpmyfllsols 



LSQLSjnp: 
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g± 1 17437056 | 
gi 1 1 17456595 j 



NOV17a 
NOV17b 
NOV17C 
NOV17d 



NOV17a 
NOV17b 
NOV17C 
NOV17d 



NOV17a 

NOV17b 

NO VI 7 c 

NOV17d 

gi 1 17445356 | 

gi j 17445348 | 

gi j 17437047 | 

gij 17437056 | 

gi j 17456595 j 



NOV17a 
N0V17b 
NOV17c 
NOV17d 
gi | 17445356 | 



82 
82 
47 
47 
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17445356 


82 


gi 


17445348 
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gi 


17437047 
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gi 


17437056 
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gi 
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83 
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47 


gi 
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gi 


17445348 
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gi 


17437047 
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gi 


17437056 
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gi 
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82 
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47 
47 



gi 


17445356 
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gi 


17445348 
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gi 
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gi 
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82 
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17445348 
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17437047 
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gi 


17437056 
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17456595 


83 
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93 
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236 

395 
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gi 1 17445348 
gi 1 17437047 
gi 1 17437056 
gi j 17456595 



NOV17a 
HOV17b 
HOV17C 
HOV17d 



NOV17a 
HOV17b 
NOV17c 




NOV17a 


248 


NOVl7b 


248 


NOV17c 


213 


N0V17d 


213 


gi 


17445356 


248 


gi 


17445348 


469 


gi 


17437047 


402 


gi 


17437056 


561 


gi 


17456595 


249 



298 
298 
263 
263 



gi 


17445356 


298 


gi 


17445348 


519 


gi 


17437047 


452 


gi 


17437056 


611 


gi 


17456595 


309 


NOV17a 


312 


NOV17b 


312 


NOV17C 


277 


NOV17d 


277 


gi 


17445356 


312 


gi 


17445348 


533 


gi 


17437047 


472 


gi 


17437056 


652 


gi 


17456595 


369 



NOV17a 


312 


NOV17b 


312 


NOV17C 


277 


NOV17d 


277 


gi 


17445356 


312 


gi| 17445348 


533 


gij 17437047 


472 


gi 


17437056 
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gi| 17456595 
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N0V17a 


312 


N0V17b 


312 


NOV17C 


277 
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gi 
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17445348 
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gi 
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gi 
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695 


gi 
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I — | — I — l^_-^-_L^ — 1 — I . — I 
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312 
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|FA CCSSAQKVMSDA 472 

!RM--EWKTLPFQALQVRCVKWRRs!vIjVSSFIA TER 651 

iTCSGRCLPGESHVSLI SLVEPPAVE^v|GASVKGCPRTWCLPREQ 368 
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- - 312 
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_ -- 312 
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TLADTSHSS SHAEFPERGVR - MNCSK^FSLVEEPVTSLGDLFNFR 695 

VLWTCPDSGTSLESKQPHQEGLSDMHLSNTICTLVSELNQFWAyPIQHDLPKEVLLTPAP 428 

970 980 990 1000 1010 1020 

....|....|....|....|-...|....|....|....|....|....|....|....| 
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312 
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-- 312 
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" - -- 695 
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NOV17d 
gi | 17445356 | 
gi | 17445348 | 
gi | 17437047 | 
gi | 17437056 | 
gi | 17456595 | 



NOV17a 
NOV17b 
NOV17C 
NOV17d 



gi 
gi 
gi 
gi 
gi 



17445356 
17445348 
17437047 
17437056 
17456595 



277 277 

312 - — r — - 312 

533 ~ - 533 

472 -- 472 

695 — " ~ 695 

549 SIRRQREFMPEEKKDTVYWEKRRKNNEAAKRSREKRRLNDAAI EGRLAALMEENALLKGE 608 

1150 1160 1170 

....|....|....|....|....|....| 

312 — 312 

312 - 312 

277 - 277 

277 277 

312 - 312 

533 533 

472 472 

695 - - 695 

609 LKALKLRFGLLPLTGSAI GSPLDWGPPAWG 638 
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Table 17F lists the domain description from DOMAIN analysis results against NOV17. 
This indicates that the NOV17 sequence has properties similar to those of other proteins 
known to contain this domain. 
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Table 17F Domain Analysis of NOV17 

gnl|Pfam|pfam00001, 7tm_l, 7 transmembrane receptor (rhodopsin 

family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 100.0% aligned 

Score = 99.4 bits (246), Expect = 3e-22 



50 



NOV17: 40 GNLSMILLIFLDTHLHTPMYFLLSQLSL 99 

Ml +||+| III I I++ II ++ 1+ I +1+ I + 

Sbjct: 1 GNLLVI LVI LRTKKLRTPTMI FLLNLAVADl^FLLTLPPWALYYLVGGDWVFGDALCKLV 60 

NOV17: 100 SFFFMTFAGAEALLLTSMAYDRYVAI CFPLHYP IRMSKRMYVLMITGSWMIGSINSCAHT 159 

1+ I 1111+++ 1 11+11 Ml +1 ++I I++ + I 

Sbjct: 61 GALFWNGYASILLLTAI S IDRYLAI VHPLRYRRIRTPRRAKVLILLVWVLALLLSLPPL 120 

NOV17: 160 VYAFRI PYCKSRAINHFFCDVPAMLTLACTDTWVYEYTVFLSSTI FIjVFPFTGI ACS YGW 219 

+ + + I I 11+ + I I I I 

Sbjct: 121 LF SWLRTVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTR 171 

NOV17: 220 VLLAV YRMHSAEGRKKAYSTCSTHLTWTFYY AP FAYTYLCPRS LRS 266 

*| + + |+ || | + I + + I 

Sbjct: 172 I LRTLRKRARSQRS LKRRS SS ERKAAKMLLWVWFVLCWLP YH I VLLLDSLCLLS I WRV 231 

NOV17: 267 LTEDKVLAVFYTILTPMLNPI IY 289 

I ++ ++ + I I I I I I 
Sbjct: 232 LPTALLITLWLAYVNSCLNPIIY 254 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 
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respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
5 protein-coupled receptors in a number of species. These receptors share a seven 

transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
1 0 displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 17 nucleic acid of the invention encoding a G-Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 17A, 
17C or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 

15 whose bases may be changed from the corresponding base shown in Table 17A or 17C while 
still encoding a protein that maintains its G-Protein Coupled Receptor -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 

20 invention additionally includes nucleic acids or nucleic acid fragments, or complements 

thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 

25 antisense binding nucleic acids in therapeutic applications in a subject In the mutant or 
variant nucleic acids, and their complements, up to about 7 percent of the bases may be so 
changed. 

The disclosed NOV17 protein of the invention includes the G-Protein Coupled 
Receptor -like protein whose sequence is provided in Table 17B or 17D. The invention also 
30 includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 17B or 17D while still encoding a protein that 
maintains its G-Protein Coupled Receptor -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 54 percent of the 
residues may be so changed. 
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The invention further encompasses antibodies and antibody fragments, such as F ab or 
(FabH that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor -like 
protein (NOV 17) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
5 NOV1 7 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

10 delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV 17 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 

15 Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 

20 cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 
Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 

25 disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 

mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

NOV 17 nucleic acids and polypeptides are further useful in the generation of 

30 antibodies that bind immuno-specifically to the novel NOV17 substances for use in 

therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV17 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
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assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV18 

5^ NOV 18 includes three novel G-Protein Coupled Receptor -like proteins disclosed 

below. The disclosed sequences have been named NOV18a and NOV18b. 
NOV18a 

A disclosed NOV 18a nucleic acid of 1062 nucleotides (also referred to as CG56663- 
01) encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 18 A. An 
10 open reading frame was identified beginning with an ATG initiation codon at nucleotides 10- 
12 and ending with a TAA codon at nucleotides 948-950. The start and stop codons are 
shown in bold in Table 18A, and the 5' and 3 5 untranslated regions, if any, are underlined. 



Table 18A. NOV18a nucleotide sequence (SEQ ID NO:77). 

TAGAGATGGA TGGAACCAATGGCAGCACCCAAACCCATTTCATCCT^ 

TGGAGAGGATCCTCTTTGTGGTCATCCTGATCGCGTACCTCCTGACCCTCGTAGGCAACACCACCATCATCC 
TGGTGTCCCGGCTGGACCCCCACCTCCACACCCCCATGTACTTCTTCCTCGCCCACCTTTCCTTCCTGGACC 
T CAGTTT CAC CAC CAG CT C CAT CC C CCAGCTGCTCTACAACCTTAATGG ATGTG ACAAG ACCAT CAG CTACA 

CCTATGACCGGTGTGTGGCTATCTGCAAGCCCCTGCACTACATGGTGATCATGAACCCCAGGCTCTGCCGGG 

CCCGCTGTGGGCACCACGAGGTGGACCACTTCCTGCGTGAGATGCCCGCCCTGATCCGGATGGCCTGCGTCA 
G CACTGTGGCCAT CGAAGGCACCGTCTTTGTCCTGAAAAAAGGTGTTGTGCTGTCCCCCTTGGTGTTTATCC 
TGCT CT CTTACAGCT AC ATTGTGAGGG C TGTGTT ACAAATTCGGT CAGC ATCAGGAAGG CAGAAG GCCTT CG 
GCACCTGCGGCTCCCATCTCACTGTGGTCTCCCTTTTCTATGGAAACATCATCTACATG 

CTCTCATCTACACCCTCAGAAACAGAGAGGTGAAGGGGGCACTGGGAAGGTTGCTTCTGGGGAAGAGAGAGC 
TAGGAAAGGAGTAAAGGCATCTCCACCTGACTTGACTTCCATCCAGGGCCACTGGCAGCATCT^ 



1 5 The disclosed NOV 18a polypeptide (SEQ ID NO:78) encoded by SEQ ID NO:77 has 

3 14 amino acid residues and is presented in Table 18B using the one-letter amino acid code. 



Table 18B. Encoded NOV18a protein sequence (SEQ ID NO:78). 

MDGTNGSTQTHFILLGFSDRPHLERILFWILIAYLLTLVGNTTIIlJVSRLDPHIiHTPMYFFLA 
HLS FLDLS FTTS S I PQLLYNLNGCDKT I S YMGCAI QLFLFLGLGGVE CLLLAVMAYDRCVAI CK 
PLHYMVIMNPRLCRGLVSVTWGCGVANSLAMSPVTLRLPRCGHHEVDHFLREMPALIRMACVST 
VAIEGWFVLKKGVVLSPLVFILIiSYSYIVRAVLQIRSASGRQKAFGTCGSHLTVVSLFYGNII 
YMYMQPGASSSQDQGMFIjMLFYNIVTPLLNPLIYTIiRNREVKGALGRLLLGKRELGKS 



A search of sequence databases reveals that the NOV18a amino acid sequence has 194 
20 of 237 amino acid residues (81%) identical to, and 215 of 237 amino acid residues (90%) 
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similar to, the 237 amino acid residue ptnr: SPTREMBL-ACC:Q9R0G5 protein from 
Marmota marmota (European marmot) (Olfactory Receptor) (E = 3.5e" 102 ). 



NOV18b 

A disclosed NOV18b nucleic acid of 1062 nucleotides (also referred to as CG56663- 
02) encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 18C. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 6-8 
and ending with a TAA codon at nucleotides 948-950. The start and stop codons are shown in 
bold in Table 18C, and the 5' and 3* untranslated regions, if any, are underlined. 



Table 18C. NOV18b nucleotide sequence (SEQ ID NO:79). 

TAGAGATGGATGGAACCAATC^CAGCACCG 

TGGAGAGGATCCTCTTTGTGGTCATCCTGATCGOT 

TGGTGTCCCGGCTGGACCCCCACCTCCA^ 

TCAGTTTCACCACCAGCTCCATCCCCCAGCTGCT 

TGGGCTGTGCCATCC^GCTCTTC^ 

CCTATGACCGGTGTGTGGCTATCTGCAAGCCC 

CCCGCTGTGGGCACCACGAGGTGGACCACTTCCTC 

GCACTGTGGCCATCGACGGCACCGTCTTTGTCCTGGCGGTGGGTGTTGTGCTGTCCCCCTTGGTGTTTATCC 
TGCTCTCTTACAGCTACATTGTGAGGGCTGTGTTAC^^ 
GCACCTGCGGCTCCCATCTCACTGTGGTCTCCCTTTTCTATG<^^ 
GAGCCAGTTCTTCCCAGGACCAGGGCATGTTCCT 

CTCTCATCTACACCCTCAGAAACAGAGAGGTGAAGGGGGCACTGGGAAGGTTGCTTTTGGGGAAGAGA 

TAGGAAAGGAGTA AAGGCATCTCCACCTGACTTCACTTCCATCCAGGGCCACT 

GAATTCCAGCTGATATTAGCCCACGACTCCCAACTTGCCTTTTTCTGGACTTTT 



In a search of public sequence databases, the NOV 18b nucleic acid sequence has600 of 
710 bases (84%) identical to a gb:GENBANK-ID:AX008326|acc:AX008326.1 mRNA from 
Marmota marmota (Sequence 24 from Patent W09967282) (E = 8.8e~ 109 ). 

The disclosed NOV18D polypeptide (SEQ ID NO:80) encoded by SEQ ID NO:79 has 
314 amino acid residues and is presented in Table 18D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV1 8b has A signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV1 8b may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or in the endoplasmic reticulum (lumen) 
with a certainty of 0.3000. The most likely cleavage site for NOV18b is between positions 42 
and 43: LVG-NT. 



Table 18D. Encoded NOV18b protein sequence (SEQ ID NO:80). 

MDGTNGSTQTHFI LLGFSDRPHLERIIiFWI LI AYLLTLVGNTTI ILVSRLDPHLHTPMYFFLAHLS FLDLS 
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FTTSSIPQLLYNIJSTGCDKTISYMGCM 
VSVTWGCGVANSLAMSPVTLRLPRCGHHEVDHFLR^ 

sysyivravlqirsasgrqkafgtcgshltwslfygniiy^^ 

IYTLRNREVKGALGRLLLGKRELGKE 



A search of sequence databases reveals that the NOV 18b amino acid sequence has 183 
of 305 amino acid residues (60%) identical to, and 237 of 305 amino acid residues (77%) 
similar to, the 320 amino acid residue ptnr:SPTREMBL-ACC:Q9Y3N9 protein from Homo 
5 sapiens (Human) (DJ88J8.1 (Novel 7 Transmembrane Receptor (Rhodopsin Family) 
(Olfactory Receptor Like) Protein) (HS6M1-15))) (E = 2.8e 98 ). 

NOV 18b is predicted to be expressed in at least the following tissues: Apical microvilli 
of the retinal pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma 
cell lines, corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral 

10 tissue, cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary 
artery and umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal 
hematopoietic cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, 
lung lymphoma cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express 
MHC II and III nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, 

1 5 pons, prostate, putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary 
artery in aortic) spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, 
thalamus, and thymus tissue. This information was derived by determining the tissue sources 
of the sequences that were included in the invention including but not limited to SeqCalling 
sources, Public EST sources, Literature sources, and/or RACE sources. 

20 The disclosed NOV1 8a polypeptide has homology to the amino acid sequences shown 

in the BLASTP data listed in Table 1 8E. 



Table 18E. BLAST results for NOV18a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
<%> 


Positives 
(%) 


Expect 


gi| 17445344 |ref|XP 

060558.1) 

(XM_060558) 


similar to 
olfactory 
receptor (H. 
sapiens) [Homo 
sapiens] 


314 


314/314 
(100%) 


314/314 
(100%) 


e-164 


gi | 5901478 |gb| AAD55 
304.1|AF044033 1 
(AF044033) 


olfactory 
receptor [Marmota 
marmot a] 


237 


194/237 
(81%) 


215/237 
(89%) 


2e-99 


gi | 13624329 |ref|NP 

112165.1) 

(NM 030903) 


olfactory 
receptor, family 
2, subfamily W, 
member 1 [Homo 
sapiens] 


320 


184/305 
(60%) 


236/305 
(77%) 


le-94 
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gi|l205443l|emb|CAC 
20523. l| (AJ302603) 


olfactory 
. receptor [Homo 
sapiens] 


320 


184/305 
(60%) 


236/305 
(77%) 


le-94 


gi 1 12054429 | emb| CAC 
20522. l| (AJ302S02) 


olfactory 
receptor [Homo 
sapiens] 


320 


184/305 
(60%) 


235/305 
(76%) 


2e-94 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 1ST. In the ClustalW alignment of the NOV18 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 Table 18F. ClustalW Analysis of NOV18 

1) Novel NOV18a (SEQ ID NO: 78) 

2) Novel NOVl8b (SEQ ID NO: 80) 

3) gi 1 17445344 | ref |XP_060558 . 1 | (XM_Q60558) similar to olfactory receptor (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 399) 

15 4) gi| 5901478 |gb|AAD55304.l|AF044033_l (AF044033) olfactory receptor [Marmota 
marmota] (SEQ ID NO: 4 00) 

5) gi | 13624329 | ref |np_112165 . 1 | (NM_030903) olfactory receptor, family 2, subfamily 
W, member 1 [Homo sapiens] (SEQ ID NO:40l) 

6) gi|l205443l|emb|CAC20523.l| (AJ302603) olfactory receptor [Homo sapiens] (SEQ ID 
20 NO:402) 

7) gi|l2054429|erab|CAC20522.l| (AJ302602) olfactory receptor [Homo sapiens] (SEQ ID 
NO: 403) 



25 



30 



35 



40 



45 



50 



NOV18a 


1 


NOV18b 


1 


gi 


17445344 


. 1 


gi 


5901478 | 


1 


gi 


13624329 


1 


gi 


12054431 


1 


gi 


12054429 


1 



NOV18a 


61 


NOV18b 


61 


9i 


17445344) 


61 


gi 


5901478| 


4 


gi 


13624329) 


61 


gi 


12054431) 


61 


gi 


12054429 j 


61 



NOV18a 
NOV18b 



gi 
gi 
gi 
gi 



17445344 | 
5901478) 
13624329) 
12054431 



121 

121 

121 

64 

121 

121 
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0 



5 



0 



5 



0 



gi | 12054429 | 121 



NOV18a 
NOV18b 



gi 
gi 
gi 
gi 
gi 



17445344 
5901478| 
13624329 
12054431 
12054429 



NOV18a 

NOV18b 

gi | 17445344 

gi j 5901478 | 

gi|l3624329 

gi | 12054431 

gi | 12054429 



NOV18a 
NO VI 8b 



gi 
gi 
gi 
gi 
gi 



17445344 | 

5901478) 

13624329) 

12054431] 

12054429) 



190 



200 210 



180 



220 



230 



240 




250 



260 



270 



280 



290 



300 




310 



320 




314 
314 
314 
237 

HFHHKS Tgl KRNCKS 320 
XFHHKSTBlKRNCKS 320 
SIKRNCKS 320 



Tables 18G lists the domain descriptions from DOMAIN analysis results against 
NOV18. This indicates that the NOV1 8 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 18G Domain Analysis of NOV18 

gnl |Pfam|pf amOOOOl, 7tm_l, 7 transmembrane receptor (rhodopsin 

family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 100.0% aligned 

Score = 95.1 bits (235), Expect = 5e-21 



NOV18: 


41 


GNTTI ILVSRLDPHIiHTPMYFFLAHIi5FZiDLSFTrSSIPQLLYNIjNGCDKTI SYMGCAIQ 


100 






II -III III II +1 + II 1 + 1 II 1 1 1 1 + 




Sbjct: 


1 


GNLIiVI LVILRTKKLRTPTNT FLLNLAVADLLFLLTIiPPWALYYLVGGDWVFGDALCKLV 


60 


HOV18: 


101 


LFLFLGIX^VEC^LIAVMAYDRC^^ 


160 






11+ 1 III ++ II +11 III 1 II + 1+ + 1 +11 




Sb j jet : 


61 


GALFWNGYAS ILLLTAI S IDRYLAI VHPLRYRRIRTPRRAKVLILLVWVLALLLSLP- - 


118 


NOV18 : 


161 


PVTLRLPRCGHHEVDHFLREMPALIRMACVSTVAIEGTVFVLKKGVVLSP 


220 






1+ 1 * III 11+ 11+ 1+ 




Sb j Ct : 


119 


PLLFSWLRTVEEGKTTVCLIDFPEESVKRSYVLLSTLVGFVL -PLLVILVCYTR 


171 


NOV18: 


221 




267 






l + l + 1+ ll + l 1+ 1 ^ + 1 + ++ ^ 




Sbjct: 


172 


ILRTLRKi^SQRSLKRRSSSERKAAKr^VVVVVFVLCWLPYra IWRV 


231 


NOV18 : 


268 


QDQGMFLMLFYNIVTPLLNPLIY 290 





+ + I- I IIHI 
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Sbjct: 232 LPTALLITLWLAYVNSCLNPIIY 254 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
5 seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 

10 olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 

1 5 Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 18 nucleic acid of the invention encoding a G-Protein Coupled 
20 Receptor -like protein includes the nucleic acid whose sequence is provided in Table 18A, 

20C or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 18A or 20C while 
still encoding a protein that maintains its G-Protein Coupled Receptor -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention farther includes 
25 nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes.nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
30 are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 16 percent of the bases may be so 
changed. 
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The disclosed NOV 18 protein of the invention includes the G-Protein Coupled 
Receptor -like protein whose sequence is provided in Table 1 8B or 20D. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 18B or 20D while still encoding a protein that 

5 maintains its G-Protein Coupled Receptor -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 40 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as Fab or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

10 The above disclosed information suggests that this G-Protein Coupled Receptor -like 

protein (NOV 18) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV18 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 

15 protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV 18 nucleic acids and proteins of the invention are useful in potential 

20 therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 

25 starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 

fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

30 Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
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autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington ! s disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

NOV 1 8 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV 18 substances for use in 
5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV 18 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
1 0 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV19 

NOV19 includes three novel G-Protein Coupled Receptor -like proteins disclosed 
below. The disclosed sequences have been named NOV19a and NOV19b. 
15 NOV19a 

A disclosed NOV 1 9a nucleic acid of 1 046 nucleotides (also referred to as CG56665- 
01) encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 19A. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 14- 
16 and ending with a TGA codon at nucleotides 1019-1021. The start and stop codons are 
20 shown in bold in Table 19A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 19A. NOV19a nucleotide sequence (SEQ ID NO:81). 

"TCAACATra 

TCAAAGTTTTGGGACAGATTTTCTACTTCTTGG 
CGTCATTGCCACCCTCTTTACAGTTGCTCTGACAGGA 
CAGACTCCACACTCCAATGTACTTTCTGCTCAGTG^^ 
AGTGCCCAAGATGGCAGTCAGCTTCCTCTCAC&GAGTAAGACC^ 

GTATGTGTTCTTGGCCCTTGGTGGAACTGAAGCCCTTCTCCTTGGTTTTATGTCTTATGATCGCTATGTAGC 

TATCTGTCACCCTTTACATTATCCTATGCTTATGAGCAAGAAGATCT 

GGCCAGTGGTTCTATCAATGCTTTCATACATACATTGTATGTC 

CATTAACCACTTTTTCTGTGAAGTTCCAGCTCTA 

TACAGTCCTCCTGAGTGGACTTATTATCTTGCTACTAC 

GCTTATTGTGGTATTCCAGATGAGCTCAGGAAAAGGACAGGCAAAAGCTGTTTCCACTTGTTCCT 

GATTGTGGCAAGCCTGTTCTATGCAACCACTCTCTTTACCTACACAAGGCCACACTCCTTGCGTTCCCCTTC 

ACGGGATAAGGCGGTGGCAGTATTTTACACCATTGTCACACCTCTACTGAACCCATTTATCTACAGCCTGAG 

AAATAAGGAAGTGACGGGGGCAGTGAGGAGACTGTTGGGATATTGGATATC 

ATCTCTGTATTGATTGAGCATTAACAACATAAAAAGCT 



The disclosed NOV19a polypeptide (SEQ ID NO:82) encoded by SEQ ID NO:81 has 
335 amino acid residues and is presented in Table 19B using the one-letter amino acid code. 

25 
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Table 19B. Encoded NOV19a protein sequence (SEQ ID NO:82). 



MNISDVISFDIIiVSAMKTGNQSFGTDFLLVGLFQYGWINSLLFWIATLFTVALTGNIMLIHLI 

RLNTRLHTPMYFLIiSQLS I VDLMY I ST TVPKMAVSFLSQSKTI RFLGCEIQTYVFLALGGTEAL 

LLGFMSYDRWAICHPLHYPMLMSKKICCXMVACAWASGSINAFIHTLYVFQLPFCRSRLINHF 

FCEVPALLSLVCQDTSQYEYTVLLSGLIILLLPFLAILASYARVLIVVFQMSSGKGQAKAVSTC 

SSHLIVASLFYATTLFTYTRPHSLRSPSRDKAVAVFYTIVTPLLNPFIYSLRNKEVTGATORLLGYWIC 

CRKYDFRSLY 



A search of sequence databases reveals that the NOV19a amino acid sequence has 1 55 
of 309 amino acid residues (50%) identical to, and 199 of 309 amino acid residues (64%) 
similar to, the 316 amino acid residue ptnr: TREMBLNEW-ACC:AAG45 1 96 protein from 
Mus musculus (Mouse) (T2 Olfactory Receptor) (E = 9.3e" 79 ). 



10 



NOV19b 

A disclosed NOV19b nucleic acid of 1046 nucleotides (also referred to as CG56665- 
02) encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 19C. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 59- 
60 and ending with a TGA codon at nucleotides 1019-1021. The start and stop codons are 
shown in bold in Table 19C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 19C NOV19b nucleotide sequence (SEQ ID NO:83). 



T CAACAT T ATT ACATGAACATTTCAGATGT CAT CT C CTTTG ATATTTTGGT TT CAGCCATGAAAACAGGAAA 

CGT CAT TG C CA CCC T CTTT ACAGTTGCTCTG ACAGGAAATAT CATG CTGAT CCAC CT CATTCGACTGAACAC 

CAGACTCCACACTCCAATGTACTTTCTGCTCAGTCAGCTCTCC^^ 

AGTGCCCAAGATGGCAGTCAGCTTCCTCTCACAGAGTAAGACCATT^^ 

GTATGTGTTCTTGGCCCTTGK3TGGAACTGAAGCCCIT 

TATCTGTCACCCTTTACATTATCCTATGCTTATGAG 

GGCCAGTGGTTCTATCAATGCT'rrCATACATACATTGTATGTGTTTCAGCTTCCATTCTGTAGGTCTCGG 



TACAGTCCTCCTGAGTGGACTTATT ATCTTGCTACTACCATTCCTAGCCATTCTGGCTT CCTATG CTCGTGT 
GCTTATTGTGGTATTCCAGATGAGCTCAGGAAAAGGACAGGCAAAAGCTGTTTCCACTTGTTCCTCCCACCT 
GATTGTGGCAAGCCTGTTCTATGCAACCACTCTCTTTACCTACACAAGGC^ 
ACGGGATAAGGCGGTGGCAGTATTTTACACCATTGTCACACCTCT^ 

AAATAAGGAAGTGACGGGGGCAGTGAGGAGACTGTTGGGATATTGGATATGCTGTAGAAAATATGACTTCAG 
ATCTCTGTATTGATTGAGCATTAACAACATAAAAAGCT 



15 In a search of public sequence databases, the NOV19b nucleic acid sequence has 592 

of 910 bases (65%) identical to a gb:GENBANK-ID:GGCOR4GEN|acc:X94744.1 mRNA 
from Gallus gallus (G.gallus cor4 DNA for olfactory receptor 4) (E = 7.8c" 48 ). 

The disclosed NOV 19b polypeptide (SEQ ID NO:84) encoded by SEQ ID NO:83 has 
320 amino acid residues and is presented in Table 19D using the one- letter amino acid code. 

20 Signal P, Psort and/or Hydropathy results predict that NOV1 9b has A signal peptide and is 

likely to be localized to the plasma membrane with a certainty of 0.4600. Alternatively, 
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NOV 1 9b may also localize to the microbody (peroxisome) with a certainty of 0.2 188, the 
endoplasmic reticulum (membrane) with a certainty of 0.1000, or in the endoplasmic 
reticulum (lumen) with a certainty of 0.1000. The most likely cleavage site for NOV19b is 
between positions 40 and 4 1 : ALT-GN. 

5 

Table 19D. Encoded NOV19b protein sequence (SEQ ID NO:84). 

MKTGNQS FGTDFLLVGLFQYGWINS LLFWI ATL FTVALTGNI ML I HLIRLNTRLHTPMY FLLSQL S I VDLM 
YI STTVPKMAVSFLSQSKTIRFLGCEIQTYVFLAI^ 
VACAWASGSINAFIHTLYVFQLPFCRSRL 
SYARVLIVVFQMSSGKGQAKAVSTCSSHLIVASLFYATTL 

IYSLKNKEVTGAVRRLLGYWI CCRKYDFRSLY 

A search of sequence databases reveals that the NOV19b amino acid sequence has 155 
of 306 amino acid residues (50%) identical to, and 198 of 306 amino acid residues (64%) 
similar to, the 316 amino acid residue ptnr:TREMBLNEW-ACC:BAB30304 protein from Mus 

1 0 musculus (Mouse) (Adult Male Testis cDNA, Riken Fuil-Length Enriched Library, 
Clone:4932441h21, Full Insert Sequence) (E - 1.3e" 79 ). 

NOV19b is predicted to be expressed in at least the following tissues: Apical microvilli 
of the retinal pigment epithelium, arterial (aortic), basal forebraih, brain, Burkitt lymphoma 
cell lines, corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral 

15 tissue, cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary 
artery and umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal 
hematopoietic cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, 
: lung lymphoma cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express 
MHC II and III nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, 

20 pons, prostate, putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary 
artery in aortic) spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, 
thalamus, and thymus tissue. This information was derived by determining the tissue sources 
of the sequences that were included in the invention including but not limited to SeqCalling 
sources, Public EST sources, Literature sources, and/or RACE sources. 

25 The disclosed NOV1 9a polypeptide has homology to the amino acid sequences shown 

in the BLASTP data listed in Table 19E. 
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Table 19E. BLAST results for NOV19a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 17445348 |ref|XP_ 
060559. l| 
(XM_060559) 


similar to 
OLFACTORY 
RECEPTOR 2T1 
(OLFACTORY 
RECEPTOR 1-25) 
(ORl-25) (H. 
sapiens) [Homo 
sapiens] 


533 


300/301 
(99%) 


301/301 
(99%) 


e-143 


gi| 17437056 |ref |XP_ 
060314. l| 
(XM_060314) 


similar to 
OLFACTORY 
RECEPTOR 2T1 
(OLFACTORY 
RECEPTOR 1-25) 
(ORl-25) (H. 
sapiens) [Homo 
sapiens] 


695 


169/310 
(54%) 


224/310 
(71%) 


5e-84 


gi | 17445356 |ref |XP_ 
060561. l| 
(XM_060561) 


similar to 
OLFACTORY 
RECEPTOR 2T1 
(OLFACTORY 
RECEPTOR 1-25) 
(ORl-25) (H. 
sapiens) [Homo 
sapiens] 


312 


172/305 
(56%) 


223/305 
(72%) 


3e-80 


gi | 17456595 | ref |XP_ 
065 073. l| 
(XM_065073) 


similar to 
olfactory 
receptor (H. 
sapiens ) [Homo 
sapiens] 


638 


142/292 
(48%) 


188/292 
(63%) 


7e-78 


gi|l7475192|ref |XP 
062796. 1| 
<XM 062796) 


similar to 
olfactory 
receptor (H . 
sapiens) [Homo 
sapiens] 


315 


154/299 
(51%) 


209/299 
(69%) 


2e-77 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 19F. In the ClustalW alignment of the NOV19 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 Table 19F. ClustalW Analysis of NOV19 

1) Novel NOV19a (SEQ ID NO: 82) 

2) Novel NOV19b (SEQ ID NO: 84) 

3) gi | 17445348 |ref|XP_060559.1 | (XM_060559) similar to OLFACTORY RECEPTOR 2T1 
(OLFACTORY RECEPTOR 1-25) (ORl-25) (H. sapiens) [Homo sapiens] (SEQ ID NO:395) 

15 4) gi 1 17437056 | ref |XP_060314 . 1 1 (XM_060314) similar to OLFACTORY RECEPTOR 2T1 
(OLFACTORY RECEPTOR 1-25) (ORl-25) (H. sapiens) [Homo sapiens] (SEQ ID NO:397) 
5) gi 1 17445356 | ref |XP_060561 . 1 1 (XM_06056l) similar to OLFACTORY RECEPTOR 2T1 
(OLFACTORY RECEPTOR 1-25) (ORl-25) . (H. sapiens) [Homo • sapiens] (SEQ ID NO: 3 94) 
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6) gi | 17456595 | ref |XP_065073 . 1 1 (XM_065073) similar to olfactory receptor (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 3 98) 

7) gi | 17475192 | ref |XP_062796 . 1 1 (XM_062796) similar to olfactory receptor (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 4 04) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



NOV19a 
N0V19b- 



gi 
gi 
gi 
gi 
gi 



17445348 
17437056 
17445356 
17456595 
17475192 



NOV19a 
N0V19b 



gi 
gi 
gi 
gi 
gi 



17445348 
17437056 
17445356 
17456595 
17475192 



NOV19a 

NOV19b 

gi 1 17445348 | 

gi | 17437056 j 

gi j 17445356 | 

gi | 17456595 | 

gi | 17475192 j 



NOV19a 
NOV19b 



gi 
gi 
gi 
gi 
gi 



17445348 
17437056 
17445356 
17456595 
17475192 



NOV19a 
NOV19b 



gi 
gi 
gi 
gi 
gi 



17445348 
17437056 
17445356 
17456595 
17475192 



NOV19a 
NOV19b 



gi 
gi 
gi 
gi 
gi 



17445348 
17437056 
17445356 
17456595 
17475192 



NOV19a 
NOV19b 




190 



200 



210 



220 



230 



FQ 180 
tiYVFQ 165 
FQ 165 
?ITMS 170 
SfAFR 164 
3ITLH 165 
166 

240 




310 



320 



330 



340 



350 



360 




|....| 



317 

302 

DFLSGNKS 311 

jSMMQAMEQSNYSVYADFILLGLFSNARFPWLLFALILLVFLTS 350 

301 



PPGQRPSARPLNGPAQHAVLTCSG RCLPG- 



332 
303 



370 



380 



390 



400 



410 



420 



317 
302 



|....|.-.| 



317 
302 
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10 



15 





17445348 


311 


gi 


17437056 


351 


gi 


17445356 


301 


gi 


17456595 


332 


gi 


17475192 


3 03 


NOV19a 


317 


NOV19b 


3 02 


gi 


17445348 


319 


gi 


17437056 


411 


gi 


17445356 


3 01 


gi 


17456595 


344 


gi 


17475192 


303 



" - — : ISFTGCG 318 

351 IASNWKI I LI H I DSRLHT PM YFLLS QLSLKD I L YI ST I VPKMLVDQ VMS QRAI S F AGCT 410 

- 301 

ESHVSLISLVE 343 

3Q3 



430 



440 



450 
..|.. 



460 



470 



|....|-,-.|. 



480 



-RLLGYW^C 325 

-RELGYWijC 310 



xpwiwiGGSIDGFL 470 

-RVIIQNIFS-- 309 

PPAVEVVTGASVKGCPRTWCLPI^QVLWDGPDSGTSLESKQPHQEGLSDMI^SNTICTLV 403 
- KGLDRCRXG 312 



20 



25 



490 



500 



510 



NOV19a 


325 


NOV19b 


310 


gi 


17445348 


379 


gi 


17437056 


471 


gi 


17445356 


309 


gi 


17456595 


404 


gi 


17475192 


312 



520 



530 
..|.. 



540 



SELNQFWAYPIQHDLPj 



C§KYDFRSLY 335 

--C|kYDFRSLY - - 32 0 

YVLHI PYCgSRAINHFFCDVPAMVTLACMDTWVYEGTVFLSTTIFIiVFPFIA 43 3 
VTMQFPFC^REINHFFCEVPALLKLSCTDTSAYETAMYVCCIMMLLIPFSV 525 

v|m_ _ 312 

itpapckvgah ihpaaredtlntsqetpgtpkcyrgkni 463 

315 



30 



35 



40 



45 



50 



55 



NOV19a 
NOV19b 
gi | 17445348 
gi | 17437056 
gi ) 17445356 
gi (17456595 
gi | 17475192 



NOV19a 
NOV19b 



335 



550 
..|.. 



560 



570 



■|....|....|. 



580 



590 



600 



335 



320 



320 

434 ISCSYGRVLIAVYHMKSAEGRKKAYLTCSTHLTVVTFYYAPFVYTYLRPRSLRSPTO 493 
526 I SGSYTRI LI TVYRMSEAEGRGKAVATCS S HM VWSLFYGAAMYTYVLPHS YHTPEQDKA 585 

""" ~ 312 

KG VKEGKAEPEGPVGPE WGS KTEMNFAGSEFKEVNFRCTASMHN- - S PDVTSDPVLQAA 521 
315 " 315 



312 
464 



335 
320 



610 
..|.. 



620 



630 
..|.. 



64 0 
..|.. 



650 



660 
--I 



gi 


17445348 


494 


gi 


17437056 


586 


gi 


17445356 


312 


gi 


17456595 


522 


gi 


17475192 


315 


NOV19a 


335 


NOVl9b 


320 


gi 


17445348 


533 


gi 


17437056 


646 


gi 


17445356 


312 


gi 


17456595 


579 


gi 


17475192 


315 



335 
320 



312 



315 



670 

..|.. 



680 



690 
..|.. 



700 



710 
..|.. 



720 
-.1 



335 
320 
533 
695 
312 



RSREKRRLNDAAIEGRI^ALMEENALLKGELKALKLRFGLI^ 638 
" - 315 



60 



65 



N0V19a 


335 


335 


N0V19b 


320 


320 


gi 


17445348 


533 


533 


gi 


17437056 


695 


695 


gi 


17445356 


312 


312 


gi 


17456595 


638 


638 


gi 


17475192 


315 


315 
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Table 19G lists the domain description from DOMAIN analysis results against 
NOV19. This indicates that the NOV19 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 19G Domain Analysis of NOV19 

gnl|Pfam|pfam0000l, 7tm_l, 7 transmembrane receptor (rhodopsin 

family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 100.0% aligned 

Score =91.3 bits (225), Expect = 8e-20 



5 


NOV19 : 
Sbjct: 


56 
1 


GNIMLIHLIRLOTRLOTPMYFLL 

II+++I +1 +111 1 I++ II+++ I 1 + I++ 
GNLLVI L VI LRTKKLRT P TNI FLLNLAVADI^FLLTLP PWALYYL VGGDWVFGDALCKLV 


115 
60 


10 


NOV19 : 
Sbjct: 


116 
61 


TYWIiALGGTEALLLGFMSYDRYVAICHPLHYPMLMSKKICCIiMVACA 

+1+1 III +1 lll+ll 1111*++ +++ 1 + + 
GAIiFVVNGYAS ILLLTAI S I DRYLAI VHPLR YRRI RTPRRAKVLI LLVWVLALLLSLPPL 


175 
120 


15 


NO VI 9: 
Sbjct: 


176 
121 


0 

LYVFQLPFCRSRLINHFFCTVPALLSLVCQDTSQYEYTVLLSGLJJLLIiPFIAJLASyAR 
|+ + + ||| + | ++ + 
LFSWLRTTOEGNTTVCLIDFPEESVKRSYVIiLSTLVGFVLPLLVI LVCYTRILRTLRKRA 


235 
180 


20 


NOV19 : 
Sbjct: 


236 
181 


VLIWFQMSSGKGQAKAVSTCSSHLIVASLFY ATTLFTYTRPHSLRSPSRDKAVAV 

+ |+| ++ + | + | + + 
RSQRSLKRRSSSERKAAKMLLWVVVFVLC^ 


291 
240 




NOV19 : 


292 


FYTIVTPLLNPFIY 305 




25 


Sbjct: 


241 


+ 1 III II 
WLAYVNSCLNPIIY 254 





G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
30 and are likely to underlie the recognition and G-protein-mediated transduction of various 

signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

35 Olfactory receptors (ORs) have been identified as extremely large subfamily of G 

protein-coupled receptors in a number of species. Th?se receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
Previously, OR genes cloned in different species were from random locations in the respective 

40 genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
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displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 19 nucleic acid of the invention encoding a G-Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 19A, 

5 1 9C or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 19A or 19C while 
still encoding a protein that maintains its G-Protein Coupled Receptor -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 

1 0 acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 

15 chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 35 percent of the bases may be so 
changed. 

The disclosed NOV 19 protein of the invention includes the G-Protein Coupled 
20 Receptor -like protein whose sequence is provided in Table 19B or 19D. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 19B or 19D while still encoding a protein that 
maintains its G-Protein Coupled Receptor -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 52 percent of the 
25 residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as Fab or 
(FabH that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor -like 
protein (NOV 19) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
30 NOV19 nucleic acids and proteins identified here may be useful in potential therapeutic 

applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
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delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV 19 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in developmental diseases, MHCII and III diseases 
5 (immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 

Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NDDDM1), bacterial, 

10 fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 
Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 

15 ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

20 NOV 1 9 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immuno-specifically to the novel NOV 19 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV19 protein has multiple hydrophilic 

25 regions^ each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV20 

30 A disclosed NOV20 nucleic acid of 1 027 nucleotides (also referred to as CG56665-0 1) 

encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 20A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 1-3 and 
ending with a TAG codon at nucleotides 940-942. The start and stop codons are shown in 
bold in Table 20A, and the 5' and 3' untranslated regions, if any, are underlined. 
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Table 20A. NOV20 nucleotide sequence (SEQ ID NO:85). 

ATGATCTGCTCA6CTATCAACCTACACTTACTACTGGCAGTTAAGATGATTCACCCTGTCTGGATTCTT 

CCTCGGGAGCAAGGGCTGTTTCTGCTGATTTATCTGGCAGTGCTGGTGGGGAACCTGCTC^TC^TTGCAGTC 

ATCACTCTCGATCAGCATCTTCACACACCCATGTACTTCTTCCTGAAGAACCTCTCCGTTTTGGATCTGTGC 

TACATCTCAGTCACTGTGCCTAAATCCATCCGTAACTCCCTGACTCGCAGAAGCTCCATCTCTTATCTTGGC 

TGTGTGGCTCAAGTCTATTTTTTCTCTGCCTTTGC^TCTGCTGAGCTGGCCrrCCTTACTGTCATGTCTTAT 

GACCGCTATGTTGCCATTTGCCACCCCCTCCAATACAGAGCCGTGATGACATCAGGAGGGTGCTATCAGATG 

GCAGTCACCACCTGGCTAAGCTGCTTTTCCTACGCAGCCGTCCAC^CTGGCAACATGTTTCGGGAGCACGTT 

TGCAGATCCAGTGTGATCCACCAGTTCTTCCGTGACATCCCTCATGTGTTGGCCCTGGTTTCCTO 

TCCTATTTCCAAATCTTCTCAACGGTGCTCAGAATCCCTTCAGGACAGAGTCGAGCAAAA 
TGCTCCCCCCAGCTC^TTGTC&TCATGCTCTTT 

AAAGCTCTGTCCATTCAGGATTTAGTGATTGCTCTGACATACACAGTTTTGCCTCCCTTCCTCAATCCCATC 
ATATATAGTCTTAGGAATAAGGAGATTAAAACAGC CATGTGGAGACTCTTTGTGAAGATATATTTTCTGCAA 
AAGTAGAACATCCTGGTCTTTACTATAGAAGATCTGCAACAAAACCCCAAAAAAGCATAAATACTTTATGAC 
AAAAAAAGATGAAAAAATT 



The disclosed NOV20 polypeptide (SEQ ID NO:86) encoded by SEQ ID NO:85 has 
313 amino acid residues and is presented in Table 20B using the one-letter amino acid code. 

5 



Table 20B. Encoded NOV20 protein sequence (SEQ ID NQ:86). 

MI CSAINLHLLLAVKMIHPVWI LAPREQGLFLLI YLAVLVGNLL I IAVITLDQHLHTPMYFFLK 
NLSVLDLCYISVTVPKS I ENS LTRRSS I SYLGCVAQVYFFSAFASAELAFLTVMSYDRYVAICH 
PLQYRAVMTSGGCTQMAVTTWLSCFSYAAVHTGNMFREHVCRSSVIHQFFRDIPHVIiALVSCEV 
FFVEFLTLALSSCLVLGCFIIiMMISYFQIFSTVLRIPSGQSRAKAFSTCSPQLIVIMLFLTTGL 
FAALGPIAKALSIQDLVIALTYTVLPPFLNPIIYSLRNKEIKTAMWRLFVKIYFLQK 

A search of sequence databases reveals that the NOV20 amino acid sequence has 134 
of 278 amino acid residues (48%) identical to, and 179 of 278 amino acid residues (64%) 
similar to, the 321 amino acid residue ptnr: SPTREMBL-ACC:Q9UGF5 BA150A6.4 protein 
1 0 from Homo sapiens (Human) (NOVEL 7 TRANSMEMBRANE RECEPTOR (RHODOPSIN 
FAMILY) (E = 2.4c" 64 ). 

The disclosed NOV20 polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 20C. 



• 


Table 20C. BLAST results for NOV20 




Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1 17437075 |ref|XP 

060319.1) 

(XM 060319) 


similar to 
OLFACTORY 
RECEPTOR 5U1 
(HS6M1-28) (H. 
sapiens) [Homo 
sapiens) 


311 


287/294 
(97%) 


288/294 
(97%) 


e-134 
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gi 1 17445373 | ref |XP_ 

060567.1) 

(XM_060567) 


similar to 
OLFACTORY 
RECEPTOR 5U1 
(HS6M1-28) (H. 
sapiens) [Homo 
sapiens] 


309 


147/272 
(54%) 


188/272 
(69%) 


8e-63 


gi | 17445394 |ref|XP 
060572. l| 
(XM_060572) 


similar to 
OLFACTORY 
RECEPTOR 5U1 
(HS6M1-28) (H. 
sapiens) [Homo 
sapiens] 


316 


133/283 
(46%) 


187/283 
(65%) 


2e-61 


gi|l7437015|ref |XP 
060307. l| 
(XM_060307) 


similar to 
OLFACTORY 
RECEPTOR 5U1 
(HS6M1-28) (H. 
sapiens) [Homo 
sapiens] 


312 


139/291 
(47%) 


189/291 
(64%) 


9e-59 


gi | 17464351 | ref |XP 
069462. l| 
(XM_069462) 


similar to 
OLFACTORY 
RECEPTOR 501 
(HS6M1-28) (H. 
sapiens ) [Homo 
sapiens] 


321 


133/278 
(47%) 


175/278 
(62%) 


3e-57 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 20D. In the ClustalW alignment of the NOV20 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (/.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 



15 



20 



Table 20D. ClustalW Analysis of NOV20 



1) Novel NOV20 (SEQ ID NO: 86) 

3) gi | 17437075 | ref |XP_060319.l| (XM_060319) similar to 
(HS6M1-28) (H. sapiens) [Homo sapiens] (SEQ ID NO:405) 

4) gi | 17445373 | ref | XP_060567 . 1 | (XM_060567) similar to 
(HS6M1-28) (H. sapiens) [Homo sapiens] (SEQ ID NO: 4 06) 

5) gi | 17445394 | ref |XP_060572 . 1 1 <XM_060572) similar to 
(HS6M1-28) (H. sapiens) [Homo sapiens] (SEQ ID NO: 4 07) 

6) gi | 17437015 |ref | XP_0603 07 . 1 | (XM_060307) similar to 
(HS6M1-28) (h. sapiens) [Homo sapiens] (SEQ ID NO:408) 

7) gi | 17464351 | ref |XP_069462 . 1 1 (XM_069462) similar to 
(HS6M1-28) (H. sapiens) [Homo sapiens] (SEQ ID NO:409) 



OLFACTORY RECEPTOR 5U1 
OLFACTORY RECEPTOR 5U1 
OLFACTORY RECEPTOR 5U1 
OLFACTORY RECEPTOR 5U1 
OLFACTORY RECEPTOR 5U1 



25 



30 



NOV20 


1 


gi 


17437075 


1 


gi 


17445373 


1 


gi 


17445394 


1 


gi 


17437015 


1 


gi 


17464351 


1 




70 



80 



90 



100 
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NOV20 

gi | 17437075 | 
gi j 17445373 j 
gi j 17445394 j 
gi | 17437015) 
gi j 17464351 | 



51 
49 
49 
60 
49 
49 



LDWHLHTPMY?FLKNLS|LDLCYI SVTVPKS IgNSLj 
LDfflHLHTPMYFFIiKNLslLDLCYISVTVPKSIgrvrSL 1 

ldShlhtp^yfflknlsBldlc^isvt^pksiBmsl 

LDgHLgTPMYFFL|NLSgLD|CY I S VTfJPKS lS§S L 
P§§ LH 5i PMYFFLgNLS|LDgC YI S VTVPySWpSL 



NOV20 


111 


gi 


17437075 


109 


gi 


17445373 


109 


gi 


17445394 


120 


gi 


17437015 


109 


gi 


17464351 


109 


NOV20 


171 


gi 


17437075 


169 


gi 


17445373 


169 


gi 


17445394 


180 


gi 


17437015 


169 


gi 


17464351 


169 


NOV20 


231 


gi 


17437075 


229 


gi 


17445373 


229 


gi 


17445394 


240 


gi 


17437015 


229 


gi 


17464351 


229 


NOV20 


291 


gi 


17437075 


289 


gi 


17445373 


288 


gi 


17445394 


300 


gi 


17437015 


289 


gi 


17464351 


289 



USgQNSL 




130 



140 
..I.. 



150 



160 



170 



180 



ELAgLTVMSY 
ELA|LTVMSY 

E^^LTVMSY 

EL 



'DRYMAI CgPLS YgY 

'dryWai cBplSyS^ 



[LTVMS YDRY^AI CgPLg]Ygfvi| 

ltvms§dry JJai cSplhySBimIi 

|LTVMS YDRY[|AI c|pLHYg|lfei 
LT0MgDRY|A|cfflpLH Y^ 10 



E^AIJLTVMS YDRYgAI cfflPLHYi* 




I 



190 



I 



200 



210 



220 



230 



240 



SgVIHQFFgDIPgSLSL 
s|viHQFFgDIP^JL|L 
Si^HQFFCDIpffi|Li§ 
"lilSQFFC|l Pg|L|L 
S§VIHQFFCDIp||LraL 

K/ihqffcd^pH^lBl 



250 




260 



270 



280 



290 



300 




SLRJSIKglKjp!^ 
SLRNKgIKWMg|^ 

SLRN§ffi|KgAkp^ 

slrni&ik^Jm^ 

SLRNil 



Table 20E lists the domain descriptions from DOMAIN analysis results against 
NOV20. This indicates that the NOV20 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 20E Domain Analysis of NOV20 

gnl |Pfam|pfam00001, 7tm_l, 7 transmembrane receptor (rhodopsin family) 
(SEQ ID NO: 810) 

CD-Length = 254 residues, 100.0% aligned 
Score =83.6 bits (205), Expect = 2e-17 



NOV20 : 


41 


GNLLI IAVITLDQHLHTPOTFFLKNLSVLDLCT 


100 






IIIM II +111 II IN II ++ 1 ++ + 1 




Sbjct: 


1 


GNLLVILVILRTKKLRTPTNI FLLNLAVADI^FLLTLPPWALYYLVGGDWVFGDALCKLV 


60 


NOV20: 


101 


VYFFSAFASAELAFLTVMS YDRYVAI CHPLQYRAVMTSGGCY QMAVTTWLS CFS YAAVHT 


160 






1 I- + II +1 lll + ll IIMI + 1 + + | + 




Sbjct: 


61 


GALFWNGYAS I LLLTAI S IDRYLAI VHPLRYRRIRTPRRAKVLI LLVWVI1ALI1LSLPPL 


120 






177 
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NOV20: 161 GNMFREHVCRSS VI HQ FFRDI PHVLAL VS CE VFFVEFLTLALS SCLVLGC FI LMMI S YFQ 220 
+ | + + + + | | || || + 

^ Sbjct: 121 LFSWLRTVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTRILRTLRKRA 180 

NOV20 : 221 IFSTVLRIPSGQSRAKAFSTCSPQLIVIMLFLTTGLFAALGPIAKALSIQDLVIALT 277 

1+ I I I ++ ++ +| + I + +||| 

Sbjct : 181 RSQRSLKRRSSSERKAAKMLLVVVWFVLCVnjPYHIVLLLDSLCLLSIWRVLPTALLITL 240 

10 NOV20: 278 -YTVLPPFLNPIIY 290 

+ linn 

Sbjct: 241 WLAYVNSCLNPIIY 254 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
15 subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
20 subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
25 likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 

Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

30 The disclosed NOV20 nucleic acid of the invention encoding a G-Protein Coupled 

Receptor -like protein includes the nucleic acid whose sequence is provided in Table 20A or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 20A while still encoding a 
protein that maintains its G-Protein Coupled Receptor -like activities and physiological 

35 functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 

40 modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

derivatized. These modifications are carried out at least in part to enhance the chemical 
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stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. 

The disclosed NOV20 protein of the invention includes the G-Protein Coupled 
Receptor -like protein whose sequence is provided in Table 20B. The invention also includes 
5 a mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 20B while still encoding a protein that maintains its G-Protein 
Coupled Receptor -like activities and physiological functions, or a functional fragment thereof. 
In the mutant or variant protein, up to about 54 percent of the residues may be so changed. 
The invention further encompasses antibodies and antibody fragments, such as F a b or 

1 0 (Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor -like 
protein (NOV20) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV20 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 

1 5 below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

20 The NOV20 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in developmental diseases, MHCII and ID diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 

25 disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 

30 hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 

i 

ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
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autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

NOV20 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV20 substances for use in 
5 , therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV20 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
10 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV21 

NOV21 includes three novel adrenal secretory serine protease-like proteins disclosed 
below. The disclosed sequences have been named NOV21a and NOV21b. 
15 NOV21a 

A disclosed NOV21a nucleic acid of 1028 nucleotides (also referred to as CG56639- 
01) encoding a novel adrenal secretory serine protease-like protein is shown in Table 21 A. An 
open reading frame was identified beginning with an TCG initiation codon at nucleotides 1-3 
and ending with a TGA codon at nucleotides 769-771 . The start and stop codons are shown in 
20 bold in Table 21 A, and the 5' and 3* untranslated regions, if any, are underlined. Because the 
start codon of NO V2 la is not a traditional initiation codon, NOV21a could be a partial reading 
frame that extends further in the 5' direction. 



Table 21A. NOV21a nucleotide sequence (SEQ ID NO:87). 

TCGCCATTTCCAGACGCCCCGGAGGCCACCACACACACCCAGCTACCAGACTGTGGCCTGGCGCCGGCCGCG 

CTCACCAGGATTGTGGGCGGCAGCGCAGCGGGCCGTGGGGAGTGGCCGTGGCAGGTGAGCCTGTGGCTGCGG 

CGCCGGGAACACCGTTGCGGGGCC^TGCTGGTGGCAGAGAGGTGGCTGCTGTCGGCGGCGCACTGCTTCGAC 

GTCTACGGGGACCCCAAGCAGTGGGCGGCCTTCCTAGGCACGCCGTTCCTGAGCGGCGCGGAGGGGCAGCTG 

GAGCGCGTGGCGCGCATCTACAAGCACCCGTTCTACAATCTCTACACGCTCGACTACGACGTGGCGCTGCT 

GAGCTGGCGGGGCCGGTGCGTCGCAGCCGCCTGGTGCGTCCCATCTGCCTGCCCGAGCCCGCGCCGCGACCC 

CCGGACGGCACGCGCTGCGTCATCACCXKaCrGGGGCTCGGTGCGCGAAGGAGGCTCCATGGCGCGGCAGCTG 

CAGAAGGCGGCCGTGCGCCTCCTCAGCGAGCAGACCTGCCGCCGCTTCTACCCAGTGCAGATCAGCAGCCGC 

ATCTCTGAACCCCCTTTCTTCTCTCCCCAACAGGGTGACGCTGGGGGACCCCTGGCCTGCAGGGAGCCCT 

GGACGGTGGGTGCTAACTGGGGTCACTAGCTGGGGCTATGGCTGTGGCCGGCCCCACTTCCCAGGTGTCTAT 

ACCCGGGTGGCAGCTGTGAGAGGCTGGATAGGACAGCACATCCAGGAGTQ AC 

CGAGACTCTACGTGAAAGCAACAGGAGCAGC^GGCCACCCAACACCCC^CCCaVCCGTACCCTACCCAAGGA 

CGGGTGTGGGGGGGCTGTGGGTCATGGGGATGCATTTTGGT 

CCACCCTAGCTCACTGGCTCAGCACCTCAGTGTC^ 

GGGGTGGAACGAAACAACCC ' ~~ 
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In a search of public sequence databases, the NOV21a nucleic acid sequence, located 
on chromosome 19, has 296 of 466 bases (63%) identical to a gb:GENBANK- 
ID:E13204|acc:E13204.1 mRNA from Homo sapiens (Human cDNA encoding a serine 
protease) (E = 3.9e~ 18 ). 

5 The disclosed NOV21a polypeptide (SEQ ID NO:88) encoded by SEQ ID NO:87 has 

256 amino acid residues and is presented in Table 2 IB using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV21a has A signal peptide and is 
likely to be localized to the microbody (peroxisome) with a certainty of 0.7480. Alternatively, 
NOV21a may also localize to the lysosome (lumen) with a certainty of 0.3168, or the 
10 mitochondrial matrix space with a certainty of 0.1000. The most likely cleavage site for 
NOV21a is between positions 68 and 69: SAA-HC. 



Table 21B. Encoded NOV21a protein sequence (SEQ ID NO:88). 

SPFPDAPEATTHTQLPDCGLAPAALTRIVGGSAAG 
WGDPKQWAAFTjGTPFLSGAEGQLERVARIYKHPFYNL^ 

PDGTRCVITGWGSVREGGSMARQLQKAAVRLLSEQTCRRFYPVQI SSRISEPPFFS PQQGDAGGPLACREPS 
GRWVLTGVTS WGYGCGRPHFPG VYTRVAAVRGWI GQHI QE ■ 

A search of sequence databases reveals that the NOV21a amino acid sequence has 99 
15 of 250 amino acid residues (39%) identical to, and 134 of 250 amino acid residues (53%) 
similar to, the 279 amino acid residue ptnr:SPTREMBL-ACC:Q9QZ74 protein from Rattus 
norvegicus (Rat) (Adrenal Secretory Serine Protease Precursor) (E = 1 .Se" 42 ). 

NOV21a is predicted to be expressed in at least the following tissues: Ovary, kidney, 
breast, lung, muscle, liver, spleen, blood, lymphocyte. This information was derived by 
20 determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
sources. 

NOV21b 

In the present invention, the target sequence identified previously, NOV21a, was 
25 subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
30 the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
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sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
5 brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 

1 0 other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated Accession Number NOV21b. This differs 

1 5 from the previously identified sequence (NOV2 1 a) in being a splice variant and a mature 
protein starting with serine. 

A disclosed NOV21b nucleic acid of 785 nucleotides (also referred to as CG56639-02) 
encoding a novel adrenal secretory serine protease-like protein is shown in Table 21C. An 
open reading frame was identified beginning with an CTT initiation codon at nucleotides 1-3 

20 and ending with a TGA codon at nucleotides 783-785. The start and stop codons are shown in 
bold in Table 21 C, and the 5' and 3' untranslated regions, if any, are underlined. Because the 
start codon of NOV21b is not a traditional initiation codon, NOV21b could be a partial 
reading frame that extends further in the 5' direction. 



Table 21C. NOV21b nucleotide sequence (SEQ ID NO:89). 

~cttcgccatto 

cgctcaccaggattgtgggcggcagcgcagcgggcc6tggggagtggccgtggcaggtgagcctgtggctgc 
ggcgccgggaacaccgttgcggggccgtgctggtggcagagaggtggctgctgtcggcggcgcactgcttcg 
acgtctacggggaccccaagcagtgggcggccttcctaggcacgccgttcctgagcggcgcggaggggcagc 
tggagcgcgtggcgcgcatctacaagcacccgttctacaatctctacacgctcgactacgacgtggcgctgc 
tggagctggcggggccggtgcgtcgcagccgcctggtgcgtcccatctgcctgcccgagcccgcgccgcgac 
ccccggacggcacgcgcrtgcgtcatcacc^gctggggctcggtgcgcgaaggaggctccatggcgcggcagc 
tgcagaaggcggccgtgcgcctcctcago;agcag 

gcatgctgtgtgccggcttcccgcagggtggcgtggacagctgctcgggtgacgctgggggacccctggcct 
gcagggagccctctggacggtgggtgctaactggggtcactagctggggctatggctgtggccggccccact 
tcccaggtgtctatacccgggtggcagctgtgagaggctggataggacagcacatccaggagtga 

25 



In a search of public sequence databases, the NOV21b nucleic acid sequence, located , 
on chromosome 19, has 160 of 162 bases (98%) identical to a gbrGENBANK- 
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ID:HUMLAMBBB|acc:M94363.1 mRNA from Homo sapiens (Human Iamin B2 (LAMB2) 
gene and ppvl gene sequence) (E = 4.3e" 59 ). 

The disclosed NOV21b polypeptide (SEQ ID NO:90) encoded by SEQ ID NO:89 has 
260 amino acid residues and is presented in Table 21D using the one-letter amino acid code. 
5 Signal P, Psort and/or Hydropathy results predict that NOV21b has A signal peptide and is 
likely to be localized to the microbody (peroxisome) with a certainty of 0.7480. Alternatively, 
NOV21b may also localize to the lysosome (lumen) with a certainty of 0.3082, or the 
mitochondrial matrix space with a certainty of 0.1000. The most likely cleavage site for 
NOV2 1 b is between positions 68 and 69: SAA-HC. 

10 

Table 21D. Encoded NOV21b protein sequence (SEQ ID NO:90). 

SPFPDAPEATTHTQLPDCGIAPAALTRIVGGSAAGRGEWPWQVSLWLRRREHRCGAVLV/^RWLLSAAHCFD 
VYGDPKQWAAFLGTPFLSGAEGQLERVARIYKHPFYNL 

PDGTRCVITGWGSVREGGSMARQLQKAAVRLLSEQTCHRFYPVQI SSRMLCAGFPQGGVDS CSGDAGGPLAC 
REPSGRWVLTGVTSWGYGCGRPHFPGVYTRVAAVRGWIGQHIQE 

A search of sequence databases reveals that the NOV21b amino acid sequence has 123 
of 250 amino acid residues (49%) identical to, and 154 of 250 amino acid residues (61%) 
similar to, the 855 amino acid residue ptnr:SPTREMBL-ACC:Q9Y5Y6 protein from Homo 
sapiens (Human) (Matriptase) (E = 3.5e" 59 ). 

NOV21b is predicted to be expressed in at least the following tissues: adrenal gland, 
Ovary, kidney, breast, lung, muscle, liver, spleen, blood, lymphocyte. . 

The disclosed NOV21a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 2 IE.. 



Table 21E. BLAST results for NOV21a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
{%) 


Positives 
(%) 


Expect 


gi| 12836503 |dbj|BAB 
23684. l| (AK004939) 


data source ;SPTR, 
source 
key:095519, 
evidence : ISS~homo 
log to DJ1170K4.4 
(NOVEL PROTEIN) 
(FRAGMENT) -put at i 
ve [Mus musculus] 


799 


118/244 
(48%) 


153/244 
(62%) 


7e-55 


gij 10257390 |gb|AAGl 
5395.l|AF057145 1 
(AF057145) 


serine protease 
TADG15 [Homo 
sapiens] 


855 


115/250 
(46%) 


146/250 
(58%) 


6e-52 
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20 
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gi | 11415040 |ref |NP_ 

068813,1) 

(NM_021978) 


suppression of 
tumorigenicity 14 

(colon carcinoma, 
matriptase, 
epithin) ; 
suppression of 
tumorigenicity 14 

(colon 
carcinoma) ,- 
matriptase [Homo 
sapiens] 


855 


115/250 
(46%) 


146/250 
(58%) 


7e-52 


gi | 7363445 |ref|NP 0 
35306. 2| 
(NM 011176) 


protease, serine, 
14 (epithin) [Afus 
mus cuius] 


855 


115/250 
(46%) 


144/250 
(57%) 


8e-52 


gi | 16758444) ref |NP_ 
446087. l| 
(NM_053635) 


suppression of 
tumorigenicity 14 
(colon carcinoma, 
matriptase, 
epithin) [Rattus 
norvegicus] 


855 


112/247 
(45%) 


141/247 
(56%) 


le-51 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 21F. In the ClustalW alignment of the NOV21 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



1 0 Table 21F. ClustalW Analysis of NOV21 

1) Novel NOV2la (SEQ ID NO: 88) 

2) Novel NOV2lb (SEQ ID NO: 90) 

3) gi| 12836503 |dbj |BAB23684.l| (AK004939) data source :SPTR, source key: 095519, 
evidence : ISS-horaolog to DJ1170K4.4 (NOVEL PROTEIN) (FRAGMENT) -putative [Mus 

15 musculus] (SEQ ID NO: 410) 

4) gi | 10257390 |gb | AAG15395 . 1 | AF057145_1 (AF057145) serine protease TADG15 [Homo 
sapiens] (SEQ ID NO: 4 11) 

5) gi | 11415040 | ref |NP_068813 -l| (NM_021978) suppression of tumorigenicity 14 (colon 
carcinoma, matriptase, epithin); suppression of tumorigenicity 14 (colon carcinoma); 

20 matriptase [Homo sapiens] (SEQ ID N0:412) 

6) gi 1 7363445 |ref |NP_035306 . 2 | (NM_011176) protease, serine, 14 (epithin) [Mus 
musculus] (SEQ ID NO: 413) 

7) gi| 16758444 | ref |NP_446087.l | (NM_053635) suppression of tumorigenicity 14 (colon 
carcinoma, matriptase, epithin) [Rattus norvegicus] (SEQ ID NO: 414) 



30 



35 



NOV21a 


1 


NOV21b 


1 


g± 


12836503) 


1 


gi 


10257390 | 


. 1 


gi 


11415040) 


1 


gi 


73 63445) 


1 


gi 


16758444) 


1 
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20 



30 



|....| 




70 



80 



tfSRHEKVNGL 
STSRHEKVNGL 
ttSRLENMNGFfj 
SSRLENMNGl 

90 



40 
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100 



110 
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N0V21a 
N0V21b 



gi 
gi 
gi 
gi 
gi 



12836503 
10257390 
11415040 
7363445) 
16758444 




10 



15 



N0V21a 
NOV21b 



gi 
gi 
gi 
gi 
gi 



12836503 
10257390 
11415040 
7363445 | 
16758444 | 



130 
|....|.. 



140 



150 



160 



170 



180 

|....| 




fcVAST-R 
iYSGVPF 
iYSGVPF 
jJYNEVPA? 
■MYSEVPV 




jFILDraE^QRLTLSPE^VR&iS 165 





2HL 1 

SHLVEEAEir 

pISlaeevd: 
phleeevdrama 
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N0V21a 


1 


N0V21b 


1 


gi 


12836503) 


166 


gi 


10257390] 


179 


gi 


11415040) 


179 


gi 


7363445) 


179 


gi 


16758444) 


179 



N0V21a 
N0V21b 



ga. 
gi 
gi 
gi 
gi 



12836503) 

10257390) 

11415040) 

7363445) 

16758444) 



190 200 
.|....|....|....|.. 



210 



220 



230 



.|....|....|. 



240 
•-I 



;glvi^^|votivvln- stlScyrysxvn 



— i 
■-- i 

3QVLPLK 224 

S ppo 235 

--FPD 235 

3 FPN 235 

3 FPN 235 




310 



320 



330 



NOV21a 


1 


N0V21b 


1 


gi 


12836503 


280 


gi 


10257390 


296 


gi 


11415040 


296 


gi 


7363445) 


296 


gi 


16758444 


296 



340 
|....|.-...|. 



350 



360 
)....) 




^VWKKGMHSYpjDPFLLSV^ 33 9 

LITNTERRHP GF^j^TFffiLPjRMJS»S CGGRLRKAbShTF 351 

tLITL I TNTERRi(p GFEATF^LPpiSS CGGRLRKaSmTF 351 

>\^TL I TNTDRRiJP GF^ATF^^PK^SCGGFLSDI^BtF 351 

^YTLITNTDRrSp GF^ATF^LP^gSCGGLLSEA^TF 351 



50 



55 



60 



65 



370 



380 



390 



400 



410 



N0V21a 
N0V21b 

gi 
gi 
gi 
gi 
gi 



12836503 
10257390 


340 
352 




NSj 


11415040 
7363445| 
16758444 


352 
352 
352 


NS; 
sb 



N0V21a 


1 


NOV21b 


1 


gi 


12836503 


400 


gi 


10257390 


412 


gi 


11415040 


412 


gi 


7363445) 


412 


gi 


16758444 


412 



)....).. ..I.... I.... I.... I.... I. 



420 




CiQP^ERiPMV. 
-jSQF^VTJ 




— i 

~- 453 
LTGRCIR 466 
1TGRCIR 466 
rRCIR 466 
RCIR 466 



70 



490 



.|....|. 



500 



510 
....).. 
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520 



530 
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NOV21a 1 

NOV21b 1 -- 

gi 12836503) 453 VN-G: 

gi 10257390 j 467 KELRCDGWADCTDHSDELNCS CDAGHQFTCKN1 

gi 11415040) 467 kelrcdgwadctdhsdelncscdaghqftcknk: 

gi 7363445] 467 KELRCDGWADCPDYSDERYCRCNATHQFTCKNQ: 

gi 16758444) 467 KDLROTGWADCPDYSDERHCRCNATHQFMCKNQ: 




1 

1 

480 
526 
526 
526 
526 



10 



15 



550 



NOV21a 
NOV21b 



9± 

gi 
gi 
gi 
gi 



12836503 
10257390 
11415040 
7363445) 
16758444 



1 
1 

481 
527 
527 
527 
527 



560 



570 



580 



590 



600 



CQE0STg|SLPH^Rj3PaSlii 



rFRCSNG] 
rFRCSNGI 
SFKCSgGI 
SFKCSgGI 



lKS 9f 
jISKSQi 

jpQsjp; 

" 'QSj 



JGKDDSGC e ' ! = " ^ 
JGKbjti • dp e I » : AS 
jGEDgaGb^^AS 
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N0V21a 
N0V21b 



gi 
gi 
gi 
gi 
gi 



12836503 
10257390 
11415040 
7363445 | 
16758444 



N0V21a 

K0V21b 

gi | 12836503 

gi 1 10257390 

gi j 11415040 

gi 1 7363445 | 

gi | 16758444 | 



N0V21a 
NOV21b 



gi 
gi 
gi 
gi 
gi 



12836503 
10257390 
11415040 
7363445| 
16758444| 



N0V21a 
NOV21b 



gi 
gi 
gi 
gi 
gi 



12836503 
10257390 
11415040 
7363445) 
16758444) 



NOV21a 


228 


N0V21b 


232 


gi 


12836503 


772 


gi 


10257390 


827 


gi 


11415040 


827 


gi 


7363445 | 


827 


gi 


16758444 


827 
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Tables 21G-H lists the domain descriptions from DOMAIN analysis results against 
NOV2 1 . This indicates that the NOV21 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 21G Domain Analysis of NOV21 

gnl| Smart |smart00020, Tryp_SPc, Trypsin-like serine protease; Many of 
these are synthesised as inactive precursor zymogens that are cleaved 
during limited proteolysis to generate their active forms. A few, 
however, are active as single chain molecules, and others are inactive 
due to substitutions of the catalytic triad residues. (SEQ ID 
NO:812) 

CD-Length « 230 residues, 100.0% aligned 
Score = 221 bits (563), Expect = 4e-59 





N0V21: 


27 




Sbjct: 


1 


10 


NOV21: 


87 




Sbjct: 


60 




N0V21 : 


146 


15 








Sbjct: 


120 




NOV21 : 


203 


20 


Sbjct: 


180 



RIVGGSAAGRGEWPWQVSLWLRRREHRCGAVLVRERWLLSAAHCFDVYGDPKQWAAFLGT 8 s 

nun i i +MIIII i i ii i++ ii+miii i n + 

RIVGGSEANIGSFPWQVSLQYRGGRHFCGGSLISPRWVLTAAHCVY-GSAPSSIRVRLGS 59 
PFL- SGAEGQIiERVARIYKHPFYNLYTLDYDVALLELAGP VRRSRLVRP I CLPEPAPRPP 145 

i ii i i +1+++ ii ii ii 1+111+1+ ii i mini i 

HDLSSGEETQIVKTVSKVIVHPNYNPSTYDNDIA^ 119 
DGTRCVTTGWGSVRE-GGSMARQLQKAAViaiLSEQTCRRFYPVQISSRISEPPFFSPQ-. - 202 

ii i ++m i n+ ii+ i ++i mi i + + 

120 AGTTCTVSGWGRTSES SGSLPDTLQEVNVPIVSNATCRRAYSGGPAITDNMLCAGGLEGG 179 
QGDAGGPLACRE PS GRWVLTGVTS WG - YGCGRPHFPGVYTRVAAVRGWI 250 

ni+iin i mi i+ iii iii n+ 1111111++ ii 

KDACQGDSGGPLVCN- -DPRWVLVGIVSWGSYGCARPlSfKPGVYTRVSSYLDWI 23 0 



Table 21H Domain Analysis of NOV21 

gnl | Pfam|pfam00089, trypsin, Trypsin. Proteins recognized include all 
proteins in families Si, S2A, S2B, S2C, and S5 in the classification 
of peptidases. Also included are proteins that are clearly members, 
but that lack peptidase activity, such as haptoglobin and protein Z 
(PRTZ*). (SEQ ID NO:813) 
CD-Length = 217 residues, 100.0% aligned 
Score » 177 bits (448) , Expect = 9e-46 
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Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being 
found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, 
including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 
families (denoted SI - S27) of serine protease have been identified, these being grouped into 6 
5 clans on the basis of structural similarity and other functional evidence. 

Tryptase is a tetrameric serine protease that is concentrated and stored selectively in 
the secretory granules of all types of mast cells, from which it is secreted during mast cell 
degranulation. Its exclusive presence in mast cells permits its use as a specific clinical 
indicator of mast cell activation by measurement of its level in biologic fluids and as a 
10 selective marker of intact mast cells using immunohistochemical techniques with antitryptase 
antibodies. 

In addition, NOV21 nucleic acids and polypeptides are useful, inter alia, as novel 
members of the protein families according to the presence of domains and sequences related to 
previously described proteins. For example, NOV21 nucleic acids and polypeptides contain a 

15 structural motif that is characteristic of protein sbelonging to the serine protease family of 
proteins. Accordingly, NOV21 may be useful in the same ways other members of this family 
are useful as detailed above. 

The disclosed NOV21 nucleic acid of the invention encoding a Adrenal secretory 
serine protease -like protein includes the nucleic acid whose sequence is provided in Table 

20 2 1 A, 2 1C or a fragment thereof. The invention also includes a mutant or variant nucleic acid 
any of whose bases may be changed from the corresponding base shown in Table 21 A or 2 1C 
while still encoding a protein that maintains its Adrenal secretory serine protease -like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to those just described, 

25 including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 

30 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 2 percent of the 
bases may be so changed. 
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The disclosed NOV21 protein of the invention includes the Adrenal secretory serine 
protease -like protein whose sequence is provided in Table 21B or 24D. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 21B or 21D while still encoding a protein that 

5 maintains its Adrenal secretory serine protease -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 54 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(F ab )2,that bind immunospecifically to any of the proteins of the invention. 

10 The above disclosed information suggests that this Adrenal secretory serine protease - 

like protein (NOV21) is a member of a "Adrenal secretory serine protease family"." Therefore, 
the NOV21 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 

1 5 protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV21 nucleic acids and proteins of the invention are useful in potential 

20 therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome, cirrhosis, 

transplantation, endometriosis, fertility, anemia, ataxia-telangiectasia, autoimmune disease, 
hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, allergies, 
immunodeficiencies, graft versus host disease (GVHD), lymphaedema, muscular dystrophy, 
Lesch-Nyhan syndrome, myasthenia gravis, and/or other diseases and pathologies. 

25 NOV2 1 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immuno-specifically to the novel NOV21 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV21 protein has multiple hydrophilic 

30 regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV22 
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NOV22 includes three novel adrenal secretory serine protease-like proteins disclosed 
below. The disclosed sequences have been named NOV22a, and NOV22b. 
NOV22a 

A disclosed NOV22a nucleic acid of 796 nucleotides (also referred to as CG56643-01) 
5 encoding a novel adrenal secretory serine protease-like protein is shown in Table 22A. An 
open reading frame was identified beginning with an ACC initiation codon at nucleotides 1-3 
and ending with a TGA codon at nucleotides 763-765. The start and stop codons are shown in 
bold in Table 22A, and the 5' and 3 ' untranslated regions, if any, are underlined. Because the 
start codon of NOV22a is not a traditional initiation codon, NOV22a could be a partial reading 
10 frame that extends further in the 5' direction. 



Table 22A. NOV22a nucleotide sequence (SEQ ID NO:91). 

ACCCGAGCAGGCCAAGATCCCCAGACCTGGTCTTGTGTCCTCCTTC CAG AATGTGGGGCCAGGCCTG CAATG 

GAGAAGCCCACCCGGGTCGTGCGCGGGTTCGGAGCTGCCTCCGGGGAGGTGCCCTGGCAGGTCAGCCTGAAG 

GAAGGGTCCCGGCACTTCTGCGGAGCAACTGTGGTGGGGGACCGCTGGCTGCTGTCTGCCGCCCACTGCTTC 

CATAGCACGAAGGTGGAGC^GGTTCGGGCCCACCTGGGCACTGCGTCCCTCCTGGGCCTGGGCGGGAGCCCG 

GTGAAGATCGGGCTGCGGCGGGTAGTGCTGCACCCCCTCTACAA 

GTCCTGGAGCTGGCCAGCCCCCTGGCCTTCAACAAATA 

AAGTTCCCTGTGGGCCGGAAGTGCATGATCTCCGGATGGGGAAATACGCAGGAAGGAAATCTGCAGAAGGCG 

TCCGTGGGCATCATAGACCAGAAAACCTGTAGTGTGCTCTACAACTTCTCCCTCACAGACCGC^ 

GCAGGCTTCCTGGAAGGCAAAGTCGACTCCTGCCAGGGTGACTCTGGGX3GCCCCCTGGCCT 

CCTGGCGTGTTTTATCTGGCAGGGATCGTGAGCTGGGGTATTGGCTG05CTCAGGTTAAGAAGCCGGGCGTG 

TACACGCGCATCACCAGGCTAAAGGGCTGGATCATCCAGGAGTG ACCACCACGTGACTGCCCAGGCCGAGAC 

TCTA " ' 



In a search of public sequence databases, the NOV22a nucleic acid sequence, located 
on chromosome 19, has 278 of 428 bases (64%) identical to a gbrGENBANK- 
15 ID:E13204|acc:E13204.1 mRNA from Homo sapiens (Human cDNA encoding a serine 
protease) (E = 1.6e" 29 ). 

The disclosed NOV22a polypeptide (SEQ ID NO:92) encoded by SEQ ID NO:91 has 
254 amino acid residues and is presented in Table 22B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV22a has no signal peptide and is 
20 likely to be localized to the microbody (peroxisome) with a certainty of 0.5090. Alternatively, 
NOV22a may also localize to the cytoplasm with a certainty of 0.4500, to the lysosome 
(lumen) with a certainty of 0.2082, or the mitochondrial matrix space with a certainty of 
0.1000. 



Table 22B. Encoded NOV22a protein sequence (SEQ ID NO:92). 

TRAGQDPQTWSCVIiLPECGARPAMEKPTRVVRGF^ 

HS TKVEQ VRAHLGT AS LLGLGG SPVKI GLRRWLHPLYNP GI LDFPLAVLEIiAS P LAFNKY IQPVCL PLAI Q 
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KFPVGRKCMISGWGNTQEGNLQKASVGI IDQKTCSVLYNFSLTDRMICAGFLEGKVDSCQGDSGGPLACEEA 
PGVFYIAG J VS WGI GCAQVKK PG VYT RI TRLKGW I IQE 

A search of sequence databases reveals that the NOV22a amino acid sequence has 100 
of 241 amino acid residues (41%) identical to, and 149 of 241 amino acid residues (61%) 
similar to, the 273 amino acid residue ptnr:TREMBLNEW-ACC:BAB20278 protein from Mus 
5 musculus (Mouse) (Type 1 Spinesin) (E = 3-le* 49 ). 

The adrenal secretory serine protease disclosed in this invention is predicted to be 
expressed in at least the following tissues: Ovary, kidney, breast, lung, muscle, liver, spleen, 
blood, lymphocyte. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
10 Public EST sources, Literature sources, and/or RACE sources. 

NOV22b 

In the present invention, the target sequence identified previously, NOV22a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 

15 downstream sequence available for the reverse primer. In each case, the sequence was 

examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 

20 sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 

25 lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 

30 ESTs were included as components for an assembly when the extent of their identity with 

another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV22b. This differs from the previously 
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identified sequence (NOV22a) in having 43 additional aminoacids and different N and C 
terminus. 

A disclosed NOV22b nucleic acid of 992 nucleotides (also referred to as CG56643-02) 
encoding a novel adrenal secretory serine protease-like protein is shown in Table 22C. An 
5 open reading frame was identified beginning with an ATG initiation codon at nucleotides 101- 
103 and ending with a TAA codon at nucleotides 920-922. The start and stop codons are 
shown in bold in Table 22C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 22G. NOV22b nucleotide sequence (SEQ ID NO:93). 

GCTAGTCTATCCCGAGACCCCTCCCACTCCAACAGTTAATGCTO 

CTTACCCAGGTGCTAGGGCGGCAGCCCCA TOGGGACAGTGGGGAGACTCTTGCGCTCTGAGCGGGCCATCAG 
GCC CACCTCCTCCTCACTCTGTGGCTTTGTGAGATTCCTG CAACTCTGTGAGCCCTGGTTTCTTCGTCTGTG 
GGGTGGGGATGCTGCATCTCGGGGCTCTTATCGGAGCGGAACTGGAGCTGCTCTGATGAT 
GGCCTTTCTGGCTCTTTCCCTGGTAGCCACCAAGCC^ 

CCAGAAAACCTGTAGTGTGCTCTACAACTTCTCCCTCACAGACCGCATGATCTGCGCAGGCTTCCTGGAAGG 
CAAAGTCGACTCCTGCC^GGGTGACTCrGGGGGCCCCCTGGCCTGCGAGGAGGCCCCTGGCGTGTTTTATCT 
GGCAGGGATCGTGAGCTGGGGTATTGGCTGCGCTC^GGTTAAGAAGCCGGGCGTGTACACGCGCATCACCAG 
GCTAAAGGGCTGGATCCTGGAGATCATGTCCTCCCAGCCCCTTCC 

GCTGGCCACCACCAGCCCCAGGACGACAGCTGGCCTCACAGTCCCGGGGGCCACACCCAGCAGACCCACC 

TGGGGCTGCCAGCAGGGTGACGGGCCAACCTGCCAACTCAACCTTATCTGCCGTGAGCACCACTGCT 

ACAGACGCCATTTCCAGACGCCCCGGAGGCCACCACACACACCCAGCTACCAGGTACCGGGAGAGACGGAGG 

GATCCCTGGGAGTGGAGGGTCCCATGTTAATCAGCCTGGGCTGCCTAACAAGACATA ACGTCGTCCACTTTG 

GGAGGCCGAGGCGGGCGGATCAAGAGGTCAGGAGATCGAGACCATCCTGGCGAACA 



10 In a search of public sequence databases, the NOV22b nucleic acid sequence, located 

on chromosome 19, has 203 of 294 bases (69%) identical to a gb:GENBANK- 
ID:AF133086|acc:AFl 33086.1 mRNA from Homo sapiens (membrane-type serine protease 1 
mRNA, complete cds) (E = 3.6e 16 ). 

The disclosed NOV22b polypeptide (SEQ ID NO:94) encoded by SEQ ID NO:93 has 

15 273 amino acid residues and is presented in Table 22D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV22b has A signal peptide and is 
likely to be localized to the mitochondrial inner membrane with a certainty of 0.8723. 
Alternatively, NOV22b may also localize to the plasma membrane with a certainty of 0.6500, 
to the mitochondrial intermembrane space with a certainty of 0.5053, or the mitochondrial 

20 matrix space with a certainty of 0.36 1 7. The most likely cleavage site for NOV22b is between 
positions 43 and 44: GDA-AS. 



Table 22D. Encoded NOV22b protein sequence (SEQ ID NO:94). 

MGTVGRIJjRSERAIRPTSSSLCGFVRFLQLCEPWFLRLWGGDA 
TKPELLQKASVGIIDQKTCSVLYNFSLTDRMICA 

CAQVKKPGVYTRITRLKGWILEIMSSQPLPMSPPSTTRMLATTSPRTTAGLTVPGATPSRPTPGAASRVTGQ 
PAN STLS AVSTTARGQT P FPDAPEATTHTQIiPGTGRDGGI PGSGGSHVNQPGLPNKT 
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A search of sequence databases reveals that the NOV22b amino acid sequence has 49 
of 90 amino acid residues (54%) identical to, and 63 of 90 amino acid residues (70%) similar 
to, the 277 amino acid residue ptnr:SPTREMBL-ACC:096899 protein from Scolopendra 
5 subspinipes (Plasminogen Activator Spa) (E = 4.3e~ 24 ). 

NOV22b is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
10 salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. . 
. NOV22c 

A disclosed NOV22c nucleic acid of 912 nucleotides (also referred to as CG56643-03) 
encoding a novel adrenal secretory serine protease-like protein is shown in Table 22E. An 
15 open reading frame was identified beginning with an ATG initiation codon at nucleotides 77- 
79 and ending with a TAA codon at nucleotides 896-898. The start and stop codons are 
shown in bold in Table 22E, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 22E. NOV22c nucleotide sequence (SEQ ID NO:95). 

CCCCATGGGGACAGTGGGGAGACTCTTGCGCTCTGAGCGGGCCATCAGGCCCACCTCCTCCTCACTCTGTGG 

CTTTGTGAGATTCCTGCAACTCTCT 

CTGTTATCGGAGCGGAACTGGAGCTGCTCTGATGATCA 

AGCCACCAAGCCCGAGCTCCTGCAGAAGGCGTCCGTGGGCATCATA 

CAACTTCTCCCTCACAGACCGCATGATCTGC 

CTCTGGGGGCCCCCTGGCCTGCGAGGAGGCCCCTGGCGTGTTTTATCTGGCAGGGATCGTGAGCTGGGGTAT 

TGGCTGCGCTCAGGTTAAGAAGCCGGGCGTGTACACGCGCATCACCAGGCTAAAGGGCTGGATCCTGGAGAT 

CATGTCCTCCCAGCCCCTTCCGATGTCTCCCCCCTCGACCACAAGGATGCTGGCCACCACCAGCCCCAGGAC 

GACAGCTGGCCTCACAGTCCCGGGGGCCACACCCAGCAGACCCACCCCTGGGGCTGCCAGCAGGGTGACGGG 

CCAACCTGCCAACTCAACCTTATCTGCCGTGAGC^CC^ 

GGAGGCCACCACACACACCCAGCTACCAGGTAC 

TGTTAATCAGCCTGGGCTGCCTAACAAGACATAACGTCGTCCACTTTG 



20 In a search of public sequence databases, the NOV22c nucleic acid sequence, located 

on chromosome 19, has 203 of 294 bases (69%) identical to a gb:GENBANK- 
ID:E13204|acc:E13204.1 mRNA from Homo sapiens (Human cDNA encoding a serine 
protease) (E=1.3e~ 18 ). 

The disclosed NOV22c polypeptide (SEQ ID NO:96) encoded by SEQ ID NO:95 has 

25 273 amino acid residues and is presented in Table 22F using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV22c has A signal peptide and is 
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likely to be localized to the mitochondrial inner membrane with a certainty of 0,8723. 
Alternatively, NOV22c may also localize to the plasma membrane with a certainty of 0.6500, 
to the mitochondrial intermembrane space with a certainty of 0.5053, or the mitochondrial 
matrix space with a certainty of 0.3617. The most likely cleavage site for NOV22c is between 
5 positions 43 and 44: GDA-AS. 



Table 22F. Encoded NOV22c protein sequence (SEQ ID NO:96). 

MGTVGRIiLRSERAIRPTSSSLCGFVRFLQLCEPW 

TKPELLQKASVGI IDQKTCSVIiYNFSLTDRMI CAGFLEGKVDSCQGDSGGPIiACEEAPGVFyLAGI VSWGIG 
CAQVKKPGVYTRITRLKGWILEI^SQPLPM 

PANS TLSAVSTTARGQTPFPDAPEATTHTQLPGTGRDGGI PGSGGSHVNQPGLPNKT 



A search of sequence databases reveals that the NOV22c amino acid sequence has 49 
of 90 amino acid residues (54%) identical to, and 63 of 90 amino acid residues (70%) similar 

10 to, the 277 amino acid residue ptrir:SPTREMBL-ACC:096899 protein from Scolopendra 
subspinipes (Plasminogen Activator SPA) (E = 4.5e" 24 ). 

NOV22c is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 

1 5 kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 

salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. . 

The disclosed NOV22a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 22G„ 

20 



Table 22G. BLAST results for NOV22a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 16758444 |ref|NP 
446087. 1| 
(NM_053635) 


suppression of 
tumorigenicity 14 
(colon carcinoma, 
matriptase r 
epithin) (Rattus 
norvegicus] 


855 


109/251 
(43%) 


148/251 
(58%) 


7e-55 


gi | 7363445 |ref|NP 0 
35306. 2| 
(NMJJ11176) 


protease, serine, 
14 (epithin) [Wus 
musculus] 


855 


110/248 
(44%) 


150/248 
(60%) 


7e-54 


gi| 9757702 |dbj|BAB0 
8218.1) (AB038498) 


homolog of human 
MT-SP1 [Xenopus 
laevis] 


845 


113/261 
(43%) 


156/261 
(59%) 


2e-52 


gi | 10257390 | gb| AAG1 
5395.1|AF057145 1 
(AF057145) 


serine protease 
TADG15 [Homo 
sapiens] 


855 


107/248 
(43%) 


145/248 
(58%) 


3e-52 
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gi | 11415040 | ref |NP_ 


suppression of 


855 


107/248 


145/248 


3e-52 


068813.1 | 


tumorigenicity 14 




(43%) 


(58%) 




(NM 021978) 


(colon carcinoma, 












matriptase, 
epithin) ; 
suppres s ion oj. 
tumorigenicity 14 
(colon 
carcinoma) ; 
matriptase [Homo 
sapiens] 











The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 22H. In the ClustalW alignment of the NOV22 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 22H. ClustalW Analysis of NOV22 



15 



20 



25 



(NM_053635) suppression of tumorigenicity 14 (colon 
[Rattus norvegicus] (SEQ ID N0:414) 
(NM_011176) protease, serine, 14 (epithin) [Mus 



1) Novel NOV22a (SEQ ID NO: 92) 

2) Novel NOV22b (SEQ ID NO: 94) 

3) Novel N0V22C (SEQ ID NO:96) 

4 ) gi 1 16758444 | ref |NP_446087 . 1 1 
carcinoma, matriptase, epithin) 

5) gi | 7363445 | ref |NP_035306 . 2 | 
musculus] (SEQ ID NO: 4 13) 

6) gi|9757702|dbj |BAB08218.1| (AB038498) homolog of human MT-SP1 [Xenopus laevis] 
(SEQ ID NO:415) 

7) gi 1 10257390 |gb| AAG15395 . 1 1 AF057145_1 (AF057145) serine protease . TADG15 [Hojno 
sapiens] (SEQ ID NO: 411) 

8) gi 1 11415040 1 ref |NP_068813 .1 1 (NMJ)21978) suppression of tumorigenicity 14 (colon 
carcinoma, matriptase, epithin) ; suppression of tumorigenicity 14 (colon carcinoma) ; 
matriptase [Homo sapiens] (SEQ ID NO: 412) 



30 



35 



NOV22a 




NOV22b 
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16758444 Iref 


gi 
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ref 


gi 


9757702 
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gi 


10257390 |gb 


gi 


11415040| ref 




MGNNRGRKAGGGS 
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M 
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HpsplI 
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SfsfiJ 


gi 


9757702 
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10257390 |gb 




miGhiJ 


gi 


11415040| ref 


HlGLlJ 
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NOV22a 
NOV22b 
NOV22C 



gi 
gi 
gi 
gi 



16758444 | ref 
7363445 | ref 
9757702) dbj 
10257390 | gb 
11415040 ref 



NOV22a 
NOV22b 
NOV22C 



gi 
gi 
gi 
gi 
gi 



167584441 ref 
7363445 | ref | 
9757702 j dbj j 
10257390 | gb j 
11415040 ref 



NOV22a 
NOV22b 
NOV22C 

gi 1 16758444 | ref 
gi 1 7363445 | ref | 
gi | 9757702 j dbj j 
gi j 102573 90 |gb| 
gi 1 11415040 | ref 



NOV22a 
NOV22b 
NOV22C 



gi 
gi 
gi 
gi 
gi 



16758444 | ref 
7363445 | ref | 
9757702 j dbj | 
10257390 |gb| 
11415040 ref 



NOV22a 
NOV22b 
NOV22C 



gi 
gi 
gi 
gi 
gi 



16758444 I ref 
7363445 | ref 
9757702 | dbj 
10257390 |gb 
11415040 ref 



NOV22a 
NOV22b 
NOV22c 



gi 
gi 
gi 
gi 
gi 



16758444 | ref 
7363445 | ref | 
9757702 | dbj | 
10257390 |gb| 
11415040 | ref 



HOV22a 



PHLEEEVDRAMAVER 177 
IPHLAEEVDRAMAVEH 177 
'REAAFEKAlSjEliK 168 
)HLVEEA£KVM&EeS 177 
)HLVEEA^VMAEE^ 177 




370 



380 



|....|....|....| 



390 



400 



410 



420 



wpqRfiwffl 40 

wiEwBwB 40 




490 
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NOV22b 
NOV22C. 

gi 
gi 
gi 
gi 
gi 



16758444 I ref 
73 63445 | ref | 
3757702 j dbj j 
10257390 |gb| 
11415040 ref 



NOV22a 
»OV22b 
NOV22C 



gi 
gi 
gi 
gi 
gi 



16758444 I ref 
7363445 | ref 
9757702 jdbj 
10257390 |gb 
11415040 ref 



NOV22a 
NOV22b 
NOV22c 

gi 
gi 
gi 
gi 
gi 



16758444 | ref 
7363445 | ref | 
9757702 | dbj | 
10257390 | gb j 
11415040 | ref 



NOV22a 
HOV22b 
NOV22c 



gi 
gi 
gi 
gi 
gi 



16758444 I ref 
7363445 jref 
9757702 jdbj 
10257390 | gb 
11415040 I ref 



NOV22a 
NOV22b 
NOV22c 



gi 
gi 
gi 
gi 
gi 



16758444 I ref 
7363445 |ref | 
9757702 jdbj | 
10257390 |gb| 
11415040 j ref 



NOV22a 
NOV22b 
NOV22C 

gi | 16758444 | ref 
gi [7363445 | ref 
gi j 9757702 j dbj 
gi j 10257390 |gb 
gij 11415040 | ref 



NOV22a 
NOV22b 
NOV22C 

gi 1 16758444 | ref 
gij 7363445 | ref | 
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gi|9757702|dbj| 
gij 10257390 |gb| . 
gi | 11415040 | ref 





PLF 










PLF 









845 
855 
855 



Tables 22I-J lists the domain descriptions from DOMAIN analysis results against 
NOV22. This indicates that the NOV22 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 221 Domain Analysis of NOV22 

gnl| Smart |smart00020, Tryp_SPc, Trypsin-like serine protease; Many of 
these are synthesised as inactive precursor zymogens that are cleaved 
during limited proteolysis to generate their active forms. A few, 
however , are active as single chain molecules, and others are inactive 
due to substitutions of the catalytic triad residues. (SEQ ID 
NO:812) 

CD-Length = 230 residues, 100.0% aligned 
Score = 220 bits (560) , Expect = 9e-59 





NOV22 : 


29 




Sbjct: 


1 


15 


N0V22: 


B8 




Sbjct: 


61 




N0V22: 


148 


20 








Sbjct: 


120 




N0V22: 


200 


25 


Sbjct: 


180 



2 9 RWRGFGAASGEVPWQVSLK- EGSRHFCGATWGDRWLLSAAHCPHSTKVEQVRAHLGM 8 7 

Mill MINI* I Mill +++ 1 1— I— I 111 + ♦ +1 11 + 

RI VGGSEANI GS FPWQVSLQYRGGRHFCGGSLI SPRWVIjTAAHCVYGSAPSS IRVRLGSH 60 



SLLGLGGSVVKI GLRRVVLHPLYNPGI LDFDLAVLELAS PLAFNKYI QPVCLPLAI QKFP 

I + + +I++II III I I+I+I+I+ 1+ + ++I + MI + I 

DLSSGEE -TQTVKVSKVI VHPNYNPSTYDNDI ALLKLSEP VTLSDTVRPI CLPSSGYNVP 



147 



119 



199 



VGRKCMI SGWGNTQEGN LQKASVGIIDQKTCSVLY- -NFSLTDRMICAGFIiEGK 

I I ♦MM II + 11+ +1 1+ II I ++II l + lll III 

AGTTCTVSGWGRTSESSGSLPDTLQEVNVP I VSNATCRRAYSGGPAI TDNMLCAGGLEGG 17 9 



l + llll Mill I + 



I MUM 111+ IIMIM++ II 



Table 22J Domain Analysis of NOV22 

gnl)Pfam|pfam00089, trypsin, Trypsin. Proteins recognized include all 
proteins in families SI, S2A, S2B, S2C, and S5 in the classification 
of peptidases. Also included are proteins that are clearly members, 
but that lack peptidase activity, such as haptoglobin and protein Z 
(PRTZ*). (SKQ ID NO;813) 
CD-Length = 217 residues, 100.0% aligned 
Score m 192 bits (488) , Expect = 2e-50 



NOV22: 


30 


VVRGFGAASGEVPWQVSLKEGSRHFCGATWGDRWLLSAAHCFHSTKVEQVRAHLGTASL 


89 






+ 11 1 +1 IIIIII+ 1 MM +++ + l+l + IMI +1 1 +1 




Sbjct: 


1 


IVGGREAQAGSFPWQVSLQVSSGHFCGGSIiISENWVLTAAHCVSGASSVRVVL — GEHNL 


58 


NOV22 : 


90 


LGLGGS2VKI GLRRWLHPL YNPG I LDFDLAVLELAS PLAFNKY I QPVCLPLAI QKFPVG 


149 






1+ 1 ++++++M III l + l + l + l 11+ ++I + III 1 III 




Sbjct: 


59 


GTTEGTEQKFDVKKI IVHPNYNPD - - TNDI ALLKLKSPVTLGDTVRP I CLPSASSDLPVG 


116 


NOV22: 


150 


RKCMI SGWGNTQEGN LQKASVGI IDQKTCSVLYNFSLTDRMI CAGFLEGKVDSCQG 


205 






1 +1111 1+ 11+ 1 1+ ++II 1 ++M Mill 1 II l+MI 




Sbjct: 


117 


TTCSVSGWGRTKNLGTSDTLQEVWP I VSRETCRS AYGGTVTDTMI CAGALGGK- DACQG 


175 



198 
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NOV22-. 206 DSGGPIACEEAPGVFYIAGIVSWGIGCAQVKKPGVYTRITRLKGWI 251 

nun i ■+ i linn mi ii 

Sbjct: 176 DSGGPLVCSDG EI/VGI VS WG YGCAVGNYPG VYTR VS RYLDWI 217 

5 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being 

found in viruses, bacteria and eukaryotes [1] . They include a wide range of peptidase activity, 
including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 
families (denoted SI - S27) of serine protease have been identified, these being grouped into 6 
clans on the basis of structural similarity and other functional evidence [1]. 

10 Tryptase is a tetranieric serine protease that is concentrated and stored selectively in 

the secretory granules of all types of mast cells, from which it is secreted during mast cell 
degranulation. Its exclusive presence in mast cells permits its use as a specific clinical 
indicator of mast cell activation by measurement of its level in biologic fluids and as a 
selective marker of intact mast cells using immunohistochemical techniques with antitryptase 

15 antibodies. Vanderslice [2] demonstrated the existence of multiple tryptases. In this respect, 
mast cell tryptase is like other serine proteases such as glandular kallikrein and trypsin, which 
are also members of multigene families. Miller et al. [3] mapped both alpha-tryptase and beta- 
tryptase to human chromosome 16 by PCR analysis of DNA from human/hamster somatic cell 
hybrids. Miller et al. [3] cloned a second cDNA for human tryptase, called beta-tryptase, from 

20 a mast cell cDNA library. The 1,142 bases of beta-tryptase were found to encode a 30-amino 
acid leader sequence of 3,089 daltons and a 245-amino acid catalytic region of 27,458 daltons. 
The amino acid sequence of beta-tryptase was found to be 90% identical with that of alpha- 
tryptase, the first 20 amino acids of the catalytic portions being 100% identical. Both alpha- 
and beta-tryptase sequences were localized to human chromosome 16 by analysis of DNA 

25 preparations from 25 human/hamster somatic cell hybrids by PCR. 

Because of the presence of the trypsin domains and the homology to the adrenal . 
secretory serine protease, we anticipate that the novel sequence described here will have useful 
properties and functions similar to these genes. 

The disclosed NOV22 nucleic acid of the invention encoding a Adrenal secretory 

30 serine protease -like protein includes the nucleic acid whose sequence is provided in Table 
22A, 25C, 25E or a fragment thereof. The invention also includes a mutant or variant nucleic 
acid any of whose bases may be changed from the corresponding base shown in Table 22A, 
25C, or 25E while still encoding a protein that maintains its Adrenal secretory serine protease 
-like activities and physiological functions, or a fragment of such a nucleic acid. The 

35 invention further includes nucleic acids whose sequences are complementary to those just 

described, including nucleic acid fragments that are complementary to any of the nucleic acids 
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just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
5 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 36 percent of the 
bases may be so changed. 

The disclosed NOV22 protein of the invention includes the Adrenal secretory serine 

10 protease -like protein whose sequence is provided in Table 22B, 25D, or 25F. The invention 
also includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 22B, 25D, or 25F while still encoding a protein that 
maintains its Adrenal secretory serine protease -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 57 percent of the 

1 5 residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Adrenal secretory serine protease - 
like protein (NOV22) is a member of a "Adrenal secretory serine protease family". Therefore, 

20 the NOV22 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

25 delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV22 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome, cirrhosis, 
transplantation, endometriosis, fertility, anemia, ataxia-telangiectasia, autoimmune disease, 

30 hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, allergies, 

immunodeficiencies, graft versus host disease (GVHD), lymphaedema, muscular dystrophy, 
Lesch-Nyhan syndrome, myasthenia gravis, and/or other diseases and pathologies. 

NOV22 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV22 substances for use in 
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therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV22 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
5 assay systems for functional analysis of various human disorders, which will help in 

understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV23 

NOV23 includes three novel serine protease DESC1 protease-like proteins disclosed 
10 below. The disclosed sequences have been named NOV23a, NOV23b, NOV23c, and 
NOV23d. 

NOV23a 

The disclosed NOV23a nucleic acid of 1546 nucleotides (also referred to as CG56647- 
02) encoding a novel serine protease DESCl-like protein is shown in Table 23A. An open 
15 reading frame was identified beginning with an ATG initiation codon at nucleotides 101-103 
and ending with a TAG codon at nucleotides 148 1-1483. Putative untranslated regions, if any, 
found upstream from the initiation codon and downstream from the termination codon are 
underlined in Table 23 A, and the start and stop codons are in bold letters. 



Table 23A. NOV23a Nucleotide Sequence (SEQ ID NO:97) 

GCCCCTGCCATAGGAGGCGGGGACTGTCATTTCACCGTCTCCTGATGCCATTCCAGAGGTTACGCCCTGA 
AGTCAGCTCAGATCCTGGGCGAGGCACTGG&T 

GGAAAACACGGGGGCATCTGGGGCTCACTTGGCACTCATCC^CCTTGTGCTGTACCTGGGGACCTCCGGC 
CTTCCTCTCTACACAGGGCTTCCACGTGGACCACACGGCCGAGCTGCGGGGAATCCGGTGGACCAGCAGT 
TTGCGGCGGGAGACCTCGGACTATCACCGCACGCTGAC^CCCACCCTGGAGGCACTGTTTGTAAGTAGTT 
TT CAG AAGACAGAGTTAG AGG CAAG CTG CGTGGGTTG CT CGGT AC TGAATTAT AGGG ATGGGAACTCCAG 
TGTCCTCGTACATTTCCAGCTGCACTTTCTGCTG 

GAGCTATTGCAGCGAGGGATCCGGGCAAGGCTGCGGGAGCACGGCATCTCCCTGGCTGCCTATGGCACAA 
TTGTGT CGG CTTG AGCTCACAGGT AGACAT AAGGG ACC CTTGG CAG AAAGAGACTTCAAATCAGGT CG CTG 
TCCAGGGAACTCCTTTTCCTGCGGGAACAGCCAGTGTGTGACCAAGGTGAACCCGGAGTGTGACGACCAG 
iSAGGACTGCTCCGATGGGTCCGACGAGGCGCACTGCGAGTGTGGCTTGCAGCCTGCCTGGAGGATGGCCG 
GCAGGATCGTGGGCGGCATGGAAGCATCCCCGGGGGAGTTT CCGTGGCAAGCCAGCCTT CGAGAGAACAA 
GGAGCACTTCTGTGGGGCCGCCATCATCAACGCCAGGTGGCTGGTGTCTG CTG CTCACTGCTT CAATGAG 
TTCCAAGACCCGACGAAGTGGGTGGCCTACGTGGGTGCGACCTACCTCAGCGGCTCGGAGGCCAGCACCG 
TGCGGGCCCAGGTGGTCCAGATCGTC^GC^CCCCCTGTAC^CGCGGACACGGCCGACITrGACGTGGC 
TOTG^GGAGCTGACCAGCCCTCTGCCTTTCGGCCGGCACATC 

AGCC^GAGGTGCTGCAGAAAGCCACTGTGGAGCTGCTGGACCAGGCACTGTGTGCCAGCTTGTACGGCCA 

TTCACTCACTGACAGGATGGTGTGCGCTGGCTACCTGGACGGGAAGGTGGACTCCTGCCAGGGTGACTCA 

GGAGGACCCCTGGTCTGCGAGGAGCCCTCTGGCCGGTTCTTTCTGGCTGGCATCGTGAGCTGGGGAATCG 

GGTGTGCGGAAGCCCGGCGTCCAGGGGTCTATGCCCGAGTCACCAGGCTACGTGACTGGATCCTGGAGGC 

CACQgAAAGGTA GAAGATGATGTACGTGCCTATCTTGATTTAGGGAGA^ 

CATAAT " _— : 
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The disclosed NOV23a nucleic acid sequence, located on chromosome 19, has 356 of 
566 bases (62%) identical to a gb:GENBANK-ID:AF133086|acc:AF133086.1 mRNA from 
Homo sapiens (membrane-type serine protease 1 mRNA, complete cds) (E - l.le" 23 ) .' 

A disclosed NOV23a polypeptide (SEQ ID NO:98) encoded by SEQ ID NO:97 is 460 
5 amino acid residues and is presented using the one-letter amino acid code in Table 23B. 

Signal P, Psort and/or Hydropathy results predict that NOV23a contains no signal peptide and 
is likely to be localized in the microbody (peroxisome) with a certainty of 0.5387. 

Table 23B. Encoded NOV23a protein sequence (SEQ ID NO:98). 

MGDRHEQDLFLPSRKTRGHLGLTWHSSTLC 
LEALFVSSFQKTELEASCVGCSVLNYRI^SS 

YGTIVSAELTGRHKGPLAERDFKSGRCPGNSFSCGNSQCVTKVNPECDDQEDCSDGSDEAHCECGLQPAWRMAGR 
IVGGMEASPGEFPWQASLRENKEHFCGAAIINARWLVSAAH 

IVKHPLYNADTADFDVAVLELTS PLPFGRHIQPVCLPAATHI FPPSKKCL I SGWGYLKEDFWKPEVLQKATVEL 
LDQALCAS LYGHS LTDRMVCAGYLDGKVDSCQGDSGGPLVCEE PSGR FFLAGI VS WGIGCAEARRPGVYARVTRL 
RDWILEATER 

The disclosed NOV23a amino acid sequence has 1 12 of 248 amino acid residues 
1 0 (45%) identical to, and 157 of 248 amino acid residues (63%) similar to, the 422 amino acid 
residue ptnr:SPTREMBL-ACC:Q9UL52 protein from Homo sapiens (Human) (serine 
protease DESC1) (E = Lie 58 ). 

NOV23a is predicted to be expressed in at least Ovary, kidney, breast, lung, muscle, 
liver, spleen, blood and lymphocyte. This information was derived by determining the tissue 
1 5 sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, Public EST sources, and/or RACE sources. 

NOV23b 

A disclosed NOV23b nucleic acid of 1777 nucleotides (also referred to as CG56647- 
03) encoding a novel serine protease DESCl-like protein is shown in Table 23C. An open 
20 reading frame was identified beginning with an ATG initiation codon at nucleotides 1 01 -103 
and ending with a TAG codon at nucleotides 1631-1633. Putative untranslated regions, if any, 
found upstream from the initiation codon and downstream from the termination codon are 
underlined in Table 23C, and the start and stop codons are in bold letters. 

Table 23C NOV23b Nucleotide Sequence (SEQ ID NO:99) 

GCCCCTGCCATAGGAGGCGGGGACTGTCATTTCACCGTCTCCTGATGCC^TTCCAGAGGTTACGCCCTGA 
AGTCAGCTCAGATCCTGGGCCAGGCACTGCATGGGAGAC&G 
GGAAAACACGGGGGCATCTGGGGCTC^CTTGGCACTCATCC 
CTTCCTCTCTACACAGGGCTTCCACGTGGACCACA 

TTGCGGCGGGAGACCTCGGACTATCACCGCACGCTGACGCCCACCCTGGAGGCACTGTTTGTAAGTAGTT 
TT CAGAAGAC AGAGT TAGAGGCAAG CTGCGTGGGTTG CT CGGTACTGAATTATAGGGATGGGAACT C CAG 
TGTCCTCGTACATTTCCAGCTGCACTTTCTGCT 

GAGCTATTGCAGCGAGGGATCCGGGCAAGGCTGCGGGAGCACGGCATCT 



202 



WO 02/068649 



PCT/US02/02785 



TTGTGTCGGCTGAGCTCACAGGGAGACATAAGGGACCCTTGGCAGAAAGAGACTTCAAATCAGGCCGCTG 
TC CAGGGAACTCCTTTTCCTGCGGGAACAGC CAGTGTGTGACCAAGGTGAACCCGG AGTGTGACGACCAG 
GAGGACTGCTCCGATGGGTCCGACGAGGCGCACTGCGAGTGTGGCTTGCAGCCTGCCTGGAGGATGGCCG 
GCAGGATCGTGGGCGGCATGGAAGCATCCCCGGGGGAGTTTCCGTGGCAAGCCAGCCTTCGAGAGAACAA 
GGAGCACTTCTGTGGGGCCGCCATCATCAACGCCAGGTGGCTGGTGTCTGCTGCTCACTGCTTCAATGAG 
TTCCAAGACCCGACGAAGTGGGTGGCCTACGTGGGTGCGACCTACCTCAGCGGCTCGGAGGCCAGCACCG 
TGCGGGCCCAGGTGGTCCAGATCGTCAAGCACCCCCTGTACAACGCGGACACGGCCGACTTTGACGTGGC 
TGTGCTGGAGCTGACCAGCCCTCTGC CTTTCGGCCGGCACATCCAGCCCGTGTG CCTCCCGGCTGCCACA 
CACATCTTCCCACCCAGCAAGAAGTGCCTGATCTCAGGCTGGGGCTACGTGCTGCAGAAAGCCACTGTGG 
AGCTG CTGGACCAGGCACTGTGTGCCAGCTTGTACGGCCATTCACTCACTGACAGGATGGTGTG CGCTGG 
CTACCTGGACGGGAAGGTGGACTCCTGCCAGGGTGACTCAGGAGGACCCCTGGTCTGCGAGGAGCCCTCT 
GGCCGGTTGTTTCTGGCTGGCATCGTGAGCTGGGGAATCGGGTGTGCGGAAGCCCGGCATCCAGGGGTCT 
ATGCCCGAGTCACCAGGCTACGCGACTGGATCCTGGAGGCCACCACCAAAGCCAGCATGCCTCTGGCCCC 
CACCATGGCTCCTGCCCCTGCCGCCCCCAGCACAGCCTGGCCCACCAGTCCTGAGAGCCCTGTGGTCAGC 
ACCCCCACCAAATCGATGCAGGCCCTCAGTACCGTGCCTCTTGACTGGGTCACCGTTCCTAAGCTACAAG 
GTATTTTCGGGGCAGAAAGGTA GAAGATGATGTACGTGCCTATCTTGATTTAGGGAGAACGGATATCGTC 
ATAGTATCTTCATAATTTTGGATCTTCCTGTTCAAGGAAAGGTCACATGTGTATCCGTTTA TTCCCATCT 
TACGTTGCGTGTACCCTCATGGTATCT 



The disclosed NOV23b nucleic acid sequence, located on chromosome 19, has 208 of 
327 bases (63%) identical to a gb:GENBANK-ID:AF098327|acc:AF098327.1 mRNA from 
Homo sapiens (putative mast cell mMCP-7-like II typtase gene, complete cds) (E = 2.8e~ 14 ) . 
5 A disclosed NOV23b polypeptide (SEQ ID NO: 1 00) encoded by SEQ ID NO:99 is 

5 1 0 amino acid residues and is presented using the one-letter amino acid code in Table 23D. 
Signal P, Psort and/or Hydropathy results predict that NOV23b contains no signal peptide and 
is likely to be localized in the microbody (peroxisome) with a certainty of 0.5131. 



Table 23D. Encoded NOV23b protein sequence (SEQ ID NO:100). 

MGDRHEQDLFLPS RKTRGHLGLTWHS STLCCTWGPPAFLSTQGFHVDHTAELRGI RWTS SLRRETSDYHRTLTPT 

LEALFVSSFQKTELEASCVGCSVLNYRDGNSSVLVHFQ 

YGTIVSAELTGRHKGPIAERDFKSGRCPGNSFSCGNSQCVTKVNPEOT 

IVGGMEASPGEFPWQASLRENKEHFCGAAI INARWLVSAAHCFNE FQDPTKWVAYVGATYLSGSEASTVRAQWQ 
I VKHPLYNADTADFDVAVLELTS PLPFGRHIQPVCLPAATHI FPPSKKCLI SGWGYVLQKATVELLDQALCASLY 
GHSLTDRMVCAGYLDGKVDSCQGDSGGPLVCEEP SGRLFIiAGI VS WG I GCAEARHPGVYARVTRLRDWILEATTK 
ASMPLAPTMAPAPAAPSTAWPTSPESPWSTPTK SMQALSTVPLDWVTVPKLQGIFGAER 

io ; 1 

The disclosed NOV23b amino acid sequence has 109 of 246 amino acid residues 
(44%) identical to, and 152 of 246 amino acid residues (61%) similar to, the 422 amino acid 
residue ptnr:SPTREMBL-ACC:Q9UL52 protein from Homo sapiens (Human) (serine 
protease DESC1) (E = L3e" 55 ). 

1 5 NOV23b is predicted to be expressed in at least the following tissues: adrenal gland, 

bone marrow, brain-amygdala, brain-cerebellum, brain-hippocampus, brain-substantia nigra, 
brain-thalamus, brain-whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma-Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, 
skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and 

20 uterus. Expression information was derived from the tissue sources of the sequences that were 
included in the derivation of the NOV23b sequence. 
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NOV23c 

A disclosed NOV23c nucleic acid of 815 nucleotides (also referred to as CG56647-01) 
encoding a novel adrenal secretory serine protease-like protein is shown in Table 23E. An 
open reading frame was identified beginning with a GGT initiation codon at nucleotides 1-3 
5 and ending with a TAA codon at nucleotides 787-789. The start and stop codons are shown in 
bold in Table 23E, and the 5' and 3' untranslated regions, if any, are underlined. Because the 
start codon of NOV23c is not a traditional initiation codon, NOV23c could be a partial reading 
frame that extends further in the 5 ' direction. 



Table 23E. NOV23c nucleotide sequence (SEQ ID NOrlOl). 

GGTCCTGCCTTCGTGGOTTCATGGCTGGTGACCTGCTGTCTTGCAGAGTGTGGCTTGCAGCCTGCCTGGAGG 
ATGGCCGGCAGGATCGTCGGCGGCATGGAAGCATCCCCGGGGGAGTTTCCGTGGCAAGCCAGCCTTCGAGAG 
AACAAGGAGCACTTCTGTGGGGCCGCCATCATCAACGCCAGGTGGCT 

GAGTTCCAAGACCCGACGAAGTGGGTGGCCTACGTGGGTGCGACCTACCTCAGCGGCTCGGAGGCCAGCACC 
GTG(^GCCCAGGTGGTCC^GATCGTCAAGCACCCCCTGTACAACGCGGACACGGCCGACTTTGACGTGGCT 
GTGCTGGAGCTGACCAGCCCTCTGCCTTTCGGCCGGCACATCCAGCCCGTGTGCCTCCCGGCTGCCACA 
ATCTTCCCACCCAG CAAGAAGTGCCTGATCTCAGGGTGGGG CTACCTCAAGGAGGACTTCCGTAAGC ATCTT 
CCTCTGCAGAAAGCCACTGTGGAGCTGCTGGACCAGGCACTGTGTGC 

GACAGGATGGTGTGCGCTGGCTACCTGGACGGGAAGGTGGACTCCTGCCAGGGTGACTCAGGAGGACCCCTG 
GTCTGCGAGGAGCCCTCTGGCCGGTTCTTTCTGGCTGGCATCGTGAGCTGGGGAATCGGGTGTGCGGAAGCC 
CGGCGTCCAGGGGTCTATGCCCGAGTCACCAGGCTACGTGACTGGATCCTGGAGGCCACCCGTTCCTAAGCT 
ACAAGGTATTTTCGGGGCAGAAA 



10 

In a search of public sequence databases, the NOV23c nucleic acid sequence, located 
on chromosome 19, has 350 of 564 bases (62%) identical to a gb:GENBANK- 
ED:E132G4|acc:E132G4.1 mRNA from Homo sapiens (Human eDNA encoding a serine 
protease) (E = 3. 2e" 26 ). 

15 The disclosed NOV23c polypeptide (SEQ ID NO:102) encoded by SEQ ID NOtlOl 

has 262 amino acid residues and is presented in Table 23F using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV23c has no signal peptide 
and is likely to be localized extracellularly with a certainty of 03750. Alternatively, NOV23c 
may also localize to the microbody (peroxisome) with a certainty of 0.1391, to the 

20 endoplasmic reticulum (membrane) with a certainty of 0. 1 000, or the endoplasmic reticulum 
(lumen) with a certainty of 0.1000. The most likely cleavage site for NOV23c is between 
positions 15 and 16: CLA-EC. 



Table 23F. Encoded NOV23c protein sequence (SEQ ID NO:102). 

"gpafvgswiot 

EFQDPTKWAYVGATYX^GSEASTVRAQVVQIVKHPLYNADTADFDVAVLELTSPLPFGRHIQPVCIiPAATH 
IFPPSKKCLISGWGYLKEDFRKHLPLQKATVELLDQ 

VCEEPSGRFFLAG IVS WGIGCAEARRPGVYARVTRIiRDWILE ATRS 
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A search of sequence databases reveals that the NOV23c amino acid sequence has 1 14 
of 248 amino acid residues (45%) identical to, and 152 of 248 amino acid residues (61%) 
similar to, the 273 amino acid residue ptnr:TREMBLNEW-ACC:BAB20278 protein from Mus 
5 musculus (Mouse) (Type 1 Spinesin) (E = 1.1 e~ 53 ). 

NOV23c is predicted to be expressed in at least the following tissues: Ovary, kidney, 
breast, lung, muscle, liver, spleen, blood, lymphocyte. This information was derived by 
determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
10 sources. 

The disclosed NOV23a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 23G. 



Table 23G. BLAST results for NOV23 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 

(%.) 


Expect 


gi| 12836503 |dbj| BAB 
23684. l| (AK004939) 


data source :SPTR, 
source 
key:0955l9, 
evidence : ISS-homo 
log to DJ1170K4.4 
(NOVEL PROTEIN) 
(FRAGMENT) -put at i 
ve [Mus musculus] 


799 


136/280 
(48%) 


180/280 
(63%) 


le-75 


gx | 16758444 |ref |NP_ 
446087. l| 
(NMJD53635) 


suppression of 
tumorigenicity 14 
(colon carcinoma, 
matriptase, 
epithin) [Rattus 
norvegicus] 


855 


133/289 
(46%) 


182/289 
(62%) 


3e-72 


gi 1 10257390 jgb | AAGl 
5395.l|AF057145 1 
(AF057145) 


serine protease 
TADG15 [Homo 
sapiens] 


855 


132/289 
(45%) 


185/289 
(63%) 


3e-72 


gi 1 11415040 |ref |np 
068813. l| 
(NM_021978) 


suppression of 
tumorigenicity 14 

(colon carcinoma, 
matriptase , 
epithin) 
suppression of 
tumorigenicity 14 

(colon 
carcinoma) ; 
matriptase [Homo 
sapiens] 


855 


132/289 
(45%) 


185/289 
(63%) 


3e-72 


gi| 12249015 |dbj| BAB 
20376. l| (AB030036) 


prostamin [Homo 
sapiens] 


855 


131/289 
(45%) 


184/289 
(63%) 


9e-72 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 23H. In the ClustalW alignment of the NOV23 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
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regions of conserved sequence (z.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 23H. Clustal W Analysis of NO V23 

1) Novel N0V23a {SEQ ID NO: 98) 

2) Novel NOV23b (SEQ ID NO:100) 
2) Novel N0V23C (SEQ ID NO:102) 

10 4) gi | 12836503 |dbj |BAB23684.l| (AX004939) data source :SPTR, source key: 095 519, 
evidence : ISS-homolog to DJ1170K4.4 (NOVEL PROTEIN) (FRAGMENT) -putative [Mus 
musculus] (SEQ ID NO: 416) 

5) gi 1 16758444 | ref |NP_446087 . 1 | (NM_053635) suppression of tumorigenicity 14 (colon 
carcinoma, matriptase, epithin) [Rattus norvegicus] (SEQ ID NO:417) 
15 6) gi|l0257390|gb|AAG15395.l|AF057145_l (AF057145) serine protease TADG15 [Homo 
sapiens] (SEQ ID NO:4l8) 

7) gi 1 11415040 I ref |NP_068813 . 1 | (NM_021978) suppression of tumorigenicity 14 (colon 
carcinoma, matriptase, epithin); suppression of tumorigenicity 14 (colon carcinoma); 
matriptase {Homo sapiens] (SEQ ID NO:419) 
20 8) gi| 12249015 |dbj|BAB20376.l| (AB030036) prostamin [Homo sapiens] (SEQ ID NO: 420) 



10 



25 



30 



35 



40 



45 



50 



55 



60 



NOV27a 
NOV27b 
NOV27C 



gi 
gi 
gi 
gi 
gi 



12836503 
16758444 
10257390 
11415040 
12249015 



NOV27a 
NOV27b 
NOV27C 

gi 
gi 
gi 
gi 
gi 



12836503 | 
16758444| 
10257390 | 
11415040 | 
12249015 I 



1 
1 
1 
1 
1 
1 
1 
1 



23 
23 
1 



NOV27a 


40 


NOV27b 


40 


NOV27c 


1 


gi 


12836503 


107 


gi 


16758444 


119 


gi 


10257390 


119 


gi 


11415040 


119 


gi 


12249015 


119 



20 

L 



30 



40 



50 



|....|. 




I-.. I. 



60 



mpttevpqaadgqgd 
mgnnrgrkagggsqd: 
mgsdrarkggggpkdi 
mgsdrarkggggpkd1 
mgsdrarkggggpkd: 



RKTRGHEG^- 23 

RKTRGHjjGgT - 23 

1 

NTKRKNRDY^RFTP 47 




4 6 LLL - VLAAL VS AGVMI 

6 1 WFSFLLLS LMAGLL\i 

61 VLIGLLLVLLGIGFLV 

61 VLIGLLLVLLGIGFLv 

6 1 VLIGLLLVLLGIGFLV 

130 



SfiFLGYKAETOjVSQV^SGSLRVLNRHFSQDLGRRES I 
f FHYR- -NVR^QKVlFNGHLRITNENFLDAyENSTSTE 

LQYR- -DVT^QKV^GYMRITNENFVDAYENSNSTE 
SLQYR- -DVl^QKV^GYMRITNENFVDAYENSNSTE 

LQYR- -DV^QKV^GYMRITNENFVDAYENSNST] 



140 



150 



160 



170 




--WIRWTS 
--HIRWTS 



180 



lrretsdShr- tltp - t; 
lrretsd§hr-tltp 



NOV27a 


81 


NOV27b 


81 


NOV27C 


1 


gi| 12836503) 


166 


gij 16758444] 


179 




,F 80 
»F 80 
1 



IVYSFGEGPLTCFFWFILDIPE^QRLTLSPEWRELL 165 
•VTAFSEGSVIAYYWSEFSIPPgLEEEVDRAMAVERV 178 
OTAFSEGSVIAYYWSEFSIPQ&VEEAERTOAgERV 178 
OTAFSEGSVIAYYWSEFSIPC^VEEAERVWaSrv 178 
kVTAFSEGS VI AYYWSEFS I PQBLVT2EAER VMAgERV 178 



220 



230 



240 



-YR- 
-YR- 



102 

102 

- 1 

DIWLNSTLGCYRYSYVNPGQVLPLKG 225 
CQDNS CS FALHARGRTVTRFTTPG - - FPNS 23 6 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



10257390 


179 




11415040 


179 


I 


12249015 


179 





gi 
gi 



NOV27a 
NOV27b 
NOV27C 
gi | 12836503 
gi 1 1 16758444 
gi | 10257390 
gi ) 11415040 
gi 1 12249015 



NOV27a 


NOV27b 


NOV27C 


gi 


12836503 


gi 


16758444 


gi 


10257390 


gi 


11415040 


gi 


12249015 



NOV27a 
NOV27b 
NOV27C 



gi 
gi 
gi 
gi 



12836503) 
16758444) 
10257390 | 
11415040 | 
12249015 I 




N0V27a 


NOV27b 


NOV27C 


gi 


12836503 


gi 


16758444 


gi 


10257390 


gi 


11415040 


gi 


12249015 



'S WAFPTDSKT^QRTQDNS CS FGLHARGVELMRFTTPG- - FPDS 23 6 
ISWAFPTDSKTVQRJQDNSCSFGLHARGVELMRFTTPG- - FPDS 23 6 
" WAFPTDSKTjVgR3!QDNS CS FGLHARGVELMRFTTPG- - FPDS 236 



250 



260 



270 



102 - -DGNSSVLVHFQ- 
102 --DGtTSSVLVHFjg- 



I.... I.... I 



280 



290 



|....| 



300 



113 
113 
1 



226 PDQQTTS CLWHL@GPEDLM I KVRLEWTRVDCRDR VAMYD AAG PLEKRL ITS VYG C 280 

237 PYPA^CQWVLRGDADSVLS^ 2 96 
237 PYPA$DfecQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYOT 296 
237. PYPAI3AR CQWALRGDADS VLSLTFRSFDLAS CDERGSDL VTVYNTLSPMEPHALVQLCGT 296 
237 P YPAHfiiRCQWAligGDADS VLS LTFRS FDLAS CDERGSDLVTVYNTLS PMEPHALVQLCGT 296 



310 



320 



330 



340 



350 



360 



113 
113 
1 



iLRP- 



-LQ 122 
-LQ 122 



281 SRQEPVMEVLASGSVMAWWKKGMHSYYDP 
297 FSPSYNLTFLSSQNVFLVTLITNTDRRHPG 
297 YPPSYNLTFHSSQNVLLITLITNTERRHPl 
297 YPPSYNLTFHSSQNVLLITLITNTERRHPl 
297 YPPSYNLTFHSSQNVLLITLITNTERRHPG 



370 



380 



390 



LLSVKSVAFQDCQVNLTLEGRLDTQGFLR 340 

&TFFQLPKMSSCGG LLSEAQGTFS 352 

&TFFQLPRMSSCGG RLRKAQGTFN 352 

SATFPQLPRMSSCGG RLRKAQGTFN 352 

2ATFFQLPRMSSCGG RLRKAQGTFN 352 



400 



NOV27a 


123 


N0V27b 


123 


NOV27C 


1 


gi 


12836503 


341 


gi 


16758444 


353 


gi 


10257390 


353 


gi 


11415040 


353 


gi 


12249015 


353 




§pyypghyppnidctwnievpnnqhvkvrfkf: 
Ipyypghyppnidctwnievpnnqhvkvrfkf: 




iqkynrlctqgqwmiqnrrlcgf 400 
dpn^pvgsctkdyveingekfcge 412 
™ IpSgtcpkdyveingekycge 412 
ptcpkdyve ingekycge 412 
Igtcpkdyveingekycge 412 



145 
145 
1 



430 



440 



450 



460 



470 



480 



Svsaelt- 

§VSAELT- 



160 
160 



4 01 RTLQP YAERI PMVASDGVT iNFTSQi 
413 RS QFWSSNSSKITVHFHSD] 

413 RS qfwtsnsnkitvrfhs: 

413 RS QFWTSNSNKITVRFHSDi 

413 RS qfwtsnsnkitvrfhs: 




- 1 

QVYYSLYNQSDPCPGEFLCSV 454 

|F|iAEYLSYDSNDPCPGMFMCKTGRCIRK 467 
LSYDSSDPCPGQFTCRTGRCIRK 467 
YDSSDPCPGQFTCRTGRCIRK 467 
LSYDSSDPCPGQFTCRTGRCIRK 467 



490 



NOV27a 


160 


NOV27b 


160 


NOV27C 


1 


gi 


12836503 


454 


gi 


16758444 


468 


gi 


10257390 


468 


gi 


11415040 


468 


gi 


12249015 


468 



500 
..|.. 



510 



530 



520 

-I — _i — I 

GRHKGP-- 

grSbkgp-- 



540 



NGLCVPACDGIKDCPNGLD^NL - 

DLRO)GWADCPDYSDERHCROTATHQmCKNQFCKPLFWVCT>SVNDCGDGSD ' EGCSCR 



170 
170 
1 

CVCRA 481 
»A 527 
CSCPA 527 
CSCPA 527 
CSCPA 527 




482 MFQCQEDSTCISLPRVCDRQP 
528 GSFKCSNGKCLPQSQQCNGKD 
528 QTFRCSNGKCLSKSQQCNGKD 
528 QTFRCSNGKCLSKSQQCNGKD 
528 QTFRCSNGKCLSKSQQCNG] 
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610 



620 



10 



NOV27a 
NOV27b 
NOV27C 



gi 
gi 
gi 
gi 
gi 



12836503) 
16758444 | 
10257390| 
11415040 | 
12249015 | 




QPAWRMAGj 
QfPAWRMAGj 
afPAWRMAGj 

2--£glss 

R-jSETKQj 
R-£ETRQ, 




670 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



NOV27a 
NOV27b 
NOV27C 



gi 
gi 
gi 
gi 
gi 



12836503 
16758444 
10257390 
11415040 
12249015 



NOV27a 
NOV27b 
NOV27C 



gi 
gi 
gi 
gi 
gi 



12836503 
16758444 
10257390 
11415040 
12249015 



NOV27a 
HOV27b 
NOV27C 



gi 
gi 
gi 
gi 
gi 



12836503 
16758444 
10257390 
11415040 
12249015 



NOV27a 
NOV27b 
NOV27C 



gi 
gi 
gi 
gi 
gi 



12636503 
16758444 
10257390 
11415040 
12249015 




680 
---EQ5 

— HQ 

|Q 

|QEDS MAS 

|QDETIFKj|S 
XiPDRGFRSjs 

i|drgfr|cs 

IDDRGFR^Sg 



690 



700 



710 



720 




730 



740 



HOV27a 


371 


NOV27b 


361 


NOV27C 


173 


gi 


12836503 


711 


gi 


16758444 


766 


gi 


10257390 


766 


gi 


11415040 


766 


gi 


12249015 


766 




850 



860 



870 



B80 



890 




900 
••I 



460 

KASMPLAPTMAPAPAAPSTAWPTSPESPVVS 480 

262 

JQQVLT 799 

855 
855 
855 
855 



910 



920 



930 



460 460 

481 TPTKSMQALSTVPLDWTVPKLQGIFGAER 510 

262 262 

799 799 

855 855 

855 855 

855 - - 855 

855 - 855 
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Tables 23I-L list the domain descriptions from DOMAIN analysis results against 
NOV23. This indicates that the NOV23 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 231 Domain Analysis of NOV23a 

gnl|Smart [smart 00 020, Tryp_SPc, Trypsin-like serine protease; Many of 
these are synthesised as inactive precursor zymogens that are cleaved 
during limited proteolysis to generate their active forms. A few, 
however, are active as single chain molecules, and others are inactive 
due to substitutions of the catalytic triad residues. (SEQ ID 
NO:812) 

CD-Length = 230 residues, 100.0% aligned 
Score - 269 bits (687), Expect = 3e-73 



5 





N0V23 : 


225 




Sbjct: 


1 


10 


NOV23 : 


284 




Sbjct: 


60 




N0V23 : 


344 


15 








Sbjct: 


119 




NOV23 : 


402 


20 


Sbjct: 


179 



RI VGGMEASPGEFPWQASLR- ENKEHFCGAAI INARWLVSAAHCFNEFQDPTKWVAYVGA 

Kill 11+ I 1 1 1 1 11+ Mil ++I+ II+++IIII 1+ +1+ 

RIVGGSEANIGSFPWQVSLQYRGGRHFCGGSLIS PRVTVLTAAHCVYGSA- PSSIRVRLGS 



i+iiimiiiii 



i+ 1 Mini in +IM) n+ in 



283 



59 



343 



TYLSGSEASTVRAQWQIVKHPLYNADTADFDVAVI^LTSPLPFGI^IQPVCLPAATHIF 

111+ +1 +++ II II II I+I+I+I+ 1+ ++I+III++ + 

HDLSSGEETQTV-KVSKVIVHPiraJPSTYDNDIALLKLSEPvTLSDTVRPICLPSSGYNV 118 



344 PPSKKCLI SGWG YLKEDFWKPE VLQ KATVELLDQALCAS LY - - GHSLTDRMVCAG YLDG 401 

I +1111 I 1+ 11+ I ++ I I II ++II l+lll |+l 

119 PAGTTCTVSGWGRTSESSGSLPDTLQEVNVPIVSNATCRRAYSGGPAITDNMLCAGGLEG 178 



Table 23 J Domain Analysis of NOV23a 

gnl)Pfam|pfam00089, trypsin, Trypsin. Proteins recognized include all 
proteins in families 31, S2A, S2B, S2C, and S5 in the classification 
of peptidases. Also included are proteins that are clearly members, 
but that lack peptidase activity, such as haptoglobin and protein Z 
(PRTZ*). (SEQ ID NO:813) 
CD-Length = 217 residues, 100.0% aligned 
Score = 223 bits (568) , Expect * 2e-59 



25 



30 



35 



NOV27: 


226 


Sbjct: 


1 


NOV27 : 


286 


Sbjct: 


58 


NOV27 : 


346 


Sbjct: 


116 


NOV27: 


406 


Sbjct: 


173 



226 IVGGMEASPGEFPWQASLRENKEHFCGAAIINARWLVSAAHCFNEFQDPTKWVAYVGATY 285 

llll II I till 11+ + Mil ++I + I+++IIII + + +1 

IVGGREAQAGSFPWQVSLQVSSGHFCGGSI.ISENWVLTAAHCVS GASSVRWLGEHN 5 7 



I +1 ♦ ♦ 1+1+ II II II l+l+l+l 11+ I ++I+III+I+ I 
LGTTEGTEQKFDVKKI IVHPNYNPDT - -NDIALLKLKSPVTLGDTVRP ICLPSASSDLPV 115 



lllllllllll + 



I I II ++II l+lll I II 1+ 



.1 + 1 
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Table 23K Domain Analysis of NOV23b 

gnl | Smart j smart o 0192, LDLa, Low-density lipoprotein receptor domain 
class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) 
receptor that plays a central role in mammalian cholesterol 
metabolism. The N- terminal type A repeats in LDL receptor bind the 
lipoproteins. Other homologous domains occur in related receptors, 
including the very low-density lipoprotein receptor and the LDL 
receptor-related protein/alpha 2-macroglobulin receptor, and in ■ 
proteins which are functionally unrelated, such as the C9 component of 
complement. Mutations in the LDL receptor gene cause familial 
hypercholesterolemia. (SEQ ID NO: 814) 
CD-Length « 38 residues, 100.0% aligned 
Score * 50.4 bits (119), Expect = 2e-07 



NOV23: 176 RCPGNSFSCGNSQCVTKVNPECDDQEDCSDGSDEAHCEC 214 

ii ii i +i+ ii +ii iiiii +i 

Sbjct: 1 TCPPGEPQCKNGRCIPLSWV-CDGVDDCGDGSDEENCPS 38 



Table 23L Domain Analysis of NOV23b 



gnl|Pfam|pfam00057, ldl_recept_a , Low-density lipoprotein receptor 
domain class A (SEQ ID NO: 815) 
CD-Length = 39 residues, 94.9% aligned 
Score = 43.5 bits (101), Expect = 3e-05 



10 



15 



20 



25 



N0V23: 175 GRCPGNSFSCGNSQCVTKVNPECDDQEDCSDGSDEAHC 212 

III 11+ +h II II IIIII +| 

Sbjct: 1 STCGPNEFQCGSGECIPMSW-VCDGDPDCEDGSDEKNC 37 

Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being 
found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, 
including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 
families (denoted SI - S27) of serine protease have been identified, these being grouped into 6 
clans on the basis of structural similarity and other functional evidence. 

Tryptase is a tetrameric serine protease that is concentrated and stored selectively in 
the secretory granules of all types of mast cells, from which it is secreted during mast cell 
degranulation. Its exclusive presence in mast cells permits its use as a specific clinical 
indicator of mast cell activation by measurement of its level in biologic fluids and as a 
selective marker of intact mast cells using immunohistochemical techniques with antitryptase 
antibodies. Vanderslice demonstrated the existence of multiple tryptases. In this respect, mast 
cell tryptase is like other serine proteases such as glandular kallikrein and trypsin, which are 
also members of multigene families. Miller et al. mapped both alpha-tryptase and beta-tryptase 
to human chromosome 16 by PCR analysis of DNA from human/hamster somatic cell hybrids. 
Miller et al. cloned a second cDNA for human tryptase, called beta-tryptase, from a mast cell 
cDNA library. The 1,142 bases of beta-tryptase were found to encode a 30-amino acid leader 
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sequence of 3,089 daltons and a 245-amino acid catalytic region of 27,458 daltons. The amino 
acid sequence of beta-tryptase was found to be 90% identical with that of alpha-tryptase, the 
first 20 amino acids of the catalytic portions being 100% identical. Both alpha- and beta- 
tryptase sequences were localized to human chromosome 16 by analysis of DNA preparations 
5 from 25 human/hamster somatic cell hybrids by PCR. 

Because of the presence of the trypsin domains and the homology to the adrenal 
secretory serine protease, it is anticipated that the novel sequences described here will have 
useful properties and functions similar to these proteins. 

The disclosed NOV23 nucleic acid of the invention encoding an Adrenal secretory 

10 serine protease -like protein includes the nucleic acids whose sequences are provided in Tables 
23A, 23C, 23E or a fragment thereof. The invention also includes a mutant or variant nucleic 
acid any of whose bases may be changed from the corresponding base shown in Table 23 A, 
23c, or 23E while still encoding a protein that maintains its Adrenal secretory serine protease - 
like activities and physiological functions, or a fragment of such a nucleic acid. The invention / 

15 further includes nucleic acids whose sequences are complementary to those just described, 
including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 

20 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used^for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 37 percent of the 
bases may be so changed. 

25 The disclosed NOV23 protein of the invention includes the Adrenal secretory serine 

protease -like protein whose sequence are provided in Table 23B, 23D, or 23F. The invention 
also includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 23B, 23D, or 23F while still encoding a protein that 
maintains its Adrenal secretory serine protease -like activities and physiological functions, or a 

30 functional fragment thereof. In the mutant or variant protein, up to about 55 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as or 
(Fab)^ that bind immunospecifically to any of the proteins of the invention. 
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The above disclosed information suggests that these Adrenal secretory serine protease - 
like proteins (NOV23) is a member of a "Adrenal secretory serine protease family". 
Therefore, the NOV23 nucleic acids and proteins identified here may be useful in potential 
therapeutic applications implicated in (but not limited to) various pathologies and disorders as 
5 indicated below. The potential therapeutic applications for this invention include, but are not 
limited to: protein therapeutic, small molecule drug target, antibody target (therapeutic, 
diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene 
therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of 
all tissues and cell types composing (but not limited to) those defined here. 

10 The NOV23 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome, cirrhosis, 
transplantation, endometriosis, fertility, anemia, ataxia-telangiectasia, autoimmune disease, 
hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, allergies, 
immunodeficiencies, graft versus host disease (GVHD), lymphaedema, muscular dystrophy, 

1 5 Lesch-Nyhan syndrome, myasthenia gravis, and/or other diseases and pathologies. 

NOV23 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV23 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

20 NOVX Antibodies" section below. The disclosed NOV23 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

25 NOV24 

NOV24 includes two novel parchorin-like proteins disclosed below. The disclosed 
sequences have been named NOV24a and NOV24b. 
NOV24a 

A disclosed NOV24a nucleic acid of 2091 nucleotides (also referred to as CG56455- 
30 01) encoding a novel parchorin-like protein is shown in Table 24A. An open reading frame 
was identified beginning with a ATG initiation codon at nucleotides 7-9 and ending with a 
TGA codon at nucleotides 2080-2082. The start and stop codons are shown in bold in Table 
24 A, and the 5' and 3' untranslated regions, if any, are underlined. 
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Table 24A- NOV24a nucleotide sequence (SEQ ID NO:103). 

GCGGCCA TGGCCGAGGCCGCGGAGCCGGAGGGGGTTGCCCCGGGTCCCCAGGGGCCGCCGGAGGTCCCCGCG 
CCTCTGGCTGAGAGACCCGGAGAGCCAGGAGCCGCGGGCGGGGAGGCAGAAGGGCCGGAGGGGAGCGAGGGC 
GCAGAGGAGGCGCCGAGGGGCGCCGCCGCTGTGAAGGAGGCAGGAGGCGGCGGGCCAGACAGGGGCCCGGAG 
GCCGAGGCGCGGGG CACGAGGGGGG CGCACGGCGAGACTGAGGCCGAGGAGGGAGCCCCGGAGGGTGCCGAG 
GTGCCCCAAGGAGGGGAGGAGACAAGCGGCGCGCAGCAGGTGGAGGGGGCGAGCCCGGGACGCGGCGCGCAG 
GGCGAGCCCCGCGGGGAGGCTCAGAGGGAGCCCGAGGACTCTGCGGCCCCCGAGAGGCAGGAGGAGGCGGAG 
CAGAGGCCTGAGGTCCCGGAAGGTAGCGCGTCCGGGGAGGCGGGGGACAGCGTAGACGCGGAGGGCCCGCTG 
GGGGACAACATAGAAGCGGAGGGCCCGGCGGGCGACAGCGTAGAGGCGGAGGGCCGGGTGGGGGACAGCGTA 
GACGCGGAAGGTCCGGCGGGGGACAGCGTAGACGCGGAGGGCCCGCTGGGGGACAACATACAAGCCGAGGGC 
CCGGCGGGGGACAGCGTAGACGCGGAGGGCCGGGTGGGGGACAGCGTAGACGCGGAAGGTCCGGCGGGGGAC 
AGCGTAGACGCGGAGGGCCGGGTGGGGGACAGCGTAGAGGCGGGGGACCCGGCGGGGGACGGCGTAGAAGCG 
GGGGTCCCGGCGGGGGACAGCGTAGAAGCCGAAGGCCCGGCGGGGGACAGCATGGACGCCGAGGGTCCGGCA 
GGAAGGGCGCGCCGGGTCTCGGGTGAGCCGCAGCAATCGGGGGACGGCAGCCTCTCGCCCCAGGCCGAGGCA 
ATTGAGGTCGCAGCCGGGGAGAGTGCGGGGCGCAGCCCCGGTGAGCTCGCCTGGGACGCAGCGGAGGAGGCG 
GAGGTCCCGGGGGTAAAGGGGTCCGAAGAAGCGGCCCCCGGGGACGCAAGGGCAGACGCTGGCGAGGACAGG 
GTAGGGGATGGGCCACAGCAGGAGCCGGGGGAGGACGAAGAGAGACGAGAGCGGAGCCCGGAGGGGCCAAGG 
GAGGAGGAAGCAGCGGGGGGCGAAGAGGAATCCCCCGACAGCAGCCCACATGGGGAGGCCTCCAGGGGCGCC 
GCGGAGCCTGAGGCC CAGCTCAGCAACCAC CTGGCCGAGGAGGGCC CCGCCGAGGGTAGCGGCGAGGCCGCG 
CGCGTGAACGGCCGCCGGGAGGACGGAGAGGCGTCCGAGCCCCGGGCCCTGGGGCAGGAGCACGACATCACC 
CTCTTCGTCAAGGCTGGTTATGATGGTGAGAGTATCGGAAATTGCCCGTTTTCTCAGCGTCTCTTTATGATT 
CTCTGGCTGAAAGGCGTTATATTTAATGTGACCACAGTGGACCTGAAAAGGAAACCCGCAGACCTGCAGAAC 
CTGGCTCCCGGAACAAACCCTCCTTTCATGACTTTTGATGGTGAAGTCAAGACGGATGTGAATAAGATCGAG 
GAGTTCTTAGAGGAGAAATTAGCTCCCCCGAGGTATCCCAAGCTGGGGACCCAACATCCCGAATCTAATTCC 
GCAGGAAATGACGTGTTTGCCAAATTCTCAGCGTTTATAAAAAACACGAAGAAGGATGCAAATGAGGTTCAT 
GAAAAGAAC CTGCTGAAGG CC CTGAGG AAGCTGG ATAATT ACTT AAATAGC CCT CTG CCTG ATGAAATAGAT 
GCCTACAGCACCGAGGATGTCACTGTTTCTGGAAGGAAGTTTCTGGATGGGGACGAGCTGACGCTGGCTGAC 
TGCAAC CTCTTACCCAAGCTCCATATTATT AAGGTT CTTCATTTTCAGATTGTGGCCAAGAAGTACAGAGAT 
TTTGAATTTCCTTCTGAAATGACTGGCATCTGGAGATACTTGAATAATGCTTATGCTAGAGATGAGTTCACA 
AATACGTGTCCAGCTGAT CAAGAGATTGAACACGCATATTC AGATGTTG CAAAAAGAATGAAATG &AGCTGG 
GCT 



In a search of public sequence databases, the NOV24a nucleic acid sequence, located 
on chromosome 21, has 1347 of 1897 bases (71%) identical to a gb:GENBANK- 
ID:AB035520|acc:AB035520.1 mRNA from Oryctolagus cuniculus {Oryctolagus cuniculus 
5 mRNA for parchorin, complete cds) (E = 2.4e" ! 75 ). 

A disclosed NOV24a polypeptide (SEQ ID NO: 104) encoded by SEQ ID NO: 103 has 
691 amino acid residues and is presented in Table 24B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV24a has no signal peptide and is 
likely to be localized to the nucleus with a certainty of 0.3000. Alternatively, NOV24a may 
1 0 also localize to the mitochondrial matrix space with a certainty of 0. 1 000, or to the lysosome 
(lumen) with a certainty of 0. 1 000. 



Table 24B. Encoded NOV24a protein sequence (SEQ ID NO:104). 

MAEAAEPEGVAPGPQGPPEVPAPIJVERPGEPGAAGGEAEGPEGS 

MGTRGAHGETEAEEGAPEGAEVPQGGEETSGAQQVEGASPGRGAQGEPRGEAQREPEDSAAPERQEEAEQR 
PEVPEGSASGEAGDSVDi^GPLGDNIEAEGPAGDSVEAEGRVGDSVDAEGPAGDSVDAEGPLGDNIQAEGPA 
GDSVDAEGRVGDSVDAEGPAGDSVDAEGRVGDSVEAGDPAGDGVEAGVPAGDSVEAEGPAGDSMDAEGPAGR 
ARRVSGEPQQSGDGSLSPQAEAIEVAAGESAGRSPGEIAWDAAEEAEVPGVKGSEEAAPGDARADAGEDRVG 
DGPQQEPGEDEERRERSPEGPREEEAAGGEEESPDSSPHGEASRGAAEPEAQLSNHLAEEGPAEGSGEAARV 
NGRREDGEASEPRALGQEHDI TLFVKAGYDGES IGNCPFSQRLFMILWLKGVI FNVTTVDLKRKPADLQNLA 
PGTNPPFMTFDGEVKTDVNKI EEFLEEKLAPPRYPKLGTQHPE SNS AGNDVFAKFS AFIKNTKKDANEVHEK 
NLLKAIiRKIJ3]mj^SPLPDEIDAYSTEDVTVSGRKFLIX3DELTIiM 
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I FPSEMTGIWRYLNNAYARDEFTNTCPADQEIEHAYSDVAKRMK | 

A search of sequence databases reveals that the NOV24a amino acid sequence has 414 
of 655 amino acid residues (63%) identical to, and 453 of 655 amino acid residues (69%) 
similar to, the 637 amino acid residue ptnr:SPTREMBL-ACC:Q9N2G5 protein from 
5 Oryctolagus cuniculus (Rabbit) (Parchorin) (E = 2.5e 1 82 ). 

NOV24a is predicted to be expressed in at least the following tissues: brain, lung, and 
kidney. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 
10 In addition, the sequence is predicted to be expressed in gastric parietal cells, choroid 

plexus, salivary duct, lacrimal gland, kidney, airway epithelia and chorioretinal epithelia 
because of the expression pattern of (GENBANK-ID: gb:GENBANK- 
ID:AB035520|acc:AB035520.1) a closely related Oryctolagus cuniculus mRNA for parchorin, 
complete cds homolog. 
15 NOV24b 

A disclosed NOV24b nucleic acid of 859 nucleotides (also referred to as CG56455-02) 
encoding a novel parchorin-like protein is shown in Table 24C. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 1-3 and ending with a TGA 
codon at nucleotides 853-855. The start and stop codons are shown in bold in Table 24A, and 
20 the 5' and 3' untranslated regions, if any, are underlined. 

Table 24C NOV24b nucleotide sequence (SEQ ID NO:105). 

ATQGCCGAGGCCGCGGAGCCTGAGGCCCAGCrCAGCAACCACCTGGCCGAGGAGGGCCCCGCCGAGGGTAGC 
GGCGAGGCCGCGCGTGTGAACGGCCGCCGGGAGGACGGAGAGGCGTCCGAGCCCCGGGCCCTGGGGCAGGAG 
CACGACATCACCCTCTTCGTCAAGGCTGGTTATGATGGTGAGAGTATCGGAAATTGCCCGTTTTCTCAGCGT 
CTCTTTATGATTCTCTGGCTGAAAGGCGTTATATTTAATGTGACCACAGTGGACCTGAAAAGGAAACCCGCA 
GACCTGCAGAACCTGGCTCCCGGAACAAACCCTCCTTTTATGACTTTTGATGGTGAAGTCAAGACGGATGTG 
AATAAGATCGAGGAGTTCTTAGAGGAGAAATTAGCTCCCCCGAGGTATCCCAAG CTGGGGAC CCAACATCCC 
GAATCTAATTCCGCAGGAAATGACGTGTTTGCCAAATT 

AATGAGATTCATGAAAAGAACCTGCTGAAGGCCCTGAGGAAGCTGGATAATTACTTAAATAGCCCTCTGCCT 
GATGAAATAGATGCCTACAGCACCGAGGATGTCACTGTTTCrrGGAAGGAAGTTTCTGGATGGGGACGAGCTG 
ACGCTGGCTGACTGCAACCTCTTACCCAAGCTCCATATTATTAAGATTGTGGCCAAGAAGTACAGAGATTTT 
GAATTTCCTTCTGAAATGACTGGCATCTGGAGATACTTGAATAATGCTTATGCTAGAGATGAGTTCACAAAT 
ACGTGTCCAGCTGATCAAGAGATTGAAC^CGC^TATTCAGATGTTGCAAAAAGAATGAAATGAAGCT 



In a search of public sequence databases, the NOV24b nucleic acid sequence, located 
on the q22.12 region of chromosome 21, has 741 of 847 bases (87%) identical to a parchorin 
mRNA from oryctolagus cuniculus gb accno AB035520.1) (E = 3.2e~ 140 ). 
25 A disclosed NOV24b polypeptide (SEQ ID NO:106) encoded by SEQ ID NO:105 has 

284 amino acid residues and is presented in Table 24D using the one-letter amino acid code. 
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Signal P, Psort and/or Hydropathy results predict that NOV24b has no signal peptide and is 
likely to be localized to the nucleus with a certainty of 0.3000. Alternatively, NOV24b may 
also localize to the mitochondrial matrix space with a certainty of 0.1 000, or to the lysosome 
(lumen) with a certainty of 0.1 000. 

5 

Table 24D. Encoded NOV24b protein sequence (SEQ ID NO:106). 

MAEAAEPEAQLSbmLAEEGPAEGSGEAARWGRREDGEASEPRALGQEHDITLFVKAGYDGESIGNCPFSQR 
LFMI LWLKGVI FNVTTVDLKRKP ADLQNLAPGTNPPFMTFDGEVKTDVNK I EE FLEEKLAPPR YPKLGTQHP 
ESNSAGNDVFAKFSAFIKOTKKDANEIHEKNLLKALtt 

TLADCNLLPKLHI I KIVAKKYRDFEFPSEMTGI WRYLNNAYARDE FTNTCPADQE IEHAYSDVAKRMK 

A search of sequence databases reveals that the NOV24b amino acid sequence has 255 
of 281 amino acid residues (90%) identical to, and 263 of 281 amino acid residues (93%) 
similar to, the 637 amino acid residue ptnr:SPTREMBL-ACC:Q9N2G5 protein from 
10 Oryctolagus cuniculus (Rabbit) (Parchorin) (E = L6e" 134 ). 

NOV24b disclosed in this invention is predicted to be expressed in at least the 
following tissues: heart, placent, skeletal muscle, stomach, and lung. , 

The disclosed NOV24a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 24E. 

15 



Table 24E. BLAST results for NOV24a 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
<%) 


Positives 
(%) 


Expect 


gi| 7592636 |dbj |BAA9 
4345.1 1 (AB035520) 


parchorin 

(Oryctolagus 

cuniculus] 


637 


436/715 
(60%) 


475/715 
(65%) 


e-130 


gi| 6685319 |sp|Q9Y69 
6 | CLI4_HUMAN 


CHLORIDE 
INTRACELLULAR 
CHANNEL PROTEIN 4 
(INTRACELLULAR 
CHLORIDE ION 
CHANNEL PROTEIN 
P64H1) 


253 


182/238 
(76%) 


207/238 
(86%) 


e-108 


gi | 7330335 |ref|NP 0 
39234. l| 
(NM_013943) 


chloride 
intracellular 
channel 4 ,- 
chloride 
intracellular 
channel 4 like 
[Homo sapiens] 


253 


182/238 
(76%) 


208/238 
(86%) 


e-108 


gi | 7304963 |ref|NP 0 
38913. l| 
(NM_013885) 


chloride 
intracellular 
channel 4 
(mitochondrial ) 
[Mus musculus) 


253 


181/238 
(76%) 


207/238 
(86%) 


e-107 


gi | 4588524 | gb | AAD26 
136.l|AF109196 1 
(AF109196) 


intracellular 
chloride channel 
P64H1 [Homo 
sapiens] 


253 


180/238 
(75%) 


205/238 
(85%) 


e-106 



215 



WO 02/068649 



PCIYUS02/02785 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 24F. In the ClustalW alignment of the NOV24 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
5 functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 24F. ClustalW Analysis of NOV24 



10 



15 



20 



1) Novel N0V24a (SEQ ID NO: 104) 

2) Novel NOV24b (SEQ ID NO: 106) 

3) gi| 7592636 |dbj | BAA94345 . 1 1 (AB035520) parchorin [Oryctolagus cuni cuius] (SEQ ID 
N0:421) 

4) gi| 6685319 |sp|Q9Y696|CLI4_HUMAN CHLORIDE INTRACELLULAR CHANNEL PROTEIN 4 
(INTRACELLULAR CHLORIDE ION CHANNEL PROTEIN P64H1) (SEQ ID NO:422) 

5) gi| 7330335 |ref |NP_039234 . 1 | (NM_013943) chloride intracellular channel 4; 
chloride intracellular channel 4 like [Homo sapiens) (SEQ ID NO: 423) 

6) gi | 7304963 |ref|NP_0389l3.l| (NM_013885) chloride intracellular channel 4 
(mitochondrial) [Mus musculus] (SEQ ID NO: 424) 

7) gi|4588524|gb|AAD26l36.i|AFl09l96_l (AF109196) intracellular chloride channel 
p64Hl [Homo sapiens] (SEQ ID NO: 425) 



25 



30 



N0V24a 


1 


NQV24b 


1 


gi 


7592636) 


1 


gi 


6685319| 


1 


gi 


7330335) 


1 


gi 


7304963) 


1 


gi 


4588524) 


1 



10 20 30 

..-.|-...|....|....|.-...|....|....|. 
MAEAAEPEGVAPGPQGPPEVPAPLAERPGEPGAAG- 



40 50 60 

.|....|....|....|....| 

- -GEAEGPEGSEGAEEAPRGAA 



MAETAEPEGGAPSPQGPPEGSALLEERPGEPDPAGPEASEGAAKAPSGEGAGAAAKAGAT 



55 
1 

60 

1 

1 

1 

1 
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NOV24a 
NOV24b 



56 
1 





7592636 


61 


gi 


6685319 


1 


gi 


7330335 


1 


gi 


7304963 


; i 


gi 


4588524 


i 


NOV24a 


110 


NOV24b 


1 


gi 


7592636 


121 


gi 


6685319 


1 


gi 


7330335 


1 


gi 


7304963 


1 


gi 


4588524 


1 



70 

....|....|... . |.. ..).... 

AVKEAGGGGPDRGPEAEARGTRG- 



90 100 110 120 

..i — i — i — i — i — r — i 

- - AHGETEAEEGAPEGAEVPQGGEETSGAQQVE 109 
_ a 

EEASGGRDGEGAGEQAPDAGTESGGETPDAKGAQIEAEGAPEGTKAPQLGEEGSGGKQVE 120 
_ _ - 1 



130 



140 



150 



160 



170 
..|.. 



180 

••I 



-GDSVDAEG- 



1 

--S 163 
1 

--- 1 

— - 1 
--- 1 



55 



NOV24a 
NOV24b 
gi (7592636 | 
gi j 6685319 | 



190 200 210 220 230 240 

| | |....| | | |....|....| |....|.... | 

170 IEAEGPAGDSVEAEGRVGDSVDAEGPAGDSVDAEGPLGDNIQAEGPAGDSVDAEGRVGDS 229 

1 - 1 

164 IDAGG SVDAAG SVDAGG SIDAGGSM-D AGGSVDAGG S 199 
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gi 
gi 
gi 



7330335 
7304963 
4588524 



NOV24a 
NOV24b 



gi 
gi 
gi 
gi 
gi 



7592636 
6685319 
7330335 
7304963 
4588524 



250 260 270 280 290 300 

| | | |....| )..... | | | |....|....| 

230 VDAEGPAGDS VDAE6RVGDS VEAGDPAGDGVEAGVP AGDS VEAEG - - PAGDSMD AEGPAG 287 

1 --- - 1 

200 IDTGG SVDAAG SVDAGGS IDTG - - RNVDAGGS I DAGGS VDAGGSMDAEGPAG 249 

1 - - 1 

1 --- 1 

! 1 

1 1 



310 320 330 340 350 360 

,...|....|....|....|....|....|..^|....|....|....|....|....| 

NOV24a 288 RARRVSGEPQQSGDGSLSPQAEAIEVAAGESAGRSPGELAWDAAEEAEVPGVKGSEEAAP 347 

NOV24b 1 - - 1 

IGAGGE P QDLGAG S P Q PRS EAVEVAAAENEGH S PGE S VE DAAAE EAA- G TREP 303 

- - 1 

- 1 

1 

— - 1 



gi 


7592636 


250 


gi 


6685319 


1 


gi 


7330335 


1 ' 


gi 


7304963 


1 


gi 


4588524 


1 
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gi 
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1 . 


gi 


4583524 
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gi 


4588524 
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NOV24a 


466 


NOV24b 


64 


gi 


7592636 


417 


gi 


6685319 


32 


gi 


7330335 


32 


gi 


7304963 


32 


gi 


4588524 


32 



406 AAEPEAQ? 
4 AAEPEAQP 



|.. ..).... | 
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IGNCPFSQRLFMILWLKGVg?gVTTVDLKRKPADLQNLAPGTgPPFjTF^EVKTDVNKI 
IGNCPFSQRLFMILWLKGvJpSvTTVDLKRKPADLQN^ 

IGNCPFSQRLFMILWLKGVj^gVTTVDLKRKPADLQl^APGTSpPFSTFj^VKTDWKI 



I GN C P F S QRL FM I L WLKG VW Fg VTT VDLKRKP ADLQNL APGTg P P F FJS £ VKTDVNK I 
IGNCPFSQRLFMILWLKGV|F| VTTVDLKRKPADLQNLAPGTijiPPF S TFg ' EVKTDVNKI 
IGNCPFSQRLFMILWLKGvBf| VTTVDLKRKPADLQNLAPGTgPPF ' TFg : EVKTDVNKI 
IGNCPFSQRLFMILWLKGvSF§VTTVDLKRKPADLQNLAPGT§jPPFE TrS : EVKTDVNKI 
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The gene of invention encodes a homolog of parchorin, a new member of the 
intracellular chloride channel family. Parchorin was discovered as a 120 kDa phosphoprotein 
in gastric parietal cells (Urushidani etal., J Membr Biol. 1999 Apr l;168(3):209-20). 
Subsequent analysis revealed that this protein had significant homology to the family of 
intracellular chloride channels, especially in the C terminal domain (Nishizawa et ah, J Biol 
Chem 2000 Apr 14;275(15):1 1 164-73). However, unlike other members of this family, 
parchorin exists mainly in the cytoplasm and translocated to the plasma membrane upon 
stimulation of chloride ion efflux. In addition, parchorin shows only two hydrophobic domains 
relative to the ten to twelve domains seen in other intracellular chloride channels. Tissue 
expression of parchorin in the rabbit is enhanced in cells that secrete water, like parietal cells, 
choroid plexus, salivary duct, lacrimal gland, kidney, airway epithelia, and chorioretinal 
epithelia. It is therefore thought that this protein plays a critical role in these tissues, possibly 
by modulating chloride ion transport. 

Intracellular chloride channels have diverse roles within cells, such as volume 
regulation, acidification of intracellular vesicles, vectorial transepithelial chloride transport 
and regulation of cellular excitability (Jentsch et al., Pflugers Arch 1999 May;437(6):783-95). 
Loss of function mutations affecting three different members of this family lead to three 
human inherited diseases: myotonia congenita, Dent's disease, and Bartter ! s syndrome. In 
addition, a mouse knockout model involving a member of this family has been generated that 
mimics diabetes insipidus (Matsumura et al., Nat Genet 1999 Jan;21(l):95-8). 

It is likely, therefore, that the protein of invention participates in physiological 
functions similar to those of other members of the intracellular chloride channel family, 
particularly parchorin. 

The disclosed NOV24 nucleic acid of the invention encoding a Parchorin-Iike protein 
includes the nucleic acids whose sequences are provided in Table 24A or 24C or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 24A or 24C while still encoding a 
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protein that maintains its Adrenal secretory serine protease -like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
5 includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 

10 binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 37 percent of the bases may be so changed. 

The disclosed NOV24 protein of the invention includes the Parchorin -like protein 
whose sequence is provided in Table 24B or 24D. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 

1 5 in Table 24B or 24D while still encoding a protein that maintains its Adrenal secretory serine 
protease -like activities and physiological functions, or a functional fragment thereof. In the 
mutant or variant protein, up to about 40 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

20 The above disclosed information suggests that this Parchorin -like protein (NOV24) is 

a member of a "Parchorin family". Therefore, the NOV24 nucleic acids and proteins identified 
here may be useful in potential therapeutic applications implicated in (but not limited to) 
various pathologies and disorders as indicated below. The potential therapeutic applications 
for this invention include, but are not limited to: protein therapeutic, small molecule drug 

25 target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic 
and/or prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
those defined here. 

The NOV24 nucleic acids and proteins of the invention are useful in potential 
30 therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome, Alzheimer's 

disease, stroke, tuberous sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, 
cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, 
leukodystrophies, behavioral disorders, addiction, anxiety, pain, neurodegeneration, systemic 
lupus erythematosus, autoimmune disease, asthma, emphysema, scleroderma, allergy, ARDS, 
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diabetes, autoimmune disease, renal artery stenosis, interstitial nephritis, glomerulonephritis, 
polycystic kidney disease, systemic lupus erythematosus, renal tubular acidosis, IgA 
nephropathy, hypercalcemia, Lesch-Nyhan syndrome, cancer, trauma, bacterial/viral/parasitic 
infection, tissue degeneration, and/or other diseases and pathologies. 
5 NOV24 nucleic acids and polypeptides are fiirther useful in the generation of 

antibodies that bind immuno-specifically to the novel NOV24 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV24 protein has multiple hydrophilic 
10 regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV25 

15 A disclosed NOV25 nucleic acid of 1 123 nucleotides (also referred to as CG56457-01) 

encoding a novel protein phosphatase-like protein is shown in Table 25A. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 60-62 and ending 
with a TGA codon at nucleotides 768-770. The start and stop codons are shown in bold in 
Table 25 A, and the 5' and 3 5 untranslated regions, if any, are underlined. 



Table 25A. NOV25 nucleotide sequence (SEQ ID NO:107). 

"tccggatcgcttcccgggcggcgagctgggggtgcacccggaccgccgcccccgggatca tgggcaatggca 
tgaccaaggtacttcctggactctacctcggaaacttcattgatgccaaagacctggatcgcctgggccgaa 
ataagatcacacacatcatctctatccatgag 

tcccggtcgctgatacccctgaggtacccatcaaaaagcacttcaaagaatgtatcaacttcatccactgct 
gccgccttaatggggggaactgccttgtgcactgctttgc&^ 

cgtatgtgatgactgtgacggggctaggctggcgggacgtgcttgaagccatcaaggccaccaggcccatcg 

ccaaccccaacccaggctt taggcagcagcttgaagagtttgg ctgggccagttcccagaagcttcgccggc 

agctggaggagcgcttcggcgagagccccttccgcgacgaggaggacttgcgcgcgctgctgcctcrctg^ 

ggcgctgtcgccagggtccggggacttcggccccgtcggccaccacagcgtcctcggccgcttccgagggga 

ccc tgca gcgcctggtgccgcgatcgccgcgggaatcacaccggccgctgccgctgctggcgcgcgtcaagc 

agactttctcttgcctcccccggtgtctgtcccgcaagggcggcaagtg aggatgcagtccagccgtggctc 

cccacttccgactggctcccrtcgggggctgtctgcgccttccacgccccccaggacgg 

gggagccccgcggcggcctoaacccntscctcc^ 

caacctgtgcgtctctgtgtccgggccggcctgctgcagccacctggtgccttagtccttgggctgggggag 
ggggcccacccttaaaggcggcgggaggggagggagggagagtggagggtttgacgggcctggagggtatta 

AAGAGACACAGAAG AAG CTG C CTGT CAAAAAAAAAAAAAAAAA 

20 — 

In a search of public sequence databases, the NOV25 nucleic acid sequence, located on 
chromosome 20, has 324 of 505 bases (64%) identical to a gb:GENBANK- 
ID:AF165519|acc:AF165519.1 mRNA from Homo sapiens (mitogen-activated protein kinase 
phosphatase x (MKPX) mRNA, complete cds) (E = 2;3e" 31 ). 
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A disclosed NOV25 polypeptide (SEQ ID NO: 108) encoded by SEQ ID NO: 107 has 
236 amino acid residues and is presented in Table 25B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV25 has no signal peptide and is 
likely to be localized to the cytoplasm with a certainty of 0.6500. Alternatively, NOV25 may 
also localize to the lysosome (lumen) with a certainty of 0.1805, to the mitochondrial matrix 
space with a certainty of 0.1000, or to the plasma membrane with a certainty of 0.1000. 



Table 25B. Encoded NOV25 protein sequence (SEQ ID NO:108). 

MGNGMTKVLPGLYLGNFIDAKDLDRLGRNKI THI I S I HE S PQPLLQD I TYLRI PVADTPEVP I KKHFKE C IN 
FIHCCRLNGGNCL VHCFAGI SRS TTI VTAYVMTVTGLGWJ^VLEAIKATRPIANPNPGFRQQLEEFGWASSQ 
KLRRQLEERFGESPFRDEEDLRALLPLCRRCRQGPGTSAPSATTASSAASEGTLQRLVPRSPRESHRPLPLL 
ARVKQTFSCLPRCLSRKGGK 



A search of sequence databases reveals that the NOV25 amino acid sequence has 91 of 
169 amino acid residues (53%) identical to, and 125 of 169 amino acid residues (73%) similar 
to, the 184 amino acid residue ptnr:SPTREMBL-ACC:Q9NRW4 protein from Homo sapiens 
(Human) (Mitogen-Acti vated Protein Kinase Phosphatase X) (E - 7.3e" 50 ). 

NOV25 is predicted to be expressed in at least brain, testis, exocrine pancreas, adipose, 
bone, peripheral blood, salivary glands, spinal cord, thyroid . This information was derived by 
determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, public EST sources and/or RACE sources. 

In addition, the sequence is predicted to be expressed in hematopoietic stem cells 
because of the expression pattern of (GENBANK-ID: gb:GENBANK- ' 
ID:AF165519|acc:AF165519.1) a closely related Homo sapiens mitogen-activated protein 
kinase phosphatase x (MKPX) mRNA, complete cds homolog. 

The disclosed NOV25 polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 25C. 



Table 25C BLAST results for NOV25 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
<aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1 17458347 |ref|XP 

059288.1) 

(XM_059288) 


similar to 
bA243J16.6 (novel 
protein with a 
dual specificity 
phosphatase , 
catalytic domain) 
(H. sapiens) 
[Homo sapiens] 


235 


223/236 
(94%) 


229/236 
(96%) 


e-124 


gi 1 18104942 |ref |NP 
542178. l| 
(NM 080611) 


dual specificity 
phosphat as e - 1 ike 
15 [Homo sapiens] 


243 


216/251 
(86%) 


222/251 
(88%) 


e-115 
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gi | 9910432 | ref |NP_0 

64570.1) 

(NM_020185) 


mitogen -activated 
protein kinase 
phosphatase x; 
homolog of mouse 
dual specificity 
phosphatase LMW- 
DSP2; JNK- 
stimulating 
phosphatase 1 
[Homo sapiens] 


184 


91/169 
(53%) 


125/169 
(73%) 


4e-53 


gi 1 13183069 | gb |AAK1 
5038.1 |AF237619_1 
lAF237bl9) 


dual specificity 
phosphatase TS- 

DSP2 [MUS 

musculus] 


184 


90/169 
(53%) 


125/169 
(73%) 


2e-52 


gi 1 14726046 | ref |XP 
046543. 1| 
(XM 046543) 


mitogen-activated 
protein kinase 
phosphatase x 
[Homo sapiens] 


184 


89/169 
(52%) 


118/169 
(69%) 


2e-50 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 25D. In the ClustalW alignment of the NOV25 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 



Table 25D. ClustalW Analysis of NOV25 



15 



20 



25 



30 



35 



1) Novel NOV25 (SEQ ID NO:108) 

2) gi 1 17458347 | ref |XP_059288 . 1 | (XMJ)59288) similar to bA243J16.6 (novel protein 
with a dual specificity phosphatase, catalytic domain) (H. sapiens) [Homo sapiens] 
(SEQ ID NO: 426) 

3) gi 1 18104942 | ref |NP_542178 . 1 1 (NM_080611) dual specificity phosphatase- like 15 
[Homo sapiens] (SEQ ID NO: 427) 

4) gi | 9910432 | ref |NP_064570 .1 | (NM_020185) mi tog en- activated protein kinase 
phosphatase x; homolog of mouse dual specificity phosphatase LMW-DSP2; JNK- 
stimulating phosphatase 1 [Homo sapiens] (SEQ ID NO: 428) 

5) gi|l3183069|gb|AAK15038.l|AF237619_l (AF237619) dual specif i city phosphatase TS- 
DSP2 [MUS imiSCUlus] (SEQ ID N0:429) 

6) gi 1 14726046 1 ref |XP_046543 .1 1 (XM_046543) mitogen-activated protein kinase 
phosphatase x [Homo sapiens] (SEQ ID NO: 430) 



NOV25 


1 


gi 


17458347 | 


1 


gi 


18104942 | 


1 


gi 


9910432 | 


1 


gi 


13183069| 


1 


gi 


14726046) 


1 



NOV25 

gi | 17458347 | 
gi j 18104942 | 
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10 



15 



20 



25 



30 



gi I 9910432 I 
gi | 13183069 | 
gi j 14726046) 



NOV25 



gi 
gi 
gi 
gi 
gi 



17458347) 

18104942) 

9910432) 

13183069) 

14726046 



NOV25 



gi 
gi 
gi 
gi 
gi 



17458347) 
18104942 j 
9910432) 
13183069) 
14726046 



NOV25 



gi 
gi 
gi 
gi 
gi 



17458347) 
18104942) 
9910432 | 
13183069) 
14726046) 




130 



SQNLT£ 
^SQNIiTRgggSl 

140 




150 



160 



170 



190 





200 
....|....| 
'LCRRCRQGPGTSAPSATTASS, 
'LCKRCRQGS ATS AS SAG- PHS 
IPLCKRCRQGSATSASSAG- PHS. 



250, 

..|. 

RKGGK 236 
RKGGK 235 
'RKGGK 243 

184 

184 

--- 184 



105 
105 
105 




210 220 , 230 240 

J.-..|....|....|....|....| 

.QRLVPRSPRESHRPLPLLARVKQTRsC 225 

;qrlvprtpreahrplpllarvkqtBsc 224 
^orlvprtpreahrplpllarvkqtgsc 232 

P gWA 179 

fSCWA 179 

|IMK- §WA 179 




Tables 25E-H list the domain descriptions from DOMAIN analysis results against 
NOV25. This indicates that the NOV25 sequence has properties similar to those of other 
35 proteins known to contain this domain. 



Table 25E Domain Analysis of NOV25 

gnl|Smart|smart00195, DSPc, Dual specificity phosphatase, catalytic 
domain (SEQ ID NO: 816) 

CD-Length = 139 residues, 97.8% aligned 
Score * 139 bits (349), Expect « 2e-34 



NOV25 : 
40 Sbjct: 


4 
1 


GMTKVLPGLYLGNFIDAKDLDRIiGRNKITHlIS - IHESPQPLLQDITYLRI PVADTPEVP 
I || +| | + |||+|+ || || I)) | | 
GPSEILPHLYLGSYSDASNLALLKKLGITHVINVTEEVPNSNKSGFLyLGlPVDDirrETO 


62 
60 


NOV25 : 


63 


I KKHFKECINFIHCCRLNGGNCLVHCFAGISRSTT IVTAYVMTVTGLGWRDVIjEAI KATR 

1 - + 1 + II II INI 1 1*1 1 1 |*+ || + | + | M| | 

I S P YLPEAVE FIEDAEKKGGKVLVHCQAGVSRS ATL 1 I AYLMKYRNMS LNDAYDFVKERR 


122 


45 


61 


120 


N0V25: 


123 


PIANPNPGFRQQLEEF 138 




Sbjct: 


121 


II +11 II +11 1 + 
PIISPNFGFLRQLIEY 136 
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Table 25F Domain Analysis of NOV25 

gnl | Pfam|pfam00782, DSPc, Dual specificity phosphatase, catalytic 
domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary 
fold is highly similar to that of tyrosine- specific phosphatases, 
except for a "recognition" region. (SEQ ID NO: 817) 
CD-Length =139 residues, 97.8% aligned 
Score « 136 bits (342), Expect = 2e-33 



NOV25 : 4 GMTKVLPGLYLGNFIDAKDLDRLGRNKITHI IS- IHESPQPLLQDITYLRI PVADTPEVP 62 

I +++II IIII++ hi h 1)1 + 1+ I.I II III I I 

^ Shjct: 1 GPSEILPHLYLGSYPTASNIAFLSKLGITHVINVTEEVPNS 60 

NOV25: 63 IKKHFKECINFIHCCRLNGGNCLVHCFAG 122 

i * i * ii in mi linn i++ ii+i i + +11 

Sbjct: 61 IS PYLDEAVEFIEDARQKGGKVLVHCQAGISRSATLI IAYLMKTRNIiSLNEAYSFVKERR 120 

10 NOV25: 123 PIANPNPGFRQQLEEF 138 

II +11 II++II 1+ 
Sbjct: 121 PIISPNFGFKRQLIEY 136 



Table 25G Domain Analysis of NOV25 

gnl | Smart | smart 00404, PTPc_motif, Protein tyrosine phosphatase, 
catalytic domain motif (SEQ ID NO: 818) 
CD-Length = 105 residues, 53.3% aligned 
Score = 41.2 bits (95), Expect = 7e-05 



15 N0V25: 50 YLRI PVADTPEVP IK -KHFKECINFIHCCRLNGGNCLVHCFAGISRSTTIVTAYVM 104 

II Ml I + I I +111 11+ 1+ I I ++ 

Sbjct: 7 YTGWPDHGVP ES PDS I LE FLRAVKKS LNK S ANNGP VWHC S AGVGRTGT FVAI DI L 62 



Table 2SH Domain Analysis of NOV25 



ghl|Pfam|pfam00102, Y_phosphatase, Protein- tyrosine phosphatase (SEQ 
ID NO: 819) 

CD-Length = 235 residues, 31.9% aligned 
Score = 38.5 bits (88), Expect = 4e-04 



20 



25 



NOV25: 50 YLRI PVADTPEVP I KKHFKEC INF IHCCRLNG - - GNCLVHCFAGI SRSTTI VTAYVM - - T 105 

I I II I I + + + + I +111 III 1+ I + ++ 

Sbj Ct : 13 9 YTGWPDHGVPESP- - KSILDLLRKVRKSKGTPDDGPI WHCSAGIGRTGTFIAIDILLQQ 196 

NOV25: 106 VTGLGWRDVL EAI KATR 122 

+ I II + +1 I 
Sbjct: 197 LEKEGWDVFDTVKKLR 213 



The gene of invention is a member of the family of dual specificity protein 

phosphatases (DSPs; Martell et ah, Mol Cells 1998 Feb 28;8(1):2-11). DSPs recognize either 

30 Ser/Thr or Tyr moieties as targets for dephosphorylation. These enzymes regulate mitogenic 

signal transduction and can thereby regulate the cell cycle. Some members of this family are 

effective tumor suppressors, for example, PTEN. PTEN is required during embryonic 

development and later in life, and mutations in this gene give rise to different kinds of 
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inherited and sporadic cancers (Eng, Recent Prog Horm Res 1999;54:441-52; discussion 453). 
In Drosophila, members of the DSP family, such as puckered, have important roles in 
development (Martin-Blanco et ah, Genes Dev 1 998 Feb 1 5; 12(4):557-70). The crystal 
structure of one member of the DSP family has been elucidated (Yuvaniyama at al., Science 
5 1996 May 3 1;272(5266): 1328-31) and this family has been successfully targeted for small 
molecule drug development (Ducruet et al., Bioorg Med Chem 2000 Jun;8(6): 145 1-66). In 
addition, overexpression of a DSP has been demonstrated to be a potential therapy for cardiac 
hypertrophy (Bueno et al., Circ Res 2001 Jan 19;88(l):88-96). The gene of invention has 
greatest homology to a DSP identified in hematopoietic stem/progenitor cells from a patient 
1 0 with myelodysplastic syndromes. It shows the presence of a distinct domain present in all 
DSPs , which qualifies it as a bona fide member of this family. Its localization is predicted to 
be cytoplasmic, which makes it a good candidate to interact with members of the signal 
transduction cascade governing the cell cycle. 

The disclosed NOV25 nucleic acid of the invention encoding a Protein phosphatase - 
1 5 like protein includes the nucleic acid whose sequence is provided in Table 25A or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 25A while still encoding a protein 
that maintains its Protein phosphatase -like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
20 are complementary to those just described, including nucleic acid fragments that are 

complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
25 derivatized. These modifications are carried out at least in part to enhance the chemical 

stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 36 percent of the bases may be so changed. 

The disclosed NOV25 protein of the invention includes the Protein phosphatase -like 
protein.whose sequence is provided in Table 25B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 25B while still encoding a protein that maintains its Protein phosphatase -like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 48 percent of the residues may be so changed. 



30 
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The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fabk that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Protein phosphatase -like protein 
(NOV25) is a member of a "Protein phosphatase family". Therefore, the NOV25 nucleic acids 
5 and proteins identified here may be useful in potential therapeutic applications implicated in 
(but not limited to) various pathologies and disorders as indicated below. The potential 
therapeutic applications for this invention include, but are not limited to: protein therapeutic, 
small molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 
10 research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
(but not limited to) those defined here. 

The NOV25 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in for example Von Hippel-Lindau (VHL) syndrome, 
pancreatitis, obesity, Alzheimer's disease, stroke, tuberous sclerosis, hypercalceimia, 
15 Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, 
multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, 
anxiety, pain, neurodegeneration, psychiatric disorders, metabolic disorders, fertility, 
hypogonadism, xerostomia, hyperthyroidism, hypothyroidism, cancer, trauma, tissue 
degeneration, viral/bacterial/parasitic infections, and/or other diseases and pathologies. 
20 NOV25 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immuno-specifically to the novel NOV25 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV25 protein has multiple hydrophilic 
25 regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV26 

30 NOV26 includes two novel GAGE-7-like proteins disclosed below. The disclosed 

sequences have been named NOV26a and NOV26b. 
NOV26a 

A disclosed NOV26a nucleic acid of 550 nucleotides (also referred to as CG56461-01) 
encoding a novel GAGE-7-like protein is shown in Table 26A. An open reading frame was 
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identified beginning with a ATG initiation codon at nucleotides 67-69 and ending with a TAA 
codon at nucleotides 400-402. The start and stop codons are shown in bold in Table 26A, and 
the 5' and 3' untranslated regions, if any, are underlined. 

Table 26A. NOV26a nucleotide sequence (SEQ ID NO:109). 

liOTCCTGCTGTCTGGACTTTTTCTGTCCC^^ 
TGGCGAGGAAGATCAACATATAGGCCTAGGCCGAGGAGAAGTGTACCACCTCCTGAGCTGATTGGGCCTATG 
CTGGAGCCCGGTGATGAGGAGCCTCAGCAAGAGGAACCCCCAACTG 

GAGAGAGAAGAAGATCAGGGTGCAGCTGAGACTCAAGTGCCTGACCTGGAAGCTGATCTCCAGGAGCTGTCT 
CAGTCAAAGACTGGGGGTGAATGTGGAAATGGTCCTGATGACCAGGGGAAGATTCTGCCAAAATCAGAACAA 
T TT AAAATGC CAGAAGG AGGTG ACAGG CAAC CACAGGT TTAAATG AAGACAAG CTGAAACAAC ACAAAACT G 

TTTTTATCTAAGATATTTGACTTAAAAATATCGAAATAAACTTTTGCAGCTTTCTCCAAAAAAAAAAAAAAA 
^ftAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACCC^ 



5 In a search of public sequence databases, the NOV26a nucleic acid sequence, located 

on the X chromosome, has 293 of 360 bases (81%) identical to a gb.GENBANK- 
ID:AF251237|acc:AF251237.1 mRNA from Homo sapiens (XAGE-1 mRNA, complete cds) 
(E = 3.6c- 46 ). 

A disclosed NOV26a polypeptide (SEQ ID NO:l 10) encoded by SEQ ID NO: 109 has 
10 111 amino acid residues and is presented in Table 26B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV26a has no signal peptide and is 
likely to be localized to the mitochondrial matrix space with a certainty of 0.4462. 
Alternatively, NOV26A may also localize to the nucleuswith a certainty of 0.3000, to the 
mitochondrial inner membrane with a certainty of 0.1347, or to the mitochondrial 
1 5 intermembrane space with a certainty of 0. 1347. 



Table 26B. Encoded NOV26a protein sequence (SEQ ED NO:110). 

MIWRGRSTYRPRPRRSVPPPELIGPMLEPGDEEPQQEEPPTESRDPAPGQEREEDQGAAETQVPDLEADLQE 
LSQSKTGGECGNGPDDQGKILPKSEQFKMPEGGDRQPQV 

A search of sequence databases reveals that the NOV26a amino acid sequence has 60 
of 1 1 5 amino acid residues (52%) identical to, and 72 of 1 1 5 amino acid residues (62%) 
20 similar to, the 1 16 amino acid residue ptnr:SPTREMBL-ACC:Q9UEU5 protein from Homo 
sapiens (Human) (GAGE-7) (E = 1 .4e" 23 ). 

NOV26a is predicted to be expressed in at least placenta. This information was derived 
by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, and/or RACE sources. 
25 NOV26b 
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In the present invention, the target sequence identified previously, NOV26a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
5 examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 

1 0 related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 

15 gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 

20 another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV26b. This is 100% identical to the 
previously identified sequence (NOV26a). 

A disclosed NOV26b nucleic acid of 494 nucleotides (also referred to as CG56461-02) 

25 encoding a novel GAGE-7-Iike protein is shown in Table 26C. An open reading frame was 
identified beginning with a ATG initiation codon at nucleotides 67-69 and ending with a TAA 
codon at nucleotides 400-402. The start and stop codons are shown in bold in Table 26C, and 
the 5' and 3' untranslated regions, if any, are underlined. 

Table 26C NOV26b nucleotide sequence (SEQ ID NO:lll). 

GTTCCTGCTGTCTGGACTTTTTCTGTCCCACTGAGAC^ 

TGGCGAGGAAGATCAACATATAGGCCTAGGCCGAGGAGAAGTGTACCACCTCCTGAGCTGATTGGGCCTATG 
CTGGAGCCCGGTGATGAGGAGCCTCAGCAAGAGGAACGA.C 

GAGAGAGAAGAAGATCAGGGTGCAGCTGAGACTCAAGTGCCTGACCTGGAAGCTGATCTCCAGGAGCTGTCT 
CAGTCAAAGACTGGGGGTGAATGTGGAAATGGTCCTGATGAC 

TTT AAAATG CCAGAAGGAGGTGACAGG CAACCACAGGTT TAAATGAAGACAAGCTGAAACAACACAAAACTG 
TTTTTATCTAAGATATTTGACTTAAAAATATCAAAATAAAC 
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In a search of public sequence databases, the NOV26b nucleic acid sequence, located 
on the X chromosome, has 346 of 426 bases (81%) identical to a gb:GENBANK- 
ID:HSL185E6A|acc:Z68274.1 mRNA from Homo sapiens (Human DNA sequence from 
cosmid L129H7, Huntington's Disease Region, chromosome 4pl6.3 contains Pseudogene and 
5 CpG island) (E = 5.7e" 53 ). 

The disclosed NOV26b polypeptide (SEQ ID NO:l 12) encoded by SEQ ID NO:l 1 1 
has 1 1 1 amino acid residues and is presented in Table 26D using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV26b has no signal peptide 
and is likely to be localized to the mitochondrial matrix space with a certainty of 0.4462. 
10 Alternatively, NOV26b may also localize to the nucleus with a certainty of 0.3000, to the 
mitochondrial inner membrane with a certainty of 0.1347, or to the mitochondrial 
intermembrane space with a certainty of 0.1347. 



Table 26D. Encoded NOV26b protein sequence (SEQ ED NO:112). 

MIWRGRSTYRPRPRRSVPPPELIGPMLEPGDEEPQQEEPPTESRDPAPGQEREEDQGAAETQVPDLEADLQE 
LSQSKTGGECGNGPDDQGKILPKSEQFKMPEGGDRQPQV 

1 5 A search of sequence databases reveals that the NOV26b amino acid sequence has 60 

of 1 15 amino acid residues (52%) identical to, and 72 of 1 15 amino acid residues (62%) 
similar to, the 1 16 amino acid residue ptnr:SPTREMBL-ACC:Q9UEU5 protein from Homo 
sapiens (Human) (GAGE-7) (E = I At 23 ). 

NOV26b is predicted to be expressed in at least the following tissues: Placenta, Whole 
20 Organism. . 

The disclosed NOV26a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 26E. 



Table 26E. BLAST results for NO V26a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
I (aa) 


Identity 
(%)' 


Positives 
(%) 


Expect 


gi | 17486397 |ref|XP 

060048.1) 

(XMJ>60048) 


similar to G 
antigen 3 (H. 
sapiens) [Homo 
sapiens] 


137 


84/84 
(100%) 


84/84 
(100%) 


2e-33 


gi 1 18027836 |gb | AAL5 
5879.l|AF318372 1 j 
(AF318372) 


unknown [Homo 
sapiens) 


111 


110/111 
(99%) 


110/111 
(99%) 


4e-33 


gi| 18157212 |emb|CAC 
83008.1) (AJ318881) 


XAGE-3 protein 
[Homo sapiens) 


111 


111/111 
(100%) 


111/111 
(100%) 


2e-29 
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gi [17486394 |ref|XP 
066835. l| 
(XM_066835) 


similar to G 
antigen Bl; 
prostate 

associated gene 1 
(H. sapiens) 
[Homo sapiens] 


185 


64/78 
(82%) 


69/78 
(88%) 


2e-26 


gi 1 14765261 |ref|XP 
032309. l| 
(XM 032309) 


hypothetical 
protein XP_0323 09 
[Homo sapiens] 


111 


80/111 
(72%) 


93/111 
(83%) 


4e-26 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 26F. In the ClustalW alignment of the NOV26 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 Table 26F. ClustalW Analysis of NOV26 

1) Novel NOV26a (SEQ ID NO: 110) 

2) Novel NOV26b (SEQ ID NO: 112) 

3) gi ) 17486397 | ref |XP_06004B . 1 1 (XM_060048) similar to G antigen 3 (H. sapiens) 
[Homo sapiens] (SEQ ID NO: 431) 

15 4) gi| 18027836 |gb|AAL55879.l|AF318372_l (AF318372) unknown [Homo sapiens] (SEQ ID 
NO:131) gi|l8157212|emb|CAC83008.l| (AJ318881) (SEQ ID NO:432) 

6) gi| 18157212 |einb|CAC83008.l| (AJ318881) XAGE-3 protein [Homo sapiens] (SEO ID 
NO:433) 

7) gi | 17486394 | ref |XP_066835 . 1 | (XM_066835) similar to 6 antigen Bl; prostate 
20 associated gene 1 (H. sapiens) [Homo sapiens] (SEQ ID NO: 434) 

8) gi | 14765261 | ref |XP_032309.1 | (XM_032309) hypothetical protein XP 032309 [Homo 
sapiens] (SEQ ID NO: 43 5) 
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gi 1 17486394 | 119 

gi j 14765261 j 69 




X11 

Ill 

IFLCSRCILSKLYTMFAIYFKPLL 178 
X11 



10 



15 



20 



25 



30 



35 



40 



NOV26a 


111 


NOV26b 


111 


gi 


17486397 


137 


gi 


18027836 


111 


gi 


18157212 


111 


gi 


17486394 


179 


gi 


14765261 


111 



111 
111 

137 
111 
111 
185 
111 



The gene of invention is a member of GAGE family of proteins. It belongs to the broad 
family of GAGE/MAGE/PAGE genes that are expressed in various human cancers. Many 
human tumors express antigens that are recognized in vitro by cytolyticT lymphocytes (CTLs) 
derived from the tumor-bearing patient. The MAGE(melanoma-specific antigen), PAGE 
(Prostate cancer antigen) and GAGE (G antigen) gene family members encode such antigens. 
Therefore these antigens can serve as therapeutic targets in cancer. 

The LNCaP progression model of human prostate cancer consists of lineage-related 
sublines that differ in their androgen sensitivity and metastatic potential. A differential display 
polymerase chain reaction was employed by Chen ME, et al. (J Biol Chem 1998 Jul 
10;273(28): 1761 8-25) to evaluate mRNA expression differences between the LNCaP sublines 
in order to define the differences in gene expression between the androgen-sensitive, 
nontumorigenic LNCaP cell line and the androgen-insensitive, metastatic LNCaP sublines, 
C4-2 and C4-2B. An amplicon, BG16.21, was isolated that showed increased expression in the 
androgen-independent and metastatic LNCaP sublines, C4-2 and C4-2B. Hybridization 
screening of a lambda gtl 1 expression library with BG 16.21 revealed two transcripts, both 
homologous to BG 16.21 at the 3' end. A GenBankTM data base search using the GCG 
Wisconsin software package revealed the shorter approximately 600-bp transcript (designated 
GAGE-7) to be a new member of the GAGE family. The second approximately 700-bp 
transcript was a novel gene (designated PAGE-1, "prostate associated gene") with only 45% 
homology to GAGE gene family members. RNA blot analysis demonstrated that GAGE-7 
mRNA was expressed at equal levels in all lineage related prostate cancer cell sublines, while 
PAGE-1 mRNA levels were elevated 5-fold in C4-2 and C4-2B as compared with LNCaP 
cells. Neither GAGE-7 nor PAGE-1 demonstrated any regulation by androgens in the prostate 
cancer cell lines used in this study. PAGE-1 and GAGE-7 expression was found to be 
restricted to testes (high) and placenta (low) on human multiple tissue Northern blots. As 
GAGE/MAGE antigens were reported previously to be targets for tumor-specific cytotoxic 
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lymphocytes in melanoma, these results suggest that PAGE-1 and GAGE-7 may be related to 
prostate cancer progression and may serve as potential targets for novel therapies. 

The GAGE-1 gene was identified previously as a gene that codes for an antigenic 
peptide, YRPRPRRY, which was presented on a human melanoma by HLA-Cw6 molecules 
5 and recognized by a clone of CTLs derived from the patient bearing the tumor. By screening a 
cDNA library from this melanoma, De Backer O, et al. (Cancer Res 1999 Jul 1;59(13):3157- 
65) identified five additional, closely related genes named GAGE-2-6. We report here that 
further screening of this library led to the identification of two more genes, GAGE-7B and -8. 
GAGE-1, -2, and -8 code for peptide YRPRPRRY. Using another antitumor CTL clone 

1 0 isolated from the same melanoma patient, they identified antigenic peptide, YY WPRPRRY, 
which is encoded by GAGE-3, -4, -5, -6, and -7B and which is presented by HLA-A29 
molecules. Genomic cloning of GAGE-7B showed that it is composed of five exons. Sequence 
alignment showed that an additional exon, which is present only in the mRNA of GAGE-1, 
has been disrupted in gene GAGE-7B by the insertion of a long interspersed repeated element ' 

1 5 retroposon. These GAGE genes are located in the pi 1 .2-pl 1 A region of chromosome X. They 
are not expressed in normal tissues, except in testis, but a large proportion of tumors of various 
histological origins express at least one of these genes. Treatment of normal and tumor 
cultured cells with a demethylating agent, azadeoxycytidine, resulted in the transcriptional 
activation of GAGE genes, suggesting that their expression in tumors results from a 

20 demethylation process. ■ 

The disclosed NOV26 nucleic acid of the invention encoding a GAGE-7 -like protein 
includes the nucleic acid whose sequence is provided in Table 26A, 26C or a fragment thereof. 
The invention also includes a mutant or variant nucleic acid any of whose bases may be 
changed from the corresponding base shown in Table 26A or 26C while still encoding a 

25 protein that maintains its GAGE-7 -like activities and physiological functions, or a fragment of 
such a nucleic acid. The invention further includes nucleic acids whose sequences are 
complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 

30 chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
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binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 19 percent of the bases may be so changed. 

The disclosed NOV26 protein of the invention includes the GAGE-7 -like protein 
whose sequence is provided in Table 26B or 26D. The invention also includes a mutant or 
5 variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 26B or 26D while still encoding a protein that maintains its GAGE-7 -like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 
up to about 28 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as Fab or 
10 (Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this GAGE-7 -like protein (NOV26) is a 
member of a "GAGE-7 family". Therefore, the NOV26 nucleic acids and proteins identified 
here may be useful in potential therapeutic applications implicated in (but not limited to) 
various pathologies and disorders as indicated below. The potential therapeutic applications 
15 for this invention include, but are not limited to: protein therapeutic, small molecule drug 

target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic 
and/or prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
those defined here. 

20 The NOV26 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in fertility disorders, cancer, trauma, tissue degeneration, 
viral/bacterial/parasitic infections, and/or other diseases and pathologies. 

NOV26 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV26 substances for use in 

25 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV26 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can bd used in 
assay systems for functional analysis of various human disorders, which will help in 

30 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV27 

NOV27 includes three novel sodium - glucose cotransporter -like proteins disclosed 
below. The disclosed sequences have been named NOV27a, NOV27b, and NOV27c. 
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NOV27a 

A disclosed NOV27a nucleic acid of 1914 nucleotides (also referred to as CG56645- 
01) encoding a novel sodium - glucose cotransporter-like protein is shown in Table 27A. An 
open reading frame was identified beginning with a ATG initiation codon at nucleotides 51-53 
5 and ending with a TGA codon at nucleotides 1 839- 1 84 1 . The start and stop codons are shown 
in bold in Table 27A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 27A. NOV27a nucleotide sequence (SEQ ID NO:113). 

TTGCCCCTCAGTCCCTCGGGCTCATACCTAGTGCCTG 
ACCTCCACACTCCCGGGACGCAGCTGAGCGTGGCTGACATCATre 

TGGCCGTGGGCATATGGTCCTCTTGTCGGGCCAGTAGGAACACGGTGAATGGCTACTTCCTGGCAGGCCGGG 
ACATGACGTGGTGGCCGATTGGAGCCTCCCTCTTCGCCAGCAGCGAGGGCTCTGGCCTCTTCATTGGACTGG 
CGGGCTCAGGCGCGGCAGGAGGTCTGGCCGTGGCAGGCTTCGAGTGGAATGCCACGTACGTGCTGCTGGCAC 
TGGCATGGGTGTTCGTGCCCATCTACATCTCCTCAGAGATCGTCACCTTACCTGAGTACATTCAGAAGCGCT 

TGGACCTGTACGCGGGGGCTCTGTTTGTGCACATCTGCCTGGGCTGGAACTTCTACCTCTCCACCATCCTCA 
CGCTCGGCATCACAGCCCTGTACACCATCGCAGGGGGCCTGGCT 

CGCTCATCATGGTGGTGGGGGCTGTCATCCTGACAATCAAAGCTTTTGACCAGATCGGTGG^ 
TGGAGGC^GCCTACGCCCAGGCCATTCCCTCCAGGACCAT^ 

ACGCCATGCACATGTTTCGAGACCCCCACACAGGGGACCTGCCGTGGACCGGGATGACCTTTGGCCTGACCA 

TCATGGCCACCTGGTACTGGTGCACCGACCAGGTGATCGTGCAGCGATCACTGTCAGCCCGGGACCTGAACC 

ATGCCAAGGCGGGCTCCATCCTGGCCAGCTACCTCAAGATGCTCCCCATGGGCCTGATCATAATGCCGGGCA 

TGATCAGCCGCGCATTGTTCCCAGATGATGTGGGCTGCGTGGTGCCGTCCGAGTGCCTGCGGGCCTGCGGGG 

CCGAGGTCGGCTGCTCCAACATCGCCTACCCCAAGCTGGTCATGGAACTGATGCCCATCGGTCTGCGGGGGC 

TGATGATCGCAGTGATGCTGGCGGCGCTCATGTCGTCGCTGACCTCCATCTTCAACAGCAGCAGCACCCTCT 

TCACTATGGACATCTGGAGGCGGCTGCGTCCCCGCTCCGGCGAGCGGGAGCTCCTGCTGGTGGGACGGCTGG 

TCATAGTGGCACTCATCGGCGTGAGTGTGGCCTGGATCCCCGTCCTGCAGGACTCCAACAGCGGGCAA 

TCATCTACATGCAGTCAGTGACCAGCTCCCTGGCCCCACCAGTGACTGCAGTCTTTGTCCTGGGCGTC^ 

GGCGACGTGCCAACGAGCAGGGGGCCTTCTGGGGCCTGATAGCAGGGCTGGTGGTGGGGGCCACGAGGCTGG 

TCCTGGAATTCCTGAACCCAGCCCCACCGTGCGGAGAGCCAGACACGCGGCCAGCCGTCCTGGGGAGCATCC 

ACTACCTGC^CTTCGCTGTCGCCCTCrrTTGCACTCAGrrGGTGCTGTTGTGGTGGCTGG 

CACCCCC^CAGAGTGTCCAGATTGAGAACCTTACCTGGTGGACCCTGGCTCAGGATGTG 

AAGCAGGTGATGGCCAAACACCCCAGAAACACGCCTTCTGGGCC^ 

TCATGTGTGTCAACATATTCTTTTATGCCTACTTCGCCTGACACTGCCATCCTGGACAGAAAGGCAGGAGCT 
CTGAGTCOTCAGGTCCACCCATTTCCCTCATGGGGATCCCGA 



In a search of public sequence databases, the NOV27a nucleic acid sequence, located 
on chromosome 1 7, has 1 598 of 1 838 bases (86%) identical to a gb:GENBANK- 

10 ID:OCU08813|acc:U08813.I mRNA from Oryctolagiis cuniculus (Oryctolagus cuniculus 
Na+/glucose cotransporter-related protein mRNA, complete cds) (E = 2.6e" 309 ). 

A disclosed NOV27a polypeptide (SEQ ID NO:l 14) encoded by SEQ ID NO:l 13 has 
596 amino acid residues and is presented in Table 27B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV27A has a signal peptide and is 

1 5 likely to be localized to the plasma membrane with a certainty of 0.8200. Alternatively, 
NOV27a may also localize to the endoplasmic reticulum (membrane) with a certainty of 
0.6850, to the Golgi body with a certainty of 0.4600, or to the enoplasmic reticulum (lumen) 
with a certainty of 0.1000. The most likely cleavage site for NOV27A is between positions 42 
and 43: CRA-SR. 
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Table 27B. Encoded NOV27a protein sequence (SEQ ID NO:114). 

SGLFIGLAGSGAAGGIAVAGFEWNATYVLLALAWVFVPIY I S SEI VTLPE YIQKRYGGQRI RMYLS VLSLLL 

SVFTKISLDLYAGALPVHICLGWNFYLSTILTLGITALYTIAGGLAAVIYTDALQTLIMWGAVILTIKAFD 

QIGGYGQLEAAYAQAIPSRTIANTTCHLPRTDAMHMFRDPHTGDLPWTGMTFGLTIMATWYWCTDQVIVQRS 

LSARDLNHAKAGSILASYLKMLPMGLIIMPGMISRALFPDDVGCWPSECLRACGAEVGCSNIAYPKLVMEIj 

MPIGLRGLM I AVMLAALMS SLTS I FNS S STLFTMD I WRRLRPRSGERELLLVGRLVI VALI GVS VAWI P VLQ 

DSNSGQLFIYMQSVTSSLAPPVTAVFVLGVFWRRAITEQGAFWGLIAGLWGATRLVLEFIjNPM 

PAVLGSlHYLHFAVALFALSGAVWAGSLLTPPPQSVQIEmT^ 

CGFNAILLMCVNIFFYAYFA 

A search of sequence databases reveals that the NOV27a amino acid sequence has 531 
of 596 amino acid residues (89%) identical to, and 559 of 596 amino acid residues (93%) 
5 similar to, the 597 amino acid residue ptnr:SPTREMBL-ACC:Q28610 protein from 

Oryctolagus cuniculus (Rabbit) (Na+/Glucose Cotransporter-Related Protein) (E = Lie* 289 )/ 

NOV27a is predicted to be expressed in at least heart and kidney. This information was 
derived by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, public EST sources, and/or RACE sources. 
10 I" addition, the sequence is predicted to be expressed in the following tissues because 

of the expression pattern of (GENBANK-ID: gb:GENBANK-ID:OCU08813|acc:U08813.1) a 
closely related Oryctolagus cuniculus Na+/glucose cotransporter-related protein mRNA, 
complete cds homolog. 

NOV27b 

15 In the present invention, the target sequence identified previously, NOV27a, was 

subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 

20 suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 

25 PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 

marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
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uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
5 another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV27b. This differs from the previously 
identified sequence (NOV27a) in having 16 extra internal amino acids. 

A disclosed NOV27b nucleic acid of 1912 nucleotides (also referred to as CG56645-02 
10 encoding a novel sodium - glucose cotransporter-like protein is shown in Table 27C An open 
reading frame was identified beginning with a ATG initiation codon at nucleotides 35-37 and 
ending with a TGA codon at nucleotides 1871-1873. The start and stop codons are shown in 
bold in Table 27C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 27C NOV27b nucleotide sequence (SEQ ID NO:115). 

CGGGCTCATACCH'AGTGCCTGCGGCAGGACAGCCA TTOCCGCCAACTCCACCAGCGACCTCCACACTCCCGG 

GACGCAGCTGAGCGTGGCTGACATCATCGTCATCACTGTGTATTTTGCTCTGAACGTGGCCGTGGGCATATG 

GTCCTCTTGTCGGGC«GTAGGAACACGGTGAATGGCTACTTCCTGGCAGGCCGGGACATGACGTGGTGGCC 

GATTGGAGCCTCCCTCTTCGCCAGCAGCGAGGGCTCTGGCCT 

AGGAGGTCTGGCCGTGGCAGGCTTCGAGTGGAATGCCACGT 

GCCCAT CTACATCTCCTCAGAGATCGTCAC CTTACCTGAGTACATTCAGAAGCGCTACGGGGGCCAG CGGAT 
CCGCATGTACCTGTCTGTCCTGTCCCTGCTACTGTCTGTCTTCACCAAGATATCGCTGGACCTGTACGCGGG 
GGC TCTGTTTG TG CACATCTGCCTG GG CTGGAACTTCTACCTCT CC AC CAT CCT CACG CTCGGCATC ACAG C 
CCTGTACACCATCGCAGGGGGCCTGGCTGCTGTAATCTACACGG 

GGGGGCTGTCATCCTGACAATCAAAGCTTTTGACCAGATCGGTGGTTACGGGCAGCTGGAGGCAGCCTACGC 

CCAGGCCATTCCCTCCAGGACCATTGCC^C^CC^CCT 

TCGAGACCC CCACACAGGGGAC CTGCCGTGGACCGGGATGACCT 

CTGGTGCACCGACCAGGTCATCGTGCAGCGATC^CTC^ 

CATCCTGGCCAGCTACCTCAAGATGCTCCCCATGGGCCTGAT 

GTTCCCAGGTGCTCATGTCTATGAGGAGAGAC&CC^VAGTGTCCGTCT 

GGTGCCGTCCGAGTGCCTGCGGGCCT<3CGGGGCCGAGGTCGGCTGCTCCAACATCGCCTACCCCAAGCTGGT 
CATGGAACTGATGCCCATCGGTCTGCGGGGGCTGATGATCGCAGTGATGCTGGCGGCGCTCATGTCGTCGCT 
GAC CTCCATCTTCAACAGCAGCAGCACC CTCT^C^CTATGGACATCTGGAGGCGGCTGCGTCCCCGCTCCGG 
CGAGCGGGAGCTCCTGCTGGTGGGACGGCTGGTCATAGTGGCACTCATCGG CGTGAGTGTGGCCTGGATC CC 
CGTCCTGCAGGACTCCAACAGCGGGCAACTCTTCATCTAC^^ 
AGTGACTGC^GTCTTTGTCCTGGGCGTCTTCTGGCGACGTGCCAAro 

AGCAGGGCTGGTGGTGGGGGCCACGAGGCTGGTCCTGGAATTCCTGAACCCAGCCCCACCGTGCGGAGAGCC 
AGACACGCGGCCAGCCGTCCTGGGGAGCATCCACTACCTGCACTTC^ 
TGCTGTTGTGGTGGCTGGAAGCCTGCTGACCCCACCCCCACAG 
GACCCTGGCTCAGGATGTGCCCTTGGGAACTAAAGCAGGTGATG^ 

GGCCCGTGTCTGTGGCTTCAATGCCATCCTCCT(^TGTGTGTCAACATATTCTTTTATGCCTACTTCGCCTG 
ACACTGCCATCCTGGACAGAAAGGCAGGAGCTCTGAGTCC 



15 In a search of public sequence databases, the NOV27b nucleic acid sequence, located 

on chromosome 17, has 903 of 1017 bases (88%) identical to a gb:GENBANK- 
ID:OCU08813|acc:U08813.1 mRNA from Oryctolagus cuniculus {Oryctolagus cunicidus 
Na+/glucose cotransporter-related protein mRNA, complete cds) (E = 4.4e" 176 ). 

The disclosed NOV27b polypeptide (SEQ ID NO: 1 16) encoded by SEQ ID NO:l 15 
20 has 612 amino acid residues and is presented in Table 27D using the one-letter amino acid 
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code. Signal P, Psort and/or Hydropathy results predict that NOV27b has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.8200. Alternatively, 
NOV27b may also localize to the endoplasmic reticulum (membrane) with a certainty of 
0.6850, to the Goigi body with a certainty of 0.4600, or to the endoplasmic reticulum (lumen) 
5 with a certainty of 0. 1 000. The most likely cleavage site for NOV27B is between positions 42 
and 43: CRA-SR. 



Table 27D. Encoded NOV27b protein sequence (SEQ ID NO:116). 



MAANSTSDLHTPGTQLS VAD 1 I VITVYFALNVAVGI WS S CRASKNTVNGYFIiAGRDMT WWP IGASLFAS SEG 
SGLFIGLAGSGAAGGIAVAGFEV^ATYVLLALAWVEVP^^ 

SVFTKISLDLYAGALFVHICLGVWFYLSTILTLGITALYTIAGGIiAAVIYTDALQTLIMWGAVILTIKAFD 
QIGGYGQLEAAYAQAIPSRTIANTTCHLPRTDAMHMFRDPHTGD^ 

LS ARDLNHAKAGS ILAS YLKMLPMGLI IMPGMI SRALFPGAHVYEERHQVS VSRTDDVGCWPSECLRACGA 
EVGCSNIAYPKLVMEI^PIGLRGLMIAVMLAALMSSLTSIFNSSSTLFTMDIPmRI^ 
IVALIGVSVAWIPVLQDSNSGQLFIYMQSVTSSLAPPVTAVFVLGVFWRRANEQGAFWGLIAGLW 
LEFLNPAPPCGEPDTRPAVLGSIHYLHFAVALFALSGAVW . 
AGDGQTLQKHAFWARVCGFNAI LLMCVNI FFYAYFA 



A search of sequence databases reveals that the NOV27b amino acid sequence has 530 
10 of 612 amino acid residues (86%) identical to, and 558 of 612 amino acid residues (91%) 
similar to, the 597 amino acid residue ptnr:SPTREMBL-ACC:Q28610 protein from 
Oryctolagus cuniculus (Rabbit) (Na+/Glucose Cotransporter-Related Protein) (E = 1 .9e~ 284 ). 

NOV27b is predicted to be expressed in at least heart and kidney. .The sequence is 
predicted to be expressed in the following tissues because of the expression pattern of 
15 (GENBANK-ID: gb:GENBANK-ID:OCU08813|acc:U08813.1) a closely related Oryctolagus 
cuniculus Na-f7glucose cotransporter-related protein mRNA, complete cds homolog. 
NOV27c 

A disclosed NOV27c nucleic acid of 1741 nucleotides (also referred to as 191 828203) 
encoding a novel sodium - glucose cotransporter-like protein is shown in Table 27E. An open 
20 reading frame was identified beginning with a ATG initiation codon at nucleotides 5-7 and 
ending with a TGA codon at nucleotides 1688-1690. The start and stop codons are shown in 
bold in Table 27E, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 27E. NOV27c nucleotide sequence (SEQ ID NO:117). 

AGCCATGGC^^ 

CATCACTGTGTATTTTGCTCTGAATGTGGCCGTGGGCATATG 
GAATGGCTACTTCCTGGCAGGCCGGGACATGACGTGGTGGCCGATTG 

GGGCTCTGGCCTCTTmTTGGACTGGCGGGCTCAGGCGCGGCAGGAGGTCTGGCCGTGGCAGGCTTCGAGTG 

GAATGCCACGTACGTGCTCCTGGCACTGGCATGGGTGOT 

GTACGCGGGGGCTCTGTTTGTGCACATCTGCCTGGGCTGGAACT 

CATCACAGCCCTGTACACCATCGCAGGGGGCCTGGCTGCTGTAATCTACACGGACGCCCTGCAGACGCTCAT 
CATGGTGGTGGGGGCTGTCATCCTGACAATCAAAGCTTTTGACCAGATCGGTGGTTACGGGCAGCTGGAGGC 
AGCCTACGCCCAGGCCATTC£CTCCAGGACCATTGC^ 
GGACATGTTTCGAGACCCCCACACAGGGGACCTGCCGTGG^ 

CACCTGGTACTGGTGCACCGACCAGGTCATCGTGCAGCGATCACTGTCAGCCCGGGACCTGAACCATGCCAA 
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GG CGGGCTCCATCCTGG CC AGCTACCTCAAGATG CTCCCCATGGGCCTGAT CAT CATGCCGGGCATGATC AG 

CCGCGCATTGTTCCCAGATGATGTGGGCTGCGTGGTGCCGTCCGAGTGCCTGCGGGCCTGCGGGGCCGAGGT 

CGGCTGCTCCAACATCGCCTACCCCAAGCTGGTCATGGAACTGATGCCCATCGGTCTGCGGGGGCTGATGAT 

CACAGTGATGCTGGCGG CGCTCATGT CGTCGCTGACCTCCATCTTCAACAGCAGCAG CACCCTCTTC ACT AT 

GGACATCTGGAGGCGGCTGCGTCCCCGCTCCGGCGAGCGGGAGCTCCTGCTGGTGGGACGGCTGGTCATAGT 

GGCACTCATCGGCGTGAGTGTGGCCTGGATCCCCGTCCTGCAGGGCTCCAACAGCGGGCAACTCTTCATCTA 

CATGCAGTCAGTGACCAGCTCCCTGGCCCCACCAGTGACTGCAGTCTTTGTCCTGGGCGTCTTCCGGCGACG 

TGCCAACGAGCAGGGGGCCTTCTGGGGCCTGATAGCAGGGCTGGTGGTGGGGGCCACGAGGCTGGTCCTGGA 

ATTCCTGAACCCAGCCCCACCGTGCGGAGAGCCAGACACGCGGCCAGCCGTCCTGGGGAGCATCCACTACCT 

GCACTTCGCTGTCGCCCTCTTTGCACTCAGTGGTGCTGTTGTGGTGGCTGGAAGCCTGCTGACCCCACCCCC 

ACAGAGTGT CCAGATTGAGAACCTTACCTGGTGGACCCTGGCTCAGGATGTGCCCTTGGGAACTAAAGCAGG ' 

TGATGGCCAAACACCCCAGAAAC7VCGCCTTCTGGGCCCGCGTCTGTGGCTTCAATGCCATCCTCCTC 

TGTCAACATATTCTTTT ATGCCTACTTCGCCTG ACACTGCCATC CTGGACAGAAAGGCAGGAG CTCTGAGTT 

GGCGGCCATGGCT 



In a search of public sequence databases, the NOV27c nucleic acid sequence, located 
on chromosome 17, has 1409 of 1445 bases (97%) identical to a gb:GENBANK- . 
ID:AX191622|acc:AX191622.1 mRNA from Homo sapiens (Sequence 144 from Patent 
5 WO0149728)(E = 0.0). 

A disclosed NOV27c polypeptide (SEQ ID NO:l 18) encoded by SEQ ID NO:l 17 has 
561 amino acid residues and is presented in Table 27F using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV27c has a signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.8200. Alternatively, 
10 NOV27c may also localize to the endoplasmic reticulum (membrane) with a certainty of 

0.6850, to the Golgi body with a certainty of 0.4600, or to the endoplasmic reticulum (lumen) 
with a certainty of 0.1000. The most likely cleavage site for NOV27C is between positions 42 
and 43:CRA-SR. 



Table 27F. Encoded NOV27c protein sequence (SEQ ID NO:118). 

MAANSTSDLHTPGTQLSVADI IVITWFALNVAVGIWS SCRASRNTVNGYFLAGRDMTWWP IGASLFASSEG 
SGLFIGLAGS GAAGGLAVAGFEWNATYVLLALAWVFVP I YI S S ELDL YAGALFVH ICLGWNFYLSTILTLGI 
TALYTIAGGLAAVrYTDALQTLIMWGAVILTIKAFDQIGGY 
MFRDPOTGDLPOTGMTFGLTIMATWYWCTDQVIVQRSLSA 

ALFPDDVGCVVPSECLRACGAEVGCSNI AYPKLVMELMP IGM I FNSS STLFTMD 

IWRRLRPRSGERELLLVGRLVIVALIGVSVAWIPVLQGSNSGQLFi™ 

NEQGAFWGLIAGLWGATRLVLEFLNPAPPCGEPDTRPAVLGSIHYLHFAVALFALSGAVVVAGSLLTPPPQ 
SVQIENLTWWTLAQDVPLGTKAGDGQTPQKHAFWARVCGFNAILLMCVNIFFYAYFA 

A search of sequence databases reveals that the NOV27c amino acid sequence has 394 
of 460 amino acid residues (85%) identical to, and 423 of 460 amino acid residues (91%) 
similar to, the 597 amino acid residue ptnr:SPTREMBL-ACC:Q28610 protein from 
Oryctolagus ctmiculus (Rabbit) (Na+/Glucose Cotransporter-Related Protein) (E = 2.6e" 125 ). 

NOV27c is predicted to be expressed in at least heart, kidney, and colon. Expression 
information was derived from the tissue sources of the sequences that were included in the 
derivation of the sequence of CuraGen Acc. No. 191828203. The sequence is predicted to be 
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expressed in kidney because of the expression pattern of (GENBANK-ID: gb:GENBANK- 
ID:OCU08813|acc:U08813.1) a closely related Oryctolagus cunicaliis Na+/glucose 
cotransporter-related protein mRNA, complete cds homolog. 

The disclosed NOV27a polypeptide has homology to the amino acid sequences shown 
5 in the BLASTP data listed in Table 27G. 



Table 27G. BLAST results for NOV27a 


Gene Index/ 
Identifier 


Protein/ Organism 


i>engtn 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi| 520469 |gb|AAA660 
65. l| (U08813) 


597 aa protein 
related to 
Na/glucose 
cotransporters 
[Oryctolagus 
cuni cuius] 


597 


531/596 
(89%) 


559/596 
(93%) 


0.0 


gi| 16553933 |dbj|BAB 
71619. l| (AK057946) 


unnamed protein 
product [Homo 
sapiens] 


517 


440/456 
(96%) 


440/456 
(96%) 


0.0 


gi| 16203958 |gb|AAH2 
1357. l|AAH2 1357 
(BC021357) 


Unknown (protein 
for MGC:29197) 
iMus mus'culus] 


678 


346/545 
(63%) 


435/545 
(79%) 


0.0 


gi| 9588428 |emb|CAC0 
0574.1 | (AL109659) 


dJl024N4.1 (novel 
Sodium : solute 
symporter family 
member similar to 
SLC5A1 (SGLT1) ) 
[Homo sapiens] 


552 


344/522 
(65%) 


425/522 
(80%) 


e-180 


gi| 2564063 | db j |BAA2 
2950. l| (AB008225) 


Na+- glucose 
cotransporter 
type 1 (SGLT-l) - 
like protein 
[Xenopus laevis] 


673 


315/539 
(58%) 


415/539 
(76%) 


e-174 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 27H. In the ClustalW alignment of the NOV27 protein, as 
10 well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (Le. 9 regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 

15 



Table 27EL ClustalW Analysis of NOV27 

1) Novel N0V27a (SEQ ID NO: 114) 

2) Novel NOV27b {SEQ ID NO: 116) 

3) Novel NOV27b (SEQ ID NO: 118) 

20 4) gi | 520469 1 gb | AAA66065 . l| (U08813) 597 aa protein related to Na/glucose 
cotransporters [Oryctolagus cuni cuius] (SEQ ID NO: 43 6) 

5) gi 1 16553 933 | dbj | BAB71619 . 1 1 (AK057946) unnamed protein product [Homo sapiens] 
(SEQ ID NO: 437) 
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6) gi|l8203958|gb|AAH21357.l|AAH2l357 (BC021357) Unknown (protein for MGC-.29197) 
[Mus musculus] (SEQ ID NO; 438) 

7) gi|9588428|emb|CAC00574.l| (AL109659) dJ1024N4.1 (novel Sodium: solute symporter 
family member similar to SLC5A1 (SGLT1) ) [Homo sapiens] (SEQ ID NO:439) 

8) gi|2564063|dbj|BAA22950.l| (AB008225) Na+-glucose cotransporter type 1 (SGLT-l)- 
like protein [Xenopus laevis] (SEQ ID NO: 440) 



NOV27a 
NOV27b 
NOV27C 

gi 
gi 
gi 
gi 
gi 



520469 | 
16553533 | 
18203958| 
9588428 | 
2564063 | 



NOV27a 
NOV27b 
NOV27C 

gi 
gi 
gi 
gi 
gi 



520469) 
16553933) 
18203958| 
9588428) 
2564063 j. 



NOV27a 

NOV27b 

NOV27C 

gi | 520469 | 

gi (16553933 | 

gi|l8203958| 

gi | 9588428 | 

gi j 2564063 j 



NOV27a 
NOV27b 
NOV27C 
gi|520469| 
gi j 16553933 | 
gi j 18203958 j 
gi j 9588428 | 
gi j 2564063 I 



NOV27a 
NOV27b 
NOV27C 



gi 
gi 
gi 
gi 
gi 



520469) 
16553933) 
18203958) 
9588428 | 
2564063 | 



NOV27a 

NOV27b 

NOV27C 

gi 1 520469 | 

gi (16553933 | 




291 
291 
256 
292 
196 



310 320 330 340 350 

. . | f .... ( ..... | .... I I I . . . . | .... | .... | . 



360 



arjglnhakagsilasylkmlpmgliimpgmisralfp 
arglnhakags ilasylkmlpmgli impgmisralfp 
arglnhakagsilasylkhlpmgliimpgmisralfp 
arg lnkaxag js i las ylkmlpmgl'jj i mp g m i s ral f p 
a^Sliotvkagsilasyl^lpmgliimpgmisralfp, 



3AHVYEERHQVSVSR* 



3AHVYEERHQVSVS: 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



gi 1 18203958 I 297 
gi j 9588428 | 301 
gi|2S640S3| 298 



HOV27a 
HOV27b 
NOV27C 
gi | 520469 | 
gi |16553933 | 
gi | 18203958 j 
gi I 9588428 | 
gi | 2564063 j 



NOV27a 


395 


* 


NOV27b 


411 


i 


NOV27C 


360 


i 




520469 | 


396 


1 


gi 


16553933 | 


316 




gi 


18203958 j 


401 




gi 


9588428) 


405 


i 


gi 


2564063] 


402 





NOV27a 
NOV27b 
NOV27C 



gi 
gi 
gi 
gi 
gi 



520469) 
16553933| 
18203958) 
9588428) 
2564063 1 



NOV27a 


515 


NOV27b 


531 


NOV27C 


480 


gi 


520469) 


516 


gi 


16553933 | 


436 


gi 


18203958) 


521 


gi 


9588428) 


525 


gi 


2564063 j 


522 



NOV27a 
NOV27b 
NOV27C 



gi 
gi 
gi 
gi 
gi 



520469| 

16553933) 

18203958) 

9588428) 

2564063 



KOV27a 
NOV27b 
NOV27c 



gi 
gi 
gi 
gi 
gi 



520469] 
16553933) 
18203958 j 
9588428) 
2564063) 




455 
471 
420 
456 
376 
461 
465 
462 



610 



550 
566 
515 
551 
471 




620 
..|.. 



I 



630 



LGgKAGDGQT 
LGjjKAGDGQT 

lgBkagdgqt 
lgSkagdgqt 
lgRkagdgqt 



640 



650 



660 



567 
583 
532 
568 
488 



574 NTEGDNSPG^GRPV^GPj^^Ep^EANTTQGPEQPGALHRSWGKWLWNWFCGLSGAPQQA 633 



552 

582 EDI SEEPHTT|T[^I YNDTTDjgNPgS - S 

670 680 690 

■ I. -I. 

567 
583 
532 
566 
488 

634 LSPAEKAVLEQKLTSIgEEP: 
552 
629 



552 

SLLKKT ILWLCGMDSRKGDK 628 

700 
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Table 271 lists the domain description from DOMAIN analysis results against NOV27. 
This indicates that the NOV27 sequence has properties similar to those of other proteins 
5 known to contain this domain. 



Table 271 Domain Analysis of NOV27 

gnl|p£am|pfam00474, SSF, Sodium.-solute symporter family. (SEQ ID 
NO: 820) 

CD-Length = 406 residues, 100.0% aligned 
Score = 310 bits (793) , Expect = 2e-85 



10 



15 



20 



25 



30 



35 



40 



45 



NOV27: 


50 


Sbjct: 


1 


NOV27: 


110 


Sbjct: 


61 


NOV27: 


170 


Sbjct: 


121 


NOV27: 


230 


Sbjct; 


181 


NOV27 : 


290 


Sbjct: 


227 


N0V27 : 


350 


Sbjct: 


278 


NOV27 : 


410 


Sbjct: 


338 


NOV27 : 


470 


Sbjct: 


397 



YFLAGRDMTWWPI GAS LFASSEGSGLFIGMG^GMGCLAVAGFE WNATYVLLALAWVFV 109 

nun ii + i ii ii + i+iiii+in in + i * i i + i 

YFjLAGRSMTGFVNGLSLAAS YMS AAS FVGLAGAGAAS GLAGGL YAIGAL VGVWLLL WL FA 60 



I + I+I+I++II+II+II +111 Mill II +1+ + II + + II 1 + 

PRIjRNLGAYTMPDYLRKRFGGKRILW^ 12 0 

YLST ILTLGI TALYT I AGGLAAVI YTDALQTL IMWGAVI LTI KAFD QI GGYGQLEAAYA 22 9 
I + +1 +11+11 II II +11 +1 ++I+ I +11 | | + + ||| | 



I +1+ III II I 1+ I I * 
-PDGLHILRDPLTGLSLWPGLVLGTTGL- 



l + ll I 
-PHILQRCL 226 



II I + I I +I + IIIIIII II + I I I I I I I I I 

- AKC I RCGVL ILTPMF 1 1 VMPGMI SRGLFAIALAGANP RACGTWGCS 277 



Mill I ++I I II l + l + llllll+.ll + lll 111+ II I+++ +| ++ I 



+ 1 



+ 1 



I +1 +11+1 II 



III ll+l I 



The gene of invention codes for a human ortholog of a rabbit sodium-glucose 
cotransporter (SGLT) and belongs to the large family of SGLTs that has been described to 
date. The rabbit gene is expressed in the kidney (Pajor,Biochim Biophys Acta 1994 Sep 
14;1 194(2):349-51), and the novel gene described herein is expressed in the heart in addition 
to the kidney. It shows the characteristic sodium-solute symporter protein motif shared by 
members of the SGLT family. 

SGLTs are critical in the maintenance of glucose homeostasis in the body, in a variety 
of tissues. Inhibitors of SGLTs are being studied in the treatment of diabetes. Treatment of 
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Zucker diabetic fatty rats with the SGLT inhibitor T-1095 lowers both fed and fasted blood 
glucose levels to near-normal levels (Nawano et al., Am J Physiol Endocrinol Metab 2000 
Mar;278(3):E535-43). In streptozotocin-induced diabetic rats, T-1095 also exerts an 
antihyperglycemic effect which is nullified by nephrectomy, indicating "that the drug acts 
5 through inhibition of renal SGLTs rather than intestinal ones (Oku et al., Biol Pharm Bull 
2000 Dec;23(12): 1434-7) In addition, SGLT-1 seems to have a role in mammalian renal 
tubulogenesis (Yang et al., Am J Physiol Renal Physiol 2000 Oct;279(4):F765-77). 

The disclosed NOV27 nucleic acid of the invention encoding a Sodium - Glucose 
Cotransporter -like protein includes the nucleic acid whose sequence is provided in Table 27A, 

10 27C, 27E or a fragment thereof. The invention also includes a mutant or variant nucleic acid 
any of whose bases may be changed from the corresponding base shown in Table 27A, 27C, 
or 27E while still encoding a protein that maintains its Sodium - Glucose Cotransporter -like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to those just described, 

1 5 including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 

20 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject 
In the mutant or variant nucleic acids, and their complements, up to about 14 percent of the 
bases may be so changed. 

The disclosed NOV27 protein of the invention includes the Sodium - Glucose 

25 Cotransporter -like protein whose sequence is provided in Table 27B, 30D, or 30F. The 
invention also includes a mutant or variant protein any of whose residues may be changed 
from the corresponding residue shown in Table 27B, 27D, or 27F while still encoding a 
protein that maintains its Sodium - Glucose Cotransporter -like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 42 

30 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Sodium - Glucose Cotransporter - 
like protein (NOV27) is a member of a "Sodium - Glucose Cotransporter family". Therefore, 
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the NOV27 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
5 targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV27 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in diabetes, obesity, hypertension, cardiomyopathy, 

10 atherosclerosis, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 

atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic stenosis, 
ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, 
transplantation, autoimmune disease, renal artery stenosis, interstitial nephritis, 
glomerulonephritis, polycystic kidney disease, systemic lupus erythematosus, renal tubular 

15 acidosis, IgA nephropathy, hypercalceimia, Lesch-Nyhan syndrome, cancer, tissue 

degeneration, diabetic nephropathy, microvascular and macrovascular disease, and/or other 
diseases and.pathologies. 

NOV27 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV27 substances for use in 

20 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV27 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 

25 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV28 

A disclosed NOV28 nucleic acid of 1 560 nucleotides (also referred to as CG56 185-01) 
encoding a MYD-l-like protein is shown in Table 28A. An open reading frame was identified 
30 beginning with a ATG initiation codon at nucleotides 3 1 -33 and ending with a TGA codon at 
nucleotides 1537-1539. The start and stop codons are shown in bold in Table 28A, and the 5' 
and 3' untranslated regions, if any, are underlined. 
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Table 28A. NOV28 nucleotide sequence (SEQ ID NO:119). 



CAGCCCTCGCGGGCGGCGTAGCCGCGGCCCA TGGAGCCCGCGGGCCGGTCCCCGGCCGCCTCGGGCCGCTGC 
TCTGCCTGCTGCTCCCCGCGTCCTGCGCCTGGTGGAGTGGCGGGTGAGGAGGAGCTGCAGGTGATTCAGCCT 
GAGAAGTCTGTATCAGTTGCAGCTGGAGAGTCGGCCGCTCTGCAGTGCACTGTGACCTCCCTGAACCCTGTG 
GGGCCCATCC^VACGGTTCAGAGGAGCTGGACCAGGCCGGAAATTAATCTACCATCAAAAAGAAGGCCACTTC 
CCCCGGGTAACAACTGTTTCAGATCTCACAAAGAGAACCAACA^ 

ACCCCAGCAGATGCCGGCACCTACTACTGTGTGAAGTTCCAGAAAGGGAGCCCTGACGTGGAGTTGAAGTCT 
GGAGCAGGCACTGAGCTGTCTGTGCGTGCCAAACCCTCTGCCCCCGTGGTATCGGGCCCCGCAGCGAGGGCC 
ACACCI^CCACACAGTGAGCTTCACCTGCGAGTCTCATGGCTTCT 

TTCAAAAATGGGAATCAGCTCTCAGACTTCGAGACCAACGTGGACCCCGCAAGAGAGAGCGTGTCCTACA 
ATCCAC^GCACAGCCAATGTGGTGCTGACCCGCGGGGACATTCACTCTCAAGTCATCTGCGAGGTGGCCCAC 
GTCACCTTGCGGGGGGACTCTTTTCGTGGGACTGCCAACTTGTCTGAGACTATCCAAGTTCCACCCACCTTG 
GAGGTT ACTCAACAGCC CATGAGGGCAGAGAACCAGGTGAATATCACCTGCCAGGTGACGAAATT CTACCCC 
CAGAGACTACAGTTGACCTGGTTGGAGAACGGCAATGTGTCCCGGACAGAAACGGCCTCAACTCTTACAGAG 
AAC AAGGATGGCAC CT ACAACTGGATGAGC TGG CT CCTGGTG AATGTATCTG C C CAC AGGGATGATGTGAAG 
CTCACCTGCC^GGTGGAGCATGACGGGCAGTCAGCGGTCAGCAAAAGCCATGACCTGAAGGTCTCAGCCCAC 
CTGAAGGAGCAGAGCTCAAATACCGCCGCTGAGAACACTGGACCT 

GGCGTGGTGTGCACCTTGCTGGTGGCCCTACTGATGGAGGCTCTCTACCTCGTCCGAATCAGACAGAAGAAA 
GCCCAGGGCTCCACTTCITCTACAAGGTTGCATGAACCCGAGAAGAATGCCAGAAAAATAACCCAGGACA^ 
AATGATATCACATATGCGGACCTGAACCTGCCCAAGGGGAAGAAGCCTGCTCCCCGGGCCGCGGAGCCCAAC 
AACCACACAGAGTATGCCAGCATTCAGACCAGCCTGCAGCCTGCGTCGGAGGACACCCTCA 



GC C AG CAT C CAGGT CCCGAGGAAGTGAATGGGAC CGTGGTTTGCTC TA 



10 



In a search of public sequence databases, the NOV28 nucleic acid sequence, located on 
chromosome 22, has 1466 of 1544 bases (94%) identical to a gb:GENBANK- 
ID:HSSIRPALP|acc:Y10375.1 mRNA from Homo sapiens (H.sapiens mRNA for SIRP- 
aIphal)(E = 7.4e 310 ). 

The disclosed NOV28 polypeptide (SEQ ID NO:120) encoded by SEQ ID NO:l 19 has 
503 amino acid residues and is presented in Table 28B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV28 has a signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.4600. Alternatively, 
NOV28 may also localize to the endoplasmic reticulum (membrane) with a certainty of 
0.1000, to the endoplasmic reticulum (lumen) with a certainty of 0.1000, or extracellularly 
with a certainty of 0.1000. The most likely cleavage site for NOV28 is between positions 30 
and 31: VAG-EE. 



15 



Table 28B. Encoded NOV28 protein sequence (SEQ ID NO.120). 



MEPAGPVPGRLGPLLCLLLPASCAWSGVAGEEELQVIQPEKSVSVAAGESAALQCTVTSIiNPVGPIQRFRGA 
GPGRKLIYHQKEGHFPRVTTVSDLTKRTNTO 

AKPSAPWSGPATJ^TPDHTVSFTCESHGFSPRDISLKWFKNGNQLSDFQTNVDPAIffiSVSYSIHSTANVVL 

TRGDIHSQVICEVAHVTLRGDSFRGTAl^SETIQVPPT^ 

NGNVSRTETASTLTENKDGTYNTWSWLLVN^ 

AENTGPNEQNIYIWGWCTLLVALLMEALY^ 

LPKGKKPAPRAAEPNNHTEYASIQTSLQPASEDTL^^ 



A search of sequence databases reveals that the NOV28 amino acid sequence has458 

of 503 amino acid residues (91%) identical to, and 475 of 503 amino acid residues (94%) 

similar to, the 503 amino acid residue ptnr:SPTREMBL-ACC:P78324 protein from Homo 
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sapiens (Human) (Protein Tyrosine Phosphatase, Non-Receptor Type Substrate 1 Precursor 
(Shp Substrate-1) (Inhibitory Receptor Shps-1) (Shps-1) (Signal- Regulatory Protein Alpha-1) 
(SIRP-Alphal) (MYD-1 Antigen)) (E - 5.7e 247 ). 

NOV28 is predicted to be expressed in at least myeloid, macrophages, Adrenal 
5 Gland/Suprarenal gland, Bone Marrow, Brain, Whole Organism. This information was derived 
by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or 
RACE sources. 

In addition, the sequence is predicted to be expressed in myeloid and macrophages 
10 because of the expression pattern of (GENBANK-ID: gb:GENBANK-ID:HSSIRPALP| acc: 
Y10375.1) a closely related H.sapiens mRNA for SIRP-alphal homolog. 

The disclosed NOV28 polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 28C. 



Table 28C. BLAST results for NOV28 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 14771369 |ref |XP_ 
044897. 1| 
(XM__044897) 


hypothetical 
protein XP_044897 
[Homo sapiens] 


504 


458/504 
(90%) 


476/504 
(93%) 


0.0 


gi | 4758978 |ref|NP 0 
04639. 1| 
(NM_004648) 


protein tyrosine 
phosphatase, non- 
receptor type 
substrate 1; SHP 
substrate-1 [Hojiio 
sapiens] 


503 


458/503 
(91%) 


475/503 
(94%) 


0.0 


gi | 6624134 | gb | AAF19 

260.1|AC004832_5 

(AC004832) 


similar to SHPS-1 
[Homo sapiens] ; 
similar to 
BAA12974.1 
(PID:gl864011) 


402 


402/402 
(100%) 


402/402 
(100%) 


0.0 


gi| 2842392 |emb|CAA7 
1944. 1| (Y11047) 


MyD-1 antigen 
[Homo sapiens] 


429 


391/429 
(91%) 


407/429 
(94%) 


0.0 


gi| 2842390 | emb|CAA7 
1942. l| (Y11045) 


MyD-l antigen 
[Bos taurusl 


506 


373/510 
(73%) 


415/510 
(81%) 


0.0 



15 

The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 28D. In the ClustalW alignment of the NOV28 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (z.e., regions that may be required to preserve structural or 
20 functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 28D. ClustalW Analysis of NOV28 

1) Novel NOV28 (SEQ ID NO: 120) 

2) gi 1 14771369 1 ref |XP_044897 . 1 1 (XM_044897) hypothetical protein XPJ)44897 [Homo 
sapiens) (SEQ ID NO: 441) 

3) gi | 475 89 78 | ref | NP_0 0463 9.1 | (NM_0 04648) protein tyrosine phosphatase, non- 
receptor type substrate 1; SHP substrate- 1 {Homo sapiens] (SEQ ID NO: 442) 

4) gi | 6624134 |gb| AAF19260 .1 | AC004832_5 <AC004832) <SEQ ID NO:443) 

5) similar to SHPS-1 [Homo sapiens]; similar to BAA12974.1 (PID:gl864011) (SEQ ID 
NO:444) 

6) gi|2842392|emb|CAA71944.l| (Y11047) MyD-1 antigen [Homo sapiens] (SEQ ID NO: 445) 

7) gi|2842390|emb|CAA7l942.l| (Y11045) MyD-1 antigen [Bos taurus) (SEQ ID NO:446) 



NOV28 


1 


gi 


14771369| 1 


gi 


4758978 


1 


gi 


6624134 


1 


gi 


2842392 


1 


gi 


2842390 


1 




PgPGRLraPLLCLLLgASraAWrtGgAGSEELQVIQPwSSVBVAAGE 











...1. 




70 
-1. 


NOV28 


61 


N 


PVGPI 




FRG 


gi 


14771369) 61 


I 


PVGPI 




b'RG 


gi 


4758978 


61 


I 


PVGPI 


w 
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30 Tables 28E-F list the domain descriptions from DOMAIN analysis results against 

NOV28. This indicates that the NOV28 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 28E Domain Analysis of NO V28 

gnl|Sraart |smart00407, igci, Immunoglobulin C-Type (SEQ ID NO: 821) 
CD-Length » 75 residues, 94.7% aligned 
Score =50.8 bits (120), Expect = 2e-07 



35 NOV28: 267 QVNITCQVTKFYPQRLQLTWLENGNVSRTETAST - LTENKDGTYNWMS WLLVNVSAHRDD 325 

+ I M HI + +111 + 11 + +1 ++MIII l + l 1+ I 
Sbj Ct: • 1 PATLVCLVTGFYPPDITVTWLKNGQEWSGVKTTDPLKDKDGTYFLSSYLTVSASTWESG 60 

NOV28: 326 VKLTCQVEHDG 336 

40 MM | + | 

Sbjct: 61 DVYTCQVTHEG 71 



Table 28F Domain Analysis of NOV28 



gnl|Smart|sraart00407, IGcl, Immunoglobulin C-Type (SEQ ID NO: 821) 
CD-Length = 75 residues, 96.0% aligned 
Score = 47.8 bits (112), Expect = 2e-06 



45 



NOV28: 164 TVSFTCESHGFSPRDISLKWFKNGNQLSDFQTNVDPARES-VSYSIHSTANWLTRGDIH 222 

+1 II I II++ I III +++ || ++ +|+| | + + 

Sbjct: 1 PATLVCLVTGFYPPDITVTWLKNGQEVTSGVKTTDPLKDKIX3TYFLSSYLTVSASTWESG 60 



NOV28: 223 SQVICEVAHVTL 234 
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Ml I 

Sbjct: 61 DVYTCQVTHEGL 72 

Protein tyrosine phosphatases (PTPases), such as SHP-1 and SHP-2, that contain Src 
5 homology 2 (SH2) domains play important roles in growth factor and cytokine signal 

transduction pathways. A protein of approximately 1 15 to 120 kDa that interacts with SHP-1 
and SHP-2 was purified from v-src-transformed rat fibroblasts (SR-3Y1 cells), and the 
corresponding cDNA was cloned. The predicted amino acid sequence of the encoded protein, 
termed SHPS-1 (SHP substrate 1), suggests that it is a glycosylated receptor-like protein with 

10 three immunoglobulin-like domains in its extracellular region and four YXX(L/V/I) motifs, 
potential tyrosine phosphorylation and SH2-domain binding sites, in its cytoplasmic region. 
Various mitogens, including serum, insulin, and lysophosphatidic acid, or cell adhesion 
induced tyrosine phosphorylation of SHPS-1 and its subsequent association with SHP-2 in 
cultured cells. Thus, SHPS-1 may be a direct substrate for both tyrosine kinases, such as the 

1 5 insulin receptor kinase or Src, and a specific docking protein for SH2-domain-containing 

PTPases. In addition, we suggest that SHPS-1 may be a potential substrate for SHP-2 and may 
function in both growth factor- and cell adhesion-induced cell signaling. (Fujioka et al. Mol 
Cell Biol. 1996 Dec;16(12):6887-99.) 

The rat 0X41 antigen is a cell surface protein containing three immunoglobulin 

20 superfamily domains and intracellular immunoreceptor tyrosine-based inhibitory motifs 

(ITIM). It is a homologue of the human signal-regulatory protein (SIRP) also known as SHPS- 
1, BIT or MFR. Cell activation-induced phosphorylation of the intracellular ITIM motifs 
induces association with the tyrosine phosphatases SHP-1 and SHP-2. To identify the 
physiological OX41 ligand, recombinant OX41-CD4d3+4 fusion protein was coupled to 

25 fluorescent beads to produce a multivalent cell binding reagent. The OX41-CD4d3+4 beads 
bound to thymocytes and concanavalin A-stimulated splenocytes. This interaction was blocked 
by the monoclonal antibody (mAb) OX101. Affinity chromatography with OX101 mAb and 
peptide sequencing revealed the rat SIRP ligand to be CD47 (integrin-associated protein). A 
direct interaction between human SIRP and human CD47 was demonstrated using purified 

30 recombinant proteins and surface plasmon resonance ruling out the involvement of other 
proteins known to be associated with CD47. The affinity of the SIRP/CD47 interaction was 
K(d) approximately 8 microM at 37 degrees C with a k(off )>/=2.1 s(-l). The membrane-distal 
SIRP V-like domain was sufficient for binding to CD47.(Vernon-Wilson EF, et al. Eur J 
Immunol. 2000 Aug;30(8):2130-7.) 



249 



WO 02/068649 PCT/US02/02785 



The transmembrane glycoprotein SHPS-1 binds the protein tyrosine phosphatase SHP- 
2 and serves as its substrate. Although SHPS-1 has been implicated in growth factor- and cell 
adhesion-induced signaling, its biological role has remained unknown. Fibroblasts 
homozygous for expression of an SHPS-1 mutant lacking most of the cytoplasmic region of 
5 this protein exhibited increased formation of actin stress fibers and focal adhesions. They 
spread more quickly on fibronectin than did wild-type cells, but they were defective in 
subsequent polarized extension and migration. The extent of adhesion-induced activation of 
Rho, but not that of Rac, was also markedly reduced in the mutant cells. Activation of the Ras- 
extracellular signal-regulated kinase signaling pathway and of c-Jun N-terminal kinases by 

10 growth factors was either unaffected or enhanced in the mutant fibroblasts. These results 

demonstrate that SHPS-1 plays crucial roles in integrin-mediated cytoskeletal reorganization, 
cell motility and the regulation of Rho, and that it also negatively modulates growth factor- 
induced activation of mitogen-activated protein kinases. (Inagaki, A. et al., EMBO J. 2000 
Dec 15;19(24):6721-3L) 

15 Machida K. et al. (Oncogene. 2000 Mar 23;19(13):1710-8.) investigated the effect of 

cell transformation by v-src on the expression and tyrosine phosphorylation of SHPS-1, a 
putative docking protein for SHP-1 and SHP-2. They found that transformation by v-src 
virtually inhibited the SHPS-1 expression at mRNA level. While nontransforming Src kinases 
including c-Src, nonmyristoylated forms of v-Src had no inhibitory effect on SHPS-1 

20 expression, transforming Src kinases including wild-type v-Src and chimeric mutant of c-Src 
bearing v-Src SH3 substantially suppressed the SHPS-1 expression. In cells expressing 
temperature sensitive mutant of v-Src, suppression of the SHPS-1 expression was temperature- 
dependent. In contrast, tyrosine phosphorylation of SHPS-1 was rather activated in cells 
expressing c-Src or nonmyristoylated forms of v-Src. SHPS-1 expression in SR3Y1 was 

25 restored by treatment with herbimycin A, a potent inhibitor of tyrosine kinase, or by the 
expression of dominant negative form of Ras. Contrary, active form of Mekl markedly 
suppressed SHPS-1 expression. Finally, overexpression of SHPS-1 in SR3Y1 led to the drastic 
reduction of anchorage independent growth of the cells. Taken together, their results suggest 
that the suppression of SHPS-1 expression is a pivotal event for cell transformation by v-src, 

30 and the Ras-MAP kinase cascade plays a critical role in the suppression. 

SHPS-1 (SH2-domain bearing protein tyrosine phosphatase (SHP) substrate- 1), a 
member of the inhibitory-receptor superfamily that is abundantly expressed in macrophages 
and neural tissue, appears to regulate intracellular signaling events downstream of receptor 
protein-tyrosine kinases and integrin-extracellular matrix molecule interactions. To investigate 
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the function of SHPS-1 in a hematopoietic cell line, SHPS-1 was expressed in Ba/F3 cells, an 
IL-3-dependent pro-B-cell line that lacks endogenous SHPS-1 protein. Interestingly, 
expression of either SHPS-1, or a mutant lacking the intracellular domain of SHPS-1 (DeltaCT 
SHPS-1), resulted in the rapid formation of macroscopic Ba/F3 cell aggregates. As the 
5 integrin-associated protein/CD47 was shown to be a SHPS-1 Iigand in neural cells, Babic, J. et 
al. (J Immunol. 2000 Apr l;164(7):3652-8.) investigated whether CD47 played a role in the 
aggregation of SHPS-1 -expressing Ba/F3 cells. In support of this idea, aggregate formation 
was inhibited by an anti-CD47 Ab. Furthermore, erythrocytes from control, but not from 
CD47-deficient mice, were able to form rosettes on SHPS-1 -expressing Ba/F3 cells. Because 

10 erythrocytes do not express integrins, this result suggested that SHPS-1 -CD47 interactions can 
take place in the absence of a CD47-integrin association. They also present evidence that the 
amino-terminal Ig domain of SHPS-1 mediates the interaction with CD47. Although SHPS-1 - 
CD47 binding likely triggers bidirectional intracellular signaling processes, these results 
demonstrate that this interaction can also mediate cell-cell adhesion. 

1 5 Inhibitory immunoreceptors downregulate signaling by recruiting Src homology 2 

(SH2) domain-containing tyrosine and/or lipid phosphatases to activating receptor complexes 
[1]. There are indications that some inhibitory receptors might also perform other functions [2] 
[3]. In adherent macrophages, two inhibitory receptors, SHPS-1 and PIR-B, are the major 
proteins binding to the tyrosine phosphatase SHP-1. SHPS-1 also associates with two tyrosine- 

20 phosphorylated proteins (pp55 and ppl30) and a protein tyrosine kinase [4]. Here, Timms, JF. 
et al. (Curr Biol. 1999 Aug 26;9(16):927-30.) have identified pp55 and pp!30 as the adaptor 
molecules SKAP55hom/R (Src-kinase-associated protein of 55 kDa homologue) and 
FYB/SLAP-130 (Fyn-binding protein/SLP-76-associated protein of 130 kDa), respectively, 
and the tyrosine kinase activity as PYK2. Two distinct SHPS-1 complexes were formed, one 

25 containing SKAP55hom/R and FYB/SLAP-1 30, and the other containing PYK2. Recruitment 
of FYB/SLAP-130 to SHPS-1 required SKAP55hom/R, whereas PYK2 associated with 
SHPS-1 independently. Formation of both complexes was independent of SHP-1 and tyrosine 
phosphorylation of SHPS-1. Finally, tyrosine phosphorylation of members of the SHPS-1 
complexes was regulated by integrin-mediated adhesion. Thus, SHPS-1 provides a scaffold for 

30 the assembly of multi-protein complexes that might both transmit adhesion-regulated signals 
and help terminate such signals through SHP-1 -directed dephosphorylation. Other inhibitory 
immunoreceptors might have similar scaffold-like functions. 

SHPS-1 (or SIRP) is a member of the immunoglobulin (Ig) superfamily abundantly 
expressed in neurons and other cell types. Within its cytoplasmic domain, it possesses at least 
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two immunoreceptor tyrosine-based inhibitory motifs, which are targets for tyrosine 
phosphorylation and mediate the recruitment of SHP-2, an Src homology 2 (SH2) domain- 
containing protein-tyrosine phosphatase. Since other immunoreceptor tyrosine-based 
inhibitory motifs-containing receptors have critical roles in the negative regulation of 
5 hemopoietic cell functions, the expression of SHPS-1 in cells of hematological lineages was 
examined. By analyzing a panel of hemopoietic cell lines, evidence was provided that SHPS-1 
is abundantly expressed in macrophages and, to a lesser extent, in myeloid cells. No 
expression was detected in T-cell or B-cell lines. Expression of SHPS-1 could also be 
documented in normal ex vivo peritoneal macrophages. Further studies showed that SHPS-1 

10 was an efficient tyrosine phosphorylation substrate in macrophages. However, unlike in non- 
hemopoietic cells, tyrosine-phosphorylated SHPS-1 in macrophages associated primarily with 
SHP-1 and not SHP-2. Finally, analyses allowed identification of several isoforms of SHPS-1 
in mouse cells. In part, this heterogeneity was due to differential glycosylation of SHPS-1 . 
Additionally, it was caused by the production of at least two distinct shps-1 transcripts, coding 

15 for SHPS-1 polypeptides having different numbers of Ig-like domains in the extracellular 

region. Taken together, these findings indicate that SHPS-1 is likely to play a significant role 
in macrophages, at least partially as a consequence of its capacity to recruit SHP-1 . Veilette, 
A. et al. (J Biol Chem. 1998 Aug 28;273(35):22719-28.) 

SHPS-1 is a 120 kDa glycosylated receptor-like protein that contains immunoglobulin- 

20 like domains in its extracellular region and four potential tyrosine phosphorylation for SH2 
domain binding sites in its cytoplasmic region. Epidermal growth factor (EGF) stimulated the 
rapid tyrosine phosphorylation of SHPS-1 and subsequent association of SHPS-1 with SHP-2, 
a protein tyrosine phosphatase containing SH2 domains, in Chinese hamster ovary cells 
overexpressing human EGF receptors. In the cells overexpressing SHPS-1, the tyrosine 

25 phosphorylation of SHPS-1 was more evident than that observed in parent cells. However, 
overexpression of SHPS-1 alone did not affect the activation of MAP kinase in response to 
EGF. These results suggest that SHPS-1 may be involved in the recruitment of SHP-2 from 
the cytosol to the plasma membrane in response to EGF. Copyright 1997 Academic Press. 
Ochi, F. et al. (Biochem Biophys Res Commun. 1997 Oct 20;239(2):483-7.) 

30 The immune system recognizes invaders as foreign because they express determinants 

that are absent on host cells or because they lack 'markers of self that are normally present. 
Oldenborg et al. (2000) demonstrated that CD47 functions as a marker of self on murine red 
blood cells. Red blood cells that lack CD47 were rapidly cleared from the bloodstream by 
splenic red pulp macrophages. CD47 on normal red blood cells prevented this elimination by 
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15 



binding to the inhibitory receptor signal regulatory protein alpha (SIRP-alpha). Thus, 
Oidenborg et al. (2000) concluded that macrophages may use a number of nonspecific 
activating receptors and rely on the presence or absence of CD47 to distinguish self from 
foreign. Oidenborg et al. (2000) suggested that CD47-SIRP-alpha may represent a potential 
pathway for the control of hemolytic anemia. 

The disclosed NOV28 nucleic acid of the invention encoding a MYD-1 -like protein 
includes the nucleic acid whose sequence is provided in Table 28A or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 28A while still encoding a protein that maintains 
its MYD-1 -like activities and physiological functions, or a fragment of such a nucleic acid. 
The invention further includes nucleic acids whose sequences are complementary to those just 
described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 6 percent of the 
20 bases may be so changed. 

The disclosed NOV28 protein of the invention includes the MYD-1 -like protein 
whose sequence is provided in Table 28B. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
28B while still encoding a protein that maintains its MYD-1 -like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 27 
percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as or 
(Fab^that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this MYD-1 -like protein (NOV28) is a 
member of a "MYD-1 family. Therefore, the NOV28 nucleic acids and proteins identified 
here may be useful in potential therapeutic applications implicated in (but not limited to) 
various pathologies and disorders as indicated below. The potential therapeutic applications 
for this invention include, but are not limited to: protein therapeutic, small molecule drug 
target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic 
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and/or prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
those defined here. 

The NOV28 nucleic acids and proteins of the invention are useful in potential 
5 therapeutic applications implicated in epilepsy, eating disorders, schizophrenia, ADD, and 
cancer; heart disease; inflammation and autoimmune disorders including Crohn's disease, 
IBD, allergies, rheumatoid and osteoarthritis, inflammatory skin disorders, allergies, blood 
disorders; psoriasis colon cancer, leukemia AIDS; thalamus disorders; metabolic disorders 
including diabetes and obesity; lung diseases such as asthma, hemolytic anemia, emphysema, 
10 cystic fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency and cancer; 
and prostate disorders including prostate cancer, and/or other diseases and pathologies. 

NOV28 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV28 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
1 5 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV28 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
20 disorders. 

NOV29 

NOV29 includes three novel CRAL-TRIO -like proteins disclosed below. The 
disclosed sequences have been named NOV29a, NOV29b, and NOV29c. 
NOV29a 

25 A disclosed NOV29a nucleic acid of 1327 nucleotides (also referred to as CG56187- 

01) encoding a CRAL-TRIO-like protein is shown in Table 29A. An open reading frame was 
identified beginning with a ATG initiation codon at nucleotides 16-18 and ending with a TGA 
codon at nucleotides 1261-1263. The start and stop codons are shown in bold in Table 29A, 
and the 5' and V untranslated regions, if any, are underlined. 

Table 29A. NOV29a nucleotide sequence (SEQ ID NO:121). 

TOAGTTGACTGGTGGA TQATGTGC^AAG^ 

CCGACCATCAGATCCTCCTCCGCTCAGTTCCGGGAGAACCTCCAGGACCTGCTGCCCATACTGCCCAATGCT 
GATGACTACTTCCTCCTGCGCTGGCTGG 

AGGCACATGGAGTTCCGGAAGCAACAAGACCTGGACAACATTGTCACATGGC^ 

TCCCTCGACCCCAAGGGTCTCCTGCTGTCAGCCTCCAA 
GAGCTGCTGTTGCATGAGTGTGAGCTGCAAACTCAGAAGCTGGGCA 

TTTGACATGGAGGGGCTGAGCCTGAAACACCTGTGGAAGCCAGCTGTGGAGGTCTACCAGCAGTTTTTTAGC 
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ATCCTGGAAGCAAATTATCCTGAGACCCTGAAGAATTTAATTGTTATTCGAGCCCCAAAACTGTTCCCCGTG 
GCCTTCAACTTGGTCAAGTCGTTCATGAGTGAGGAGACACGCAGGAAGATTGTGATTCTGGGAGACAACTGG 
AAGCAGGAGCTGACAAAATTCATCAGCCCCGACCAGCTGCCTGTGGAGTTTGGGGGGACCATGACTGACCCC 
GATGGCAACCCCAAGTGCCTGACCAAGATCAACTATGGGGGTGAGGTGCCCAAGAGCTACTACCTGTGCGAG 
CAGGTGAGGCTGCAGTATGAGCaCACGAGGTCCGTGGGCCGCGGCTCCTCCCTGCAGGTGGAGAACGAGATC 
CTGTTCCCGGGCTGTGTGCTCAGGTGGCAGTTTGCTTCAGATGGTGGGGACATCGGCTTTGGGGTTTTCCTG 
AAGACCAAGATGGGGGAGCAGCAGAGTGCTAGGGAGATGACGGAGGTGCTGCCCAGCCAGCGCTACAATGCC 
CACATGGTGCCTGAGGATGGGAGCCTCACCTGCCTCCAGGCTGGCGTCCTGCGCTTCGACAACACCTACAGC 
^^^°^ G ^ GOT ^ GCTA ^ CTG TGGAGGTGCTGCTTCCCGACAAGGCCTCTGAGGAGACGCTG 
CAGAGTCTCAAGGCGATGAGACCCTCCCCAACACAGTGAAGACCCCAGCCACCTCTACCTGTCrArTPraai- 
CCCTTCACACCCACCCCTCTGACCPrTGPrT ' ■ 



In a search of public sequence databases, the NOV29a nucleic acid sequence, located 
on chromosome 22, has 935 of 1263 bases (74%) identical to a gb.GENBANK- 
ID:RN0132352|acc:AJ132352.1 mRNA from Rattus norvegicus (Rattus norvegicus mRNA 
for 45 kDa secretory protein, partial) (E = 4.0e -132 ). 

A disclosed NOV29a polypeptide (SEQ ID NO: 122) encoded by SEQ ID NO: 121 has 
415 amino acid residues and is presented in Table 29B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV29a has no signal peptide and is 
likely to be localized extracellularly with a certainty of 0.6500. Alternatively, NOV29a may 
also localize to the mitochondrial membrane spacewith a certainty of 0.1000, to the lysosome 
(lumen) with a certainty of 0.1 000, or to the microbody (peroxisome) with a certainty of 
0.0348. 



15 



20 



25 



Table 29B. Encoded NOV29a protein sequence (SEQ B) NO:122). 



A search of sequence databases reveals that the NOV29a amino acid sequence has 387 
of 397 amino acid residues (97%) identical to, and 390 of 397 amino acid residues (98%) 
similar to, the 406 amino acid residue ptnr:SPTREMBL-ACC:Q9UDX3 protein from Homo 
sapiens (Human) (WUGSC:H_DJ0539M06.4 PROTEIN) (E = 7.2e' 208 ). 

NOV29a is predicted to be expressed in at least Bone, liver, brain, and prostate. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

In addition, the sequence is predicted to be expressed in Bone because of the 
expression pattern of (GENBANK-ID: gb:GENBANK-ID:RN0132352|acc: AJ132352.1) a 
closely related Rattus norvegicus mRNA for 45 kDa secretory protein, partial homolog. 
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NOV29b 

A disclosed NOV29b nucleic acid of 1305 nucleotides (also referred to as CG56187- 
03) encoding a CRAL-TRIO-like protein is shown in Table 29C. An open reading frame was 
identified beginning with a ATG initiation codon at nucleotides 14-16 and ending with a TGA 
5 codon at nucleotides 1 262-1264. The start and stop codons are shown in bold in Table 29C, 
and the 5 ' and 3' untranslated regions, if any, are underlined. 

Table 29C. NOV29b nucleotide sequence (SEQ ED NO:123). 

AGTTGACTGGTGGA TQATGTGGGAAGGGTTAGGGGCGGGGTTGGTGGCCCC^ 

GACCAT(^GATCCTCCTCCGCTCAGTTCCGGGAGAACCTCCAGGACCTGCTGCCCATACTGCCCAATGCTGA 
TGACTACTTCCTCCTGCGCTGGCTGCGAGCTCGAAACTTTGACCTGC^ 

GCACATGGAGT T C CGGAAG CAACAAGACCTGGAC AACATTGT C ACATGG CAGC C CCCTG AGGTCAT C CAG CT 

GTATGACTCGGGTGGTCTTTGTGGCTACGACTACGAAGGCTGCCCrrGTGTACTTCAACATCATTGGGTCCCT 

CGACCCCAAGGGTCTCCTGCTGTCAGCCTCCAAGCAGGATATGATCCGGAAGCGCATCAAAGT 

GCTGTTGCATGAGTGTGAGCTGCAAACTCAGAAGCTGGGCAGGAAGATCGAGATGGCGCTGATGGTGTTTGA 

CATGGAGGGGCTGAGCCTGAAACACCTGTGGAAGCCAGCTGTGGAGGTCTACCAGCAGTTTTTTAGCATCCT 

GGAAGCAAATTATCCTGAGACCCTGAAGAATTTAATTGTTATTCGAG^^ 

CAACTTGGTCAAGTCGTTCATGAGTGAGGAGACACG(^GGAAGATTGTGATTCTGGGAGACAACTGGAAG^ 
GGAGCTGACAAAATTCATCAGCCCCGACCAGCTGCCTGTGGAGTTO 

CCACCCC^GTGCCTGACCAAGATCAACTATGGGGGTGAGGTGCCCAAGAGCTACTACCTGTGCGAGCAGGT 

GAGGCTGCAGTATGAGCACACGAGGTCCGTGGGCCGCGGCTCCTCCCTGCAGGTGGAGAACGAGATCCTGTT 

CCCGGGCTGTGTGCTCAGGTGGCAGTTTGCTTCAGATGGTGGGGACATCGGCTTTGGGGTTTTCCTGAAGAC 

CAAGATGGGGGAGOVGCAGAGTGCTAGGGAGATGACGGAGGTGCTGCCCAGCCAGCGCTACAATGCCCACAT 

GGTGC CTGAGGATGGGAGCCTCACCTGCCTCCAGG CTGGCGTCTATGTCCTGCG CTTCGACAACACCTACAG . 

CCGGATGC^TGCCAAGAAGCTCAGCTACACTIGTGGAGGTGCTGCTTCCCGAC^GGCCTCrGAGGAGACGCT 

GCAGAGTCTCAAGGCGATGAGACCCTCCCCAACACAGTG AAGACCCCAGCCACCTCCACCT 

CCCCTTCAC 



In a search of public sequence databases, the NOV29b nucleic acid sequence, located 

on chromosome 22, has 906 of 1212 bases (74%) identical to a gb:GENBANK- 
10 ID:BC005759|acc:BC005759.1 mRNA from Mus musculus (Mus musculus, clone MGC:6302, 

mRNA, complete cds) (E = 2.0e" 137 ). 

A disclosed NOV29b polypeptide (SEQ ID NO: 124) encoded by SEQ ID NO: 123 has 

416 amino acid residues and is presented in Table 29D using the one-letter amino acid code. 

Signal P, Psort and/or Hydropathy results predict that NOV29b has no signal peptide and is 
1 5 likely to be localized extracellularly with a certainty of 0.4500. Alternatively, NOV29b may 

also localize to the mitochondrial membrane spacewith a certainty of 0.1000, to the lysosome 

(lumen) with a certainty of 0.1 000, or to the microbody (peroxisome) with a certainty of 

0.0779. 



Table 29D. Encoded NOV29b protein sequence (SEQ ID NO:124). 

MM WEGLGAGL VAPEVMRAPPTI RS S S AQFRENLQDLLPI LPNADDY FLLRWLRARNFDLQKS EDMLRRHME F 

RKQQDLDNrVTWQPPEVTQLYDSGGLCGYDYEGCPW 

^LQTQKLGRKIEMALMVFDMEGLSLKHLWKPAVEVYQQ 

SFMSEETRRKIVILGDNWKQEI*TKFISPDQLPVEFGGTMTDPDGHPKCLTKINYGGEVPKSYyLCEQVRIK3y 
EHTRSVGRGSSLQVENEILFPGCVLRWQFASDGGDIGFGVFLKTKMGEQQSAREMTEVLPSQRYNAHMVPED 
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| GSLTCLQAGVYVLRFDNTYSRMHAKKLSYTVEVLLPDKASEETLQSLKAMRPSPTQ | 

A search of sequence databases reveals that the NOV29b amino acid sequence has 906 
of 1212 amino acid residues (74%) identical to, and 906 of 1212 amino acid residues (74%) 
similar to, the 2529 amino acid residue gb:GENBANK-ID:BC005759|acc:BC005759.1 protein 
5 from Mus musculus (Mus musculus, clone MGC:6302, mRNA, complete cds) (E = 2.0e~ 137 ). 

NOV29b is predicted to be expressed in at least Bone, liver, brain, and prostate. . The 
sequence is predicted to be expressed inbone because of the expression pattern of 
(GENBANK-ID: gb:GENBANK-ID:BC005759|acc:BC005759.1) a closely related Mus 
musculus, clone MGC:6302, mRNA, complete cds homolog. 
10 NOV29c ( 

A disclosed NOV29c nucleic acid of 1218 nucleotides (also referred to as CG56189- 
01) encoding a CRAL-TRIO-like protein is shown in Table 29E. An open reading frame was 
identified beginning with a ATG initiation codon at nucleotides 1-3 and ending with a TAG 
codon at nucleotides 1216-1218. The start and stop codons are shown in bold in Table 29E, 
15 and the 5' and 3' untranslated regions, if any, are underlined. 



Table 29E. NOV29c nucleotide sequence (SEQ ID NO:125). 

ATGTTCCGGGAGAACATCCi^GATGTGCrATCTGCGCTGCCC^TCCTGATGACTACTTCCTCCTGCGCTGG 

CTC CAAG C T CGGAGCTTTG AC CTGCAG AAATCAGAGGACATGCTG AGGAAGCAT AT G GAGTT C CGGAAG CAA 

CAAGAC CTGG C CAACAT C CTTG CCTGGCAG CC C C CAGAGGTGGT CAGGCTGTACAAC GCTAACGGCATATG C 

GGCCACGACGGTGAGGGCAGCCCTGTCTGGTACCAC^TTGTGGGAAGCCTGGACCCCAAAGGCCTCTTGCTC 

TCAGCCTCCAAACAGGAGTTGCTCAGGGACAGCTTCCGGAGCTGCGAGCTGCTCCTGCGGGAGTGTGAGCTG 

C AG AGT CAG AAG CTGGGGAAGAGGGTGGAGAAAAT CATAGCTATTTTTGGTCTCGAAGGG CTGGGCCTGAGG 

GATCTGTGGAAGCC^GGAATAGAGCTTCTCCAGGAGTTTTTCTCAGCACTTG 

TTGAAGAGTTTAATTGTTGTGAGAGCCCCC^GCTATTCGCCGTAGCCTTCAACCTGCT 

AGTGAAGAGACACGCAGGAAGGTGGTGATTCTCGGAGATCTGATGGTTCCTGCATCCGAAGGTGTAGGGCAC 

CCAACTGGTGTTGAGGGCCCACTGCCTCX3TGGGCT 

AGCCCCGACCAGCTGCCCGTGGAGTTTGGGGGGACCAT^^ 

AAGATCAACTACGGGGGTGAGGTGCCCAAGAGCTACTACCTGTGCAAGCAGGTGAGGCTGCAGTATGAGCAC 
ACGAGGTCCGTGGGCCGCGGCTCCTCCCTGCAGGTGGAGAACGAGATCCTGTTCCCGGGCTGTGTGCTCAGG 
TGGCAGTTTGCTTCAGATGGTGGGGACATTGGCTTTGGGGTTTTCCTGAAGACCAAGATGGGGGAGCGGCA^ 
AGGGCTAGGGAGATGACAGAGGTGCTGCCCAGCCAGCGCTACAATGCCCACATGGTGCCTGAAGATGGGATT 
CTCACCTGCCTCCAGGCCGGCAGCTATGTCCTGAGGTTTTA^ 

ATGAGCTACACCGTGGAGGTACTGCTCCCAGACCAAACCTTCATGGAGAAGATGGAGAAATTCTAG 



In a search of public sequence databases, the NOV29c nucleic acid sequence, located 
on chromosome 22, has 41 8 of 532 bases (78%) identical to a gb:GENBANK- 
E):HS130H16A|acc:AL096881.1 mRNA from Homo sapiens (Novel human mRNA similar to 
20 Rattus norvegicus 45 kDa secretory protein, AJ132352) (E = 4.9e' 129 ). 

The disclosed NOV29c polypeptide (SEQ ID NO: 126) encoded by SEQ ID NO: 125 
has 405 amino acid residues and is presented in Table 29F using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV29c has no signal peptide 
and is likely to be localized extracellularly with a certainty of 0.4500. Alternatively, NOV29c 
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may also localize to the microbody (peroxisome) with a certainty of 0.2010, to the 
mitochondrial matrix space with a certainty of 0.1000, or to the lysosome (lumen) with a 
certainty of 0.1 000. 



Table 29F. Encoded NOV29c protein sequence (SEQ ID NO:126). 

MFRENIQDVLSALPNPDDYFLLRWLQARSro 

GHDGEGSPVWYHIVGSLDPKGIiLSASKQELLRDSFRSCEI^ 

DLWKPGIELLQEFFSALEANYPEILKSLIVVRAPKLF 

PTGVEGPLPGGLPDNWKQELTKFISPDQLPVEFGGTMTDPDGNPKCLTKINYGGEVPKSYYLCK^ 
TRSVGRGSSLQVENEILFPGC^RWQFASDGGDIGFGVFT.KT^^ 

LTCLQAGSYVLRFYinTSLVHSKRISYTVEVLLPDQTFMEKMEKF 



A search of sequence databases reveals that the NOV29c amino acid sequence has 157 
of 176 amino acid residues (89%) identical to, and 166 of 176 amino acid residues (94%) 
similar to, the 406 amino acid residue ptnr:SPTREMBL-ACC:Q9UDX3 protein from Homo 
sapiens (Human) (WUGSC:HDJ0539M06.4 PROTEIN) (E = 2.6e 167 ). 
10 NOV29c is predicted to be expressed in at least Bone, liver, brain, and prostate . This 

information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

In addition, the sequence is predicted to be expressed in Bone because of the 
15 expression pattern of (GENB ANK-ID: gb:GENBANK-ID:RNO!32352|acc: AJ132352.1) a 
closely related Rattus norvegicus mRNA for 45 kDa secretory protein, partial homolog. 

The disclosed NOV29a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 29G. 



Table 29G. BLAST results for NOV29a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi| 6624133 |gb|AAF19 
259.l|AC004832 4 
(AC004832) 


similar to 45 kDa 
secretory protein 
[Rattus 
norvegicus] ; 
similar to 
CAA10644.1 
(PlD:g4164418) 
[Homo sapiens] 


406 


387/398 
(97%) 


390/398 
(97%) 


0.0 


gi | 7110715 |ref|NP 0 

36561.1) 

<NM 012429) 


SEC14 (S. 
cerevisiae) -like 
2; tocopherol- 
associated 
protein [Homo 
sapiens] 


403 


269/394 
(68%) 


331/394 
(83%) 


e-165 
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gi | 16758646 |ref |NP_ 
446253. l| 
(NM_053801) 


SEC14 (S. 
cerevisiae) -like 
2 [Rattus 
norvegicus] 


403 


271/394 
(68%) 


329/394 
(82%) 


e-164 


gi 1 13543184 |gb | AAHO 

5759.l|AAH05759 

(BC005759) 


Unknown (protein 
for MGC:63 02) 
[Mus musculus] 


403 


273/394 
(69%) 


328/394 
(82%) 


e-164 


gi | 4164418 | emb | CAA1 
0644. l| (AJ132352) 


45 kDa secretory 
protein [Rattus 
norvegicus] 


400 


267/384 
(69%) 


326/384 
(84%) 


e-163 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 29H. In the ClustalW alignment of the NOV29 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (ie. 9 regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and , 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 29H. ClustalW Analysis of NOV29 

1) Novel NOV29a (SEQ ID NO: 122) 

2) Novel NOV29b (SEQ ID NO:124) 

3) Novel N0V29C (SEQ ID NO: 126) 

4) gi| 6624133 |.gb|AAF19259.l)AC004832_4 (AC004832) similar to 45 kDa secretory 
protein [Rattus norvegicus]; similar to CAA10644.1 (PID:g4164418) [Homo sapiens] 
(SEQ ID NO:447) 

5) gi|7110715|ref |NP_036561. 1 1 (NM_012429) SEC14 (S. cerevisiae) -like 2; 
tocopherol -associated "protein [Homo sapiens] (SEQ ID NO:448) 

6) gi | 16758646 | ref ]NP_446253 . 1 1 (NM_053801) SEC14 (S. cerevisiae) -like 2 [Rattus 
norvegicus] (SEQ ID NO: 449) 

7) gi|l3543184|gb|AAH05759.l|AAH05759 (BC005759) Unknown (protein for MGC:6302) 
[Mus musculus] (SEQ ID NO:450) 

8) gi|4164418|emb|CAAl0644.l| (AJ132352) 45 kDa secretory protein [Rattus 
norvegicus] (SEQ ID NO: 451) 



NOV29a 
NOV29b 
NOV29C 



gi 
gi 
gi 
gi 
gi 



6624133) 
7110715| 
16758646 | 
13543184| 
4164418! 



NOV29a 
NOV29b 
NOV29C 
gi | 6624133 | 
gi 1 7110715 | 
gi I 16758646 | 
gi | 13543184 j 
gi 1 4164418 | 
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140 



150 



160 



170 



180 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



NOV29a 
NOV29b 
NOV29C 



gi 
gi 
gi 
gi 
gi 



6624133 | 
7110715 j 
16758646 | 
13543184 j 
4164418) 



NOV29a 
NOV29b 
NOV29C 



gi 
gi 
gi 
gi 
gi 



6624133] 
7110715 j 
16758646 | 
13543184 j 
4164418 I 



NOV29a 


232 


NOV29b 


231 


NOV29C 


213 


gi 


6624133 | 


221 


gi 


7110715 j 


221 


gi 


16758646) 


221 


gi 


13543184) 


221 


gi 


4164418 | 


221 



NOV29a 


276 






NOV29b 


275 






NOV29c 


273 






gi 


6624133 | 


265 




: 


gi 


7110715 j 


265 






gi 


16758646) 


265 




Q 


gi 


13543184 j 


265 




Q 


gi 


4164418) 


265 


3i 





NOV29a 


336 


geSq 

GEgQ 


NOV29b 


335 


NOV29C 


333 


GERQ 


gi 


6624133) 


325 


GEgQ 


gi 


7110715) 


. 325 


GERQ 


gi 


16758646) 


325 


GERQ 


gi 


13543184) 


325 


GERQ 


gi 


4164418| 


325 


GERQ' 



N0V29a 
NOV29b 
HOV29c 



gi 
gi 
gi 
gi 
gi 



6624133) 
7110715) 
16758646| 
13543184 | 
4164418) 




394 
395 
393 
385 
385 
385 
385 
385 
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Tables 29I-J list the domain descriptions from DOMAIN analysis results against 
NOV29. This indicates that the NOV29 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 291 Domain Analysis of NOV29 

gnl| Smart |smart00516, SEC14, Domain in homologues of a S. cerevisiae 
phosphatidylinositol transfer protein (Secl4p) ; Domain in homologues 
of a S. cerevisiae phosphatidylinositol transfer protein (Secl4p) and 
in RhoGAPs, RhoGEFs and the RasGEF , neurof ibromin (NF1) . Lipid-binding 
domain. The SEC14 domain of Dbl is known to associate with G protein 
beta/gamma subunits. (SEQ ID NO: 822) 
CD-Length = 157 residues, 96.8% aligned 
Score « 131 bits (329), Expect » 9e-32 



NOV2 9 : 90 VI QLYDSGGLCGYDYEGCPVYFNI IGSLDPKGLLLSASKQDMIRKRIKVCELLLHECELQ 14 9 

I + I II II +1 II III I + ++++I + I I I I 
Sb j Ct : 4 VGKAYIPGGR- -YDKDGRPVLVFRAGRFDLK SVTLEELLRYLVYVLEKALQE 53 

10 NOV29: 150 TQKLGRKI EMALMVFDMEGLS LKHLWKPAVEVYQQFFS I LEANYPETLKNL I VIRAPKLF 209 

+1 II +II++III+ ♦ I + I ++ 11+ +111 I + +1 I I 
Sb jet : 54 - EKKTGGI EGFTTI FDLKGLSMSN PDLGVLRKILKILQDHYPERLGKVYI INPPWFF 109 

NOV29: 210 PVAFNLVKSFMSEETRRKIVILGDNWKQELTKFISPDQLPVEFGGT 255 
15 | + + + | |*|| + || || + | + |+|| ++I l+lll I III 

Sbjct: 110 RVLWKIIKPFLSEKTREKIRFVGPDSKEELLEYIDPEQLPEELGGT 155 



Table 29 J Domain Analysis of NOV29 



gnl|Pfam|pfam00650, CRALJTRIO, CRAL/TRIO domain.. The original profile 
has been extended to include the carboxyl domain from the known 
structure of Secl4. (SEQ ID NO: 823) 
CD-Length = 185 residues, 98.9% aligned 
Score » 120 bits (300), Expect m 2e-28 



20 



25 



30 



35 



N0V29: 


73 


Sb j Ct : 


3 


NOV29: 


132 


Sbjct: 


63 


NOV29: 


190 


Sbjct: 


121 


N0V29: 


250 


Sbjct: 


180 



RKQQDLDN I V - TWQPPEWI QLYDSGGLCGYDYEGCPVYFNI I GS LDPKGLLLSASKQDM 13 1 

|++ +| |+ |+ || +11 +1 III 11+ I +1 + + I 

RREFGVDTILEEATYPKEVIAKLYPQFIHGSDKDGRPVYLERRGQLNLKKMLFITTVERM 62 



+| + I 11+ ++I+I I + III++I+I+ III ++ +11 

VRNLVYEMEQALLYLLPACSRKVGTLINGSCTVFDLKGVSVSSANWVPGVL- - KKVLNIL 12 0 



+ III I +1 II II + l+l 1+ +11 II +11+ I II ++I I II 



Vitamin E (alpha-tocopherol) is an essential dietary nutrient for humans and animals. 
The mechanisms involved in cellular regulation as well as in the preferential cellular and 
tissue accumulation of alpha-tocopherol are not yet well established. We previously reported 
(Stocker, A., Zimmer, S., Spycher, S. E., and Azzi, A. (1999) IUBMB Life 48, 49-55) the 
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identification of a novel 46-kDa tocopherol-associated protein (TAP) in the cytosol of bovine 
liver. Here, we describe the identification, the molecular cloning into Escherichia coli, and the 
in vitro expression of the human homologue of bovine TAP, hTAP. This protein appears to 
belong to a family of hydrophobic ligand binding proteins, which have the CRAL (cis-retinal 
5 binding motif) sequence in common. By using a biotinylated alpha-tocopherol derivative and 
the IASys resonant mirror biosensor, the purified recombinant protein was shown to bind 
tocopherol at a specific binding site with K(d) 4.6 x 10(-7) m. Northern analyses showed that 
hTAP mRNA has a size of approximately 2800 base pairs and is ubiquitously expressed. The 
highest amounts of hTAP message are found in liver, brain, and prostate. In conclusion, hTAP 

10 has sequence homology to proteins containing the CRALTRIO structural motif. TAP binds to 
alpha-tocopherol and biotinylated tocopherol, suggesting the existence of a hydrophobic 
pocket, possibly analogous to that of SEC 14. Zimmer S. et al. (J Biol Chem. 2000 Aug 
18;275(33):25672-80.) 

The disclosed NOV29 nucleic acid of the invention encoding a CRAL-TRIO -like 

15 protein includes the nucleic acid whose sequence is provided in Table 29 A, 29C, 29E or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 29A, 29C, or 29E while 
still encoding a protein that maintains its CRAL-TRIO -like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 

20 whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

25 derivatized. These modifications are carried out at least in part to enhance the chemical 

stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 26 percent of the bases may be so changed. 

The disclosed NOV29 protein of the invention includes the CRAL-TRIO -like protein 

30 whose sequence is provided in Table 29B, 29D, or 29F. The invention also includes a mutant 
or variant protein any of whose residues may be changed from the corresponding residue 
shown in Table 29B, 29D, or 29F while still encoding a protein that maintains its CRAL-TRIO 
-like activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 32 percent of the residues may be so changed. 
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The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this CRAL-TRIO -like protein 
(NOV29) is a member of a "CRAL-TRIO family". Therefore, the NOV29 nucleic acids and 

5 proteins identified here may be useful in potential therapeutic applications implicated in (but 
not limited to) various pathologies and disorders as indicated below. The potential therapeutic 
applications for this invention include, but are not limited to: protein therapeutic, small 
molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 

1 0 research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
(but not limited to) those defined here. 

The NOV29 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in brain disorders including epilepsy, eating disorders, 
schizophrenia, ADD, and cancer; heart disease; inflammation and autoimmune disorders 

1 5 including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, inflammatory skin 
disorders, allergies, blood disorders; psoriasis colon cancer, leukemia AIDS; thalamus 
. disorders; metabolic disorders including diabetes and obesity; lung diseases such as asthma, 
emphysema, cystic fibrosis, and cancer; pancreatic disorders including pancreatic 
insufficiency and cancer; and prostate disorders including prostate cancer, and/or other 

20 diseases and pathologies. 

NOV29 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV29 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

25 NOVX Antibodies" section below. The disclosed NOV29 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

30 NOV30 

A disclosed NOV30 nucleic acid of 717 nucleotides (also referred to as CG56191-01) 
encoding a novel Ryudocan-like protein is shown in Table 30A. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 22-24 and ending with a 
TAG codon at nucleotides 658-660. Putative untranslated regions, if any, found upstream from 
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the initiation codon and downstream from the termination codon are underlined in Table 30A, 
and the start and stop codons are in bold letters. 

Table 30A. NOV30 Nucleotide Sequence (SEQ ID NO:127) 

~^GGCTGTTCACCCTCTCTGGA TGGCGGTAC^ 
TTTTGCAGGCACCCCCACCACCCCTGAGTCAATCCAAGAAACTGAGGTCATCAACCCAGGACCGCCTAGGG 
GCCCAAACTTCTCCAGATCCCTACTGGAAGACTCTGGATGTGGGTGTTGGGGGCAGGAACCTGATGACTCT 
GAGCTCTCTGGCTCTAGAGATATTGATGAGTCAAGGGACCCCAAGATCATCCCTGAAGTGATCCAACCCTT 
GGTGCTTCTAGATAACCACATCCCTGAGAGGGCAGGGCCTGGGAACCTGGTCGCCACTGAAACCAAGGAAC 
TG GAG G AC AACGAGGT CATCC CCAGG AGGATCTCACT CT CTGCGGGGGAC CAGG ATGTGT C CAATAAGGCA 
CCCATGTCCAACACTGCCCAGGGCAGCAACATCTTTGAGAGAATGGAGGTCGTGGCAGTCCTGATTGTGGA 
CAG CAT CG CGGGCATC CTCT CTGCTGTTTTCCTGATC CTG CTT CTGGTGAAC CATATGAAG AAGGATGAAG 
GCAG7VAACGACCTGAGCAGGAAGCCCATCTACi^AAAAGCCCCrAGCAAGGAGTT 

GAGC^CTGGTTTGGACTTTAG GGGATAGGGAAGTCGGAGGATl^rTGCAGAGTGGCCATTAGGATGCGGGAG 
GACAACC 

The NOV30 nucleic acid was identified on chromosome 22 and has 553 of 708 bases 
5 (78%) identical to a gb:GENBANK-ID:HUMRYUDO|acc:D13292.1 mRNA from Homo 
sapiens (mRNA for ryudocan core protein) (E = 2.2e" 82 ). 

A disclosed NOV30 polypeptide (SEQ ID NO: 128) encoded by SEQ ID NO: 127 is 
212 amino acid residues and is presented using the one-letter code in Table 30B. Signal P, 
Psort and/or Hydropathy results predict that NOV30 contains a signal peptide and is likely to 
10 be localized in the plasma membrane with a certainty of 0.4600. The most likely cleavage site 
for NOV30 is between positions 23 and 24: TPT-TP. 

Table 30B. Encoded NOV30 protein sequence (SEQ ID NO:128) 

MAVPTAPALLLLLLLLLFAGTPTTPESIQETEV^ 

DESRDPKI I PE VIQPLVLLDNHI PERAGPGNLVPTETKELEDNEVI PRRI SLSAGDQDVSNKAPMSNTAQGS 
NIFERMEWAVLI VPS IAGILS AVFLILLLVNHMKKDEGRNDLSRKPI YKKAPS KELLRFFYEHWFGL 

The disclosed NOV30 amino acid sequence has 121 of 198 amino acid residues (61%) 
identical to, and 140 of 198 amino acid residues (70%) similar to, the 202 amino acid residue 
1 5 ptnnS WISSPROT-ACC: P34901 protein from Rattus norvegicus (Rat) (Syndecan-4 Precursor 
(Ryudocan Core Protein)) (E = 1 .9e~ 5! ). 

NOV30 is predicted to be expressed in at least myeloid tissue, B-cell lymphoma, 
including B-cell precursor lymphoblastic leukemia, lymphoplasmacytoid, immunoblastic, . 
lymphocytic/CLL, hairy cell leukemia, large B-cell, mantle-cell, marginal zone and follicular, 
20 lymphomas, endothelia, Lymphopoietic and bone marrow (BM) plasma cells (PCs). This 

information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

In addition, NOV30 is predicted to be expressed in the following tissues because of the 
25 expression pattern of (GENBANK-ID: gb:GENBANK-ID:HUMRYUDO|acc: D13292.1) a 

264 



WO 02/068649 PCT/US02/02785 



closely related Human mRNA for ryudocan core protein homolog in species Homo sapiens: 
myeloid tissue. 

NOV30 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 30C. 



Table 30C. BLAST results for NOV30 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 14771140) ref |XP_ 
009530. 3| 
(XMJD09530) 


syndecan 4 
(amphiglycan, 
ryudocan) [Homo 
sapiens] 


198 


119/197 
(60%) 


136/197 
(68%) 


9e-49 


gi | 4506861 | ref |NP_0 
02990. l| 
(NM_002999) 


syndecan 4 
(amphiglycan, 
ryudocan) [Homo 
sapiens] 


198 


120/197 
(60%) 


137/197 
(68%) 


2e-45 


gi | 6981522 | ref |NP_0 
36781. l| 
(KM 012649) 


ryudocan/ syndecan 
4 [Rattus 
norvegicus] 


202 


119/199 
(59%) 


139/199 
(69%) 


3e-45 


gi | 6755442 | ref |NP_0 
35651. l| 
(NM 011521) 


syndecan 4 [Mus 
musculus] 


198 


117/199 
(58%) 


136/199 
(67%) 


6e-41 


gi| 1351051 |sp|P4941 
6|SDC4 CHICK 


SYNDECAN- 4 
PRECURSOR 


197 


80/216 
(37%) 


105/216 
(48%) 


le-14 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 30D. 



10 



15 



20 



25 



30 



35 



Table 30D Clustal W Sequence Alignment 



1) NOV30 (SEQ ID NO: 128) 

2) gi | 14771140 | ref |XP_009530 .3 | (XM_009530) syndecan 4 (amphiglycan, ryudocan) 
[Homo sapiens] (SEQ ID NO:452) 

3) gi | 4506861 | ref |NP_002990 . 1 1 (NM_002999) syndecan 4 (amphiglycan, ryudocan) [Homo 
sapiens] (SEQ ID NO:453) 

4) gi 1 6981522 | ref |NP_03678l . 1 1 (NM_012649) ryudocan/ syndecan 4 [Rattus norvegicus] 
(SEQ ID NO: 454) 

5) gi | 6755442 | ref |NP_035651. 1 1 (NM_011521) syndecan 4 [Mus mus cuius] (SEQ ID 
NO:455) 

6) gi| 1351051 |sp|P49416|SDC4_CHICK SYNDECAN- 4 PRECURSOR (SEQ ID NO:456) 



NOV30 



gi 
gi 
gi 
gi 
gi 



14771140) 

4506861 

6981522 

6755442 

1351051 



NOV30 


58 


gi 


147711401 52 


gi 


4506861 


52 


gi 


6981522 


57 


gi 


6755442 


57 


gi 


1351051 


51 




GDTER-KMEGEKKNTM 108 
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10 



15 



NOV30 


113 


gi 


147711401 107 


gi 


4506861 


107 


gi 


6981522 


111 


gi 


6755442 


108 


gi 


1351051 


109 




NOV30 


173 


gi 


14771140 | 167 


gi 


4506861 


167 


gi 


6981522 


171 


gi 


6755442 


167 


gi 


1351051 


166 



190 



200 



210 



220 




20 Table 30E lists the domain description from DOMAIN analysis results against 

NOV30. This indicates that the NOV30 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 30E Domain Analysis of NOV30 

gnl |Pf amjpf am01034, Syndecan, Syndecan domain. Syndecans are 
transmembrane heparin sulfate proteoglycans which are implicated in 
the binding of extracellular matrix components and growth factors (SEQ 
ID NO: 824) 

CD-Length = 359 residues, 21.7% aligned 
Score = 41.6 bits (96), Expect * 5e-05 



25 NOV30: 115 NEVIPRRISLSAGDQDVSNKAPMSNTA QGSNIFERMEVVAVLIVDS I AGILS 166 

II I + + + + 1+ II I I 1 1 I l + l +1 + M 

Sb}Ct: 258 NETSPEOTAAANPEPLGRGQRPIDNTVDSGSSGAQQSQKILERKEVLAAVIAGGVVGLLF 317 

NOV30: 167 AVFLI LLL VNHM - KKDEG 183 

30 I I I I++ ++ I I I I I I 

Sbjct: 318 AVFLVMFMLYRMKKKDEG 335 

Kininogens, the high molecular weight precursor of vasoactive kinins, bind to a wide 
variety of cells in a specific, reversible, and saturable manner. The cell docking sites have 

35 been mapped to domains D3 and D5(H) of kininogens; however, the corresponding cellular 
acceptor sites are not fully established. To characterize the major cell binding sites for 
kininogens exposed by the endothelial cell line EA.hy926, intact cells were digested with 
trypsin and other proteases and found a time- and concentration-dependent loss of (125)1- 
labeled high molecular weight kininogen (H-kininogen) binding capacity (up to 82%), 

40 indicating that proteins are crucially involved in kininogen cell attachment. Cell surface 

digestion with heparinases similarly reduced kininogen binding capacity (up to 78%), and the 
combined action of heparinases and trypsin almost eliminated kininogen binding (up to 85%), 
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suggesting that proteoglycans of the heparan sulfate type are intimately involved. Consistently, 
. inhibitors such as p-nitrophenyl-beta-d-xylopyranoside and chlorate interfering with heparan 
sulfate proteoglycan biosynthesis reduced the total number of kininogen binding sites in a 
time- and concentration-dependent manner (up to 67%). In vitro binding studies demonstrated 
5 that biotinylated H-kininogen binds to heparan sulfate glycosaminoglycans via domains D3 
and D5(H) and that the presence of Zn(2+) promotes this association. Cloning and over- 
expression of the major endothelial heparan sulfate-type proteoglycans syndecan-1, syndecan- 
2, syndecan-4, and glypican in HEK293t cells significantly increased total heparan sulfate at 
the cell surface and thus the number of kininogen binding sites (up to 3. 3-fold). This gain in 

10 kininogen binding capacity was completely abolished by treating transfected cells.with 

heparinases. It was concluded that heparan sulfate proteoglycans on the surface of endothelial 
cells provide a platform for the local accumulation of kininogens on the vascular lining. This 
accumulation may allow the circumscribed release of short-lived kinins from their precursor 
molecules in close proximity to their sites of action (Renne et ah, J Biol Chem 2000, 

15 275(43):33688-96). 

Lymphopoietic cells require interactions with bone marrow stroma for normal 
maturation and show changes in adhesion to matrix during their differentiation. Syndecan, a 
heparan sulfate-rich integral membrane proteoglycan, functions as a matrix receptor by 
binding cells to interstitial collagens, fibronectin, and thrombospondin. Therefore, it was asked 

20 whether syndecan was present on the surface of lymphopoietic cells. In bone marrow, 

syndecan was only found on precursor B cells. Expression changes with pre-B cell maturation 
in the marrow and with B-lymphocyte differentiation to plasma cells in interstitial matrices. 
Syndecan on B cell precursors is more heterogeneous and slightly larger than on plasma cells. 
Syndecan 1) is lost immediately before maturation and release of B lymphocytes into the 

25 circulation, 2) is absent on circulating and peripheral B lymphocytes, and 3) is reexpressed 
upon their differentiation into immobilized plasma cells. Thus, syndecan is expressed only 
when and where B lymphocytes associate with extracellular matrix. These results indicate that 
B cells differentiating in vivo alter their matrix receptor expression and suggest a role for 
syndecan in B cell stage-specific adhesion (Sanderson et aL, Cell Regul 1989,l(l):27-35). 

30 Detection of abnormal numbers and/or distribution of bone marrow (BM) plasma cells 

(PCs) on trephine biopsies can be important in the differential diagnosis of multiple myeloma 
(MM) and other PC disorders. A variety of immunohistochemical markers can potentially 
improve the specificity and sensitivity of PC detection on routine histological sections 
obtained from trephine BM biopsies, but most of them are not completely satisfactory. In one 
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study, the antibody CD138/B-B4, which is an optimal marker for PC detection on BM 
aspirates by flow cytometry, was investigated to determine whether it can be used successfully 
for the identification of PCs also on formalin-fixed, decalcified biopsies. A series of samples 
including normal BM, MM, monoclonal gammopathies of undetermined significance, and B- 
5 cell lymphoma of various types, including B-cell precursor lymphoblastic leukemia, 

lymphoplasmacytoid, immunoblastic, lymphocytic/CLL, hairy cell leukemia, large B-cell, 
mantle-cell, marginal zone and follicular lymphomas, have been investigated for CD138 
expression using a sensitive immunohistochemical technique. Within the BM 
microenvironment, CD 138 was characterized by excellent sensitivity and specificity. Virtually 

10 all normal and neoplastic PCs expressed clear-cut membrane CD138 immunostaining, whereas 
all other cell types did not. All cases of MM, including plasmablastic and leukemic cases, 
showed strong immunoreactivity. Conversely, all B-cell lymphomas, including all cases 
characterized by secretive features, lymphoplasmacytoid, and immunoblastic lymphomas, 
were completely negative. These results demonstrate that CD138 is a highly sensitive and 

15 specific marker that is useful for the rapid and precise localization of normal and neoplastic 
PCs on routine BM sections. In addition, because of its clear-cut cell membrane localization, 
CD138 can be used successfully in double-marker immunostaining reactions to evaluate 
precisely nuclear prognostic markers such as Ki67 and p53 in MMs (Chilosi et ah, Mod Pathol 
1999, 12(12):! 101-6). 

20 Monoclonal antibody therapy has emerged as a viable treatment option for patients 

with lymphoma and some leukemias. It is now beginning to be investigated for treatment of 
multiple myeloma. There are relatively few surface antigens on the plasma cells that are 
suitable for antibody-directed treatment. Possible molecules include HM1 .24, CD38, ICAM-1 
(CD54), CD40, CD45, CD20, and syndecan 1. There is now some clinical experience with 

25 anti-CD38 antibody in lymphoma and myeloma. However, to date, there has been minimal 
clinical activity observed. Additional antibodies are entering clinical trials. A new approach 
involves the generation of an anti-CD38 single-chain variable fragment (scFv) construct that 
acts as the carrier of a toxin gene instead of being conjugated directly to the toxin itself. It is 
hoped that expression of the toxin by CD38+ plasma cells will promote suicide of the 

30 malignant cells without affecting normal cells or generating an immunologic response to the 
toxin. Ongoing clinical trials are also attempting to target B-cell antigens such as CD20. 
Although CD20 is present only on 20% of myeloma cells, it may be present on myeloma 
precursor cells. This treatment has met with success in follicular lymphoma and is now being 
evaluated in clinical trials in both Europe and the United States for myeloma. Although these 
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clinical trials are in very early stages, researchers are beginning to understand that antibody 
therapy can be used not only as a carrier molecule of radioisotopes and toxins, but also as 
molecules that can trigger tumor cells and promote growth arrest or apoptosis (Maloney et al., 
Semin Hematol 1999, 36(1 Suppl 3):30-3). 
5 The NOV30 nucleic acid of the invention encoding a Ryudocan-like protein includes 

the nucleic acid whose sequence is provided in Table 30A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 30A while still encoding a protein that maintains 
its Ryudocan-like activities and physiological functions, or a fragment of such a nucleic acid. 

10 The invention further includes nucleic acids whose sequences are complementary to those just 
described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

1 5 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 22% of the residues 
may be so changed. 

20 The NOV30 protein of the invention includes the Ryudocan-like protein whose 

sequence is provided in Table 30B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 30B while 
still encoding a protein that maintains its Ryudocan-like activities and physiological functions, 
or a functional fragment thereof. In the mutant or variant protein, up to about 63 of the bases 

25 may be so changed. 

The NOV30 nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from: brain disorders including epilepsy, 

30 eating disorders, schizophrenia, ADD, cancer, heart disease, inflammation and autoimmune 
disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 
inflammatory skin disorders, allergies, blood disorders, psoriasis, colon cancer, leukemia, 
AIDS, thalamus disorders, metabolic disorders including diabetes and obesity, lung diseases 
such as asthma, myelomas, emphysema, cystic fibrosis, and cancer, pancreatic disorders 
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including pancreatic insufficiency and cancer, and prostate disorders including prostate cancer 
and other diseases, disorders and conditions of the like. 

NOV30 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV30 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
1 0 disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 

NOV31 

A disclosed NOV3 1 nucleic acid of 683 nucleotides (also referred to as CG56392-01) 
1 5 encoding a novel Sulfur-rich Keratin-like protein is shown in Table 31 A. An open reading 

frame was identified beginning with an ATG initiation codon at nucleotides 46-48 and ending 
with a TGA codon at nucleotides 652-654. Putative untranslated regions, if any, found 
upstream from the initiation codon and downstream from the termination codon are underlined 
in Table 3 1 A, and the start and stop codons are in bold letters. 

Table 31A. NOV31 Nucleotide Sequence (SEQ ID NO:129) 

AGGGGGCTCCACACCTGCACCTCCTTCTCACCTGCTCCTCTACCTGCTCCAC 
TGGGCTGCTGTGGCTGCrCCGGAGGCTGTGGCTCCAGCT^ 

GGCTCTGGCTGCAGGGGCTGTGGCCCCAGCTGCTGTGCACCCGTCTACTGCTGCAAGCCCGTGTGCTGCTG 
TGTTCCAGCCTGTTCCTGCTCTAGCTGTGGCAAGCGGGGCTGTGGCTCCTGTGGGGGCTCCAAGGGAGGCT 
GTGGTTCTTGTGGCTGCTCCCAGTGCAGTTGCTOCAA 

TGCTCCCAGTGCAGCTGCTGCAAGCCCTACTGCTCCCAGTCCAGCTGTTGTAAGCCCTGTTGCTGCT 
AGGCTGTGGATCATCCTGCTGCC^GTCCAGCTGCTGCAAGCCCTG 

CCGTGTGCTGCCAGTCCAGCTGCTGCAAGCCCTGTTGCTGCCAGTCCAACTGTTGTGTCCCTGTGTGCTGC 
CAGTGTAAGATCTGAGGCTCTAGTGGGAAACCTCAGGTAGCTCC 



The NOV31 nucleic acid was identified on chromosome 1 1 and has 654 of 683 bases 
(95%) identical to a gb:GENBANK-ID:HSA6693|acc:AJ0Q6693.1 mRNA from Homo sapiens 
(UHS KerA gene) (E - 3.3e" 136 ). 

A disclosed NOV31 polypeptide (SEQ ID NO:130) encoded by SEQ ID NO:I29 is 

202 amino acid residues and is presented using the one-letter code in Table 3 IB. Signal P, 

Psort and/or Hydropathy results predict that NOV3 1 contains a signal peptide and is likely to 

be localized to the cytoplasm with a certainty of 0.4500. The most likely cleavage site for a 

NOV31 peptide is between amino acids 32 and 33: TRT-MG. 
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Table 31B. Encoded NOV31 protein sequence (SEQ ID NO;130) 

MWDIKSRGSGGSTPAPPSHLLLYLLHPQSTRTMGCCGCSGGCGSSCGGCDSSCGSCGSGCRGCGPSCCAPVy 
CCKPVCCCVPACSCSSCGKRGCGSCGGSKGGCGSCGCSQCSCCKPCCCSSGCGSSCCQCSCCKPYCSQSSCC 
KPCCCSSGCGSSCCQSSCCKPCCCQSSCCVPVCCQSSCCKPCCCQSNCCVPVCCQCKI 



The disclosed NOV31 amino acid sequence has 158 of 170 amino acid residues (92%) 
identical to, and 158 of 170 amino acid residues (92%) similar to, the 169 amino acid residue 
ptnr:SWISSNEW-ACC:P26371 protein from Homo sapiens (Human) (Keratin, Ultra High- 
5 Sulfur Matrix Protein A (Uhs Keratin A) (Uhs Kera)) (E = 1 Ae ]0l y 

NOV31 is predicted to be expressed in at least Kidney, Pancreas, Testis and Whole 
Organism. This information was derived by determining the tissue sources of the sequences 
that were included in the invention including but not limited to SeqCalling sources, Public 
EST sources, Literature sources, and/or RACE sources. 
1 0 In addition, NOV3 1 is predicted to be expressed in the following tissues because of the 

expression pattern of (GENBANK-ID:gb:GENBANK-ID:HSA6693|acc:AJ006693.1) a 
closely related Homo sapiens UHS KerA gene homolog in species Homo sapiens: Kidney, 
Pancreas and Testis. 

NOV31 also has homology to the amino acid sequences shown in the BLASTP data 
15 listed in Table 3 1C. 



Table 31C. BLAST results for NOV31 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 12835376 | dbj | BAB 
23238. l| (AK004258) 


data source :SPTR, 
source 
key:Q64526, 
evidence : I SS -put a 
tive-similar to 
ULTRA-HIGH 
SULPHUR KERATIN 
[Mus mus cuius] 


195 


76/157 
(48%) 


91/157 
(57%) 


5e-l3 


gi| 2136964 |pir| |I46 
489 


cysteine -rich 
hair keratin 
associated 
protein - rabbit 


126 


53/120 
(44%) 


72/120 
(59%) 


3e-ll 


gi| 128446 00 | dbj |BAB 
26426. l| (AK009665) 


data source :SPTR, 
source 
key:Q28707, 
evidence : ISS-homo 
log to CYSTEINE 
RICH HAIR KERATIN 
ASSOCIATED 
PROTEIN-putative 
[Mus jnusculus] 


168 


59/116 
(50%) 


70/116 
(59%) 


le-10 


gi | 15082220 |ref|NP 
149048. l| 
(NM_033059) 


keratin 
associated 
protein 4.14 
[Homo sapiens] 


195 


56/122 
(45%) 


65/122 
(52%) 


2e-10 
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gi| 13386198|ref |NP 


RIKEN cDNA 


165 


53/106 


61/106 


2e-10 


081363.1) 


2300006N05 [MUS 




(50%) 


(57%) 




(NM 027087) 


musculus] 











The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 3 ID. 



Table 31D Clustal W Sequence Alignment 



1) NOV31 {SEQ ID NO: 130) 

2) gi | 12835376 | dbj |BAB23238.l| (AK004258) data source :SPTR f source key: Q64 52 6, 
evidence : ISS-put at ive -similar to ULTRA-HIGH SULPHUR KERATIN [Mus musculus] (SEQ ID 
NO:457) 

3) gi | 2136964 |pir | ) 146489 cysteine-rich hair keratin associated protein - rabbit 
(SEQ ID N0:458) 

4) gi|l2844600|dbj|BAB26426.l| (AK009665) data source :SPTR, source key: Q2 8707, 
evidence :ISS-homolog to CYSTEINE RICH HAIR KERATIN ASSOCIATED PROTEIN-putative [Mus 
musculus] (SEQ ID NO: 45 9) 

5) gi|l50B2220|ref |NP_JL49048 .1 1 (NM_033059) keratin associated protein 4 . 14 [Homo 
sapiens] (SEQ ID NO:460) 

6) gi | 13386198 | ref |NP_081363 . 1 | (NM_027087) RIKEN cDNA 2300006N05 [Mus musculus] 
(SEQ ID NO: 461) 



NOV31 


1 


gi 


12835376 


1 1 


gi 


2136964 | 


1 


gi 


12844600 


1 


gi 


15082220 


1 


gi 


13386198 


1 



NOV31 


47 


gi 


12835376 


30 


gi 


2136964) 


26 


gi 


12844600 


30 


gi 


15082220 


50 


gi 


13386198 


30 



10 

MWDIKSRGSGl 



20 

I.... I... 

PAPPSHLLLYLLHP 

[S 





G O 

iPQCCQSLCCQg 




120 



scssHgkrl 

IPJgCISS 

ass-— — -s 

RP|IHCRPS 

:4 SSBCKPr 



95 
85 
61 
70 
90 
65 



130 



140 



150 



160 



NOV31 

gi 
gi 
gi 
gi 
gi 



12835376 | 
2136964) 
12844600) 
15082220) 
13386198 | 



96 SCGGSKI 

86 gp 

62 £(P 

71 VS 

91 SSMCCQI 

66 QP 



NOV31 


150 


SSG-B 






gi 


12835376 | 141 


RPSgg 


s 




gi 


2136964) 


107 


RP--- 






gi 


12844600 


116 


GSSffi 


RP- 


gi 


15082220 


141 


issm 






gi 


13386198 


116 


Rp -SI 


3S 






NOV31 

gi 1 12835376 1 
gi j 2136964 | 
gi j 12844600 | 
gi j 15082220 | 



199 QCKI 202 

195 195 

126 126 

168 168 

195 195 
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g± | 13386198 | 165 165 

Insulin-like growth factor 1 (IGF-1) mediates many of the actions of growth hormone. 
5 Overexpression of IGF-1 has been reported to have endocrine and paracrine/autocrine effects 
on somatic growth in transgenic mice. To study the paracrine/autocrine effects of IGF-1 in hair 
follicles, transgenic mice were produced by pronuclear microinjection of a construct 
containing a mouse ultra-high sulfur keratin (UHS-KER) promoter linked to an ovine IGF-1 
cDNA. This UHS-KER promoter has previously been shown to direct expression of a reporter 

10 gene to the hair follicles of transgenic mice. Four transgenic mouse lines were established as a 
result of microinjection of 435 embryos. Transgene expression was found in skin at day 8 and 
day 15 of age in three of the lines. Progeny tests were carried out by mating two of the 
transgenic expressing males to nontransgenic females. Mice from one line were all 
nonexpressors while four of the 12 mice from the other showed integration of the transgene 

15 and three expressed transgene IGF-1 mRNA in the skin. Vibrissa growth at 1 1 -21 d of age was 
significantly greater in transgenic expressors than in their nontransgenic littermates. 
Specifically, the increase in vibrissa length for transgenics at days 11-16 (20.5%) is 
approximately 2-fold compared with days 16-21 (1 1 .9%). These results demonstrate that local 
overexpression of IGF-1 in transgenic mice is capable of stimulating vibrissa growth during 

20 the first neonatal hair cycle (Su et ah, J Invest Dermatol 1999, 1 12(2):245-8). 

The major histological components of the hair follicle are the hair cortex and cuticle. 
The hair cuticle cells encase and protect the cortex and undergo a different developmental 
program to that of the cortex. In one study, the molecular characterization of a set of 
evolutionarily conserved hair genes which are transcribed in the hair cuticle late in follicle 

25 development was reported. Two genes were isolated and characterized, one expressed in the 
human follicle and one in the sheep follicle. Each gene encodes a small protein of 16 kD, 
containing greater than 50 cysteine residues, ranging from 31 to 36 mol% cysteine. Their high 
cysteine content and in vitro expression data identify them as ultra-high-sulfur (UHS) keratin 
proteins. The predicted proteins are composed almost entirely of cysteine-rich and glycine-rich 

30 repeats. Genomic blots reveal that the UHS keratin proteins are encoded by related multigene 
families in both the human and sheep genomes. Tissue in situ hybridization demonstrates that 
the expression of both genes is localized to the hair fiber cuticle and occurs at a late stage in 
fiber morphogenesis (MacKinnon et aL, J Cell Biol 1990, 111(6 Pt l):2587-600). 

The NOV31 nucleic acid of the invention encoding a Sulfur-rich Keratin-like protein 

35 includes the nucleic acid whose sequence is provided in Table 3 1 A, or a fragment thereof. The 
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invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
fromthe corresponding base shown in Table 31 A while still encoding a protein that maintains 
its Sulfur-rich Keratin-like activities and physiological functions, or a fragment of such a 
nucleic acid. The invention further includes nucleic acids whose sequences are complementary 
5 to those just described, including nucleic acid fragments that are complementary to any of the 
nucleic acids just described. The invention additionally includes nucleic acids or nucleic acid 
fragments, or complements thereto, whose structures include chemical modifications. Such 
modifications include, by way of non-limiting example, modified bases, and nucleic acids 
whosesugar phosphate backbones are modified or derivatized. These modifications are carried 

1 0 out at least in part to enhance the chemical stability of the modified nucleic acid, such that 
they may be used, for example, as antisense binding nucleic acids in therapeutic applications 
in a subject. In the mutant or variant nucleic acids, and their complements, up to about 5% of 
the residues may be so changed. 

The NOV31 protein of the invention includes the Sulfur-rich Keratin-like protein 

1 5 whose sequence is provided in Table 3 1 B. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
3 IB while still encoding a protein that maintains its Sulfur-rich Keratin-like activities and 
physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 56% of the bases may be so changed. 

20 The NOV3 1 nucleic acids and proteins of the invention are useful in potential 

diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from: brain disorders including epilepsy, 
eating disorders, schizophrenia, ADD, cancer, heart disease, inflammation and autoimmune 

25 disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 

inflammatory skin disorders, allergies, blood disorders, psoriasis, colon cancer, leukemia, 
AIDS, thalamus disorders, metabolic disorders including diabetes and obesity, lung diseases 
such as asthma, emphysema, cystic fibrosis, and cancer, pancreafic disorders including 
pancreatic insufficiency and cancer, and prostate disorders including prostate cancer and other 

30 diseases, disorders and conditions of the like. 

NOV31 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- ' 
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NOVX Antibodies" section below. For example the disclosed NOV31 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
disorders, which will help in understanding of pathology of the disease and development of 
5 new drug targets for various disorders. 



NOV32 

A disclosed NOV32 nucleic acid of 1575 nucleotides (also referred to as CG56686-01) 
encoding a novel DNMT1 associated protein- 1 (DMAP)-like protein is shown in Table 32A. 
10 An open reading frame was identified beginning with an ATG initiation codon at nucleotides 
94-96 and ending with a TGA codon at nucleotides 1573-1575. Putative untranslated regions, 
if any, found upstream from the initiation codon and downstream from the termination codon 
are underlined in Table 32A, and the start and stop codons are in bold letters. 



Table 32A. NOV32 Nucleotide Sequence (SEQ ID NO:131) 

CTTGGAGGCTGCAGGTCCGGACCCAGGTGCGGAAGTGCGAGGGCC CAGGCACTGAC CCTTGACCTCCGGTG 
GCTCCCCCATCTCTCAGGCGCGA TGGCTACGGGCGCGGATGTACGGGACATTCrAGAACTCGGGGGTCCAG 
AAGGGGATG C AGC CTCTGGGAC CAT CAG CAAGAAGGACATTAT CAAC C CGGACAAG AAAAAATC CAAGAAG 
TCCTCTGAGACACTGACTTTCAAGAGGCCCGAGGGCATGC^CCGGGAAGTCTATGCCTTGCTCTACTCTGA 
CAAGAACAAGGGCTCCTGCTTGCTTAGCAGGATGCAGGAGGACCTGAAGT CTTTTG CTCCAGGACATGACT 
TTCTTGCTATAGGGGATGCACCCC CACTG CTACC CAGTGACACTGGCCAGGGATACCGTACAGTGAAGGCC 
AAGTTGGGCTCCAAGAAGGTGCGGCCTTGGAAGTGGATGCCATTCACCAACCCGGCCCGCAAGGACGGAGC 
AATGTTCTTCCACTGGCGACGTGCAGCGGAGGAGGGCAAGGACTACCCCTTTGCCAGGTTCAATAAGACTG 
TG CAGGTGCCTGTGTACTCGGAGCAGGAGTACCAGCTTTATCTCCACGATGATGCTTGGACTAAGG CAGAA 
ACTGACCACCTCTTTGACCTCAGCCGCCGCTTTGACCTGCGTTTTGTTGTTATCCATGACCGGTATGACCA 
CCAGCAGTTCAAGAAGCGTTCTGTGGAAGACCTGAAGGAGCGGTACTACCAC^TCTGTGCTAAGCTTGCCA 
ACGTGCGGGCTGTGCCAGGCACAGACCTTAAGATACCAGTATTTGATGCTGGGCACGAACGACX3GCGGAAG 
GAACAGCTTGAGCGTCTCTACAACCGGACCCCAGAGCAGGTGGCAGAGGAGGAGTACCTGCTACAGGAGCT 
GCGCAAGATTGAGGCC CGGAAGAAGGAGCGGGAGAAACGCAG CCAGG ACCTGC AG AAGCTG ATCACAGCGG 
CAGACACCAC TGCAGAGC AGCGGCG C ACGG AACG CAAGG CC C C CAAAAAG AAG CTACCCCAGAAAAAGGAG 
GCTGAGAAGCCGGCTGTTCCTGAGACTGCAGGCATCAAGTTTCCAGACTTCAAGTCTCCAGGTG 
GCGGAGCCAACGGATGAAGCTGCCAAGCTCTGTGGGACAGAAGAAGATCAAGGCCCTGGAACAGATGCTGC 
TGGAGCTTGGTGTGGAGCTGAGCCCGACACCTACGGAGGAGCTGGTGCACATGTTCAATGAGCTGCGAAGC 
GACCTGGTGCTCCTCTACGAGCTCAAGCAGGCCTGTGCCAACTGCGAGTATGAGCTGCAGATGCTC 
CCGTCATGAGGCACTGGCCCGGGCTGGTGTGCTAGGGGGCCCTGCCACACCAGCATCAGGCCCAGGCCCGG 
'CCTCTGCTGAGCCGGCAGTGACTGAACCCGGACTTGGTCCTGACCCCAAGGACACCATCATTGATGTGGTG 
GGCGCACCCCTCACGCCCAATTCGAGAAAGCGACGGGAGTCGGCCTCCAGCTCATCTTCCGTGAAGAAAGC 
CAAGAAGCCGTGA 



15 The NOV32 nucleic acid was identified on chromosome lp34 and has 1244 of 1273 

bases (97%) identical to a gb:GENBANK-ID:AF265228|acc:AF265228.1 mRNA from Homo 
sapiens (DNMT1 associated protein-1 (DMAP1) mRNA, complete cds) (E = 1 .Oe* 309 ). 

A disclosed NOV32 polypeptide (SEQ ID NO:132) encoded by SEQ ID NO:131 is 
493 amino acid residues and is presented using the one-letter code in Table 32B. Signal P, 

20 Psort and/or Hydropathy results predict that NOV32 does not contain a signal peptide and is 
likely to be localized to the nucleus with a certainty of 0.9800. 
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Table 32B. Encoded NOV32 protein sequence (SEQ ID NO:132) 

MATGADVRDILEI/GGPEGDAASGTISKKDI^ 

SRMQEDL KS FAPGHDFLAI GDAPPLLPSDTGQG YRTVKAKLGS KKVRP WKWMP FTNP ARKDGAM F FHWRRAA 
EEGKDYPFARFNKTVQVPVYSEQEYQLYLHDDAWTKAETDHLFDLSRRFBLRF 

LKERYYHICAKLANVRAVPGTDLKI PVFDAGHERRRKEQLERL YNRTPEQVAEEEYLLQELRKI EARKKERE 
KRSQDLQKLI TAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPETAGI KFPDFKSAGVTLRSQRMKLPS S VG 
QKKIKALEQMLLELGVELSPTPTEELVHMFNELRSDL^ 

PATPASGPGPASAgPAVTEPGLGPDPKDTIIDWGAPLTPNSRKRRSSASSSSSVKKAKKP 



The disclosed NOV32 amino acid sequence has 401 of 401 amino acid residues 
(100%) identical to, and 401 of 401 amino acid residues (100%) similar to, the 467 amino acid 
residue ptnr: SPTREMBL-ACC:Q9NPF5 protein from Homo sapiens (Human) (Hypothetical 
5 53.0 Kda Protein (Dnmtl Associated Protein-1) (E = 1.3e~ 248 ). 

NOV32 is predicted to be expressed in at least Adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hijjpocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
10 small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and uterus. This 

information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

NOV32 also has homology to the amino acid sequences shown in the BLASTP data 
15 listed in Table 32C. 



Table 32C. BLAST results for NOV32 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%> 


Expect 


gi| 7243231 |dbj |BAA9 
2663. l| (AB037846) 


KIAA1425 protein 
[Homo sapiens] 


495 


446/473 
(94%) 


446/473 
(94%) 


0.0 


gi | 13123776 |ref|NP 
061973. l| 
(NM_01910O) 


DNA 

methyl trans f erase 
1-associated 
protein 1 [Homo 
sapiens] 


467 


446/473 
(94%) 


446/473 
(94%) 


0.0 


gi | 12052838 | emb | CAB 
66592. l| (AL136657) 


hypothetical 
protein [Homo 
sapiens] 


467 


443/473 
(93%) 


445/473 
(93%) 


0.0 


gi | 12963557 |ref|NP 
075667.1 | 
(NM 023178) 


DNMT1 associated 
protein-1 [Mus 
musculus] 


468 


437/474 
(92%) 


438/474 
(92%) 


0.0 


gi|l2805675|gb|AAH0 

2321.l|AAH02321 

(BC002321) 


Unknown (protein 
for 

IMAGE: 3594236) 
[Mus musculus] 


451 


420/457 
(91%) 


421/457 
(91%) 


0.0 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 32D. 
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Table 32D Clustal W Sequence Alignment 



10 



15 



20 



25 



30 



35 



40 



1) NOV32 (SEQ ID NO: 132) 

2) gi | 7243231 1 dbj (BAA92663.1) (AB037846) KIAA14 25 protein [Homo sapiens] {SEQ ID 
NO:462) 

3) gi|l3123776|ref |NP_061973 .1 1 (NM_019100) DNA methyl transferase 1-associated 
protein 1 [Homo sapiexis) (SEQ ID NO: 4 63) 

4) gi 1 12052838 1 emb | CAB66592.1) (AL136657) hypothetical protein [Homo sapiensl (SEQ 
ID NO:464) 

5) gi | 12963557 | ref |NP_075667 . 1 | (NM_023178) DNMT1 associated protein- 1 [Mus 
jnusculus] (SEQ ID NO:465) 

6) gi|l2805675|gb|AAH02321.l|AAH02321 (BC002321) Unknown (protein for 
IMAGE: 3594236) [MUS musculus) (SEQ ID NO: 466) 



NOV32 


1 


gi 


7243231] 


1 


gi 


13123776 


1 


gi 


12052838 


. 1 


gi 


12963557 


1 


gi 


12805675 


1 



NOV32 


. 33 


gi 


7243231) 


61 


gi 


13123776 


33 


gi 


12052838 


33 


gi 


12963557 


33 


gi 


12805675 


16 


NOV32 


$3 


gi| 7243231) 


95 


gi 13123776 


67 


gi 


12052838 


67 


gij 12963557 


67 


gij 12805675 


50 




GSCLLSRMQEDLKSFAPGHDFLAIG 92 

94 
66 
66 
66 
49 



130 



140 



150 



I 



I. 



160 



170 



180 



I. 



DAPPLLPSDTGQGYRTVKAKLGSKKVRPWKWMPFTNPARKDGAMFFKWRRAAEEGKDYPF 
DAPPLLPSDTGQGYRTVKAKLGSKKVRPWKWMPFTNPARKDGAMFFHWRRAAEEGKDYPF 
DAPPLLPSDTGQGYRTVKAKLGSKKVRPWKWMPFTNPARKDGAMFFHWRRAAEEGKDYPF 
DAPPLLPSDTGQGYRTVKAKLGSKKVRPWK^PFTNPARKDGAMFFHWRRAAEEGKDYPF 
DAPPLLPSDTGQGYRTVKAKLGSKKVRPWKWMPFTNPARKDGAMFFHWRRAAEEGKDYPF 
DAPPLLPSDTGQGYRTVKAKLGS KKVRPWKWMPFTNPARKDGAMFFffWRRAAEEGKDYP F 



152 
154 
126 
126 
126 
109 



190 



200 



210 



45 



50 



55 



60 



65 



NOV32 



gi 


7243231) 


155 


gi 


13123776 


127 


gi 


12052838 


127 


gi 


12963557 


127 


gi 


12805675 


110 


270 V3 2 


213 


gi 


7243231) 


215 


gi 


13123776 


187 


gi 


12052838 


187 


gi 


12963557 


187 


gi 


12805675 


170 


NOV32 


273 


gi 


7243231) 


275 


gi 


13123776 


247 


gi 


12052838 


247 


gi 


12963557 


247 


gi 


12805675) 230 



I 



220 
..I.. 



230 



153 



ARFNKTVQVPVYSEQEYQLYLHDDAWTKAETDHLFDLSRRFDLRFWIHDRYDHQQFKKR 
ARFNKTVQVPVYSEQEYQLXLHDDAWTKAETDHLFDLSRRFDLRFWIHDRYDHQQFKKR 
ARFNKTVQVPVYSEQEYQLYLHDDAWTKAETDHLFDLSRRFDLRFWIHDRYDHQQFKKR 
ARFNKTVQgpVYSEQEYQLYLHDjgAWTKAETDHLFDLSRRFDLRFWIHDRYDHQQFKKR 
ARFNKTVQVPVYSEQEYQLYLHDDAWTKAETDHLFDIiSRRFDLRFWIHDRYDHQQFKKR 
ARFNKTVQVPVYSEQEYQLYLHDDAWTKAETDHLFDLSRRFDLRFVVIHDRYDHQQFKK' 



240 
.| 

212 
214 
186 
186 
186 
s 169 



250 



260 
..].. 



270 
..I.. 



280 



290 



I . 



300 
..I 



SVEDLKERYYHICAKLANVRAVPGTDLKIPVFDAGHERRRKEQLERLYNRTPEQVAEEEY 
SVEDLKERYYHICAKLANVRAVPGTDLKIPVFDAGHERRRKEQLERLYNRTPEQVAEEEY 
SVEDLKERYYHICAKLANVRAVPGTDLKIPVFDAGHERRRKEQLERLYNRTPEQVAEEEY 
S VEDL KER Y YHI CAKL ANVRAVPGTDLKI PVFDAGHERRRKE QLE RLYNRTPEQVAEEE Y 
S VEDL KER Y YH I C AKL ANVRA VPGTDL K I P VFDAGHERRRKE Q L ERL YNRT P E Q VAE EE Y 
S VEDLkERYYHTCAKL ANVRAVPGTDLKI PVFDAGHERRRKEQLE RLYNRTPEQVAEEE Y 



310 



I 



320 



330 



I. 



I 



340 



I. 



350 
..I.. 



360 



LLQELRKIEARKKEREKRSQDLQKLITAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPE 
LLQELRKIEARKKEREKRSQDLQKLITAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPE 
LLQELRKIEARKKEREKRSQDLQKLITAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPE 
LLQELRKIEARKKEREKRSQDLQKLITAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPE 
LLQELRKIEARKKER&KRSQDLQKLITAADTTASQRRTERKAPKKKLPQKKEAEKPAVPE 
LLQELRKIEARKKEREKRSQDLQKLITAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPE 



277 



WO 02/068649 



PCT/US02/02785 



10 



15 



20 



25 



NOV32 


333 


gi 


|7243231| 


335 


gi 


|13123776 


| 307 


gi |12052838 


307 


gi 


12963557 


307 


gi 


12805675 


290 


NOV32 


393 


gi 


7243231| 


395 


gi 


13123776 


367 


gi 


12052838 


367 


gi 


12963557 


367 


gi 


12805675 


350 



NOV32 



gi 
gi 
gi 
gi 
gi 



7243231) 
13123776 
12052838 
12963557 
12805675 



I- 



370 



380 



390 



400 



410 



420 



TAGIKFPDFKSAGVTLRSQRMKLPSSVGQKKIKALEQMLLELGVELSPTPTEELVHMFNE 
TAGIKFPDFKSAGVTLRSQRMKLPSSVGQKKIKALEQr'lLLELGVELSPTPTEELVHMFNE 
TAGIKFPDFKSAGVTLRSQRMKLPSSVGQKKIKALEQMLLELGVELSPTPTEELVHMFNE 
TAGIKFPDFKSAGWLRSQRMKLPSSVGQKKIKALEQMLLELGVELSPTPTEELVHMFNE 
TAGIKFPDFKSAGVTLRSQRMKLPSSVGQKKIKALEQMLLELGVELSPTPTEELVHMFNE 
TAGIKFPDFKSAGVTLRSQRMKLPSSVGQKKIKALEQMLLELGVELSPTPTEELVHMFNE 



392 
394 
366 
366 
366 
349 



430 



440 



450 



I 



I 



LRSDLVLLYELKQACANCEYELQMLRHRHE 
LRSDLVLLYELKQACANCEYELQMLRHRHEAL 
LRSDLVLLYELKQACANCEYELQMLRHRHEAL 
LRSDLVLLYELKQACANCEYELQMLRHRHE AL 
LRSDLVLLYELKQACANCEYELQMLRHRHEAL 
LRS DL VLL YE L KQ AC ANCE YE LQMLRHRHEAL 




490 



500 




I. 



510 
. J. . 



520 



PNSRKRRESASSSSSVKKAKKP 
'PNSRKRRESASSSSSVKKAKKP 
'PNSRKRRESASSSSSVKKAKKP 
'PNSRKRRESASSS SS VKKAKKF 
'PNSRKRRESASSSSSVKKAKKP 
'PNSRKRRESASSSSSVKKAKKP! 



493 
495 
467 
467 
468 
451 



Methylation of CpG islands is associated with transcriptional silencing and the 

30 formation of nuclease-resistant chromatin structures enriched in hypoacetylated histones. 

Methyl-CpG-binding proteins, such as MeCP2, provide a link between methylated DNA and 
hypoacetylated histones by recruiting histone deacetylase, but the mechanisms establishing the 
methylation patterns themselves are unknown. Whether DNA methylation is always causal for 
the assembly of repressive chromatin or whether features of transcriptionally silent chromatin 

35 might target methyltransferase remains unresolved. Mammalian DNA methyltransferases 
(DNMT) show little sequence specificity in vitro, yet methylation can be targeted in vivo 
within chromosomes to repetitive elements, centromeres and imprinted loci. This targeting is 
frequently disrupted in tumour cells, resulting in the improper silencing of tumour-suppressor 
genes associated with CpG islands. Robertson et al. (Nat Genet 2000, 25:338-42) have shown 

40 that the predominant mammalian DNA methyltransferase, DNMT1 , co-purifies with the 
retinoblastoma (Rb) tumour suppressor gene product, E2F1, and HDAC1 and that DNMT1 
cooperates with Rb to repress transcription from promoters containing E2F-binding sites. 
These results establish a link between DNA methylation, histone deacetylase and sequence- 
specific DNA binding activity, as well as a growth-regulatory pathway that is disrupted in 

45 nearly all cancer cells. Recently, Rountree et al. (Nat Genet, 2000, 25:269-77) have shown that 
the non-catalytic amino terminus of DNMT1 binds to HDAC2 and a new protein, DMAP1 (for 
DNMT1 associated protein), and can mediate transcriptional repression. DMAP1 has intrinsic 
transcription repressive activity, and binds to the transcriptional co-repressor TSG101. 



WO 02/068649 



PCT/US02/02785 



DMAP1 is targeted to replication foci through interaction with the far N terminus of DNMT1 
throughout S phase, whereas HDAC2 joins DNMT1 and DMAP1 only during late S phase, 
providing a platform for how histones may become deacetylated in heterochromatin following 
replication. Thus, DNMT1 not only maintains DNA methylation, but also may directly target, 
5 in a heritable manner, transcriptionally repressive chromatin to the genome during DNA 
replication. 

The NOV32 nucleic acid of the invention encoding a DNMT1 associated protein- 1 
(DMAP)-like protein includes the nucleic acid whose sequence is provided in Table 32A, or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 

10 bases may be changed from the corresponding base shown in Table 32A while still encoding a 
protein that maintains its DNMT1 associated protein-1 (DMAP)-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 

1 5 invention additionally includes nucleic acids or nucleic acid fragments, or complements 

thereto, whose structures include chemical modifications. Such modifications include, by way 
of non-limiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 

20 antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or variant 
nucleic acids, and their complements, up to about 3% of the residues may be so changed. 

The NOV32 protein of the invention includes the DNMT1 associated protein-1 
(DMAP)-like protein whose sequence is provided in Table 32B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 

25 residue shown in Table 32B while still encoding a protein that maintains its DNMT1 

associated protein-1 (DMAP)-like activities and physiological functions, or a functional 
fragment thereof. In the mutant or variant protein, up to about 9% of the bases may be so 
changed. 

The NOV32 nucleic acids and proteins of the invention are useful in potential 
30 diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from: cancers such as breast cancer, colorectal 
cancers, lung cancer, liver cancer, pancreatic cancer, prostate cancer, stomach cancers, 
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developmental syndromes, Fragile X and Rett and other diseases, disorders and conditions of 
the like. 

NOV32 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV32 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
10 disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 



NOV33 

A disclosed NOV33 nucleic acid of 7693 nucleotides (also referred to as CG56688-01) 
15 encoding a novel Notch 1 -like protein is shown in Table 33 A. An open reading frame was 

identified beginning with an ATG initiation codon at nucleotides 1-3 and ending with a TAA 
codon at nucleotides 7669-7671. Putative untranslated regions, if any, found upstream from 
the initiation codon and downstream from the termination codon are underlined in Table 33A, 
and the start and stop codons are in bold letters. 



Table 33 A. NOV33 Nucleotide Sequence (SEQ ID NO: 133) 

AT6CCGCC6CTCCTGGCGCCCCTGCTCTGCCTGGCGCTGCTGCCCGCGCTCGCCGCACGAGGCCCGCGATG 
CTCCCAGCCCGGTGAGACCTGCCTGAATGGCGGGAAGTGTGAAGCGGCCAATGGCACGGAGGCCTGCGTCT 
GTGGCGGGGCCTTCGTGGGCCCGCGATGCCAGGACCCCAACCCGTGCCTCAGCACCCCCTGCAAGAACGCC 
GGGACATGCCACGTGGTGGACCGCAGAGGCGTGGCAGACTATGCCTGCAGCTGT 

GCCCCTCTGCCTGACACCCCTGGACAACGCCTGCCTCACCAACCCCTGCCGCAACGGGGGCACCTGCGACC 
TGCTCACGCTGACGGAGTACAAGTGCCGCTGCCCGCCCGGCTGGTCAGGGAAATCGTGCCAGCAGGCTGAC 
CCGTGCGCCTCCAACCCCTGCGCCAACGGTGGCCAGTGCCTGCCCTTCGAGGCCTCCTACATCTGCCACTG 
CCCACCCAGCTTCCATGGCCCCACCTGCCGGCAGGATG 

GCC^CGGAGGCACCTGCCACAACGAGGTCGGCTCCTACCGCTGCGTCTGCCGCGCCACCCACACTGGCCCC 

AACTGCGAGCGGCCCTACGTGCCCTGCAGCCCCTCGCCCTGCCAGAACGGGGGCACCTGCCGCCCCACGGG 

CGACGTCACCCACGAGTGTGCCTGCCTGCCAGGCTTCACCGGCCAGAA 

GTCCAGGAAACAACTGCAAGAACGGGGGTGCCTGTGTGGACGGC 

CCAGAGTGGACAGGTCAGTACTGTACCGAGGATGTGGACGAGTGC 

CGGCGGGACCTGCCACAACACCCACGGTGGCTACAACTGCGTGTGTGTCAACGGCTGGACTGGTGAGGACT 
GCAGCGAGAACATTGATGACTGTGCCAGCGCCGCCTGCTTCCACGGCGCCACCTGCCATGACCGTGTGGCC 
TCCTTTTACTGCGAGTGTCCCC^TGGCCGCACAGGTCTG 

CCCCTGTAACGAGGGCTCCAACTGCGACACCAACCCTGTCAATGGCAAGGCCATCTGCACCTGCCCCTC 

GGTACACGGGCCCGGCCTGCAGCCAGGACGTGGATGAGTGCTCGCTGGGTGCCAACCCCTGCGAGCATGCG 

GGCAAGTGC^TCAACACGCTGGGCTCCTTCGAGTGCCAGTGTCTGCAGGGCTACACGGGCCCCCGATGCGA 

GATCGACGTCAACGAGTGCGTCTCGAACCCGTGCCAGAACX^OTC 

TCCAGTGCATGTGCATGCCCGGCTACGAGGGTGTGCACTGCGAGG 

CCCTGCCTGCAC^TGGCCGCTGCCTGGACAAGATC^TGAGTTCCAGTGCGAGTGCCCCACGGGCTTa^C 
TGGGCATCTGTGCCAGTACGATGTGGACGAGTGTGCCAGCACCCCCTGCAAGAATGGTGCCAAGTGCCTGG 
ACGGACCCAACACTOACACCTGTGTGTGCACGGAAGGGTACACGGG 

GAGTGCGACCCCGACCCCTGCCACTACGGCTCCTGCAAGGACGGCGTCGCCACCTTCACCTGC 
CCCAGGCTACACGGGCCACCACTGCGAGACCAACATCAACGAGTGCTCCAGCCAGCCCTGCCGCCACGGGG 
GCACCTGCCAGGACCGCGACAACGCCTACCTCTGCTTCTGCCTGAAGGGGACCACAGGACCCAACTGCG 
ATCAACCTGGATGACTGTGCCAGC^GCCCCTGCGACTCGGGCACCTGTCTGGACAAGATCGATGGCTACGA 
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GTGTGCCTGTGAGCCGGGCTACACAGGGAGCATGTGTAACATCAACATCGATGAGTGTGCGGGCAACCCCT 

GCCACAACGGGGGCACCTGCGAGGACGGCATCAATGGCTTCACCTGCCGCTGCCCCGAGGGCTACCACGAC 

CCCACCTGCCTGTCTGAGGTCAATGAGTGCAACAGCAACCCCTGCGTCCACGGGGCCTGCCGGGACAGCCT 

CAACGGGTACAAGTGCGACTGTGACCCTGGGTGGAGTGGGACCAACTGTGACATCAACAACAACGAGTGTG 

AATCCAACCCTTGTGTO^CGGCGGCACCTGCAAAGACATGACCAGTGGCTACGTGTGCACCTGCCGGGAG 

GGCTTCAG CGGTCCCAACTGC CAGACCAACAT CAACGAGTGTGCGTCCAACCCATGTCTGAACAAGGGCAC 

GTGTATTGACGACGTTGCCGGGTACAAGTGCAACTGCCTGCTGCCCTACACAGGTGCCACGTGTGAGGTGG 

TGCTGGCCCCGTGTGCCCCCAGCCCCTGCAGAAAGGGCGGGGAGTGCAGGCAATCCGAGGACTATGAGAGC 

TTCTCCTGTGTCTGCCCCACGGCTGGGGCCAAAGGGCAGACCTGTGAGGTCGACATCAACGAGTGCGTTCT 

GAGCCCGTGCCGGCACGGCGCATCCTGCCAGAACACCCACGGCGGCTACCGCTGCCACTGCCAGGCCGGCT 

ACAGTGGGCGCAACTGCGAGACCGACATCGACGACTGCCGGCCCAACCCGTGTCACAACGGGGG CTCCTG C ' 

ACAGACGGCATCAACACGGC CTTCTGCGACTGCCTG CCCGGCTTCCGGGGCACTTT CTGTGAGGAGGACAT 

CAACGAGTGTGCCAGTGACCCCTGCCGCAACGGGGCCAACTGCACGGACTGCGTGGACAGCTACACGTGCA 

CCTGCCCCGCAGGCTTCAGCGGGATCCACTGTGAGAACAACACGCCTGACTGCACAGAGAGCTCCTGCTTC 

AACGGTGGCACCTGCGTGGACGGCATCAACTCGTTCACCTGCCTGTGTCCACCCGGCTTCACC^ 

CTGCCAGCACGATGTCAATGAGTGCGACTC^CAGCCCrGCCTGC^TGGCGGCACCTGTC^GGACGGCTGCG 

GCTCCTACAGGTGCACCTGCCCCCAGGGCTACACTGGCCCC^^CTGCCAGAACCTTGTGCACTGGTGTGAC 

TCCTCGCCCTGCAAGAACGGCGGCAAATGCTGGCAGACCCACACCCAGTACCGCTGCGAGTGCCCCAGCGG 

CTGGACCGGCCTTTACTGCGACGTGCCCAGCGTGTCCTGTGAGGTGGCTGCGCAGCGACAAGGTGTTGACG 

TTGCCCGCCTGTGCCAGCATGGAGGGCTCTGTGTGGACGCGGGCAACACGCACCACTGCCGCTGCCAGGCG 

GGCTACACAGGC^GCTACTGTGAGGACCTGGTGGACGAGTGCTC^ 

CTGCACGGACTACCTGGGCGGCTACTCCTGCAAGTGCGTGGCCGGCTACCACGGGGTGAACTGCTCTGAGG 

AGATCGACGAGTGCCTCTCCCACCCCTGCCAGAACGGGGGCACCTGCCTCGACCTCCCCAACACCTACAAG 

TGCTCCTGCCCACGGGGCACTCAGGGTGTGC^CTGTGAGATCAACGTGGACGACTGCAATCCCCCCGTTGA 

CCCCGTGTCCCGGAGCCCCAAGTGCTTTAACAACGGCACCTGCGTGGACCAGGTGGGCGGCTACAGCTGCA 

CCTGCCCGCCGGGCTTCGTGGGTGAGCGCTGTGAGGGGGATGTCAACGAGTGCCTGTCCAATCCCTGCGAC 

GCCCGTGGCACCCAGAACTGCGTGCAGCGCGTCAATGACTTCCACTGCGAGTGCCGTGCTGGTCACACCGG 

GCGCCGCTGCGAGTCCGTCATCAATGGCTGCAAAGGCAAGCCCTGCAAGAATGGGGGCACCTGCGCCGTGG 

CCTCCAACACCGeCCGCGGGTTCATCTGCAAGTGCCCTGCGGGCTTCGAGGGCGCCACGTGTGAGAATGAC 

GCTCGTACCTGCGGCAGCCTGCGCTGCCTCAACGGCGGCACATGCATCTCCGGCCCGCGCAGCCCC^ 

CCTGTGCCTGGGCCCCTTCACGGGCCCCGAATGCCAGTTCCCGGCCAGCAGCCCCTGCCTGGGCGGCAACC 

CCTGCTAC^CCAGGGGACCTGTGAGCCCACATCCGAGAGCCCCTTCTACCGTTGCCTGTGCCCCGCCAAA 

TTCAACGGGCTCTTGTGCCACATCCTGGACTACAGCTTCGGGGGTGGGGCCGGGCGCGACATCCCCCCGCC 

GCTGATCGAGGAGGCGTGCGAGCTGCCCGAGTGCCAGGAGGACGCGGGCAACAAGGTCTGCAGCCTGCAGT 

GCAACAACCACGCGTGCGGCTGGGACGGCGGTGACTGCTCCCTCAACTTCAATGACCCCTGGAAGAACTGC 

ACGCAGTCTCTGCAGTG CTGG AAGTACTTCAGTGACGGCCACTGTGACAGCCAGTGCAACTCAG CCGGCTG 

CCTCTTCGACGGCTTTGACTGCCAGCGTGCGGAAGGCCAGTGCAACCCCCTGTACGACCAGTACTGCAAGG 

ACCACTTCAGCGACGGGCACTGCGACCAGGGCTGCAACAGCGCGGAGTGCGAGTGGGACGGGCTGGACTGT 

GCGGAGCATGTACCCGAGAGGCTGGCGGCCGGCACGCTGGTGGTGGTGGTGCTGATGCCGCCGGAGCAGCT 

GCGCAACAGCTCCTTCCACTTCCTGCGGGAGCTCAGCCGCGTGCTGCAC^CCAACGTGGTCTTCAAGCGTC 

ACGCACACGGC CAGCAGATGAT CTT C CCCT ACT ACGG C CGCGAGGAGGAG CTG CG CAAGCAC C C CAT CAAG 

CGTGCCGCCX5AGGGCTGGGCCGCACCTGACGCCCTGCTGGGCCAGGTGAAGGCCTCGCTGCTCCCTGGTGG 

CAGCGAGGGTGGGCGGCGGCGGAGGGAGCTGGACCCCATGGACGTCCGCGGCTCCATCGTCTACCTGGAGA 

TTGACAACCGGCAGTGTGTGCAGGCCTCCTCGCAGTGCITCCAGAGTGCC^ 

GGAGCGCTCGCCTCGCTGGGCAGCCTCAACATCCCCTACAAGATCGAGGCCGTGCAGAGTGAGACCGTGGA 

GCCGCCCCCGCCGGCGCAGCTGCACTTCATGTACGTGGCGGCGGCCGCCTTTGTGCTTCTGTTCTTCGTGG 

GCTGCGGGGTGCTGCTGTCCCGCAAGCGCCGG(^GCAGCATGGCCAGCTCTGGTTCCCTGAGGGCTTCAAA 

GTGTCTGAGGCCAGCAAGAAGAAGCGGCGGGAGCCCCTCGGCGAGGACTCCGTGGGCCTCAAGCCCCTGAA 

GAACG CTT CAGACGGTG C CCT CATGGACGACAAC CAGAATGAGTGGGGGGACG AGGACCTGGAGAC CAAGA 

AGTTCCGGTTCGAGGAGCCCGTGGTTCTGCCTGACCTGGACGACCAGACAGACCACCGGCAGTGGACTCAG 

CAGCACCTGGATGCCGCTGACCTGCGCATGTCTGCCATGGCCCCCACACCGCCCCAGGGTGAGGTTGACGC 

CGACTGCATGGACGTCAATGTCCGCGGGCCTGATGGCTTCACCCCGCTCATGATCGCCTCCTGCAGCGGGG 

GCGG CCTGGAGACGGGCAACAGCGAGGAAGAGGAGGACG CGCCGGCCGTCATCTCCGACTTCATCTACCAG 

GGCGCCAGCCTGCACAACCAGACAGACCGCACGGGCGAGACCGCCTTGCACCTGGCCGCCCGCTACTCACG 

CTCTGATGCCGCCAAGCGCCTGCTGGAGG CCAGCGCAGATGCCAACAT CCAGGAGIVAC ATGGGCCGCACCC 

CGCTG(^TGCGGCTGTGTCTGCCGACGCACAAGGTGTCTTCO\GATCCTGATCCGGAACAGGGCCACAGAC 

CTGGATGCCCGCATGCATGATGGCACAACTCCACTGATCCTGGCTGCCCGCCTGGCCGTGGAGGGCATGCT 

GGAGGACCTCATCAACTCACACGCCGACGTCAACGCCGTAGATGACCTGGGCAAGTCCGCCCTGCACTGGG 

CCGCCGCCGTGAAC^ATGTGGATGCCGCAGTTGTGCTCCTGAAGAACGGGGCTAACAAAGATATGCAGAAC 

AACAGGGAGGAGACACCCCTGTTTCTGGCCX3CCCGGGAGGGCAGCTACGAGACCGCCAAGGTGCrGCTGGA 

C^ACTTTGCC^CCGGGACATCACGGATC^TATGGACCGCCTGCCGCGCGA(^TCGC^CAGGAGCGCATGC 

ATCACGACATCGTGAGGCTGCTGGACGAGTACAACCrGGTGCGCAGCCCGCAGCTC 

GGGGGCACGCCCACCCTGTCXSCCCCCGCTCrGCTCGCCCAACGGCTACCTGGGCAGCCTCAAGCC 

GCAGGGCAAGAAGGTCCGCAAGCCCAGCAGCAAAGGCCTGGCCTGTGGAAGCAA 

AGGCACGGAGGAAGAAGTCCCAGGATGGCAAGGGCTGCCTGCTGGACAGCTCCGGCATGCTCTCGCCCGTG 
GACTCCCTGGAGTCACCCCATGGCTACCTGTCAGACGTGG CCTCX3CCGCCACTGCTGCCCTC CCCGTTCCA 
GCAGTCTCCGTCCGTGCCCCTCAACCACCTGCCTGGGATGCCCGACACCCACCTGGGCATCGGGCACCTGA 
ACGTGGCGGCCAAGCCCGAGATGGCGGCGCTGGGTGGGGGCGGCCGGCTGGCCTTTGAGACTGGCCCACCT 
CGTCTCTCCCACCTGCCTGTGGCCTCTGGC&CCAGCACCGTCCTGGGCTCC 
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TTTCACTGTGGGCGGGTCCACCAGTTTGAATGGTCAATGCGAGTGGCTGTCCCGGCTGCAGAGCGGCATGG 
TGCCGAACCAATACAACCCTCTGCGGGGGAGTGTGGCACCAGGCCCCCTGAGCACACAGGCCCCCTCCCTG 
C^GCATGGC^TGGTAGGCCCGCTGCACAGTAGCCTTGCTGCCAGCGCCCTGTCCCAGATGATGAGCTACCA 
GGGCCTGCCC^GCACCCGGCTGGCCACCCAGCCTCACCTGGTGCAGACCCAGCAGGTGCAGCCACAAAACT 
TAC^GATGCAGCAGCAGAACCTGCAGCCAGCAAACATCCAGCAGCAGCAAAGCCTGCAGCCGCCACCACCA 
CCACCACAGCCGCACCTTGGCGTGAGCTCAGCAGCCAGCGGCCACCTGGGCCGGAGCTTCCTGAGTGGAGA 
GCCGAGCCAGGCAGACGTGCAGCCACTGGGCCCCAG CAGC CTGGCGGTGCACACTATTCTG CCCCAGGAGA 
GCCCCGCCCTGCCCACGTCGCTGCCATCCTCGCTGGTCCCACCCGTGACCGCAGCCCAGTTCCTGACGCCC 
CCCTCGCAGC ACAGCTACTCCTCGCCTGTGGACAACACCCCCAGCCAC CAGCTACAGGTGCCTGAG CACCC 
CTTCCTGACCCCTTCGCCGGAGTCG C CCGACCAATGGTCGTCCTCGTCGC CGCACTCTAATGTGTCTGACT 
GGTCTGAGGGCGTGTCGTCGCCCCCGACCTCCATGCAGTCCCAGATCGCGCGCATCCCGGAGGCGTTCAAG 
TAATAGCTCGAGGTGCCAGCAGCTC 



The NOV33 nucleic acid has 7670 of 7693 bases (99%) identical to a gbrGENBANK- 
ID:AF308602|acc:AF308602.1 mRNA from Homo sapiens (NOTCH 1 (Nl) mRNA, complete 
cds)(E = 0.0). 

A disclosed NOV33 polypeptide (SEQ ID NO:134) encoded by SEQ ID NO:133 is 
2556 amino acid residues and is presented using the one-letter code in Table 33B. Signal P, 
Psort and/or Hydropathy results predict that NOV33 contains a signal peptide and is likely to 
be localized to the plasma membrane with a certainty of 0.4600. The most likely cleavage site 
for a NOV33 peptide is between amino acids 1 8 and 1 9: ALA-AR. 



Table 33B. Encoded NOV33 protein sequence (SEQ ID NO:134) 

MPPLLAPLLCIJUjLPAI*AARGPRCSQPGETCLNGGKCEAANGTEACVC^GAFV 

TCHWDRRGVADYACS CALGFS GPLCLTPLDNACLTNPCRNGGTCDLLTLTEYKCRCPPGWS GKSCQQADPC 
ASNPCANGGQCLPFEAS YI CHCPPS FHGPTCRQDVNECGQKPGLCRHGGTCHNEVGS YRCVCRATHTGPNCE 
RPYVPCSPSPCQNGGTCRPTGDVTHECACLPGFTGQNCEENIDDCPG^CKNGGACVDGVNTYNCP 
GQYCTEDVDECQI^PNACQNGGTCHNTHGGYNCVC^ 
CPHGRTGLLCHLNDACISNPCNEGSNCDTNPWGKAICTCT 

LGSFECQCLQGYTGPRCEIDVNECTSNPCQNDATCLDQIGEFQCMCMPGYEGVHCEVNTD 

CLDKINEFQCECPTGFTGHLCQYDVDEC^STPCKNGAKCLIXSPNTYTCVCTEGYTGTHCE^ 

YGS CKDGVATFTCLCRPGYTGHHCETNINECS S QPCRHGGTCQDRDNAYLCFCIiKGTTGPNCEINLDDCASS 

PCDSGTCLDKIIXSYECACEPGYTGSMC^INIDECAGNPCHNGGTCEDGINGFTCRCPEGYHDPTCLSEWEC 

NSNPC^GACRDSLNGYKCDCDPGWSGTNCDINNNECESNPCVNGGTCmMTS 

NECASNPCLNKGTC IDDVAGYKCNCLLP YTGATCEWLAPCAP S PCRNGGE CRQSEDYES FS CVCPTAGAKG 

QTCEVDINECVLSPCRHGASCQNTHGGYRCHCQAGYSGRNCETDIDDCRPNPCHNGGSCTIX3INTAFCD 

GFRGTFCEEDINECASDPCRNGANCTDCVDSYTCTCPAGFSGIH^ 

LCPPGFTGSYCQHDVNECTSQPCLHGGTCQIXSCGSYRCTCPQGYTGPNCQNLVHWC^ 

QYRCECPSGWTGLYCDVPSVSCEVAAQRQGVDVARLCQHGGLCVDAGNTHHCRCQAGYTGSYCED^ 

SPCQNGATCTDYLGGYSCKCTAGYHGVNCSEEIDE<^ 

DCNPPVDPVSRSPKCFNNGTCVDQVGGYSCTCPPGFVGERCEGDVNECLSNPOTARGTQNCVQRVNDra 

RAGHTGRRCESVINGCKGKPCKNGGTCAVASNTARGFICKCPAGFEGATCENDARTCGSLRCLNGGTCISGP 

RSPTCLCIX3PFTGPECQFPASSPCLGGNPCYNQGTCEPTSESPFYRCLCPAKFNGLLCHILDYSFGGGAGRD 

IPPPLIEEACELPECQEI)AGNKVCSLQCNNHACGWDGGDCSIiNFNDPWKNCTQSLQCWKYFSDGHCDSQ 

AGCLFDGFDCQRAEGQCNPLYDQYCKDHFSDGHCDQGCNSAECEWDGLDCAEHVPERLAAGTLVVW 

QLRNSSFHFLRELSRVLHTNVVFKRDAHGQQMIFPYYGREEEIiRKHPIKRAM 

GSEGGRRRRELDPMDVRGSIVYLEIDNRQCTQASSQCFQSATTiVAAFLGALASLGSI^IPYK 

PPPPAQLHFMYVAAAAFVLLFFVGCGVLLSRKRRRQHGQLWFPEGFKVSEASKKKR 

ASDGALMDDNQNEWGDEDLETKXFRFEEPVVLPDLDDQTO 

MDVNVRGPDGFTPI^IASCSGGGLETGNSEEEEDAPAVISDFIYQGASLHNQTDRTGCT 
AKRLIxEl^ADANIQDNMGRTPLHAAVS ADAQGVFQ IL I RNRATDLDARMHDGTTPLI LAARLAVEGMLEDLI 
NSHADVNAVDDLGKSALHWAAAVNNVDAAVVLLKNGANKDMQN^ 
DITDHMDRLPRDIAQERMHHDIVRIJJ)EYNLVRSPQI^GAPLGGTPTLSPPLCSPN^ 

KPSSKGIACGSKEAKDLKARRKKSQDGKGCLLDSSGMLSPVDSLESPHGYLSDVASPPLLPSPFQQSPSVPL 
NHLPGMPDTHLGIGHLNVAAKPEMAA^ 

LNGQCEWLSRLQSGMVPNQYNPLRGSVAPGPLSTQAPSLQHGMVGPLHSSLAASALSQMM^ 

QPHLVQTQQVQPQNLQMQQQNLQPANIQQQQSLQPPPPPPQPHLGVSSAASGHLGRSFLSGEPSQADVQPLG 

PSSLAVHTILPQESPALPTSLPSSLVPPVTAAQFLTPPSQHSYSSPVDNTPSHQLQVPEHPFLTPSPESPDQ 
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| WSSSSPHSNVSDWSEGVSSPPTSMQSQIARIPEAFK , | 

The disclosed NOV33 amino acid sequence has 2543 of 2556 amino acid residues 
(99%) identical to, and 2545 of 2556 amino acid residues (99%) similar to, the 2556 amino 
acid residue ptnr:TREMBLNEW-ACC:AAG33848 protein from Homo sapiens (Human) 
(Notch 1)(E = 0.0). ' 

NOV33 is predicted to be expressed in at least Adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and uterus. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

In addition, NOV33 is predicted to be expressed in the following tissues because of the 
expression pattern of (GENBANK-ID: gb:GENBANK-ID:AF308602|acc:AF308602.1) a 
closely related Homo sapiens NOTCH 1 (Nl) mRNA, complete cds homolog in species Homo 
sapiens: brain. 

NOV33 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 33C. 



Table 33C. BLAST results for NOV33 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1 112 75980 | gb | AAG3 
3848.l|AF308602 1 
(AF308 602) 


NOTCH 1 [Homo 
sapiens] 


2556 


2543/255 
6 (99%) 


2545/2556 
(99%) 


0.0 


gi|l07215|pir| |A4Q0 
43 


notch protein 
homolog TAN-1 
precursor - human 


2555 


2537/255 
6 (99%) 


2541/2556 
(99%) 


0.0 


gi| 1171777 |sp|P4653 
l|NTCl_HUMAN 


NEUROGENIC LOCUS 
NOTCH PROTEIN 
HOMOLOG 1 
PRECURSOR 
( TRANSLOCATION - 
ASSOCIATED NOTCH 
PROTEIN TAN-1) 


2444 


2429/244 
4 (99%) 


2431/2444 


0.0 


gi| 6093542 |sp|Q0700 
8 |NTC1_RAT 


NEUROGENIC LOCUS 
NOTCH HOMOLOG 
PROTEIN 1 
PRECURSOR 


2531 


2301/255 
7 (89%) 


2401/2557 
(92%) 


0.0 


gi| 112074 |pir| |S18l 
88 


notch protein 
homolog - rat 


2531 


2300/255 
7 (89%) 


2399/2557 
(92%) 


0.0 
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The homology of these sequences is shown graphically in the Clustal W analysis shown 
in Table 33D. 



Table 33D Clustal W Sequence Alignment 



1) NOV33 (SEQ ID NO: 134) 

2) gi|ll275980|gb|AAG33848.l|AF308602_l (AF308602) NOTCH 1 [Homo sapiens] (SEQ ID 
NO:467) 

3) gi|l07215|pir| |A40043 notch protein homolog TAN-l precursor - human (SEQ ID 
NO:468) 

4) gi| 1171777 |sp|P4653l|NTCl_HUMAN NEUROGENIC LOCUS NOTCH PROTEIN HOMOLOG 1 
PRECURSOR (TRANSLOCATION- ASSOCIATED NOTCH PROTEIN TAN-l) (SEQ ID NO: 469) 

5) gi|S093542|sp|Q07008|NTCl_RAT NEUROGENIC LOCUS NOTCH HOMOLOG PROTEIN 1 PRECURSOR 
(SEQ ID NO: 470) 

6) gi|ll2074|pir| |S18188 notch protein homolog - rat (SEQ ID NO:471) • 



NOV33 



gi 
gi 
gi 
gi 
gi 



11275980 | 
107215 | 
1171777 | 
6093542) 
1120741 



10 



20 



I I 



30 



40 
. I . . 



I 



50 



60 



I. 



MPPLLAPLLCLALLPALAARGPRCSQPGETCLNGGKCEAANGTEACVCGGAFVGPRCQDP 
MPPLLAPLLCLALLPALAARGPRCSQPGETCLNGGKCEAANGTEACVCGGAFVGPRCQDP 
MPPLLAPLLCLALLPALAARGPRCSQPGETCLNGGKCEAANGTEACVCGGAFVGPRCQD? 
MPPLLAPLLCLALLPALAARGPRCSQPGETCLNGGKCEAANGTEACVCGGAFVGPRCQDP, 
MPgLLAPLLCLgLLPALAARGgRCSQP^TCLNGGgCESANGTEACVC^AFVGfflRCQDPj 
MP§LLAPLLCL3LLPAL7^G3RCSQP^TCLNGGlaCE*^GTEACVcBGAFVrfflRronp! 



60 ■ 

60 

60 

60 

60 

60 



NOV33 

gi | 11275980 | 
gi | 107215 | 
gi j 1171777 | 
gi j 6093542 | 
gi j 112074 | 




NOV33 


121 


gi 


11275980) 


121 


gi 


107215) 


121 


gi 


1171777 | 


121 


gi 


6093542 j 


121 


gi 


112074 | 


121 



NOV33 


181 


gi 


11275980) 181 


gi 


107215) 


181 


gi 


1171777 


181 


gi 


6093542 


181 


gi 


112074) 


181 



JL 



13 0 



140 



L 



150 



160 



I 



170 



180 



TLTEYKCRCPPGWSGKSCQQADPCASNPCANGGQCLPFEASYICHCPPSPHGPTCRQDVN 
TLTEYKCRCPPGWSGKSCQQADPCASNPCANGGQCLPFEASYICHCPPSFHGPTCRQDVN 
TLTEYKCRCPPGWSGKSCQQADPCASNPCANGGQCLPFEA^YICHCPPSFHGPTCRQDVN 
TLTEYKCRCPPGWSGKSCQQADPCASNPCANGGQCLPFEASYICECPPSFHGPTCRQDVN 
TLTEYKCRCPPGWSGKSCQQADPCASNPCANGGQCLPFEgSYIcgCPPgFKGPTCRQDVN 
TLTEYKCRCPPGWSGKSCQQADPCASN?CANGGQCLPF5§SYIC£CPP§FHGPTCRQDV^ 



180 
180 
180 
180 
180 
180 



190 



JL 



200 



210 



| 



220 



I 



230 



240 



ecgqkpglcrhggtchnevgsyrcvcrathtgpncerpyvpcspspcqnggtcrptgdvt 
ecgqkpBlcrhggtchnevgsyrcvcrathtgpncerpyvpcspspcqnggtcrptgdvt 
ecgqkpSlcrhggtchnevgsyrcvcrathtgpncsrpyvpcspspcqnggtcrptgdvt 
ecgqkpHlcrhggtchnevgsyrcvcrathtgfncerpyvpcspspcqnggtcrptgdvt 

SCgQgpgLCRHGGTCHNEjjGSYRCgCRATKTGPgCEgPYVPCSPSPCQNGGTCRPTGDjjT 
5CgQ^PgLCRHGGTCHNEiiGSYRc3cRATHTGPt:CEgPYVPCSPSPCQNGGTCRPTGDST 



NOV33 


241 


gi 


11275980) 


241 


gi 


107215 | 


241 


gi 


1171777 | 


241 


gi 


6093542) 


241 


gi 


112074 | 


241 



NOV33 



gi 
gi 
gi 



11275980] 
107215) 
1171777 | 



301 
301 
301 
301 



250 



260 



| 



270 



280 
..I.. 



| 



290 



300 



kecaclpgftgqnceeniddcpgnncknggacvdgvntyncpcppewtgqyctedvdecq 
hecaclpgftgqnceeniddcpgnncknggacvdgvntyncpcppewtgqyctedvdecq 
hecaclpgftgqnceeniddcpgnncknggacvdgvntyncpcppewtgqyctedvdecq 
hecaclpgftgqnceeniddcpgnncknggacvdgvntyncpcppewtgqyctedvdecq 
hecaclpgfggqnceels^ddcpgnncknggacvdgvntyncgcppewtgqyctedvdecq 
^caclpgfbgqnceenaddcpgnncknggacvdgvntyncbcppewtgqyctedvdecq 



300 
300 
300 
300 
300 
300 



310 



320 



330 



340 



I. 



350 



360 



LM PNACQNGGT CHNTHGG YNC VC VNG WTGED CSENI DD C AS AACFHGATCHDRVASFYCE 
LMPNACQNGGTCHNTHGGYNCVCVNGWTGEDCSENIDDCASAACFHGATCHDRVASFYCE 
LMPNACQNGGTC^NTHGGYNCVCVNGWTGEDCSENIDDCASAACFHGATCHDRVASFYCE 
LMPNACQNGGTCHNTHGGYNCVCVNGWTGEDCSENIDDCASAACFHGATCFrDRVASFYCE 
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g±| 6093542 | 
gi | 112074 | 



lmpnacqnggtch^hggyncvcvngwtgedcsgniddcasaacfsgatchdrvasfyce 
lmpnacqnSgtchnShggyncvc vngwtgedcsSniddc as aac fBgatchdrvas fyce 



360 
360 



NOV33 

gi 1 11275980 | 
gi j 107215 | 
gi j 1171777 | 
gi j 6093542 j 
gi j 112074 | 



361 
361 
361 
361 
361 
361 



I 



370 



380 



390 



400 



410 
..I 



420 

4 



cphgrtgllcklndacisnpcnegsncdtnpvngkaictcps<sytgpacsqdvdecslg 
cphgrtgllchlndacisnpcnegsncdtnpvngkaictcpsgytgpacsqdvdecslga 
cphgrtgllchlndacisnpcnegsncdtnpvngkaictcpsgytgpacsqdvdecslga 
cphgrtgllchlndaciskpcnegsncdtnpvngkaictcpsgytgpacsqdvdecslga 
cphgrtgllchlndac i snpcnegsncdtnp vngkai ctcpj3gytgp acs qd vde c^lga 
cphgrtgllchlndacisnpcnegsncdtnpvngkaictcp"gytgpacsqdvdecSlg, 



42 0 
420 
420 
420 
420 
420 



NOV33 



gi 
gi 
gi 
gi 
gi 



11275980| 
107215 | 
1171777) 
6093542 j 
112074 | 



430 



440 



450 



460 



470 



480 




KOV33 


481 


gi 


11275980 | 


481 


gi 


107215) 


481 


gi 


1171777 | 


481 


gi 


6093542) 


481 


gi 


112074] 


481 



1 



490 
...I.. 



I 



500 



510 



520 
. . I • . 



530 



I, 



540 
..I 



gyegvhcevntdecasspclhngrcldkinefqcecptgftghlcqydvdecastpckng 
gyegvhcevntdecasspclhngrcldkinefqcecptgftghlcqydvdecastpckng 
gyegvhcevntdecasspclhngrcldkihefqcecptgftghlcqJ[dvdecastpckng 
gyegvhcevntdecasspclkngrcldkinefqcecptgftghlcqydvdecastpckng 

GYEG^CE^TDECASSPCLHNGRCgDKINEF[*CgCPjfc^ 

gyegvKce^tdecasspclhngrcSqkinefSce^ 



NOV33 


541 


gi 


11275980) 


541 


gi 


107215) 


540 


gi 


1171777 | 


541 


gi 


6093542) 


541 


gi 


112074) 


541 



550 



560 



570 



580 



) 



590 



600 



akcldgpntytcvctegytgthcevdidecdpdpchygsckdgvatftclcrpgytghhc 
akcldgpntytcvctegytgthcevdidecdpdpchygsckdgvatftclcrpgytghhc 
akcldgpntytcvctegytgthcevdidecdpdpchygsckdgvatftclcrpgytghhc 
akcldgpntytcvctegytgthcevdidecdpdpchygsckdgvatftclcrpgytghhc 
akcldgpntytcvctegytgthcevdidecdpdpchRgJckdgvatftclcHpgytghhc 
akcldgpntttcvctegytgthcevdidecdpdpchmgSckdgvatftclcSpgytghhc 



610 



.1 



620 

..I.. 



630 



640 

. . I . . 



650 

. . I 



I 



660 

. . I 



NOV33 


601 


ETNINECSSQPCR 


gi 


11275980 | 


601 


ETNINECSSQPCR 


gi 


107215 | 


600 


ETNINECSSQPCR 


gi 


1171777 | 


601 


ETNINECSSQPCR 


gi 


6093542) 


601 


ETNINECSSQPCR 


gi 


112074 | 


601 


ETNINECgSQPCRj 



sgtcqdgdnaylcfclkgttgpnceinlddcasspcdsgtcldkid 
SgtcqdBdnaylcfclkgttgpnceinlddcasspcdsgtcldkid 
SgtcqdHdnaylcfclkgttgpnceinlddcasspcdsgtcldkid 
|gtcqdBdnaylcfclkgttgpnce inlddcas spcdsgtcldkid 
ggtcqdgdnhylcgclkgttgpnceinlddcas^pcdsgtcldkid 
ggtcqd^dngylc^clkgttgpnce inlddcas §pcdsgtcldkid 



660 
660 
659 
660 
660 
660 



670 



KOV33 


661 


gi 


11275980) 


661 


gi 


107215) 


660 


gi 


1171777 | 


661 


gi 


6093542) 


661 


gi 


112074 | 


661 



GYECACE PGYTGSMC 
GYECACEPGYTGSMC 
GYECACEPGYTGSMC 
GYECACE PGYTGSMC 
GYECACEPGYTGSMC 
GYECACE PGYTGSMCNj 



S80 

if 



I. 



690 
. . I . . 



700 
).. 



I , 



710 



I. 



720 
..I 



idecagnpchnggtcedgingftcrcpegyhdptclsevnec 
idecagnpchnggtcedgingftcrcpegyhdptclsevnec 
idecagnpchnggtcedgingftcrcpegyhdptclsevnec 
idecagnpchnggtcedgingftcrcpegyhdptclsevnec 
idecagSpchnggtcedgi^gftcrcpegyhdptclsevnec 
•nidecagSpchnggtcedgiSgftcrcpegyhdptclsevnec 



720 
720 
719 
720 
720 
720 



N0V33 


721 


gi 


11275980) 


721 


gi 


107215) 


720 


gi 


1171777 | 


721 


gi 


6093542) 


721 


gi 


112074 | 


721 




NOV33 

gi 1 11275980 | 



790 



800 



810 



820 



83 0 



840 



EGFS GPNGQTNINEC AS NPCLNKGTC I DDVAGYKCNCLLP YTGATCE WL APCAPS PC RN 
EGFSGPNCQTNINECASNPCLNKGTCIDDVAGYKCNCLLPYTGATCEWLAPgAPSPCRN 
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gi | 107215 | 
gi | 1171777 | 
gi j 6093542 | 
gi j 112074 | 



780 
781 
781 
781 
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2460 
2460 
2459 
2444 
2434 
2434 



2480 



2490 



2461 
2461 
2460 
2444 
2435 
2435 




I. 



2500 



I. 



2510 
..I... 



PVDNTPSHQLQVPEHPFLTPSPESPD 
PVDNTPSHQLQVPEHPFLTPSPESPD 
PVkJnTPSHQLQVPEHPFLTPSPES PD 





ESgALPTSLPSSWPPgTM 
ES gALPTSLPSsSgVPP™rru 


SQFLTPPSQHSYSSj 
gQFLTPPSQHSYSS; 


| PVDNTPSHQLQVPEHPFLTPS PES PD 
; PVDNTPSHQLQVPEHPFLTPSPESPD 



2520 
. I 

2519 
2519 
2518 
2444 
2494 
2494 



2530 
_K .J . . . 



2540 



2550 



NOV33 


2520 


gi 


11275980) 


2520 


gi 


107215 | 


2519 


gi 


1171777 | 


2444 


gi 


6093542 | 


2495 


gi 


112074 | 


2495 



QWSSSSgHSNffiSDWSEGSSSPPTSMfflSQI^IPEAFK 

qwssss^sn5sdwsegSsspptsmssoi8Bi?eafk 

QWSSSSaSSNaSDWSEGfflSSPPTSMBSOlBBlPEAFK 



QWSSSSgHSNnSDWSEGgsSPPTSMjaSQigjjlPEAFK 

qwssssShsnmsdwseg?! sspptsmBsqiS Sipeafk 



2556 
2556 
2555 
2444 
2531 
2531 



60 



Tables 33E-I list the domain descriptions from DOMAIN analysis results against 
NOV33. This indicates that the NOV33 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 33E Domain Analysis of NO V33 

gnl | Smart [smart 00004, NL, Domain found in Notch and Lin- 12; The Notch 
protein is essential for the proper differentiation of the Drosophila 
ectoderm. This protein contains 3 NL domains. (SEQ ID NO: 825) 
CD-Length = 39 residues, 100.0% aligned 
Score = 45.1 bits (105), Expect = 5e-05 



NOV33: 1443 PPLIEEACELPECQEDAGNKVCSLQCNNHACGWDGGDCS 

i ii *i + i+ ii +in i mini 

Sb;j ct : 1 PQDPWSRCEDAQCWDKFGDGVCDEE CNNAECLWDGGDCS 



1481 



39 



Table 33F Domain Analysis of NOV33 

gnljSmart |smart00004, NL, Domain found in Notch and Lin- 12? The Notch 
protein is essential for the proper differentiation of the Drosophila 
ectoderm. This protein contains 3 NL domains. {SEQ ID NO: 825) 
CD-Length =39 residues, 94.9% aligned 
Score =44.7 bits (104), Expect = 7e-05 



10 



NOV33: 1486 DPWKNCTQSLQCWKYFSDGHCDSQCNSAGCLFDGFDCQ 1523 

III I III I II II +IM 1 1*1 1 II 

Sbjct: 3 DPWSRCE - DAQCWDKFGDGVCDEE CNNAECLWDGGDCS 39 



Table 33G Domain Analysis of NOV33 

gnl | Smart | smart 000 04, NL, Domain found in Notch and Lin- 12; The Notch 
protein is essential for the proper differentiation of the Drosophila 
ectoderm. This protein contains 3 NL domains. (SEQ ID NO: 825) 
CD-Length =39 residues, 97.4% aligned 
Score » 41.2 bits (95), Expect » 7e-04 



15 



N0V3 3 : 1522 CQRAEGQCNPLYDQYCKDHFSDGHCDQGCNSAECEWDGLDC 

I + l I I I I II 11+ 11*111 III II 

Sb3 ct : 1 PQDPWSRCE- - - DAQCWDKFGDGVCDEECNNAECLWDGGDC 



1562 



38 



Table 33H Domain Analysis of NOV33 



gnl|Pfam|pfam00023, ank, Ank repeat. Ankyrin repeats generally consist 
of a beta, alpha, alpha, beta order of secondary structures. The 
repeats associate to form a higher order structure. (SEQ ID NO: 826) 
CD-Length =33 residues, 97.0% aligned 
Score * 42.7 bits (99), Expect = 3e-04 



20 



NOV3 3 : 1929 GETALHLAARYSRSDAAKRLLEASADANIQDN 

i i Mini + i mi ii i +i 

SbjCt: 2 GNTPLHLAARNGHLEVVKLLLEAGADVNARDK 



1960 



33 
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Table 331 Domain Analysis of NOV33 

gnl | Pf am|pfam00066, notch, Notch (DSL) domain. The Notch domain is 
also called the 'DSL' domain. The notch proteins are transmembrane 
proteins with extracellular domains of repeated EGF domains and the 
notch (or DSL) domain N- terminal to that. These proteins are generally 
involved in lateral inhibition in developmental processes. (SEQ ID 
NO:826) 

CD-Length =38 residues, 81.6% aligned 
Score = 42.0 bits (97), Expect = 4e-04 



NOV33: 1533 YDQYCKDHFSDGHCDQGCNSAECEWDGLDCA 1563 

I ++I + I++I l + l I 1*1 I +|| || + 
Sb j Ct : 8 YRRHCAERFANGVCNQECNNAACGFDGGDCS 3 8 

Notch is a surface receptor. It transmits signals received from outside the cell to the 
cell's interior. Notch ligands, such as Delta, Serrate and Scabrous interact with epidermal 
growth factor repeats contained in Notch's extracellular domain. Notch plays an active role in 
the differentiation of glial cellsand Notch influences the length and organisation of neuronal 
processes. Several homologs of the Drosophila Notch receptor and its ligands, Delta/Serrate, 
have been cloned in man. Three human disorders including a neoplasia (a T-cell acute 
lymphoblastic leukemia/lymphoma), a late onset neurological disease (CADASIL) and a 
developmental disorder (the Alagille syndrome) are associated with mutations in, respectively, 
the Notchl, Notch3 and Jaggedl genes, pointing out the broad spectrum of Notch activity in 
humans (Joutel A, and Tournier-Lasserve E, 1998, Semin Cell Dev Biol, 9:619-25; Frisen J, 
and Lendahl U, 2001, Bioessays 23:3-7). 

In Drosophila, the intracellular domain of Notch binds Suppressor of hairless, a 
multifunction transcription factor that acts as a signal transducing molecule shuttling between 
the cytoplasm and the nucleus. A nuclear function has been documented for the mammalian 
Notch homolog (Lu, 1996), as well as for Drosophila Notch (Struhl and Adachi, 1998, Cell 
• 93:649-60). When Notch is bound by a ligand, a signal is passed across the cell membrane 
releasing the Suppressor of Hairless protein, freeing this protein to enter the nucleus and 
assume its role in activating transcription of enhancer of Split complex genes. E(spl)-C 
proteins act in turn to repress the adoption of neural and other differentiated states. Deltex, an 
intracellular docking protein, replaces Suppressor of Hairless as Su(H) leaves the site of 
interaction with the intracellular tail of Notch. 

The Notch receptor's function is called neurogenic, but this confusing nomenclature 
refers to the phenotype established in the absence of functional Notch. Notch's function is to 
repress the adoption of differentiation by cells that carry the Notch protein. A look at the 
principle ligand of Notch (Delta) and its function makes the anti-neural function of Notch 
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more easily understood. Delta is not secreted, but is cell bound. The Delta-Notch interaction 
serves a cell adhesive function between ligand and receptor bearing cells. The receptor bearing 
cell is inhibited in assuming a differentitated state, while the ligand bearing cell is free to do 
so. During neurogenesis, this latter cell delaminates, that is, it migrates out of the epithelial 
5 cell layer in which it formerly resided, and assumes the differentiated state of a neuroblast in 
its new physical location within the developing nervous system. Thus Notch is involved in 
neurogenesis with respect to cells that bears the ligands for Notch: Delta, Serrate and 
Scabrous. 

Lateral inhibition is one of the major themes of development. The process of lateral 

10 inhibition and cell selection is repeated hundreds of times in Drosophila, with differentiation 
that takes place in nearly every kind of tissue. For example, Notch is required to limit the 
number of neuronal precursors, limit the number of muscle precursors, limit the growth of 
malphigian tubules, and regulate the growth of the ovary. Notch also functions as receptor for 
both Serrate and Delta in organizing the dorsal-ventral boundary of the wing. One important 

1 5 target of Serrate and Notch in this context is wingless (Diaz-Benjumea and Cohen, 1995, 

Development 121:4215-25). Two extreme models can be envisioned for lateral inhibition. The 
first implicates the Notch pathway in the choice of a single precursor via a negative feedback 
loop. This process could be random in some cases. The second model postulates that the 
precursor is pre-determined by some mechanism other than Notch signaling, and that Notch 

20 signaling then serves only to mediate mutual, uniform repression of other cells and ensure 
development of a single precursor. Studies concerning the physical spacing of precursors for 
the microchaetes of the peripheral nervous system suggest the existence of a regulatory loop 
under transcriptional control between Notch and its ligand Delta. Activation of Notch leads to 
repression of the achaete-scute genes, which are themselves known to regulate transcription of 

25 Delta; this regulation may perhaps be direct (Seugnet et al., 1997, Dev Biol. 192:585-98). 
Neuroblast segregation was studied in embryos lacking both the maternal and the zygotic 
forms of either Notch or Delta. A seven-up-LacZ marker was used to follow neuralization of 
5-2 and 7-4 neuroblast groups. In the absence of Notch signaling, the cells with an equivalence 
group do not enter the neural differentiation pathway simultaneously. Neuralization within a 

30 group is progressive with two or three cells segregating early and several more later. This 
suggests that neural potential is not evenly distributed among these cells. A requirement for 
transcriptional regulation of Notch and/or Delta during neuroblast segregation in embryos was 
tested by providing Notch and Delta ubiquitously at uniform levels. Neuroblast segregation 
occurs normally under conditions of uniform Notch expression, suggesting that transcriptional 
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regulation of Notch is not necessary for many aspects of development of the larval CNS and 
PNS. In particular, it is dispensable both before and after neuroblast segregation, implying that 
it is not a necessary component of neuroblast segregation, per se. Under conditions of uniform 
Delta expression, a single neuroblast segregates from each proneural group in 80% of the 
5 cases; in the remaining 20%, more than one neuroblast segregates from a single group of cells. 
Thus transcriptional regulation of Delta is largely dispensable, with only a small percentage of 
multiple neurons segregating in each cluster. Genes such as achaete, scute, extramacrochaete, 
and wingless could be responsible for local differences in proneural activity. Notch signaling 
would enable all cells to mutually repress one another; only a cell with an elevated neural 

1 0 potential could overcome this repression (Seugnet et al.,. 1 997, Development 124:201 5-25). 

The development and patterning of the wing in Drosophila relies on a sequence of cell 
interactions molecularly driven by a number of ligands and receptors. Genetic analysis 
indicates that a receptor encoded by the Notch gene and a signal encoded by the wingless gene 
play a number of interdependent roles in this process and display very strong functional 

15 interactions. At certain times and places, during wing development, the expression of wingless 
requires Notch activity and that of its ligands Delta and Serrate. This has led to the proposal 
that all the interactions between Notch and wingless can be understood in terms of this 
regulatory relationship. This proposal has been tested by analyzing interactions between Delta- 
and Serrate-activated Notch signaling and Wingless signaling during wing development and 

20 patterning. Cell death caused by expressing dominant negative Notch molecules during wing 
development cannot be rescued by coexpressing Nintra. This suggests that the dominant 
negative Notch molecules cannot only disrupt Delta and Serrate signaling but can also disrupt 
signaling through another pathway. One possibility is the Wingless signaling pathway, since 
the cell death caused by expressing dominant negative Notch molecules can be rescued by 

25 activating Wingless signaling. Furthermore, the outcome of the interactions between Notch 
and Wingless signaling differs when Wingless signaling is activated by expressing either 
Wingless itself or an activated form of the Armadillo. For example, the effect of expressing 
the activated form of Armadillo with a dominant negative Notch on the patterning of sense 
organ precursors in the wing resembles the effects of expressing Wingless alone. This result 

30 suggests that signaling activated by Wingless leads to two effects: a reduction of Notch 
signaling and an activation of Armadillo (Brennan, 1999, Curr Biol, 9:707-10). 

Expression of a dominant negative Notch molecule (Extracellular Notch or ECN) 
throughout the developing wing mimics the effects of loss of Notch function. However, Nintra 
cannot rescue the cell death caused by overexpressing ECN. Since Nintra provides constitutive 
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signaling for Delta and Serrate during wing development and the effects of ECN are mediated 
by the sequestration of extracellular molecules that can interact with Notch, this suggests that 
the ECN molecule is sequestering extracellular molecules other than Delta and Serrate and 
attenuating signaling through another pathway. One candidate pathway is the Wingless 
5 signaling pathway, since the cell death caused by expressing the ECN can be rescued by 

activating Wingless signaling. Therefore, it is possible that the ECN molecule is sequestering 
the Wingless protein. The possibility that Wingless can bind the extracellular domain of Notch 
is supported by the following results, in particular, by two observations: first, that some of the 
deleterious effects of ECN can be suppressed by Wingless, but not Wingless signaling in the 
10 form of a constitutively active Armadillo molecule; and second, that this interaction requires 
specific EGF-like repeats of Notch, namely repeats 17-19 and 24-26 but not 10-12. Evidence 
for a physical interaction between Notch and Wingless has also been provided recently by 
Wesley (1999, Mol Cell Biol. 19:5743-58) who finds that the Wingless protein is enriched in a 
biopanning assay designed to identify proteins that interact with the extracellular domain of 

15 the Notch protein and that Wingless can be immunoprecipitated with Notch from embryo 

extracts and cultured cells. These experiments also show that the association of Wingless with 
Notch requires the integrity of a region of Notch centered around EGF-like repeats 24-26 
(Wesley, 1999, Mol Cell Biol. 19:5743-58) which these experiments indicate are essential for 
the interactions that are described between Wingless and ECN during wing development and 

20 patterning (Brennan et al., 1999, Curr Biol. 9:707-10). The interaction of Wingless and Notch 
signaling that has been observed might also be important during normal neural development. 
Wingless and Delta have opposite effects during neurogenesis; Wingless promotes while Delta 
suppresses the development of sense organs. Various experiments suggest that during the 
segregation of neural precursors a reduction of Notch signaling in the precursors themselves is 

25 as important as the Delta-mediated activation of Notch signaling in the surrounding cells. It is 
possible that, like the activation of Notch by Delta, the suppression of Notch signaling is an 
active process mediated by the interaction of Wingless and Dishevelled with Notch. If this 
were the case, since both Delta and Wingless have spatially and temporally regulated patterns 
of gene expression, their interactions with Notch could contribute to the well-documented bias 

30 in the appearance of precursors from clusters of cells with neural potential. This competitive 
interaction could also account for the observed increases in Wingless signaling associated with 
reductions in Notch signaling during lateral inhibition (Brennan et al., 1999, Curr Biol. 9:707- 
10; Brennan et al, 1999, Dev Biol. 216:210-29). 
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The NOV33 nucleic acid of the invention encoding a Notchl-like protein includes the 
nucleic acid whose sequence is provided in Table 33A, or a fragment thereof. The invention 
also includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 33A while still encoding a protein that maintains its 
5 Notch 1 -like activities and physiological functions, or a fragment of such a nucleic acid. The 
invention further includes nucleic acids whose sequences are complementary to those just 
described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 

10 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1% of the residues 

1 5 may be so changed. 

The NOV33 protein of the invention includes the Notchl-like protein whose sequence 
is provided in Table 33B. The invention also includes a mutant or variant protein any of whose 
residues may be changed from the corresponding residue shown in Table 33B while still 
encoding a protein that maintains its Notchl-like activities and physiological functions, or a 

20 functional fragment thereof. In the mutant or variant protein, up to about 11% of the bases 
may be so changed. 

The NOV33 nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 

25 have efficacy for treatment of patients suffering from: neoplasia such as T-cell acute 

lymphoblastic leukemia/Iymphoma and mammary carcinomas, a late onset neurological 
disease like CADASIL and a developmental disorder such as the Alagille syndrome, familial 
and congenital cholestatic diseases, Hereditary vascular dementia, neurological diseases and 
other diseases, disorders and conditions of the like. 

30 NOV33 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV33 protein have multiple 
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hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 
5 NOV34 

A disclosed NOV34 nucleic acid of 935 nucleotides (also referred to as CG56715-01) 
encoding a novel Olfactory Receptor-like protein is shown in Table 34A. An open reading 
frame was identified beginning with an ACA codon, which codes for the amino acid 
Threonine, at nucleotides 2-4 and ending with a TGA codon at nucleotides 932-934. Putative 
10 untranslated regions, if any, found upstream from the initiation codon and downstream from 
the termination codon are underlined in Table 34A, and the start and stop codons are in bold 
letters. 

Table 34A. NOV34 Nucleotide Sequence (SEQ ID NO:135) 

CACACAAGAAGGCATCTACTTCATCCT 

TCCCCGTCTGCTGTCTCTA»CCATCTCCATCATG^ 

CCATCTGTCCACCAGCGCATGTATCTGTTTCTCTCCATGCT 

CACCCTACCCACAGTCATG(^GCTTCTCTGGTTCAACGTTCGTAGAATCAGCTCTGAGGCCCGTTTTGCTC 
AGTTTTTCTTCCTTCATGGATTCTCCTTTATGGACT 

GTGGCCATCTGCTGTCCCCTCCATTATGCCTCCATCCTCACCAATGAAGTCATTGGTAGAACTGGGTTAGC 
CATCATTTGCTGCTGTGTTCTGGCGGTTCTTCCCT CC CTTTT CTT ACT CAAGCGACTGCCTTTCTGCCACT 
CCCACCTTCTCTCTCGCTCCTATTGCCTCCACCAGGATATGATCCGCCrrGGTCTGTGCTGACATCAGGCTC 
AACAGCTGGTATGGATTTGCTCTTGCCTTGTTCATTATTATCGTGGATCCTCTGCTCATTGTGATCTCCTA 
TACACTrATTCTGAAAAATATCTTGGGCACAGCCACCT 

TGTCCCACATTCTGGCTGTCCTGGTCCTCTACATTCCCATGGTTGGTGTATCTATGACTCATC 
AAGCATGCCTCTCCACTGGTCCATGTTATCATGGCCAATATCTACCTGCTGGCACCCCCGGTGATGAACCC 
CATCATTTACAGTGTAAAGAACAAGCAG ATCCAATGG GGAATGTTAAATTTCCTTTC C CTCAAAAATATG C 
ATTCAAGATGAG 



The NOV34 nucleic acid has 578 of 903 bases (64%) identical to a gb:GENBANK- 

15 ID:AF137396|acc:AF137396.2 mRNA from Homo sapiens (ubiquilin 3, HOR 5'Betal4, . 

HOR5'Betal3, HOR5'Betal2, and HORS'Betal 1 genes, complete cds; HOR 5'BetalO and 

HOR5'Beta9 pseudogenes, complete sequence; HOR5'Beta8 and HOR5 f Beta7 genes, complete 

cds; CHR1 10RF1 and amphiphysin pseudogenes, complete sequence; HOR5'Beta6 and 

HOR5 ? Beta5 genes, complete cds; HOR5'Beta4 pseudogene, complete sequence; HOR 5TBeta3 

20 genes, complete cds; HOR5 r Beta2 pseudogene, complete sequence; and HOR 5'Betal gene, 

complete cds) (E = 6. 1 e" 50 ). 

A disclosed NOV34 polypeptide (SEQ ID NO:136) encoded by SEQ ID NO:135 is 

310 amino acid residues and is presented using the one-letter code in Table 34B. Signal P, 

Psort and/or Hydropathy results predict that NOV34 contains a signal peptide and is likely to 

25 be localized to the plasma membrane with a certainty of 0.6000. The most likely cleavage site 

for a NOV34 peptide is between amino acids 36 and 37: IMG-NT. 
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Table 34B. Encoded NOV34 protein sequence (SEQ ID NO:136) 

TQEGIYFILTDIPGFEASHIWISIPVCCLYTISIMGNTTILTVIRTEPSVHQRMYLFLSMIiALTDLGLTLTT 
LPTVMQLLWFNVRRISSEARFAQFFFIiHGFSFMESSVLI^ 

C CCVLAVLPSLFLL KRL PFCHSHLL SRS YCLHQDM I RLVCADI RLNS WYGFALAL FI I IVDPLLI VI SYTLI 
LKNILGTATWAERLRMjNNCLSHILAVLV^ 

VKNKQIQWGMLNFLSLKNMHSR 

The disclosed NOV34 amino acid sequence has 160 of 296 amino acid residues (54%) 
identical to, and 210 of 296 amino acid residues (70%) similar to, the 319 amino acid residue 
ptnr:TREMBLNEW-ACC: AAG4 1 684 protein from Mus musculus (Mouse) (MOR 3'BETA4) 
5 (E = 6.3e 83 ). 

NOV34 is predicted to be expressed in at least Apical microvilli of the retinal pigment 
epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma cell lines, corpus 
callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 
cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 

10 umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 
cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 
cell lines, fetal lymphoid tissue, adult lymphoid tissue, those that express MHC II and III 
nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 

1 5 spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus and thymus 
tissue. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 

NOV34 also has homology to the amino acid sequences shown in the BLASTP data 

20 listed in Table 34C. 



Table 34C. BLAST results for NOV34 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 

(%) 


Expect 


gi | 17472775 | ref |XP_ 
061808. l| 
(XM_061808) 


similar to MOR 
3Beta4 (H. 
sapiens) [Homo 
sapiens] 


317 


298/300 
(99%) 


298/300 
(99%) 


e-136 


gi 1 17456767| ref |XP 

061618.1) 

(XM_061618) 


similar to 
prostate specific 
G-protein coupled 
receptor (H. 
sapiens) [Homo 
sapiens] 


879 


168/289 
(58%) 


215/289 
(74%) 


6e-81 


gi| 17456753 |ref|XP 
061614. 1| 
(XM_061614) 


similar to MOR 
3Beta4 (H. 
sapiens) [Homo 
sapiens] \ 


315 


154/294 
(52%) 


214/294 
(72%) 


5e-75 
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gi| 17472781 j ref |XP 

061811.1] 

(XM_061811) 


similar to 
OLFACTORY 
RECEPTOR 5112 
(HOR5BETA12) (H. 
sapiens) [Homo 
sapiens] 


312 


150/290 
(51%) 


204/290 
(69%) 


6e-74 


gi| 11908220 |gb|AAG4 
1684. 1| (AF133300) 


MOR 3 'Beta4 [Mus 
musculus] 


319 


159/294 
(54%) 


209/294 
(71%) 


2e-72 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 34D. 



Table 34D Clustal W Sequence Alignment 

5 

1) NOV34 (SEQ ID NO: 136) 

2) gi | 17472775 | ref |XP_061808.1 | (XM_061808) similar to MOR 3Beta4 (H. sapiens) 
[Homo sapiens] (SEQ ID NO:472) 

3) gi | 17456767 | ref |XP_061618.l| (XM_061618) similar to prostate specific G-protein 
10 coupled receptor (H. sapiens) [Hojho sapiens] (SEQ ID NO: 473) 

4) gi | 17456753 | ref |XP_061614 . 1 | <XMJ)61614) similar to MOR 3Beta4 (H. sapiens) 
[Homo sapiens] (SEQ ID NO: 474) 

5) gi 1 17472781 1 ref |XP_061811 . 1 1 (XM_06181l) similar to OLFACTORY RECEPTOR 5112 
(HOR5BETA12) (H. sapiens) [Homo sapiens] (SEQ ID NO:475) 

15 6) gi | 11908220 | gb| AAG41684 . 1 | (AF133300) M0R3'Beta4 [Mus musculus] (SEQ ID NO: 476) 



20 



25 



30 



35 



40 



NOV34 



gi 
gi 
gi 
gi 
gi 



17472775 
17456767 
17456753 
17472781 
11908220 



NOV34 



gi 
gi 
gi 
gi 
gi 



17472775 
174567 67 
17456753 
17472781 
11908220 



NOV34 

gi | 17472775 
gi j 17456767 
gi j 17456753 
gi j 17472781 
gi j 1190822 0 



10 20 30 40 50 60 

....|....|....|....|....|....|....|....|....|....|....|....| 

i -- - i 

i i 

1 MSLALDLCPLSQRLEAJPSSIVLFFQTAPAVRHPKGLLELHKTVPTSIKEELKGFFPTSD 60 

1 1 

1 1 

1 1 

70 . 80 90 100 HO 120 

....|....|....|....|....|....|....|....|....|....|....|....| 

1 - - 1 

1 - 1 

61 HFI 1TDFIAKYHTDLKKAVLGIATPRQQFKALNTCISHI CAVLIFYVPTLSAAMLHQFAR 120 
1 j_ 

! ..... : t 

i -- i 

130 140 150 160 170 180 
....|....|....|....|....|....|....|....|....|....|....|....| 
1 . 1 

1 " - " - 1 

121 DVSPMIHVLMADIFLLVPPLLNPIVYCVKTHQIREKVVGKIiCPKNCFLKSKILPRCSFVP 180 

1 " " 1 

1 1 

1 - 1 



45 



50 



NOV34 



gi 
gi 
gi 
gi 



1747277S 
17456767 
17456753 
17472781 
11908220 



190 200 210 220 230 240 

| | | | | | | | | | | | 

1 -- 1 

1 - ~ - 1 

181 GFRLAYYYLPPHPSKVSFLDPVEKANRSAPTQFSPMPSADASLIiADLGTFSSLQHATFFL 240 

1 - - - 1 

1 - - - -— 1 

1 — 1 



250 260 270 280 290 300 

gi|l7472775| 1 ___ x 

gi 1 17456767 j 241 TGFQGLEGUIGWISIPFCFIYLTVILGNLTILHVICTDATLHGPMYYFLGMIiAVTDLGLC 3 00 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 





17456753 


1 


gi 


17472781 


1 


gi 


11908220 


1 


NOV34 


1 


gi 


17472775 


1 


gi 


17456767 


301 


gi 


17456753 


1 


gi 


17472781 


1 


gi 


11908220 


1 


NOV34 


1 


gi 


17472775 


1 


gi 


17456767 


361 


gi 


17456753 


1 


gi 


17472781 


1 


gi 


11908220 


1 


NOV34 


1 


gi 


17472775 


1 


gi 


17456767 


421 


gi 


17456753 


1 


gi 


1747278X 


1 


gi 


11908220 


1 



NOV34 

gi | 17472775 
gi | 17456767 
gi j 17456753 
gi 1 17472781 
gi j 11908220 



NOV34 



NOV34 



310 
..|.. 



320 



330 
..|....|. 



340 



350 



360 



370 



380 

|....|....|, 



390 



400 410 
..|....|....|.. 



420 



430 440 450 460 470 480 

|....|....|....|....|....|....|....|....|....|....|....| 



490 



500 
..|.. 



510 



520 



530 



540 



1 " — r- - - 1 

481 MIGLSLVHRFGEHLPR WHLFMS YVYLLVP PLMNPI I YS I KTKQIRQRI I KKFQFIKSLR 540 

1 1 

1 — - - " - 1 

1 - 1 



550 



560 



570 



gi 


17472775 


1 


gi 


17456767 


541 


gi 


17456753 


1 


gi 


17472781 


1 


gi 


11908220 


1 


NOV34 


26 


gi 


17472775 


33 


gi 


17456767 


601 


gi 


17456753 


33 


gi 


17472781 


31 


gi 


11908220 


33 


NOV34 


86 


gi 


17472775 


93 


gi 


17456767 


661 


gi 


17456753 


93 


gi 


17472781 


91 


gi 


11908220 


93 




580 

•■•-I 
■gQEGI 

!slg|ss|svsat 
©wnBsd-avepi 

LF-S- -VTHPj 
.TS-SssSlVSST 



590 



600 




610 



620 



630 



640 



650 



660 



146 
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10 



15 



20 



25 



gi 
gi 
gi 
gi 
gi 



17472775 
17456767 
17456753 
17472781 
11908220 



153 CCVliAfVLI 

721 fflvSiaf 

153 r 

151 feFITL: 
153 SNjL* 



NOV34 


206- 




gi 


17472775 


213 




gi 


17456767 


781 


s| 


gi 


17456753 


213 




gi 


17472781 


211 


LF 


gi 


11908220 


213 





NOV34 


266 




gi 


17472775 


273 




gi 


17456767 


841 




gi 


17456753 


273 


r8 


gi 


17472781 


271 


raj 


gi 


11908220 


273 






880 
I 

300 

SLKNMHSR- - 317 

CY 879 

FVLRRRF 315 

FRMFHHIKI 312 

iCSPKISSITM 319 



Table 34E list the domain descriptions from DOMAIN analysis results against 
NOV34. This indicates that the NOV34 sequence has properties similar to those of other 
proteins known to contain this domain. 



30 



35 



40 



45 



50 



Table 34E Domain Analysis of NOV34 



gnl|pfam|pfam00001, 7tm_l, 7 transmembrane receptor {rhodopsin 

family). (SEQ ID NO: 8 10) 

CD-Length = 254 residues, 100.0% aligned 

Score = 41.6 bits (96), Expect = 7e-05 



NOV4 0: 


36 


Sbjct : 


1 


NOV40: 


96 


Sbjct: 


61 


NOV40: 


156 


Sbjct: 


117 


NOV40: 


216 


Sb j ct : 


172 


NOV40: 


268 


Sbjct: 


232 



II II " + +11 11+ II II I + I 

GNLLVILVILRTKKLRTPTNIFIJjNIAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 60 



+ 1 l+M Ml II I I I 



MM I 



+ + + + + 



, - - , , ++ I + I++ II 

LPPLLFSWLRTVEEGNTWCLIDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTR 171 

It, KNI USTATWAERIoRALNNCLSHILAVL VLY I PMVGVSMTHRFAKHAS PLV 267 

II + +| | ++ ++ ++| | + +| 

Sb j ct : 172 ILRTLRKRARSQRSLKRRS SSERKAAKMLL VVVVVFVLCWLP YHI VLLLDSLCLLS I WRV 231 

'IMANIYIxL- - - APPVMNPI IY 287 
I I +111 I I 



G-Protein Coupled Receptor (GPCRs) have been identified as an extremely large 
family of protein receptors in a number of species. At the phylogenetic level they can be 
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classified into four major subfamilies. These receptors share a seven transmembrane domain 
structure with many neurotransmitter and hormone receptors. They are likely to be involved in 
the recognition and transduction of various signals mediated by G-Proteins, hence their name 
G-Protein Coupled Receptors. The human GPCR genes are generally intron-less and belong to 
5 four gene subfamilies, displaying great sequence variability. These genes are dominantly 
expressed in olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large family of GPCRs in 
a number of species. As members of the GPCR family, these receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 

10 likely to underlie the recognition and G-protein-mediated transduction of odorant signals. Like 
GPCRs, the ORs they can be expressed in a variety of tissues where they are thought to be 
involved in recognition and transmission of a variety of signals. The human OR genes are 
typically intron-less and belong to four different gene subfamilies, displaying great sequence 
variability. These genes are dominantly expressed in olfactory epithelium. 

1 5 The NOV34 nucleic acid of the invention encoding a Olfactory Receptor-like protein 

includes the nucleic acid whose sequence is provided in Table 34A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 34A while still encoding a protein that maintains 
its Olfactory Receptor-like activities and physiological functions, or a fragment of such a 

20 nucleic acid. The invention further includes nucleic acids whose sequences are complementary 
to those just described, including nucleic acid fragments that are complementary to any of the 
nucleic acids just described. The invention additionally includes nucleic acids or nucleic acid 
fragments, or complements thereto, whose structures include chemical modifications. Such 
modifications include, by way of non-limiting example, modified bases, and nucleic acids 

25 whose sugar phosphate backbones are modified or derivatized. These modifications are carried 
out at least in part to enhance the chemical stability of the modified nucleic acid, such that 
they may be used, for example, as antisense binding nucleic acids in therapeutic applications 
in a subject. In the mutant or variant nucleic acids, and their complements, up to about 36% of 
the residues may be so changed. 

30 The NOV34 protein of the invention includes the Olfactory Receptor-like protein 

whose sequence is provided in Table 34B. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
34B while still encoding a protein that maintains its Olfactory Receptor-like activities and 
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physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 49% of the bases may be so changed. 

The NOV34 nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications implicated in various diseases and disorders described 
5 below and/or other pathologies. For example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from: developmental diseases, MHCII and III 
diseases (immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders, Cell Shape disorders, Feeding 

10 disorders, control of feeding, potential obesity due to over-eating, potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm, adenocarcinoma, lymphoma, prostate cancer, 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 

15 hypertension, urinary retention, osteoporosis, Crohn's disease, multiple sclerosis, treatment of 
Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, ulcers, asthma, 
allergies, benign prostatic hypertrophy, psychotic and neurological disorders, including 
anxiety, schizophrenia, manic depression, delirium, dementia, severe mental retardation, 
Dentatorubro-pallidoluysian atrophy (DRPLA), Hypophosphatemic rickets, autosomal 

20 dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease or Gilles 
de la Tourette syndrome and other diseases, disorders and conditions of the like. 

NOV34 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 

25 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

NOVX Antibodies" section below. For example the disclosed NOV34 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
disorders, which will help in understanding of pathology of the disease and development of 

30 new drug targets for various disorders. 



NOV35 

A disclosed NOV35 nucleic acid of 1 102 nucleotides (also referred to as CG5671 8-01) 
encoding a novel Olfactory Receptor-like protein is shown in Table 35A. An open reading 
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frame was identified beginning with an ATG initiation codon at nucleotides 92-94 and ending 
with a TGA codon at nucleotides 1049-1051. Putative untranslated regions, if any, found 
upstream from the initiation codon and downstream from the termination codon are underlined 
in Table 3 5 A, and the start and stop codons are in bold letters. 

Table 35A. NOV35 Nucleotide Sequence (SEQ ID NO:137) 

ATAACTCTACTAACTAAAAGAAAATGTATCTAAATATTTTACATAACACTT7ATGTATCACTTCTATAGGA ' 
TGGAAG C C TT TTGAGGACAAATGCTG CAAAAC CAGGACAC CATGGAAATC C TAAGCAACT CAACAT CT AAA 
TTTCCAACCTTCTTGTTGACCGGCATTCCTGGCCTAGAGTCTGCCCATGTCTGGATCTCCATTCCTTTCTG 
OTGTTTTTATGCCAOTGCCCTCTCTGGGAACAGCGTGATCCTGTT 

ATGAACCCATGTATTATTTCCTCTTCAGGCTATCAGCCACTGATCTGGGCTTGACTGTTTCTTCATTGTCA 
ACAACATTAGGTATCCTCTGGTTTGAGGCACGTGAAATCAGTCTATATAGCTGCATTGTCCAGATGTTTTT 
TCTTCATGGATTC^CTTTTATGGAATCTGGAGTGCrGGTGGCTACAGCCT 

GTGAC CCT CTGAGG CACACTACCATTCT CACT AATT CCAGAAT CATTCAAATG GGT CT T CTGATGATTACA 

CGTGCTATAGTACTAATATTGCCACTACTTTTGCTCCTTAAGCCTCTCTATTTCTGTAGAATGAATGCCCT 

TT C T CACT C CTATTGTTACCATCC AG ATGTGATTCAATTAG CATGTTCAGACATTCGGG CAAATAG CAT CT 

GTGGATTAATTGATCT CAT CCTGAC CA.CT GGAAT AG ATACACCATG CA CT TAATT 

ATTCACTCTGTCCTCA<GAATTGCCTCCCCTGAAGAATGGCACAAGGTCTrCAGCACCT 

GGGAGCAGTTGCTTTCTTCTACATCCACATGCTGAGCCTGTCCTTGGTGTATCGCTA 

CCAGAGTAGTCCATTCAGTGATGGCTAATGTATACCTGCTTTTACTC^ 

AGTGTAAAAACAAAACAAAT C CGC AAGGCTATGCTCAGT CTGCTG CTTACAAAATGAACAGACATAGTTTT 
ATTTGATACAAACCTGGCATGAATGACTTGCACTGTA 

5 

The NOV35 nucleic acid, located on chromosome 1 1, has 636 of 1006 bases (63%) 
identical to a gb:GENBANK-ID:AF133300|acc:AF133300.2 mRNA from Mus musculus 
(MOR 3'Betal, MOR 3'Beta2, MOR 3'Beta3, and MOR 3*Beta4 genes, complete cds; Cbx3 
pseudogene, complete sequence; and MOR 3'BetaS and MOR 3'Beta6 genes, complete cds) (E 
10 = 1.2e" 51 ). 

A disclosed NOV35 polypeptide (SEQ ID NO:138) encoded by SEQ ID NO:137 is 
319 amino acid residues and is presented using the one-letter code in Table 35B. Signal P, 
Psort and/or Hydropathy results predict that NO V3 5 contains a signal peptide and is likely to 
be localized to the plasma membrane with a certainty of 0.6000. 

Table 35B. Encoded NOV35 protein sequence (SEQ ID NO:138) 

MLQNQDTMEILSNSTSKFPTFLLTGIPGLESAHVWISIPFCCFYAIALSGNSVILFVIITQQSLHEPM^FL 
FRLS ATDLGLTVSS LSTTLGIL WFEARE I S LYSCI VQMFFLHGFTFMESGVLVATAFDRYVAI CDPLRHTTI 
LTNSRI IQMGLLMITRAIVLI LPLLUjLKPLY FCRMNALSHSYCYHPDVIQLACSDIRANS I CGLIDLILTT 
GIDTPCIVLSYILI IHSVLRIASPEEWHKVFSTCTSHVGAVAFFYIHMLSLSL 

YLLLPPVLNPI IDS VKTKQIRKAMLSLLLTK 

15 

The disclosed NOV35 amino acid sequence has 160 of 309 amino acid residues (51%) 
identical to, and 219 of 309 amino acid residues (70%) similar to, the 321 amino acid residue 
ptnr:TREMBLNEW-ACC:AAG42364 protein from Homo sapiens (Human) (odorant receptor 
HOR3 f BETAl)(E = 2.5e 81 ). 
20 NOV35 is predicted to be expressed in at least Apical microvilli of the retinal pigment 

epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma cell'lines, corpus 
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callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 
cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 
umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 
cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 
cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express MHC II and III 
nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 
spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus and thymus 
tissue. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 

NOV35 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 35C. 



15 



Table 35C. BLAST results for NOV35 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
<aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


. gi| 17456801 |ref |XP 
061626. 1| 
(XMJ>61626) 


similar to 
OLFACTORY 
RECEPTOR 5112 
(HOR5BETA12) (H. 
sapiens) [Homo 
sapiens] 


342 


196/312 
(62%) 


251/312 
(79%) 


le-95 


gi 1 17456777 |ref|XP 
061621.1 | 
(XM_061621) 


similar to 
olfactory 
receptor- like 
protein COR3beta 
(H. sapiens) 
[Homo sapiens] 


327 


165/312 
(52%) 


218/312 
(68%) 


6e-77 


gi 1 17456767 |ref|XP 
061618. l| 
<XM_061618) 


similar to 
prostate specific 
G- protein coupled 
receptor (H. 
sapiens) [Homo 
sapiens] 


879 


158/304 
(51%) 


218/304 
(70%) 


8e-74 


gi | 17472781 |ref|XP 
061811. 1| 
(XM_061811) 


similar to 
OLFACTORY 
RECEPTOR 5112 
(HOR5BETA12) (H. 
sapiens) [Homo 
sapiens] 


312 


155/295 
(52%) 


209/295 
(70%) 


le-73 


gi| 18202242 |sp|0886 
28|0XE2_RAT 


Olfactory 
receptor 51E2 (G- 
protein coupled 
receptor RAlc) 


320 


147/305 
(48%) 


206/305 
(67%) 


4e-73 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 35D. 



5 
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Table 35D Clustal W Sequence Alignment 



1) NOV35 (SEQ ID NO: 13 8) 

2) gi 1 17456801 1 ref |XP_0€1626 . 1 | (XM_061626) similar to OLFACTORY RECEPTOR 5112 
5 (HOR5BETA12) (H. sapiens) [Homo sapiens] (SEQ ID NO:477) 

3) gi | 17456777 | ref | XP_061621 . 1 | (XM_06162l) similar to olfactory receptor-like 
protein COR3beta (H. sapiens) [Homo sapiens] (SEQ ID NO:478) 

4) gi 1 17456767 | ref |XP_061618 . l | (XM_061618) similar to prostate specific G-protein 
coupled receptor (H. sapiens) [Homo sapiens] (SEQ ID NO: 479) 

0 5) gij 17472781 1 ref |XP_061811 . 1 | (XM_061B11) similar to OLFACTORY RECEPTOR 5112 
( HOR5 BETA1 2 ) (H. sapiens) [Homo sapiens] (SEQ ID NO:480) 

6) gi| 18202242 |sp|08B628|OXE2_RAT Olfactory receptor 51E2 (G-protein coupled 
receptor RAlc) (SEQ ID NO: 481) 



NOV35 



gi 
gi 
gi 
gi 



174568011 
17456777 j 
17456767| 
17472781) 
18202242 



10 20 30 40 50 60 

....|....|....|....|....|....|....|....|....|....|....|....| 

1 -- 1 

1 --- - l 

1 " - 1 

1 MSLALDLCPLSQRLEAFPSSIVLFFQTAPAVRHPKGLLELHKTVPTSIKEELKGFFPTSD 60 

1 — " - - - 1 

1 - " 1 



5 



NOV35 



17456801 
17456777 
17456767 
17472781 
18202242 



70 80 90 100 110 120 

....|...-|....|....|....|....|....|....|....|....|....|....| 

1 - 1 

1 - 1 

1 — -'- 1 

61 HFI ITDFIAKYHTDLKWAVLGIATPRQQFKALNTC I SHICAVLIFYVPTLS AAMLHQFAR 120 

1 --- - 1 

1 -- 1 



5 



NOV35 



gi 
gi 
gi 
gi 
gi 



17456801 
17456777 
17456767 
17472781 
18202242 



130 140 150 160 170 180 

-...|.-..|....|....|....|....|-...|....|-...|....|....|....| 

1 " " — - 1 

1 ' " 1 

1 - " - 1 

121 DVS PMIHVLMADI FLLVPPLLNP I VYC VKTHQ I RE KWGKLCP KNCFLKSK IL PRCS FVP 180 

1 " " — 1 

1 --- 1 



190 200 210 220 230 240 

....| | | | | | |....| |....| | | 

NOV35 1 1 

5 gi|l745680l| 1 -- ! 

gi|l7456777| 1 --- ! 

gi 1 17456767 j 181 GFRIAYYYLPPHPSKVSFLDPVEKANRSAPTQFSPMPSADASLLADLGTFSSLQRATFFL 240 

gi|l747278l| 1 --- - 1 

gi|l8202242| 1 i 



NOV35 



gi 
gi 
gi 
gi 
gi 



17456801 
17456777 
17456767 
17472781 
18202242 



250 260 270 280 290 300 

....|.-..|....|.-..|....|....|....|....|....|....|....|....| 

1 - - 1 

1 - - 1 

1 _ - 1 

241 TGFQGLEGLHGWI S I PFCF I YLTVILGNLTILHVI CTDATLHGPMYYFLGMLAVTDLGLC 300 

1 - — - - 1 

1 " — 1 



J 



NOV35 



gi 
gi 
gi 
gi 
gi 



17456801 
17456777 
17456767 
17472781 
18202242 



310 320 330 340 350 360 

-...|--..|....|....|....|-....|....|....|.-..|....|....|....| 

1 " - 1 

1 1 

1 - - 1 

3 01 LSTLPTVLGIFWFDTREIGIPACFTQLFFIHTLSSMESSVLLSMSIDRYVAVCNPLHDST 360 

1 - 1 

1 i 
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10 



15 



NOV35 



gi 
gi 
gi 
gi 



17456801 
17456777 
17456767 
17472781 
18202242 



NOV35 



gi 
gi 
gi 
gi 
gi 



17456801 
17456777 
17456767 
17472781 
18202242 



370 380 390 400 410 420 

....|...-|..--|....|....|....|....|....|....|-...|....|....| 

1 1 

1 - 1 

1 i 

361 VLTPACIVKMGLSSVLRSALLILPLPFIJjKRFQYCHSHVLAHAYCLHLEIMKLACSSIIV 420 
X --- 1 

1 - - --- - 1 



430 440 450 460 470 

|....|....|....|....|....|....|..:.|....|....|... 



480 



1 - — 1 

1 " - — - 1 

421 NHI YGLFWACT VG VDSLL I FLS YALI LRTVLS I ASHQERLRALNTCVSHI CAVLLFYI P 480 

1 — 1 

1 — - — i 1 



20 



25 



NOV35 



gi 
gi 
gi 
gi 
gi 



17456801 
17456777 
17456767 
17472781 
18202242 



490 500 510 520 530 540 

....|....|....|....|....|....|....|....|....|....|....|....| 

1 " - 1 

1 — - - 1 

1 ---- --- 1 

481 MIGLSLVHRFGEHLPRVVHLFMSYVYLLVPPIJ^IIYSIKTKQIRQRIIKKFQFIKSLR 540 

1 — 1 

1 - 1 



550 



30 



35 



40 



45 



50 



55 



60 



65 



70 



NOV35 



gi 
gi 
gi 
gi 
gi 



17456801 
17456777 
17456767 
17472781 
18202242 



NOV35 

gi 1 17456801 
gi 1 17456777 
gi 1 1745 67 67 
gi j 17472781 
gi j 18202242 



NOV35 



gi 
gi 
gi 
gi 



17456801 
17456777 
17456767 
17472781 



1 
1 
1 

541 

1 

1 



560 
..|....|... 

MLQNQDT 

MTETSLSSQC- FPR 



CNHQYCLNLLQDFGGHPPSPLSPHT 



570 580 
llM-- -EPLI 

FjgNgTs---ss[ 

,GNSSSSVS$ 

sscjjj: 



590 



600 



610 




39 

44 

32 

600 

30 

29 



NOV35 


40 


gi 


17456801 


45 


gi 


17456777 


33 


gi 


17456767 


601 


gi 


17472781 


31 


gi 


18202242 


30 


N0V35 


100 


gi 


17456801 


105 


gi 


17456777 


93 


gi 


17456767 


661 


gi 


17472781 


91 


gi 


18202242 


90 
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gi 1 18202242 | 210 VMF0S| 



»P 269 



850 



860 



10 



NOV35 


280 


gi 


17456801 


285 


gi 


17456777 


273 


gi 


17456767 


841 


gi 


17472781 


271 


gi 


18202242 


270 




870 880 890 

I 

.SELL 1 
IKVLL 

§§lqs| 

STTHAFCY 

---giB|UMFHHIKI 

~~IEAGGNT 



900 



319 

KSNHQLFLIRDKAIY 341 
SWGFNVRGLRGRWD-- 327 

879 

312 

320 



NOV35 


319 




319 


gi 


17456801 


342 


E 


342 


gi 


17456777 


327 




327 


gi 


17456767 


879 




879 


gi 


17472781 


312 




312 


gi 


18202242 


320 




320 



25 



Table 35E list the domain descriptions from DOMAIN analysis results against 
NOV35. This indicates that the NOV35 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 35E Domain Analysis of NOV35 

gnl|pfam|pfaraQ0O01, 7tm_l, 7 transmembrane receptor (rhodopsin family) 
(SEQ ID NO: 810) 

CD-Length m 254 residues, 40.9% aligned 
Score = 56.2 bits (134), Expect =* 3e-09 



30' 



35 



40 



45 



NOV35: 50 GNSVILFVIITQQSLHEPMYYFLFRLSATDLGLTVSSL^^ 109 

II *++ II* + I I II 1+ II II I + 

Sbjct: 1 GNLLVILVILRTKXLRTPTNIFLLJ^ 60 

NOV35: 110 MFFLHGFTFMESGVLVATAFDR YVAI CDPIiRHTT ILTNSR I X QM 153 

+ +1 I * 111-11 111+ I I I + 
Sbjct: 61 GALFVVNGYASILLI/TAI S IDRYLAI VHPLRYRRIRTPRRAKVL 104 

G-Protein Coupled Receptor (GPCRs) have been identified as an extremely large 
family of protein receptors in a number of species. At the phylogenetic level they can be 
classified into four major subfamilies. These receptors share a seven transmembrane domain 
structure with many neurotransmitter and hormone receptors. They are likely to be involved in 
the recognition and transduction of various signals mediated by G-Proteins, hence their name 
G-Protein Coupled Receptors. The human GPCR genes are generally intron-less and belong to 
four gene subfamilies, displaying great sequence variability. These genes are dominantly 
expressed in olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large family of GPCRs in 

a number of species. As members of the GPCR family, these receptors share a seven 

transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
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likely to underlie the recognition and G-protein-mediated transduction of odorant signals. Like 
GPCRs, the ORs they can be expressed in a variety of tissues where they are thought to be 
involved in recognition and transmission of a variety of signals. The human OR genes are 
typically intron-less and belong to four different gene subfamilies, displaying great sequence 
5 variability. These genes are dominantly expressed in olfactory epithelium. 

The NOV35 nucleic acid of the invention encoding a Olfactory Receptor-like protein 
includes the nucleic acid whose sequence is provided in Table 3 5 A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 35 A while still encoding a protein that maintains 

10 its Olfactory Receptor-like activities and physiological functions, or a fragment of such a 

nucleic acid. The invention further includes nucleic acids whose sequences are complementary 
to those just described, including nucleic acid fragments that are complementary to any of the 
nucleic acids just described. The invention additionally includes nucleic acids or nucleic acid 
fragments, or complements thereto, whose structures include chemical modifications. Such 

15 modifications include, by way of non-limiting example, modified bases, and nucleic acids 

whose sugar phosphate backbones are modified or derivatized. These modifications are carried 
out at least in part to enhance the chemical stability of the modified nucleic acid, such that 
they may be used, for example, as antisense binding nucleic acids in therapeutic applications 
in a subject. In the mutant or variant nucleic acids, and their complements, up to about 37% of 

20 the residues may be so changed. 

The NOV35 protein of the invention includes the Olfactory Receptor-like protein 
whose sequence is provided in Table 35B. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
35B while still encoding a protein that maintains its Olfactory Receptor-like activities and 

25 physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 52% of the bases may be so changed. 

The NOV35 nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 

30 have efficacy for treatment of patients suffering from: developmental diseases, MHCII and III 
diseases (immune diseases), Taste and scent detectability Disorders, Burkitfs lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders, Cell Shape disorders, Feeding 
disorders, control of feeding, potential obesity due to over-eating, potential disorders due to 
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starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV- 1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm, adenocarcinoma, lymphoma, prostate cancer, 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 

5 hypertension, urinary retention, osteoporosis, Crohn's disease, multiple sclerosis, treatment of 
Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, ulcers, asthma, 
allergies, benign prostatic hypertrophy, psychotic and neurological disorders, including 
anxiety, schizophrenia, manic depression, delirium, dementia, severe mental retardation, ^ 
Dentatorubro-pallidoluysian atrophy (DRPLA), Hypophosphatemic rickets, autosomal 

0 dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease or Gilles 
de la Tourette syndrome and other diseases, disorders and conditions of the like. 

NOV35 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 

5 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

NOVX Antibodies" section below. For example the disclosed NOV35 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
disorders, which will help in understanding of pathology of the disease and development of 

0 new drug targets for various disorders. 



NOV36 

NOV36 includes two novel cadherin 1 1 -like proteins disclosed below. The disclosed 
sequences have been named NOV36a and NOV36b. 

NOV36a 

A disclosed NOV36a nucleic acid of 2476 nucleotides (also referred to as CG56729- 
01) encoding a novel cadherin 1 l-like protein is shown in Table 36A. An open reading frame 
was identified beginning with an ATG initiation codon at nucleotides 46-48 and ending with a 
TGA codon at nucleotides 2389-2391. Putative untranslated regions, if any, found upstream 
from the initiation codon and downstream from the termination codon are underlined in Table 
36A, and the start and stop codons are in bold letters. 

Table 36A. NOV36a Nucleotide Sequence (SEQ ID NO:139) 
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TCAGCACCC^GGGCCAGTGAACAGAGCCCTGGCTGGAGTCCAAACA TGTGGGGCCTGGTGAGGCTCCTGC 

TGGCCTGGCTGGGTGGCTGGGGCTGCATGGGGCGTCTGGCAGCCCCAGCCCGGGCCTGGGCAGGGTCCCG 

GGAACACCCAGGGCCTGCTCTGCTGCGGACTCGAAGGAGCTGGGTCTGGAACCAGTTCTTTGTCATTGAG 

GAATATGCTGGTCCAGAGCCTGTTCTCATTGGCAAGCTGCACTCGGATGTTGACCGGGGAGAGGGCCGCA 

CCAAGTACCTGTTGACCGGGGAGGGGGCAGGCACCGTATTTGTGATTGATGAGGCCACAGGC^TAT 

TGTTACCAAGAGCCTTGACCGGGAGGAAAAGGCGCAATATGTGCTACTGGCCCAAGCCGTGGACCGAGCC 

TCCAACCGGCGCCTGGAGCCCCCATCAGAGTTCATCATCAAAGTGCAAGACATCJUVCGACAATCCACCCA 

TTTTTCCCCTTGGGCCCTACCATGCCACCGTGCCCGAGATGTCCAATGTCGGTACATCAGTGATCCAGGT 

GACTGCTCACGATGCTGATGACCCCAGCTATGGGAACAGTGCCAAGCTGGTGTACACTGTTCTGGATGGA 

CA<^GGAGGAGTTCTTGGTGGTGATCCAGGCC^GGA(^TGGGCGGCC^CATGGGGGGGCTGTCAGGCAG 

CACTACGGTGACTGTCACGCTCAGCGATGTCAACGACAACCCCCCCAAGTTCCCACAGA 

TTCTCCGTGGTGGAGACAGCTGGACCTGGCACACTGGTGGGCCGGCTCCGGGCCCAGGACCCAGACCTGG 

GGG AC AACGC C CTGATGG CAT AC AGC ATC CTG G ATGGGGAGGGGTCTGAGG CCTT CAG CAT CAG CACAGA 

CTTGCAGGGTCGAGACGGGCTCCTCACTGTCCGCAAGGTTCTAGACTTTGAGAGCCAGCGCTCCTACTCC 

TTCCGTGTCGAGGCCACCAACACGCTCATTGACCCAGCCTATCTGCGGCGAGGGCCCTTCAAGGATGTGG 

CCTCTGTGCGTGTGGCAGTGC^U^GATGCCCCAGAGCCACCTGCCTTCACCCAGGCTGCCTACCACCTGAC 

AGTGCCTGAGAACAAGGCCCCGGGGACCCTGGTAGGCCAGATCTCCGCGGCrGACCTGGACTCCCCTGCC 

AGCCCAATCAGGTACTCCATCCTCCCCCACTCAGATCCGGAGCGTTGCTTCTCTATCCAGCCCGAGGAAG 

GCACCATCCATACAGCAGCACCCCTGGATCGCGAGGCTCGCGCCTGGCACAACCTCACTGTGCTGGCTAC 

AGAGCTCGGTGAGGACTCCCAGGCCTCGCGCGTGCAAGTGGCCATCCAGACCCTGGATGAGAATGACAAT 

GCTCCCCAGCTGGCTGAGCCCTACGATACTTTTGTGTGTGACTCTGCAGCTCCTGGCCAGCTGATTCAGG 

TCATCCGGGCCCTGGACAGAGATGAAGTTGGCAACAGTAGCCATGTCTCCTTTCAAGGTCCTCTGGGCCC 

TGATGCCAACTTTACTGTCCAGGACAACCGAGATGGCTCCGCCAGCCTGCTGCTGCCCTCCCGCCCTGCT 

CCACCCCGCCATGCCCCCTACTTGGTTCCCATAGAACTGTGGGACTGGGGGCAGCCGGCGCTGAGCAGCA 

CTGCCACAGTGACTGTTAGTGTGTGCCGCTGCCAGCCTGACGGCTCTGTGGCATCCTGCTGGCCTGAGGC 

TCACCTCTCAGCTGCTGGGCTCAGCACCGGCGCCCTGCTTGCCATCATCACCTGTGTGGGTGCCCTGCTT 

GCCCTGGTGGTGCTCTTCGTGGCCCTGCGGCGGCAGAAGCAAGAAGCACTGATGGTACTGGAGGAGGAGG 

ACGTCCGAGAGAACATCATCACCTACGACGACGAGGGCGGCGGCGAGGAGGACACCGAGGCCTTCGACAT 

CACGGCCTTGCAGAACCCGGACGGGGCGGCCCCCCCGGCGCCCGGCCCTCCCGCGCGCCGAGACGTGTTG 

CCCCGGGCCCGGGTGTCGCGCCAGCCCAGACCCCCCGGCCCCGCCGACGTGGCGC^CTCCTGGCGCTGC 

GGCTCCGCGAGGCGGACGAGGACCCCGGCGTACCCCCGTACGACTCGGTGCAGGTGTACGGCTACGAGGG 

CCGCGGCTCCTCTTGCGGCTCCCTCAGCTCCCTGGGCTCCGGCAGCGAAGCCGGCGGCGCCCCCGGCCCC 

GCGGAGCCGCTGGACGACTGGGGTCCGCTCTTCCGCACCCTGGCCGAGCTGTATGGGGCCAAGGAGCCCC 

CGGCCCCCTG AGCGCCCGGGCTGGCCCGGCCCACCGCGGGGGGGGGGCAGCGGGCACAGGCCCTCTGAGT 

GAGCCCCACGGGGTCCAGGCGGGCGG 



The disclosed NOV36a nucleic acid sequence, located on chromosome 14, has 992 of 
1514 bases (65%) identical to a gb:GENBANK-ID:HUMCAl lA|acc:L34056.1 mRNA from 
Homo sapiens (cadherin-1 1 mRNA, complete cds) (E = 7.3e' 145 ) . 
5 A disclosed NOV36a polypeptide (SEQ ID NO: 140) encoded by SEQ ID NO: 1 39 is 

781 amino acid residues and is presented using the one-letter amino acid code in Table 36B. 
Signal P, Psort and/or Hydropathy results predict that NOV36a contains a signal peptide and is 
likely to be localized in the mitochondria! inner membrane with a certainty of 0.8227 in one 
embodiment and to the plasma membrane with a certainty of 0.4400 in an additional 
10 embodiment. The most likely cleavage site for a NOV36a polypeptide is between amino acids 
16 and 17: GWG-CM. 



Table 36B. Encoded NOV36a protein sequence (SEQ ID NO:140). 

MWGLVRLLLAWLGGWGCMGRLAAPARAWAGS REHPGP ALLRTRRS WVWNQFFVIEE YAGPEPVL IGKMSDVDRG 
EGRTKYLLTGEGAGTVFVIDEATGNIHVTKSL^^ 

PLGP YHATVPEMSNVGTS VI QVTAHDADDPS AKLVYTVLDGLPFFS VDPQTGVVRTAI PNMDRETQEEFL V 
VIQAKDMGGHMGGLSGSTTVTVTLSDVNDNP PKFPQSLYQFS WETAGPGTLVGRLRAQDPDIiGDNALMAYS ILD 
GEGSEAFSISTDLQGRDGLLTVRKVLDFESQRSYSFRV 

QAAYHLTVPENKAPGTLVGQI SAADLDS PASPIRYSILPHSDPERCFS IQPEEGTIHTAAPLDREARAWHNLTVL 
ATE LGED 5 QAS RVQVAI QTLDSNDNAPQLAE P YDTFVCDS AAPGQLI QVI RALDRDE VGNS SHVS FQGP LGPDAN 



310 



WO 02/068649 



PCTYUS02/02785 



FTVQDl^GSASLLLPSRPAPPRHAPYLVPIEL^ 

TGALLAI ITCVGALLALVVLFVALRRQKQEALMVIiEEEDVRENI ITYDDEGGGEEDTEAFDITALQNPDGAAPPA 
PGPPAIUlDVLPRARVSRQPRPPGPADVAQIilJtfjRLR]^ 

GAPGPAEPLDDWGPLFRTLAELYGAKEPPAP 



The disclosed NOV36a amino acid sequence has 434 of 746 amino acid residues 
(58%) identical to, and 552 of 746 amino acid residues (73%) similar to, the 796 amino acid 
residue ptnr:SWISSPROT-ACC:P55287 protein from Homo sapiens (Human) (CADHERIN- 
5 1 1 precursor (osteoblast-cadherin) (OB-cadherin) (OSF-4)) (E = 2.3c" 229 ). 

NOV36a is predicted to be expressed in at least Cerebral Medulla/Cerebral white 
matter, Gall Bladder, Retina, Temporal Lobe and Uterus. This information was derived by 
determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
10 sources. 

In addition, NOV36a is predicted to be expressed in the following tissues because of 
the expression pattern of (GENBANK-ID: gb:GENBANK-ID:HUMCAl 1 A|acc:L34056.1) a 
closely related Homo sapiens cadherin-1 1 mRNA, complete cds homolog in species Homo 
sapiens: osteoblasts. 

15 

NOV36b 

A disclosed NOV36b nucleic acid of 2476 nucleotides (also referred to as CG56729- 
02) encoding a novel Cadherin 1 1-Iike protein is shown in Table 36C. An open reading frame 
was identified beginning with an ATG initiation codon at nucleotides 46-48 and ending with a 
20 TGA codon at nucleotides 2389-2391. Putative untranslated regions, if any, found upstream 
from the initiation codon and downstream from the termination codon are underlined in Table 
36C, and the start and stop codons are in bold letters. 



Table 36C NOV36b Nucleotide Sequence (SEQ ID NO:141) 

TCAGCACCC^GGGCCAGTGAACAGAGCCCTGGCTGGAGTCC^^CATGTGGGGCCTGGTGAGGCTCCTGC 

TGGCCTGGCTGGGTGGCTGGGGCTGCATGGGGCGTCTGGCAGCCCCAGCCCGGGCCTGGGCAGGGTCCCG 

GGAACACCCAGGGCCTGCTCTGCTGCGGACTCGAAGGAGCTGGGTCTGGAACC7VGTTCTTTGTCATTGAG 

GAATATGCTGGTCCAGAGCCTGTTCTCATTGGCAAGCTGCACrrCGGATGTTGACCGGGGAGAGGGCCGCA 

CCAAGTACCTGTTGACCGGGGAGGGGGCAGGCACCGTATTTGTGATTGATGAGGCCACAGGCAATA 

TGTTACCAAGAGCCTTGACCGGGAGGAAAAGGCGCAATATGTGCTACTGGCCCAAGCCGTGGACCGAGCC 

TCCAACCGGCCCCTGGAGCCCCCATCAGAGTTCATCATC^ 

TTTTTCCCCTTGGGCCCTACCATGCCACCGTGCCCGAGATGTCCAATGTCGGTACATCAGTGATCCAGGT 

GACTGCTCACGATGCTGATGACCCCAGCTATGGGAACAGTGCCAAGCTGGTGTACACTGTTCTGGATGGA 

CTGCCTTTCTTCTCTGTGGACCCCCAGACTGGTGTGGTGCGTAC^GCCATCCCCAACATGGACCGGGAGA 

(^CAGGAGGAGTTCTTGGTGGTGATCCAGGCCAAGGACATGGGCGGCCACATGGGGGGGCTGTC^GGCAG 

CACTACGGTGACTGTC^CGCTCAGCGATGTCAACGACAACCCCCCCAAGTTCCCACAGAGTCTATAC 

TTCTCCGTGGTGGAGA(^GCTGGACCTGGCACACTGGTGGGCCGGCTCCGGGCCCAGGACCCAGACCTGG 

GGGACAACIGCCCTGATGGCATACAGCATCCTGGATGGGGAGGGGTCTGAGGCCTTCA^ 

CTTGCAGGGT CGAGACGGG CT CCT CACTGTC CGTAAG CCCCT AGACTTTGAGAGC CAG CG CTC CTACTC C 

TTCCGTGTCGAGGCCACCAACACGCTCATTGACCCAGCCTATCTGCGGCGAGGGCCCT 

CCTCTGTGCGTGTGGCAGTGCAAGATGCCCCAGAGCCACCTGCCT 

AGTGCCTGAGAA(^GGCCCCGGGGACCCTGGTAGGCC^ATCTCCGCGGCTGACCTGGACTCCCCTGGC 
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AGCCCAATCAGGTACTCCATCCTCCCCCA<^CAGATCCGGAGCGTTGCTTCTCTATCCAGCCCGAGGAAG 
GCACCATCCATACAGCAGCACCCCTGGATCGCGAGGCTCGCGCCTGGCACAACCTCACTGTGCTGGCTAC 
AGAGCTCGGTGAGGACTCCCAGGCCTCGCGCGTGCAAGTGGCCATCCAGACCCTGGATGAGAATGACAAT 
GCTCCCCAGCTGGCTGAGCCCTACGATACTTTTGTGTGTGACTCTGCAGCTCCTGGCCAGCTGATTCAGG 
TCATCCGGGCCCTGGAC7VGAGATGAAGTTGGCAACAGTAGCCATGTCTCCTTTCAAGGTCCTCTGGGCCC 
TGATGCCAACTTTACTGTCCAGGACAACCGAGATGGCTCCGCCAGCCTGCTGCTGCCCTCCCGCCCTGCT 



CTGCCACAGTGACTGTTAGTGTGTGCCGCTGCCAGCCTGACGGCTCTGTGGCATCCTGCTGGCCTGAGGC 
TCACCTCTCAGCTGCTGGGCTCAGCACCGGCGCCCTGCTTGCCATCATCACCTGTGTGC^TGCCCTGCTT 
GCCCTGGTGGTGCTCTTCGTGGCCCTGCGGCGGCAGAAGCAAGAAGCACTGATGGTACTGGAGGAGGAGG 
ACGTCCGAGAGAACATCATCACCTACGACGACGAGGGCGGCGGCGAGGAGGACACCGAGGCCTTCGACAT 
CACGGCCTTGCAGAACCCGGACGGGGCGGCCCCCCCGGCGCCCGGCCCTCCCGCGCGCCGAGACGTGTTG 
CCCCGGGCCCGGGTGTCGCGCCAGCCCAGACCCCCCGGCCCCGCCGACX5TGGCGCAGCTCCTGGCGCTGC 
GGCTCCGCGAGGCGGACGAGGACCCCGGCGTACCCCCGTAQ3ACTCGGTGCAGGTGTACGGCTACGAGGG 
CCGCGGCTCCTCTTGCGGCTCCCTC^GCTCCCTGGGCTCCGGCAGCGAAGCCGGCGGCGCCCCCGGCCCC 
GCGGAGCCGCTGGACGACTGGGGTCCGCTCTTCCGCACCCTGGCCGAGCTGTATGGGGCCAAGGAGCCCC 
CGGCCCCCTG AGCGCCCGGGCTGGCCCGGCCCACCGCGGGGGGGGGGCAGCGGGCACAGGCCCTCTGAGT 
GAGCCCCACGGGGTCCAGGCGGGCGG 



The disclosed NOV36b nucleic acid sequence, located on chromosome 1 1, has 1 100 of 
1 109 bases (99%) identical to a gb:GENBANK-ID:AK025342|acc:AK025342.1 mRNA from 
Homo sapiens (cDNA: FLJ21689 fis, clone COL09459) (E - 2.3e" 240 ) . 

A disclosed NOV36b polypeptide (SEQ ID NO:142) encoded by SEQ ID NO:141 is 
781 amino acid residues and is presented using the one-letter amino acid code in Table 36D. 
Signal P, Psort and/or Hydropathy results predict that NOV36b contains a signal peptide and is 
likely to be localized to the mitochondrial inner membrane with a certainty of 0.8227 in one 
embodiment and to the plasma membrane with a certainty of 0.4400 in an additional 
embodiment. The most likely cleavage site for a NOV36b peptide is between amino acids 16 
and 17: GWG-CM. 



Table 36D. Encoded NOV36b protein sequence (SEQ ID NO:142). 



MWGLVRLLLAWIiGGWGMGRLAAPARAWAGSREHPGPALLRTRRSWVWNQF 

EGRTKYLLTGEGAGTVFVIDEATGNIHVTKSLDREE 

PLGPYHATVPEMSNVGTSVIQOTAHDADDPSYGNSAKLVYTVL 

VI QAKDMGGHMGGLSGS TTVTVTLSDVNDNPPKFPQSLYQFS VVETAGPGTLVGRLRAQDPDLGDNALMAY S I LD 

GEGSEAFSISTDLQGRDGLLTTOIO'LDFESQRSYSFRVEATNTLIDPAYLRRGPFKDVASVRVAVQDAPEPPAFT 

QAAYHLTVPENKAPGTLVGQI SAADLDSPASPIRYSILPHSDPERCFS IQPEEGTIHTAAPLDREARAWHNLTVL 

ATELGEDSQASRVQVAIQTLDE2TONAPQLAEPYDTFVCD 

FTVQDNRDGSASLLLPSRPAPPRHAPYLVPIELWDWGQPALSSTA^ 

TGALLAIITCVGALLALVVLFVALRRQKQEAL!^ 

PGPPARRDVLPRARVSRQPRPPGPADVAQLIJ^RLREADEDPGTO^ 

GAPGPAEPLDDWGPLFRTLAELYGAKEPPAP 



The disclosed NOV36b amino acid sequence has 435 of 746 amino acid residues 
(58%) identical to, and 553 of 746 amino acid residues (74%) similar to, the 796 amino acid 
residue ptnr:SWISSNEW-ACC:P55287 protein from Homo sapiens (Human) (cadherin-1 1 
precursor (osteoblast-cadherin) (OB-cadherin) (OSF-4)) (E = 2.5c* 230 ). 

NOV36b is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
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nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. Expression information was derived from the tissue sources of the 
5 sequences that were included in the derivation of the NOV36b sequence. 

In addition, NOV36b is predicted to be expressed in the following tissues because of 
the expression pattern of (GENBANK-ID: gb:GE]SBANK-ID:AK025342|acc:AK025342.1) a 
closely related Homo sapiens cDNA: FLJ21689 fis, clone COL09459 homolog in species 
Homo sapiens: ostoeblasts. 
10 NOV36a also has homology to the amino acid sequences shown in the BLASTP data 

listed in Table 36K 



Table 36E. BLAST results for NOV36a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi| 16553903 | dbj |BAB 
71613.1 | (AK057922) 


unnamed protein 
product [Homo 
sapiens] 


493 


459/466 
(98%) 


461/466 
(98%) 


0.0 


gi 13626134 |sp|0933 
19 CADB_CHICK 


CADHERIN-11 
PRECURSOR 


792 


430/749 
(57%) 


552/749 
(73%) 


0.0 


gi 3377485 |gb|AAC28 
073. 1| (AF002983) 


cadherin 
precursor 
[Xenopus laevis] 


794 


429/751 
(57%) 


547/751 
(72%) 


0.0 


gi| 1705549 |sp|P5528 
8|CADBJ40USE 


CADHERIN- 11 
PRECURSOR 
(OSTEOBIiAST- 
CADHERIN) (OB- 
CADHERIN) (OSF-4) 


796 


432/753 
(57%) 


552/753 
(72%) 


0.0 


gi | 1377894 | dbj | BAA0 
4798.1 | (D21254) 


OB- cadherin- 1 
[Homo sapiens] 


796 


434/753 
(57%) 


552/753 
(72%) i 


0.0 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
15 in Table 36R 



Table 36F Information for the ClustalW proteins 

1) NOV36a (SEQ ID NO: 140) 

2) NOV36b (SEQ ID NO:142) 

3) gi (16553903 | dbj |BAB71613.l| (AK057922) unnamed protein product [Homo sapiens] 
(SEQ ID NO:482) 

4) gi 1 13626134 | sp 1 093319 | CADB_CHICK CADHERIN-11 PRECURSOR (SEQ ID NO: 483) 

5) gi|3377485|gb|AAC28073.l| (AF002983) cadherin precursor [Xenopus laevisj (SEQ ID 
NO:484) 

6) gi| 1705549 |sp|P55288|CADB_MOUSE CADHERIN-11 PRECURSOR (OSTEOBLAST- CADHERIN) (OB- 
CADHERIN) (OSF-4) (SEQ ID NO:485) 

7) gi 1 1377894 1 dbj |BAA04798.l| (D21254) OB-cadherin-1 [Homo sapiens] (SEQ ID NO: 486) 



30 NOV36a 1 
NOV36b l 
gi | 16553903 | 1 
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gi | 13626134 | 
gi | 3377485 
gi | 1705549 
gi j 1377894 



1 MKEDNi 

1 MKKDFCLHGL[ 

1 MKENYCLQj 

1 MKENYCLQ- 




iMLYYSHAITTEKflNHVRPjSl 
ti^CSHATSLRkNNKLR^SFHS 
ifcraSQAFA^RSHLHPSF^ 
^CHSHAFAPE^GHLRPS|Hg 




90 100 
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NOV36a 
NOV36b 



9i 

gi 
gi 
gi 
gi 



16553903 | 
13626134 | 
3377485 
1705549 
1377894 



NOV36a 
NOV36b 
gi | 16553903 
gi | 13626134 
gi | 3377485 
gi | 1705549 
gi j 1377894 



NOV36a 
NOV36b 



gi 
gi 
gi 
gi 
gi 



16553903 

13626134 

3377485 

1705549 

1377894 



NOV36a 
NOV36b 



gi 
gi 
gi 
gi 
gi 



16553903 

13626134 

3377485 

1705549 

1377894 



NOV36a 
NOV36b 



gi 
gi 
gi 
gi 
gi 



16553903) 
13626134| 
3377485) 
1705549 j 
1377894 



NOV36a 
NOV36b 



gi 
gi 
gi 
gi 
gi 



16553903 | 
13626134 j 
3377485 
1705549 
1377894 



NOV36a 
NOV36b 
gi | 16553903 | 



52 
52 
52 
61 
61 
61 
61 



232 
232 
232 
241 
241 
241 
241 



I 



FVIEEY f 

fvieeyI 
fvieeyI 

FVIESYi 
FVIEEYi 
FVIEEYI 




110 



120 
••I 



gnihStkBldree 
gnihStkSldree 
gnihBtkSldree 

GNIHgTKjjLDREE 
GNIHgTKyLDREE 

gnihBtkBldree 



111 
111 
111 
120 
120 
120 
120 



130 



140 



150 



160 



170 



180 




250 260 270 280 

.J. . . A . . . - I — I — I . . .-| . 



290 



300 



MGGMGGLSGgTgVTffiTLSDVNDNPPKFPQSw: 

mgghmgglsgS tBvtStlSdvndnppkfpqsS." 
mgghmgglsg| tBvt|tlSdvndnppkfpqsS/ 

MGGHMGGLSGjj TBvTMTL^DVNDNPPKFPQsS , 

wgghmgglsgutSvtotludvndnppkfposS' 



mgghmgglsgStBvtj tl5dvndnppkfpqs§y] 

MGGHMGGLSGQtBvT[| TLyDVNDNPPKFPQSgYQfJiSV 
MGGHMGGLSGy^RvTP TLgDVNDNPPKFPQSaYQBsv 



jYQgSV 
|YQ|SV 

!yq|sv 
|YqSsv 
IySBsv 
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gi | 13626134 | 
gi | 3377485 
gi j 1705549 
gi (1377894 



NOV36a 472 
NOV36b 472 
gi | 16553903 | 467 
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Tables 36G-P list the domain description from DOMAIN analysis results against 
NOV36. This indicates that the NOV36 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 36G. Domain Analysis of NOV36 

gnl|Pfam|pfam01049, Cadherin_C_term, Cadherin cytoplasmic region. 
Cadherins are vital in cell -cell adhesion during tissue 
differentiation. Cadherins are linked to the cytoskeleton by catenins. 
Catenins bind to the cytoplasmic tail of the cadherin. Cadherins 
cluster to form foci of homophilic binding units. A key determinant to 
the strength of the binding that it is mediated by cadherins is the 
juxtamembrane region of the cadherin. This region induces clustering 
and also binds to the protein pl20ctn. (SEQ ID NO: 827) 
CD-Length s 150 residues, 98.7% aligned 
Score =99.8 bits (247), Expect = 5e-22 



NOV36: 625 RRQKQEALMVLEEEDVRENI I TXDDBGGGEEDIEhFDlTALQNPDGAAPPAPGPPARRDV 684 

11*1*1 I** 1*1 1*1 I I I I i I t I I I I I I I I * I 1 1 I * I I — * I I 

Sbjct: 1 RRRKKEPLI IDEDEDI RENI INYDDEGGGEEDTDAFDI SALRSGGNPKP I EELKLRRDI K 60 

NOV36: 685 LPRARVSRQPRPPGPADVAQLLAIiRLREADEDPGVPPYDSVQVYGY^^ 744 

♦ I MM hi * +1*111 II 11111*1 II II +1 

Sbjct : 61 PELQSLPRPRRPPAPDDI ADFINEKIiKEADNDPTAPP YDSLQTYAY- - EGSGSVAGSLSS 118 

NOV36: 745 SGSEAGGAPGPAEPJ^DWGPLFRTLAELYG 774 

I * * 1*1 I I I I* M**H 
Sbjct: 119 LNSSTTDSDQDYDYLNDWGPRPKKIiADMYG 148 



Table 36H. Domain Analysis of NOV36 

gnl|Pfam|pfam00028, cadherin, Cadherin domain (SEQ ID NO:828) 
CD-Length = 92 residues, 97.8% aligned 
Score = 81.3 bits (199), Expect = 2e-16 



NOV36 : 379 YHLTVPENKAPGTIiVGQI SAADLDSPA- SPIRYSILPHSDPERCFSIQPEEGTIHTAAPI* 437 

I *IIM II I **l I I I Nil I I I I* I * I II 

Sbjct: 1 YSASVPENAPVGTEVLTVTATDADLGPNGRI FYS I LGG- GPGGWFRIDPDTGDLSTTKPIi 59 

NOV36: 438 DREARAWHNLTVLATELGEDSQASRVQVAIQ 468 

III* * llllll* 1*11 
Sbjct: 60 DRESIGEYELTVLATDSGGPPLSGTTTVTIT 90 



Table 361. Domain Analysis of NOV36 

gnl|Pfam|pfam0002Q, cadherin, Cadherin domain (SEQ ID NO: 828) 
CD-Length » 92 residues, 100.0% aligned 
Score =72.8 bits (177), Expect = 7e-14 



NOV36: 264 YQFSWETAGPGTLVGRLRAQDPDLGDNALMAYS I LDGEGSEAFS I STDLQGRDGLLTVR 323 

I II I I III * I I III I * llll I III II* 

Sb j Ct : 1 YSASVPENAPVGTEVLTVTATDADLGPNGRI FYS ILGGGPGGWPRIDPD- - - -TGDLSTT 56 

p 

NOV36: 324 KVLDFESQRS YSFRVEATNTLI DPAYLRRGPFKDVASVRVAVQ 366 
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Sbjct: 57 



I II II I III- I 

KPLDRESIGEYELTVLATDSGGPPLS - 



+ 1 + I 
-GTTTVTITVL 92 



Table 36 J. Domain Analysis of NOV36 



gnl|Pfam|pfam00028, cadherin, Cadherin domain (SEQ ID NO:828) 
CD-Length = 92 residues, 85.9% aligned 
Score = 63.9 bits (154), Expect m 3e-ll 



10 



NOV36: 155 YHATVPEMSOTGTSVIQVTAHDADDPSYGNSAKLVYTVLDGLP- - FFSVDPQTGWRTAI 212 

I Mil + III |+ III III I + + + |++| || +| +|| || + | 

Sb}Ct: 1 YSASVPENAPVGTEVLTVTATDAD LGPNGRI FYS ILGGGPGGWFRIDPDTGDLSTTK 57 

NOV3 6: 213 PNMDRETQEE FL WIQAKDMGGH 235 

I +111+ 1+ + + I I II 
Sbjct: 58 P -LDRES IGEYELTVLATDSGGP 79 



Table 36K. Domain Analysis of NOV36 

gnl|Pfam|pfam00028, cadherin, Cadherin domain (SEQ ID NO:828) 
CD-Length = 92 residues, 97.8% aligned 
score a 59.3 bits (142), Expect = 8e-l0 



15 



20 



NOV36: 52 FVIEEYAGPEPVLIGKLHSDVDRG - EGRTKYLLTGEGAGT VTVIDEATGNIHVTKSLDRE 110 

+ I I ++ *1 I I II I + I I I I II ||++ II llll 

Sbjct: 3 AS VPENAP VGTEVLTVTATDADLGPNGRI FYS ILGGGPGGWFRIDPDTGDLSTTKPLDRE 62 

NOV36: 111 EKAQYVLLAQAVDRASNRPLEPPSEFIIKVQ 141 

+11 II 11+11 
Sbjct: 63 SIGEYELTVLATDSGG-PPLSGTTTVTITVL 92 



Table 36L. Domain Analysis of NOV36 



gnl|Pfam|pfam00028, cadherin, Cadherin domain (SEQ ID NO:828) 
CD-Length = 92 residues, 100.0% aligned 
Score « 44.7 bits (104), Expect = 2e-05 



25 



30 



NOV42: 483 YDTFVCDSAAPGQL I QVI RALDRDEVGNS SHV S FQGPLGPDANFT VQDNRDGSASLLLPS 542 

I I ++ l I + + I I I | || | + + + | 

Sbj ct : 1 YS AS VPENAP VGTEVLTVTATDADLGPNGRI FYS I LGGGPGGWFRIDPD- - -TGDLSTTK 57 

NOV42: 543 RPAPPRHAJPYLVPIELWDWGQPALSSTATVTVSVC 577 

I + + I I I || | ||| ++ | 
Sbjct: 58 PLDRES IGEYELTVLATDSGGP PLSGTTTVTITVL 92 



Table 36M. Domain Analysis of NOV36 



gnl | Smart | smart 00112, CA, Cadherin repeats.; Cadherins are 
glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin 
domains occur as repeats in the extracellular regions which are 
thought to mediate cell-cell contact when bound to calcium. (SEQ ID 
NO: 829) 

CD-Length - 82 residues, 100.0% aligned 
Score « 79.7 bits (195), Expect = 6e-i6 
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NOV36 : 396 I S AADLDS PA - S P IRYS I L PHS DPERCFS I QPEEGT IHTAAPLDREARAWHNLTVLATEL 454 

+11 I II +1111 + III II I I I Mill +++ III 11+ 

Sbjct : 1 VSATDADSGENGKVTYS ILS - GNDGGLFS IDPETGI ITTTKPLDREEQSEYTLTVEATDG 59 

NOV3 6: 455 GEDSQASRVQVAIQTLDENDNAP 477 

I +11+11 Mill 
Sbjct: 60 GGPPLS STATVTVTVLDVNDNAP 82 



Table 36N. Domain Analysis of NOV36 

gnl | Smart | smart 00 112, CA, Cadherin repeats.; Cadherins are 
glycoproteins involved in Ca2+ -mediated cell- cell adhesion. Cadherin 
domains occur as repeats in the extracellular regions which are 
thought to mediate cell -cell contact when bound to calcium. <SEQ ID 
NO:829) 

CD- Length = 82 residues, 96.3% aligned 
Score = 73.9 bits (180), Expect » 3e-l4 



10 



15 



NOV36: 
Sbjct: 
NOV36: 
Sbjct: 



70 



129 



63 



SDVDRGE - GRTKYLLTGEGAGTVFVIDEATGN IHVTKSLDREEKAQYVLLAQAVDRASNR 12 8 

+1 I II 1+ I + I +1 II II I II IIIII+++I I +| I 

TDADSGENGKVTYSILSGNDGGLFSIDPETGI ITTTKPLDREEQSEYTLTVEATDGGGP - 62 



PLE'PPSEFIIKVQDINDNPP 

II + + I l + MI I 
PL S STATVTVTVLDVNDNAP 



148 



82 



Table 360. Domain Analysis of NOV36 

gnl | Smart | smart 00112, CA, Cadherin repeats.; Cadherins are 
glycoproteins involved in Ca2+ -mediated cell -cell adhesion. Cadherin 
domains occur as repeats in the extracellular regions which are 
thought to mediate cell- cell contact when bound to calcium. (SEQ ID 
NO.-829) 

CD-Length = 82 residues, 98.8% aligned 
Score * 63.5 bits (153), Expect = 4e-ll 



NOV36: 281 LRAQDPDLGDNALMAYSILDGEGSEAFS I STDLQGRDGLLTVRXVLDFESQRS YSFRVEA 340 
20 + I I I l+l + MM I III + |++| | || I I 1+ III 

Sb}Ct: 1 VSATDADSGENGKVTYS I LSGNDGGLFS I DPE TGI ITTTKPLDREEQSEYTLTVEA 56 



NOV36: 341 



25 Sbjct: 57 



TNTL I DP AYLRRG P FKD VAS VR VAVQDAP E P P 

1+ I l+l I I I + 

TDGGGP PLS S TATVTVTVLD VNDNA 



372 
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Table 36P. Domain Analysis of NOV36 

gnl | Smart | smart 00112, CA, Cadherin repeats.; Cadherins are 
glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin 
domains occur as repeats in the extracellular regions which are 
thought to mediate cell -cell contact when bound to calcium. (SEQ ID 
N0:829) 

CD-Length = 82 residues, 74.4% aligned 

Score = 55.8 bits (133), Expect = 9e-09 



30 



NOV36: 172 VTAHDADDPSYGNSAKLVYTVLDGLPF- - FSVDPQTGWRTAIPNMDRETQEEFLWIQA 229 

l + l III 11+ I++I I IMMI++ I I +111 I |+ + *+| 

Sbjct: 1 VSATDADSGENG KVTYS ILSGNDGGLFS IDPETGI ITTTKP - LDREEQSEYTLTVEA 56 



NOV36: 230 KDMGG 234 
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I II 

Sbjct: 57 TDGGG 61 
Sbjct: 57 TDGGG 61 



5 Cadherins, first discovered in mouse teratocarcinoma cells, are a family of animal 

glycoproteins responsible for calcium-dependent cell-cell adhesion. Cadherins preferentially 
interact with themselves in a homophilic manner in connecting cells; thus acting as both 
receptor and ligand. There are a number of different isoforms distributed in a tissue-specific 
manner in a wide variety of organisms. Cells containing different cadherins tend to segregate 

1 0 in vitro, while those that contain the same cadherins tend to preferentially aggregate together. 
This observation is linked to the finding that cadherin expression causes morphological 
changes involving the positional segregation of cells into layers, suggesting they may play an 
important role in the sorting of different cell types during morphogenesis, histogenesis and 
regeneration. They may also be involved in the regulation of tight and gap junctions, and in 

1 5 the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins 
which are component of intercellular desmosome junctions involved in the interaction of 
plaque proteins. The first three cadherins to be described were E-cadherin is present on many 
types of epithelial cells; N-cadherin on nerve, muscle, and lens cells; and P-cadherin on cells 
in the placenta and epidermis. 

10 The NOV36 proteins bear close resemblance to cadherin-1 1, a member of the cadherin 

family of proteins, expressed in osteoblasts. The tissue expression in brain, uterus and retina of 
these NOV36 proteins indicate they might play an important role during organogenesis and 
development. 

The disclosed NOV36 nucleic acid of the invention encoding a cadherin 1 1 -like 
15 protein includes the nucleic acid whose sequence is provided in Table 36A, 36C or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 36A or 36C while still encoding a 
protein that maintains its cadherin 1 l-like activities and physiological functions, or a fragment 
of such a nucleic acid. The invention further includes nucleic acids whose sequences are 
10 complementary to those just described, including nucleic acid fragments that are 

complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of non-limiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
5 derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
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binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, in one embodiment up to about 35% of the NOV36a residues 
may be so changed and in an additional embodiment up to about 1% of the NOV36b residues 
may be so changed. 

5 The disclosed NOV36 protein of the invention includes the cadherin 1 1-like protein 

whose sequence is provided in Table 36B or 36D. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 36B or 36D while still encoding a protein that maintains its cadherin 1 1-like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 

10 up to about 43% of the NOV36a and NOV36b bases may be so changed. 

The above defined information for this invention suggests that these cadherin 1 1-like 
proteins (NOV36) is a member of a "cadherin 1 1 family". Therefore, the NOV36 nucleic acids 
and proteins identified here may be useful in potential therapeutic applications implicated in 
(but not limited to) various pathologies and disorders as indicated below. The potential 

15 therapeutic applications for this invention include, but are not limited to: protein therapeutic, 
small molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 
research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
(but not limited to) those defined here. 

20 The nucleic acids and proteins of NOV36 are useful in Von Hippel-Lindau (VHL) 

syndrome, cirrhosis, transplantation, endometriosis, fertility, anemia, ataxia-telangiectasia, 
autoimmune disease, hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, 
allergies, immunodeficiencies, graft versus host disease (GVHD), lymphaedema, muscular 
dystrophy, Lesch-Nyhan syndrome, myasthenia gravis, and/or other pathologies and disorders. 

25 NOV36 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV36 protein have multiple 

30 hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 



320 



WO 02/068649 



PCT/US02/02785 



NOV37 

A disclosed NOV37 nucleic acid of 8575 nucleotides (also referred to as CG56733-01) 
encoding a novel Ten«M2-like protein is shown in Table 37A. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 199-201 and ending with a 
5 TAA codon at nucleotides 8476-8478. Putative untranslated regions, if any, found upstream 
from the initiation codon and downstream from the termination codon are underlined in Table 
37A, and the start and stop codons are in bold letters. 



Table 37A. NOV37 Nucleotide Sequence (SEQ ID NO:143) 

TAAAGTACCTGTCATCTTGACAAGTGGCG^^ 

AGCTCGC CCCGTGATTGGGCTCTCCCGCGATCTGCACCGGGGGAAGCGCATGAGAGGCCAATGAGACTTGA 
ACCCTGA GCCTAAGTTGTCACCAGCAGGACTGATGTGCACACAGAAGGAATGAAGTA TGGATGTGAAAGAA 
CGCAGGCCTTACTGCTCCCTGACCAAGAGCAGACGAGAGAkGGAACGGCGCTAC^CAAATTCCTCCGCAGA 
CAATGAGGAGTGCCGGGTACCCACAGA.GAAGTCCTACAGTTCCAGCGAGACATTG 

ATT CCT CG CGGCTGCTTT ACGG CAACAGAGTGAAGGATTTGGTT CACAGAGAAGCAGACGAGTTCACTAGA 

CAAAGCAGGATGC ACTATGGAAAC CGAGTCACAG AC CTCAT C CACCGG GAGT CAG ATGAGTTTC CTAGACA 

AGGTATCCTTCACCAGGGCTACTCCCTTAGCACAGGGTCTGACGCCGACTCCGACACCGAGGGAGGGATGT 

CTCCAGAACACGCCATCAGACTGTGGGGCAGAGGGATAAAATCCAGGCGCAGTTCCGGCCTGTCCAGTCGT 

GAAAACTCGGCCCTTACCCTGACTGACTCTGACAACGAAAACAAATCAGATGATGAGAACGGTCGTCCCAT 

TCCACGTACATCCTCGCGTAGTCTCCTCCCATTTGTTCAGCTGCCrAGCTCCCATAATCCT 

GCTGCCAGATGCCATTGCTAGACAGCAACACCTCCCATCAAATCATGGACACCAACCCTGATGAGGAATTC 

TCCCCCAATTCATACCTGCTCAGAGCATGCTCAGGGCCCCAGCAAGCCTCCAGCAGTGGTCCTCCGAACCA 

CCACAGCCAGTCGACTCTGAGGCCCCCTCTCC^ 

CCGCGAACTCCCTCAACAGGAACTGACTGACCAATCGGCGGAGTCAGATCCACGC^ 

AATGACCTGGCCACCACACCAGAGTCCGTTCAGCTTCAGGACAGCTGGGTGCTAAACAGCAACGTGCCACT 

GGAGACCCGGCACTTCCTCTTCAAGACCTCCTCGGGGAGCACACCCTTGTTCAGCAGCTCTTCCCCGGGAT 

ACCCTTTGACCTCAGGAACGGTTTACACGCCCCCGCCCCGCCTGCTGCCCAGGAATACTTTCTCCAGGAAG 

GCHTTCAAGCTGAAGAAGCCCTCCAAATACrGCAGCTGGAAATGTGCTGCCOTCTCCG 

CCTCCTCTTGGCTATTTTGCTGXSCGTAT^ 

CTGCAGATGGG CACACCTTTAACAATGGGATAAGGAC CGG CTTAC CAGGAAACGATGATGTGGCAACAATG 
CCATC TGGAGGCAAAGTG C C CTGGT C GTTGAA&AACAGCAGCATAGACAGTGGTGAAGCAG AAGTTG GTCG 
GCGGGTAACACAAGAAGTCCCACCAGGGGTGTTTTGGAGGTCACAAATTCAC^ 

CATGCCCAGTATGACTTCATGGAACGTCTGGACGGGAAGGAGAAGTGGAGTGTGGTTGAGTCTCCCAGGGA 

ACGCCGGAGC^TACAGACCTTGGTTCAGAATGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCCTGTGGC 

AT CTGG CC TTCTACAATGATGGAAAAGAC AAAGAGATGGTTT CCT TCAATACTGTTGTCCTAGATTCAGTG 

CAGGACTGTCCACGTAACTGCCATGGGAATGGTGAATGTGTGTCCGGGGTGTGTCACTGTTTCCCAGGATT 

TCTAGGAGCAGACTGTGCTAAAGCTGCCTGCCCTGTCCTGTGCAGTGGGAATGGACAATATTCTAAAGGGA 

CX5TGCCAGTGCTACAGCGGCTGGAAAGGTGCAGAGTGCGACGTGCCCATGAATCAGTGCATCGATCCTTCC 

TGCGGGGGCCACX5GCTCCTGCATTGATGGGAACTGTGTCTGCTCTGCTGGCTACAAAGGCGAGCACTGTGA 

GGAAGTTGATTGCTTGGATCCCACCTGCTCCAGCCACGGAGTCTGTGTGAATGGAGAATGCCTGTGCAGCC 

CTGGCTGGGGTGGTCTGAACTGTCAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACG 

TACCTGCCTGACACGGGCCTCTGCAGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCTGW 

AGTAGACTGTGGCACTC^CGGCGTCTGCATCGGGGGAGCCTGCCGCTGTGAAGAGGGCTGGACAGGCGCAG 

CGTGTGACCAGCGCGTGTGCCACCCCCGCTGCATTGAGCACGGGACCTGTAAAGATGGCAAATGTGAATGC 

CGAGAGGGCTGGAATGGTGAACACTGCACCATTGATGGCTGCCCTGACTTGTGCAACGGTAACGGGAG 

CACACTGGGTCAGAACAGCTGGC^GTGTGTCTGCC^GACCGGCTGGAGAGGGCCCGGATGCAACGTTGCCA 

TGGAAACTTCCTGTGCTGATAACAAGGATAATGAGGGAGATGGACTCATTGACTGCATGGATCCCGATTGC 

TGCCTACAGAGTTCCTGCCAGAATCAGCCCTATTGTCGGGGACTGCCGGATCCTCAGGACATCATTA 

AAGCCTTCAATCGCCTTCT(^GC^^GCTGCCAAATCCTTTTATGATCGAATCAGT 

ATAGCACCCATGTTATACCTGGAGAAAGTCCTTTCAATAGCTTGGTTTOTCT 

ACTACAGATGGAACTCCCCTGGTCGGTGTGAACGTGTCTTTTGTCJtflGTACCC^ 

<^CCCGCCAGGATGGCACGTACTCCCTCTCCAGGTTC^^ 

TACACTTTGAGCGAGCCCC^TTCATGAGCC^GGAGCGCACTGTGTGGCTGCCGTGGAAC^GCTrTTACGCC 
ATGGACACCCTGGTGATGAAGACCGAGGAGAACTCCATCCCCAGTTGTGACCTCAGTGGTTTTTGTCGGCT 
TGATCCCATCATCATCTCCTCCCCTCTGTCCACTTTCT^^ 

CTGAGACC CAGGTACTTCATG AAGAAATCGAGCTC C CTGGT T CCAATGTGAAACTTCG CTAT CTGAG C T CT 
AGAACTGCAGGGTACAAGTCACTGCTGAAGATCACCATGACCCAGTCCACAGTGCCCCTC 
GGTTCACCTGATGGTGGCTGTCGAGGGGCATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTGGCCT 
ACAC C TTCAT CTGGGACAAGACAG ATGCGTATGGC CAAAGGGTGTATGGACT CTCAGATGCTGTTG GTATG 
TTTTGGTTTOUkAGGACAGCCCTCCrTCa 

AGACAAACACCACATCCTCAATGTTAAAAGTGGTATCCTACACAAAGGCACTGGGG^ 
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CCCAGCAGCCTGCCATCATCACCAGCATCATGGGCAATGGTCGCCGCCGGAGCATTTCCTGTC^ 

AACGGCCTTGCTGAAGGCAACAAGCTGCTGGCCCCAGTGGCTCTGGCTGTTGGAATCGATGGGAGCCTCTA 

TGTGGGTGACTTCAATTACATCCGACGCATCTTTCCCTCTCGAAATGTGACCAGCATCTTGGAGTTACGGA 

GAAATAAAGAGTTTAAACATAGCAACAACCCAGCACACAAGTACTACTTGGCAGTGGACCCCGTGTCCGGC 

TCGCTCTACGTGTCCGACACCAACAGCAGGAGAATCTACCGCGTCAAGTCTCTGAGTGGAACCAAAGACCT 

GGCTGGGAATTCGGAAGTTGTGGCAGGGACGGGAGAGCAGTGTCTACCCTTTGATGAAGCCCGCTGCGGGG 

ATGGAGGGAAGGCCATAGATGCAACCCTGATGAGCCCGAGAGGTATTGCAGTAGACAAGAATGGGCTCATG 

TACTTTGTCGATGCCACCATGATCCGGAAGGTTGACCAGAATGGAATCATCTCCACCCTGCTGGGCTCCAA 

TGACCTCACTGCCGTCCGGCCGCTGAGCTGTGATTCCAGCATGGATGTAGCCCAAGTTCGTCTGGAGTGGC 

CAACAGACCTTGCTGTCAATCCCATGGATAACTCCTTGTATGTTCTAGAGAACAATGTCATCCTTCGAATC 

ACCGAGAACCACCAAGTCAGC^TCATTGCGGGACGCCCCATGCACTGCCAAGTTCCTGGCATTGACTACTC 

ACTCAGCAAACTAGCCATTCACTCTGCCCTGGAGTCAGCCAGTGCCATTGCCATTTCTCACACTGGGGTCC 

TCTACATCACTGAGACAGATGAGAAGAAGATTAACCGTCTACGCCAGGTAACAACCAACGGGGAGATCTGC 

CTTTTAGCTGGGGCAGCCTCGGACTGCGACTGCAAAAACGATGTC 

TGCCTACGCGACTGATGCC^TCTTGAATTCCCCATC^TCCTTAGCTGTAGCTCCAGATGGTACCAOT 

TTGCAGACCTTGGAAATATTCGGATCAGGGCGGTCAGCAAGAACAAGCCTGTTCTTAATGCC^ 

TATGAGGCTGCATCCCCCGGAGAGCAGGAGTTATATGTTTTCAACGCTGATGGCATCCACCAATACACTGT 

GAGCCTGGTGACAGGGGAGTA(OTGTACAATTTCACATATAGTACTGACAATGATGTCACTGAATTGATTG 

ACAATAATGGGAATTCCCTGAAGATCCGTCGGGACAGCAGTGGCATGCCCCGTCACCTGCTCATGCCTGAC 

AACCAGATCATCACCCTCACCGTGGGCTICCAATGGAGGCCTCAAAGTCGTGTCCAC^CAGAACCTGGAGCT 

TGGTCTCATGACCTATGATGGCAACACTGGGCTCGTGGCCACCAAGAGCGATGAAACAGGATGGACGACTT 

TCTATAGCTATGACCACGAAGGCCGCCTGACCAACGTGACGCGCCCCACGGGGGTGGTAACCAGTCTGCAC 

CGGGAAATGGAGAAATCTATTACCATTGACATTGAGAACTCCAACCGTGATGATGACGTCACTGTCATCAC 

CAACCTCT CTTCAGTAGAGG C CTC CT ACACAGTGGTACAAG AT CAAGTTCGGAACAG CTACCAGCT CTGTA 

ATAATGGTACCCTGAGGGTGATGTATGCTAATGGGATGGGTATCAGCTTCCACAGCGAGCCCCATGTCCTA 

GCGGGCACCATCACCCCCACCATTGGACGCTGCAACATCTCCCTGCCTATGGAGAATGGCTTAAA 

TGAGTGGCGCCTAAGAAAGGAACAGATTAAAGGCAAAGTCACCATCTTTGGCAGGAAGCTCGAGGTCCATG 

GAAGAAATCTCITGTCCATTGACTATGATCGAAATATTCGGACTGAAAAGATCTATGATGACCACCGGAA^ 

TTCACCCTGAGGATCATTTATGACCAGGTGGG CCGCqpCTT CCTCTGGCTGCCCAGCAGCGGGCTGGCAGC 

TGTCAACGTGTCATACTTCTTCAATGGGCGCCTGGCTGGGCTTCAGCGTGGGGCCATGAGCGAGAGGACAG 

ACATCGACAAGCAAGGCCGCATCGTGTCCCGCATGTTCGCTGACGGGAAAGTGTGGAGCTACTCCTACCTT 

GACAAGATGGTCCTCCTGCTTCAGAGCCAACGTCAGTATATATTTGAGTATGACTCCTCTGACCGCCTCCT 

TGCCGTCACCATGCCCAGCGTGGCCCGGCACAGCATGTCCACACACACCTCCATCGGCTACATCCGTAATA 

TTTACAACCCGCCTGAAAGC^TGCTTCGGTCATCTTTGACTACAGTGATGACGGCOSCATCCTGAAGACC 

TCCTTTTTGGGCACCGGACGCCAGGTGTTCTACMGTATGGGAAACTCrCCAAGTTATCAGAGATTGTCTA 

CGACAGTACCGCCGTCACCTTCGGGTATGACGAGACCACTGGTGTCTTGAAGATGGTCAACCTCCAAAGTC 

GGGGCTTCTCCTGCACCATCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTACAGGTTCTCC 

GAGGAAGGCATGGTCAATGCCAGGTTTGACTACACCTATC^ 

GCCCGTCATAAGTGAGACTCCCCTCCCCGTTGACCrrCTACCGCTATGATGAGATTTCTGGCAAGGTGGAAC 
ACTTTGGTAAGTTTGGAGTCATCTATTATGACATCAACCAGATCATCACCACTGC 

AAACACTTCGACACCCATGGGCGGATCAAGGAGGTCCAGTATGAGATGTTCCGGTCCCT<^TGTACrrGGAT 
GACGGTGCAATATGACAGCATGGGCAGGGTGATCAAGAGGGAGCTAAAACT^ 

CGAAGTACACCTATGACTACGATGGGGACGGGCAGCTCCAGAGCGTGCCGGCCGTCAATGACCGCCCGACC 

TGGCGCTACAGCTATGACCTTAATGGGAATCTCCACTTACTGAACCCAGGCAACAGTGTGCGCCTCATGCC 

CTTGCGCTATGACCTCCGGGATCGGATAACCAGACTCGGGGATGTGCAGTACAAAATTGACGACGATGGCT 

AT CTGTGC CAGAGAGGGT CTG ACATC TT CG AATACAATT C C AAGGG CCTCCTAACAAG AGCCTACAACAAG 

GCCAGCGGGTGGAGTGTCC^GTACCGCTATGATGGCGTAGGACGGCGGGCTTCCT^ 

CCACCACCTGCAGTACTTCTACTCTGACCTCCACAACCCGACGCGCATCACCCATGTCTACAATCACT 

ACTCGGAGATTACCTCACTGTACTACGACCTCCAGGGCCACCTCTTTGCCATGGAGAGCAGCAGTGGGGAG 

GAGTACTATGCTGCCTCTGATAACACAGGGACTCCTCTGGCTGTGTTCAGCATCAACGGCCTCATGATCAA 

A»GCTGCAGTACACGGCCTATGGGGAGATTTATTATGACnr 

TCCATGGGGGACTCTATGACCCCCTGACCAAGCTGGTCCACTTCACTCAGCGTGATTATGATGTGCTGGCA 

GGACGATGGACCTCCCCAGACTATACCATGTGGAAAAACGTGGGCAAGGAGCCGGCCCCCTTTAACCTGTA 

TATGTTCAAGAGCAAGAATCCTCTCAGCAGTGAGCTAGATTTGAAGAACT 

GGCTOGTGATGTTTGGATTTCAGCTTAGCAACATCATTCCTGGCTT^ 

CCTCCTCCCTATGAATTGTCAGAGAGTCAAGCAAGTGAGAATGGACAGCTCATCACAGGTGC 

AACAGAGAGACATAACCAGGCTTTCATGGCTCTGGAAGGACAGGTCATTACTAAAAAG 

TCCGAGAGAAAGCAGGTCATTGGTTTGCCACCACCACGCCCATCATTC 

AAAGAAGGGCGGGTGACCACGGGCGTGTCCAGC^TCGC(^GCGAAGATAGCCGCAAGGTGGCATCTGTGCT 
GAACAACGCCTACTACCTGGACAAGATGCACTAGAGCATCGAGG 

TTGGCTCAGCCGATGGCGACCTGGTCACACTAGGCACCACCATCGGCCGCAAGGTGCTA^ 
AACGTGACCGTGTCCCAGCCCACGCTGCTGGTCAACGGCAGGACTCGAA^ 

GTACTCCACGCTGCTGCTCAGCATCCGCTATGGCCTCACCCCCGACACCCTGGACGAAGAGAAGGCCCGCG 
TCCTGGAGCAGGCGAGACAGAGGGCCCTGGGCACGGCCTGGGCCAAGGAGCAGCAGAAAGC^ 
AGAGAGGGGAGCCGCCTGTGGACTGAGGGCGAGAAGCAGCAGCTTCTGAGCACCGGGCGCGTGCAAGGGTA 
CGAGGGATATTACGTGCTTCCCGTGGAGCAATACCCAGAGCTTCCAGACAGTAGCAGCAACATCCAGTTTT 
TAAGACAGAATGAGATGGGAAAGAGGTAACAAAATAATCTG CTG C CAT TC CT TGTCTGAATGG CTCAG CAG 
GAGT AACTGTTAT CT C CT CTC CTAAGGAG ATG AAGACCTAACAGGGG CACTGCGG 
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The NOV37 nucleic acid, located on chromosome 5, has 4965 of 5004 bases (99%) 
identical to a gb:GENBANK-ID:AB032953|acc:AB032953.I mRNA from Homo sapiens 
(mRNA for KIAA1 127 protein, partial cds) (E = 0.0). 

A disclosed NOV37 polypeptide (SEQ ID NO:144) encoded by SEQ ID NO:143 is 
5 2759 amino acid residues and is presented using the one-letter code in Table 37B. Signal P, 
Psort and/or Hydropathy results predict that NOV37 does not contain a signal peptide and is 
likely to be localized extracellularly with a certainty of 0.7900 in one embodiment, to the 
plasma membrane with a certainty of 0.7900 in another embodiment and to the nucleus with a 
certainty of 0.6000 in an additional embodiment. 



Table 37B. Encoded NOV37 protein sequence (SEQ ID NO:144) 

MD VKERRP Y CSLTKSRREKERRYTNS SADNEECRVPTQKSY SS SETLKAFDHDS SRLLY GNRVKDLVHREAD 

EFTRQSRMHYGNRVTDLIHRESDEFPR^^ 

SSRENSMtfLTDSDNENKSDDENGRPIPRTSSRS^ 

EFS PNS YLLRACSGPQQASSSGPPNHHSQSTLRPPLPP PHNHTLSHHHSSANSLNRNSLTNRRSQIHAPAPA 
PNDIiATTPESVQLQDSWVLNSNVPLETRHFLFKTSSGSTPLFSSS S PGYPLTSGTVYTPPPRLLPRNTFSRK 
AFKLKXPSKYCSWKCAALSAIAAALLLAILLAYFAAMHL^ 

SGGKVPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFIJCFNISLGKDALFGVYIRRGLPPSHA 
QYDFMERLDGKEKWSVVESPRERRSIQTLVQNEAVF^ 

PRNCHGNGECVSGVCHCFPGFLGADCAKAACPVLCSGNGQYSKGTCQCYSGTO 

GS CIDGNCVCSAGYKGEHCEEVDCLDPTCSSHGVCVNGECLCS PGWGGLNCELARVQCPDQCSGHGTYXiPDT 
GLCSCDPNWMGPDCSW CS VDCGTHGVCI GGACRCEEGWTGAACDQR VCHPRCI EHGTCKDGKCE CREGWNG 
EHCTIDGCPDLOTGNGRCTLGQNSWQCTCQTGWRGPGCNVAfffi 

NQPYCRGLPDPQDIISQSLQSPSQQAAKSFYDRISFLIGSDSTHVIPGESPFNSLVSLIRGQVVTTDGTPLV 
G VNVS FVKYP K YGYTI T RQDGTYS LSRFDL I ANGGAS LTLHFE RAP FM S QERTVWL P WNS FY AMD TL VMKTE 

ENSIPSCDLSGFCRLDPIIISSPLSTFFSAAPGQNPIVPETQVLHEEIELPGSNVKLRYLSSRTAGYKSLLK 
I TMTQSTVPLNLIRVHLMVAVEGHLFQKSFQAS PNLAYTFI WDKTDAYGQRVYGLSDAVGMFWFQRTALLQG 
FELDPSNLGGWSLDKHHILNVKSGILHKGTGENQFLTQQPAI ITSIMGNGRRRS ISCPSCNGLAEGNKLLAP 
VALAVGIDGSLYVGDFNYI RRI FPSRNVTS ILEIJIRNKEFKHSNNPAHKYYLAVDP VSGSLYVSDTNSRRIY 
RVKSLSGTKDLAGNSEVVAGTGEQCLPFDEARCGDGGK^ 

GI I S TLLGSNDLTAVRPLS CDS SMDVAQVRLEWPTDLAVNPMDNS L YVLENNVI LRI TENHQVS I IAGRPMH 
CQVPGIDYSLSKLAIHSALESASAIAI SHTGVLYITETDEKKINRLRQVTTNGEI CLLAGAASDCDCKNDVN 
CNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSK^ 

GIHQYTVSLOTGEYLYNFTYSTDNDVTELIDl^GNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKVVS 
TQNI^LGLMTYDGNTGLl^ 

VTVITNLSSVEASYTWQDQVRNSYQLCNNGTLRWA^ 

LNS IEWRLRKEQIKGKVTI FGRKLEVHGRNLLS IDYDRNIRTEKI YDDHRKFTLRI I YDQVGRPFLWLPS SG 
LAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADGKW^^ 

LLAVTMPSVARHSMSTHTS IGYIRNI YNPPESNAS VIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSE IV 

YDSTAVTFG YDETTGVLKMVNLQS GGFS CT IRYRKIGPLVDKQ I YRFSEEGMVNARFDYTYHDNS FRI AS I K 

PVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVOTLSKHFT)THGRIKCT 

VQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSTO^ 

YDI,RDRITRLGDVQYKIDDDGYLCQRGSDIFEYNSKGLLTRAYN^ 

QYFYSDLHNPTRITHVYNHSNSEITSLYYDLQ^^ 

TAYGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDYTMWKW 

NPLS SELDLKNYVTDVKS WLVMFGFQL SNI I PGFPRAKMYFVP PP YELSESQAS ENGQL I TGAHQTTERHNQ 
AFMALEGQVI TKKLHASIREKAGHWFATTTP I IGKGIMFAI KEGRVTTGVS S IAS EDSRKVASVLNNAYYLD 
KMHYSIEGKDTHYFVKIGSADGDLVTLGTTIGRKVLESGV^ 

RYGLTPDTLDEEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYTVLPVE 
QYPELADSSSNIQFLRQNEMGKR 



The disclosed NOV37 amino acid sequence has 2634 of 2764 amino acid residues 
(95%) identical to, and 2679 of 2764 amino acid residues (96%) similar to, the 2764 amino 
acid residue ptnr:SPTREMBL-ACC:Q9WTS5 protein from Mm muscutus (Mouse) (TEN- 
M2)(E = 0.0). 
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NOV37 is predicted to be expressed in at least Amygdala, Brain, Bronchus, Cerebral 
Medulla/Cerebral white matter, Cochlea, Coronary Artery, Epidermis, Hair Follicles, 
Hippocampus, Hypothalamus, Kidney, Left cerebellum, Lung, Lymph node, Parietal Lobe, 
Pineal Gland, Retina, Right Cerebellum, Substantia Nigra, Vulva and Whole Organism. This 
5 information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

In addition, NOV37 is predicted to be expressed in the following tissues because of the 
expression pattern of (GENB ANK-ID: gb:GENBANK-ID:AB032953|acc: AB032953.1) a 
10 closely related Homo sapiens mRNA for KIAA1 127 protein, partial cds homolog in species 
Homo sapiens: Amygdala, Brain, Bronchus, Cerebral Medulla/Cerebral white matter, Cochlea, 
Coronary Artery, Epidermis, Hair Follicles, Hippocampus, Hypothalamus, Kidney, Left 
cerebellum, Lung, Lymph node, Parietal Lobe, Pineal Gland, Retina, Right Cerebellum and 
Substantia Nigra. 

1 5 NO V37 also has homology to the amino acid sequences shown in the BLASTP data 

listed in Table 37C. 



Table 37C. BLAST results for NOV37 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 7657415 |ref|NP 0 
35986. 2| 
(NM_011856) 


odd Oz/ten-m 
homolog 2 
(Drosophila) ; odd 
Oz/ten-m homolog 
3 (Drosophila) 
[Mus musculus) 


2764 


2633/277 
7 (94%) 


2677/2777 
(95%) 


0.0 


gi | 9910320 |ref|NP o 
64473.1 | 
(NM 020088) 


neurestin alpha 
[Rattus 
norvegicus] 


2765 


2625/277 
8 (94%) 


2676/2778 
(95%) 


0.0 


gi | 10241574 | emb | CAC 
09416.1) (AJ279031) 


teneurin-2 
[Gallus gallus] 


2802 


2525/281 
8 (89%) 


2639/2818 
(93%) 


0.0 


gi | 5307761 |dbj (BAA8 
1892. 1| (AB026979) 


ten-m3 [Danio 
rerio] 


2590 


1707/260 
2 (65%) 


2097/2602 
(79%) 


0.0 


gi | 6760369 | gb| AAF28 

316.l|AF195418_l 

(AF195418) 


ODZ3 [MUS 
musculus) 


2346 


1665/236 
0 (70%) 


1993/2360 
(83%) 


0.0 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
20 in Table 37D. 



Table 37D Clustal W Sequence Alignment 



1) N0V37 (SEQ ID NO: 144) 

2) gi| 7657415 |rex|NP_035966. 2 | (NM_011856) odd Oz/ten-m homolog 2 (Drosophila); odd 
Oz/ten-m homolog 3 (Drosophila) [Mus musculus) (SEQ ID NO: 487) 

3) gi|9910320|ref |NP_064473.1| (NM_020088) neurestin alpha [Rattus norvegicus] (SEQ 
ID NO:488) 



324 



WO 02/068649 



PCT/US02/02785 



4) gi) 10241574 )emb|CAC09416.l| (AJ279031) teneurin-2 [Gallus gallus] (SEQ ID 
NO: 489) 

5) gi | 5307761 |dbj)BAA81892.l| (AB026979) ten-m3 [Daniorerio] (SEQ ID NO -490) 

6) gi) 6760369 |gb|AAF283l6.l|AF195418_l (AF195418) ODZ3 [Mus musculus] (SEQ ID 
NO:491) 



NOV37 


1 


gi 


7657415| 


1 


gi 


9910320| 


1 


gi 


10241574) 


1 


gi 


5307761) 


1 


gi 


6760369) 


1 




NOV37 


61 


gi 


7657415) 


59 


gi 


9910320) 


59 


gi 


10241574) 


59 


gi 


5307761) 


1 


gi 


6760369 | 


1 



NOV37 


119 


gi 


7657415 | 


119 


gi 


9910320) 


119 


gi 


10241574) 


119 


gi 


5307761) 


1 


gi 


6760369| 


1 




80 90 100 

J....I I. 

- srmhygnr-|t|lihreHdefpr 

STNFTIiAELGlCj^PSPHRg GYCSDM 

sanfti^lg^cspsphr| GYCSDM 

3TNFTLAELGjCj|pSPHRg GYCSDI 



110 



120 



jgilhqgyslstgsdadsd 
|gilhqgyslstgsdadsd 
gilhqgyslstgsdadsd 
gilhqgyslstgsdadsd 



118 

118 

118 

118 

1 

1 



13 0 



140 



150 
I . . 



160 



I , 



TEGGMSPEHAIRLWGRGIKSgRSSGLSSRENSALTLTDSDNENKSDi 
TEGGMSPEHAIRLWGRGIKSgRSSGLSSRENSALTLTDSDNENKSDiW 
TEGGMSPEHAIRLWGRGIKSgRSSGLSSRENSALTLTDSDNENKSD^r 
TEGGMSPEHAIRLWGRGIKS§RSSGLSSRENSALTLTDSDNENKSD& 



170 180 
|....|....| 

167 

167 

167 

DFHTHLSEKLK 178 

1 

1 



190 



200 



210 



220 



230 



240 



NOV37 


167 


gi 


7657415) 


167 


gi 


9910320) 


167 


gi 


10241574) 


179 


gi 


5307761) 


1 


gi 


6760369) 


1 



NOV37 


211 


gi 


7657415 | 


211 


gi 


9910320) 


211 


gi 


10241574) 


239 


gi 


5307761) 


41 


gi 


6760-369) 


1 



NOV37 


270 


gi 


7657415 | 


270 


gi 


9910320) 


270 


gi 


10241574 | 


298 


gi 


5307761) 


96 


gi 


6760369) 


1 



NOV37 


329 


gi 


7657415) 


329 


gi 


9910320) 


329 


gi 


10241574) 


357 


gi 


5307761) 


156 


gi 


6760369) 


1 




430 



440 



450 



460 



470 
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NOV37 

gi | 7657415 | 
gi j 9910320 j 
gi | 10241574 | 
g±|530776l| 
gi j 6760369 | 




10 



490 



500 



510 



520 



530 



540 



NOV37 


446 


gi 


7657415 | 


446 


gi 


9910320) 


446 


gi 


10241574 | 


•474 


gi 


5307761) 


273 


gi 


6760369 | 


30 



NOV37 

gi | 7 657415 | 
gi j 9910320 j 
gi j 10241574 | 
gi|530776l) 
gi | 6760369 j 



NOV37 

gi | 7657415 | 
gi j 9910320 | 
gi j 10241574 | 
gi j 5307761 1 
gi | 6760369 




NOV37 


621 


gi 


[7657415 I 


621 


gi| 99103201 


621 


gi 


1 10241574) 


649 


gi 


5307761| 


453 


gi| 6760369) 


210 


NOV37 


681 


gi 


7657415| 


681 


gi 


9910320| 


681 


gi 


10241574 | 


709 


gi 


5307761) 


513 


gi 


6760369) 


270 


NOV37 


740 


gi 


7657415) 


740 


gi 


9910320) 


741 


gi 


10241574) 


769 


gi 


5307761| 


573 


gi 


6760369| 


330 



670 



£80 
..I... 



I. 



G90 
..I.. 



TCffiCYSGWKGgECDVPjgNi 

tcBcysgwkgBecdvpSnqcidpscgghgsciiTgncvc 

TcScYSGWKGgECDVP.gNQCIDPSCGGHGSCIgGNCVC 
TCfflC YSG WKG@E CD V?feo;C IDPS CGGHG SCI §GNC VC 



qcidpscgghgsciSgncvc 



qcidpscgghgsciSgncvc 



cScYSGWKGjjECDVpS 

cScBsgwkgSecdvp! 



SqcidpRi 



TO c iGHGgClSG|cgc| 

•pBcggSgqc uSblcSo 





N0V37 


797 


gi 


7657415 | 


797 


gi 


9910320) 


798 


gi 


10241574) 


829 


gi 


5307761) 


630 


gi 


6760369[ 


387 



850 



860 

■ 1. 



Jb 



870 



880 



890 



I. 



900 



QTTGTE1 



dgcpdlcngngrctlgqnswqcvcqtgwrgpgcnvametscadnkdnegdgl l 
dgcpdlc^gngrctlgqnswqcvcqtgwrgpgcwametscadnkdnegdglv 
dgcpdlcngngrctlgqnswqcvcqtgwrgpgcnvametscadnkdneg'dglv 
dgcpdlcngngrctlgqnswqcvcqtgwrgpgcnvametscadnkdnegdglv 

^GGpgLC^GRCTLgQI^wgcV^ 
£GCPgkCN§NGRCTL2QN§^^ 
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NOV37 



gi 
gi 
gi 
gi 
gi 



7657415) 
9910320 | 
10241574) 
5307761) 
6760369) 




940 



950 



960 



QSPSQI 

JedwpI 

iQ--£DWP 
--jSGSPj 
[QPAjSPQi 
[TPSi 




1020 



NOV37 


911 


gi 


7657415) 


909 


gi 


9910320 | 


910 


gi 


10241574 | 


947 


gi 


5307761) 


744 


gi 


6760369) 


501 



MOV37 


970 


gi 


7657415 | 


964 


gi 


9910320) 


965 


gi 


10241574) 


1002 


gi 


5307761| 


799 


gi 


6760369 j 


556 



NOV37 


1030 


gi 


7657415 | 


1024 


gi 


9910320) 


1025 


gi 


10241574 | 


1062 


gi 


5307761) 


859 


gi 


6760369) 


616 




NOV37 


1090 


gi 


7657415) 


1084 


gi 


9910320) 


1085 


gi 


10241574 | 


1122 


gi 


5307761) 


919 


gi 


6760369 j 


676 



1150 



1160 

..I I. 



1170 



1180 



1190 



1200 



LNL IjWHLMVAVEGHLFQKS FWASPNLAYTF I WDKTDAYGQRVYGLSDAV 
LNL IgVHLMVAVEGHLFQKS FWAS PNLAYTF I WDKTDAY GQRVY GLSDAWS VGFE YEuC 
LNLIgVHLMVAVEGHLFQKSFfflASP^LAYTFIWDKTDAYGQRVYGLSDAVVSVGFEYESc 



1140 

1143 

1144 

1181 

978 

735 



1210 



1230 



NOV37 



gi 
gi 
gi 
gi 
gi 



7657415) 
9910320) 
10241574 | 
5307761) 
6760369 



NOV37 

gi 
gi 
gi 
gi 
gi 



7657415) 
9910320 j 
10241574 | 
5307761 | 
6760369 



1240 




1270 
..|.-. 



1280 

..I... 



I. 



1290 
..|... 



| . 



1300 1310 

..I....L...I... 



1320 



1198 
1204 
1205 
1242 
1039 
796 



imgngrrrs i scpscnglaegnkll apvalavgi dgslqvgdfnyirrifps rnvtsi le 
imgngrrrsiscpscnglaegnkllapvalavgidgslfvgdfnyirrifpsrnvtsile 
imgngrrrsiscpscnglaegnkllapvalavgidgslfvgdfnyirrifpsrnvtsile 
imgngrrrsiscpscnglaegnkllapvalavgidgslfvgdfnyirrifpsrnvtsile 
imgngrrrsiscpscng^gnkllapvaij^g|dgslfvgdfnyirrifps^ts2Je 

t MGNGRRRSISCPSCNgSU5gNKLLAPVA^ 



AgGgDGSLF VGDFNYIRRI FPSgNVTSgjH 

aSgidgslWvgdfotErrifpsSwtsSle 



1257 
1263 
1264 
1301 
1098 
855 



1330 



NOV37 


1258 


LR^NKEFKHSlSj 


gi 


7657415| 


1264 


lrInkefkhsnS 


gi 


9910320| 


1265 


lrInkefkhsnS 


gi 


10241574 1- 


1302 


lrBnkefkhsn" 



1340 



1350 



1360 



1370 



1380 



kdlagnsewagt 
kdlagnsewagt 



LRjNKEFKHSI^gPgHKYYLAVDPVTGSLYVSDTNSRRIYRVKSLBGgKDLAGNSEWAGT 
LR^KEF-KHSNigPAHKYYLAVDPVgGSLYVSDTNSRRIYi^KSLSGSKDLAGNSEVVAGT 
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g±| 5307761 | 


1099 


gij 6760369 j 


856 


NOV37 


1318 


gi| 7657415) 


1323 


gi j 9910320 j 


1324 


gij 10241574) 


1361 


gi j 5307761 1 


1151 


gij 6760369) 


908 



NOV37 

gi | 7657415 | 
gij 9910320 j 
gij 10241574 | 
gij 5307761) 
gij 6760369 j 



NOV37 



gi 
gi 
gi 
gi 



7657415) 
9910320 j 
10241574) 
5307761) 
67603691 



NOV37 

gi | 7657415 | 
gij 9910320 | 
gij 10241574 | 
gij 5307761 | 
gij 6760369 | 



PAHgYYLAgDPjiTGffiLYVSDTNSRRla 



EWAGT 
EWAGT 



1150 
907 



1390 



1400 
■■I... 



1410 
-.1... 



_L 



1420 



1430 



1440 



I 



GEQCLPFDEARCGDGGKAgDATIiMSPRGIAVDKNGLMYFVDATMIRKVDQNGIISTLLGS 
GEQCLPFDEARCGDGGKAVDATLMSPRGIAVDKNGLMYPVDATMIRKVDQNGIISTLLGS 
GEQCLPFDEARCGDGGKAVDATLMS PRGI AVDKNGLM YFVDATM I RKVDQNGI I STLLGS 
GEQCLPFDEARCGDGGKAVDATLMSPRGI AVDK0GLMYFVDATMIRKVDQNGI I STLLGS 
GEQCLPFDEARCGDGGKAiEABLgSpgGIAVDKNGt^FVDgTMIRKVD^GIISTLLGSl 
GE Q C LP FDE ARCGDGGKAV?3aTLM S pBgJaSd KNGLy Y F VdStM I RKVBQNG 1 1 STLLGS 



1377 
1382 
1383 
1420 
1210 
967 



1450 



1460 



1470 
.■|....| 



1480 



1490 



1500 



MOV37 


1378 


Hi 


7657415) 


1383 


gi 


9910320) 


1384 


gi 


10241574) 


1421 


gi 


5307761) 


1211 


gi 


6760369) 


968 



NDLTAVRPLS CDS SMDVgQVRLE WPTDLAVNPMDNS L YVLENNVILRI TENHQVS 1 1 AGR 
^LTAWPLSCDSSMDVgQVRLEWPTDLAVNPMDNSLWLENNVILRITENHQVSIIAGR 
iSTOLTAWPLSCDSSMDvSQVRLEWPTDLAVNPMDN^ 

CroLTAVRPLSCDSSMDVpQVRLEWPTDLAvSPMDNSLYVLENNVILRITENHQVSIIAGR 
NDLTSgRPLgCD ^SM^||qVRLE WPTDL AgNPMDNS nYVDTNNVffiLffllTENgQVglSAGR 
NTDLtJ^RPlE^ 



1437 
1442 
1443 
1480 
1270 
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20 Table 37E lists the domain description from DOMAIN analysis results against 

NOV37. This indicates that the NOV37 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 37E. Domain Analysis of NOV37 

gnl |Pfam|pfam02068, Metallothio_PEC, Plant PEC family met alio thionein. 
(SEQ ID NO: 830) 

CD-Length = 77 residues, 97.4% aligned 
Score = 41.6 bits (96) , Expect « 6e-04 



25 NOV37: 738 CSVDCGTHGVCIGG-ACRCEEGWTGAACDQRVCHPRCIEHGTCKDGKCECREGWNGEHCT 796 

I II I II HII I I II II II I Mil 
Sbjct: 2 CDDKCGCP S PCPGGNS CRCTS GGEAGAGDQ EHTTC PC -GEHC- 42 

NOV37: 797' IDGC-PDLCNGNGRCTLGQNSWQCVCQTGWRGPGCNVAMETSCA 839 

30 II I I I + I I III I III 

Sbjct: 43 --GCNPCTCPKTQTPTGRKGRANCSC GAGCTCA SCA 76 

Neurestin shows homology to a neuregulin gene product, human gamma-heregulin, a 
Drosophila receptor-type pair-rule gene product, Odd Oz (Odz) / Ten(m), and Ten(a), 

35 suggesting a possible function in synapse formation and morphogenesis. A mouse neurestin 
homolog has independently been cloned as DOC4 from the NIH-3T3 cell line. Northern blot 
analysis showed that neurestin is highly expressed in the brain and also in other tissues at 
much lower levels. In situ hybridization studies showed that neurestin is expressed in many 
types of neurons, including pyramidal cells in the cerebral cortex and tufted cells in the 

40 olfactory bulb during development. In adults, neurestin is mainly expressed in olfactory and 

hippocampal granule cells, which are known to be generated throughout adulthood. 

Nonetheless, in adults the expression of neurestin was experimentally induced in external 

tufted cells during regeneration of olfactory sensory neurons. These results suggest a role for 

neurestin in neuronal development and regeneration in the central nervous system 
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Neurestin is a putative transmembrane protein whose expression is developmental^ 
regulated in neurons. Neurestin expression pattern were examined in mitral/tufted cells in the 
developing rat olfactory bulb. In the main olfactory bulb, neurestin expression was segregated 
in the dorso-rostral area and in the ventro-caudal area, but not in between. In the accessory 
5 olfactory bulb, neurestin expression was found only in the far caudal area. This area did not 
completely correspond to a caudal half of the vomeronasal nerve and glomerular layers 
positive for a G-protein Go alpha. These spatio-temporal expression patterns suggest that 
neurestin functions as a target recognition molecule that specifies zonal projection patterns of 
olfactory and vomeronasal sensory neurons 
1 0 The NOV37 nucleic acid of the invention encoding a Ten-M2-like protein includes the 

nucleic acid whose sequence is provided in Table 37A, or a fragment thereof. The invention 
also includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 37A while still encoding a protein that maintains its Ten- 
M2-like activities and physiological functions, or a fragment of such a nucleic acid. The 

. 1 5 invention further includes nucleic acids whose sequences are complementary to those just 

described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

20 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1% of the residues 
may be so changed. 

25 The NOV37 protein of the invention includes the Ten-M2-like protein whose sequence 

is provided in Table 37B. The invention also includes a mutant or variant protein any of whose 
residues may be changed from the corresponding residue shown in Table 37B while still 
encoding a protein that maintains its Ten-M2-like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 35% of the bases 

30 may be so changed. 

The NOV37 nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from: brain disorders including epilepsy, 
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eating disorders, schizophrenia, ADD, cancer, heart disease, inflammation and autoimmune 
disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 
inflammatory skin disorders, allergies, blood disorders, psoriasis, colon cancer, leukemia, 
AIDS, thalamus disorders, metabolic disorders including diabetes and obesity, lung diseases 
5 such as asthma, emphysema, cystic fibrosis, and cancer, pancreatic disorders including 

pancreatic insufficiency and cancer, and prostate disorders including prostate cancer and other 
diseases, disorders and conditions of the like. 

NOV37 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 

10 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV37 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 

1 5 disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 



NOV38 

A disclosed NOV38 nucleic acid of 1090 nucleotides (also referred to as CG56737-01) 
encoding a novel Activin Beta C Chain-like protein is shown in Table 38A. An open reading 
20 frame was identified beginning with an ATG initiation codon at nucleotides 3-5 and ending 
with a TAG codon at nucleotides 1068-1070. Putative untranslated regions, if any, found 

upstream from the initiation codon and downstream from the termination codon are underlined 

» 

in Table 38A, and the start and stop codons are in bold letters. 



Table 38A. NOV38 Nucleotide Sequence (SEQ ID NO:145) 

C^TGACCTCCTCATTGCTTCTGGCCTTTCTCCTCCTGGCTCCAACCA^GTGGCCACTCCCAGAGCTGG 
CGGTCAGTGTCC^GCATGTGGGGGGCCC^CCTTGG 
GCCMGAGAAGCATCTTGGACAAGCTGCACCTCArc 
CTGCTTTGAGGACTGCACTGCAGC^CCTCCACGGGGT 

ACAGGAATGTGAAATCATCAGCITTGCTGAGACAGACTCCACTTCAGCCTACAGCTCCCTGCTCACTTTT 
CACCTGTCCACTCCTCGGTCCCACCACCTGTACCATGCCCGCCTGTGGCTGCACGTGCTCCCCACCCTTC 
CTGGCACTCTTTGCTTGAGGATCrrCCGAT^ 

GGCTGAGCACCACATCACCAACCTGGGCTGGCATACCTTAACTCTGCCCrCTAGTGG 

AAGTCCGGTGTCCTGAAACTGC^CTAGACTGCAGACCCCTAGAAGGCAACAGCACAGTTACTGGAC^C 

CGAGGCGGCTCTTGGACACAGCAGGACACCAGCAGCCCTTC^ 

TGGAGCAGGCCGGGCCAGGAGGAGGACCCCCACCTGTGAGCCTGCGACCCCCTTATGTTGCAGGCGAGAC 
CATTACGTAGACTTCCAGGAACTGGGATGGCGGGACTGGATACTGCAGCCCGAGGGGTACCAGCTGAATT 
ACTGGAGTGGGCAGTGCCCTCCCGACCTGGCTGGCAG 
CTTCAGCCTCCTCAAAGCCAACAATCCTTGGCCTGCCAGTACCTCCT 

CCCCTCTCTCTCCrCTACCrrGGATCATAGTGGCAATGTGGTCAAGACGGATGTGCCAGATATG 
AGGCCTGTGGCTGCAGCTAGCAAGAGGACCTGGGGCTTTG 
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The disclosed NOV38 nucleic acid sequence, located on chromosome 12ql3.1, has 
748 of 935 bases (80%) identical to a gb:GENBANK-ID:MMU96386|acc:U96386.1 mRNA 
from Mus musculus (activin beta E subunit mRNA, complete cds) (E = 5.2e~ 120 ) . 

A disclosed NOV38 polypeptide (SEQ ID NO:146) encoded by SEQ ID NO: 145 is 
355 amino acid residues and is presented using the one-letter amino acid code in Table 38B. 
Signal P, Psprt and/or Hydropathy results predict that NOV38 contains a signal peptide and is 
likely to be localized extracellularly with a certainty of 0.5135. The most likely cleavage site 
for a NOV38 peptide is between amino acids 18 and 19: TVA-TP . 

Table 38B. Encoded NOV38 protein sequence (SEQ ID NO:146). 

MTSSLLLAFLLLAPTTVA^ 

LQHLHGVPQGALLEDNREQECEI I SFAETDSTSAYSSLLTFHLSTPRSHHLYHARLWLHVIiPTLPGTLCIjRI FRW 

GPRRRRQGSRTLLAEHHITNLGWHT^ 

LKIRANEPGAGRARIUITPTCEPATPLCCRRDHV^ 

SAVFSLLKANNPWPj^TSCCVPTAittlP^ . 

The disclosed NOV38 amino acid sequence has 217 of 355 amino acid residues (61%) 
identical to, and 253 of 355 amino acid residues (71%) similar to, the 352 amino acid residue 
ptnnSWISSPROT-ACC:P55103 protein from Homo sapiens (Human) (inhibin beta C chain 
precursor (activin beta-C chain)) (E = 7.6e" 103 ). 

NOV38 is predicted to be expressed in at least ovary and liver. This information was 
derived by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or 
RACE sources. 

Possible small nucleotide polymorphisms (SNPs) found for NOV38 are listed in Table 

38C. 



Table 38 


IC: SNPs 


Consensus 
Position 


Depth 


Base 
Change 


PAF 


95 


14 


T>C 


N/A 



NOV38 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 38D. 



Table 38D. BLAST results for NOV38 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
. (%) 


Positives 
(%) 


Expect 


gi | 13899338 |ref|NP 
113667.1 | 
(NM 031479) 


hypothetical ; 
protein MGC4638 
[Homo sapiens] 


350 


291/348 
(83%) 


307/348 
(87%) 


e-153 
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gi| 1471453 9 |gb|AAHl 

0404.l|AAH10404 

(BC010404) 


inhibin beta E 
[Mus musculus] 


350 


253/352 
(71%) 


280/352 
(78%) 


e-129 


gi | 6680453 |ref|NP 0 

32408.1) 

(NM 008382) 


inhibin beta E 
[Mus musculus] 


350 


253/352 
(71%) 


280/352 
(78%) 


e-129 


gi) 13929160 |ref|NP_ 
114003. l| 
(NM 031815) 


activin beta E 
[Rattus 
norvegicus] 


350 


250/352 
(71%) 


279/352 
(79%) 


e-125 


gi j 4809189 | gb | AAD30 
133 .l|AF140032 1 
(AF140032) 


activin beta E 

[Rattus 

norvegicus] 


350 


248/352 
(70%) 


279/352 
(78%) 


e-124 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 38E. 



Table 38E Information for the ClustalW proteins 

1) N0V38 (SEQ ID NO: 146) 

2) gi 1 13899338 | ref |NP_113667 . 1 1 (NM_031479) hypothetical protein MGC4 63 8 [Homo 
sapiens] (SEQ ID NO:492) 

3) gi|l4714539|gb|AAH10404.l|AAH10404 (BC010404) inhibin beta E [Mus musculus] (SEQ 
ID NO:493) 

4) gi 1 6680453 |ref|NP_03240B.l| (NM_008382) inhibin beta E [Mus musculus] (SEQ ID 
NO:494) 

5) gi 1 13929160 | ref |NP_114003 . 1 1 (NM_031815) activin beta E [Rattus norvegicus] (SEQ 
ID NO:495) 

6) gi|4809189|gb|AAD30133.l|AF140032_l (AF140032) activin beta E [Rattus norvegicus] 
(SEQ ID NO:496) 
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- 1 
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4809189 | 


1 
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59 
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59 



70 80 



SRPRI TjJjpgPQAALTRALRRLQ 
SRPRITRPgPQAALTRALRRLQ 
SRPRITRPgPQAALTRALRRLQ 
SRPRITRPgPQAALTRALRRLQ 
SRPRITRpSpQAALTRALRRLQ 




NOV38 


117 






TP! 


gi 


13899338) 


112 






TPI 


gi 


14714539) 


113 


Q 




PLV 


gi 


6680453) 


113 






PL* 


gi 


13929160 | 


113 






PLV 


gi 


4809189) 


113 






PLV 



13 0 



140 



150 



160 



170 



"SHHLYHARLWLHVgP 
S HHL YHARL WLHvBp 
'sHHLYHARLWLHVgPS 
SHHLYHARLWLHvJjP 

shhlyharlwlhvHp 
s hhlyharlwlhvBp 



180 



NOV38 


177 


gi 


13899338) 


172 


gi 


14714539) 


172 


«ri 


6680453 | 


172 


gi 


13929160) 


172 


gi 


4809189) 


172 




335 



WO 02/068649 



PCT/US02/02785 



10 



15 



NOV38 


237 


gi| 13899338 | 


232 


gi | 14714539 j 


232 


gij 6680453 | 


232 


gij 13929160 | 


232 


gij 4809189 | 


232 


NOV38 


297 



gi 
gi 
gi 
gi 
gi 



13899338) 
14714539 | 
6680453) 
13929160 | 
48091891 



292 
292 
292 
292 
292 




310 



320 



I 



330 



L 



340 



I 



350 



I. 



SFHSAVFSLLKANNPWP 
A.SFHSAVFSLLKANNPWP 
ASFHSAVFSLLKANNPWP 
.^SFHSAVFSLLKANNPWP 
.^SFHSAVFSLLKANNPWP 
kSFHSAVFSLLKANNPWPAS 



SCCVPTARRPLSLLYLDHgGNWKTDVPDMWEACGCS 
SCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 
S CC VPT ARRPL SLL YLDHNGNWKTD VPDM WE AC G C S 
SCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 
SCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 
gSCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 



355 
350 
350 
350 
350 
350 



20 Tables 38F-G list the domain description from DOMAIN analysis results against 

NOV38. This indicates that the NOV38 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 38F Domain Analysis of NO V38 

gnl | Smart | smart 002 04, TGFB, Transforming growth factor-beta (TGF-beta) 
family; Family members are active as di sulphide -linked homo- or 
heterodimers . TGFB is a multifunctional peptide that controls 
proliferation, differentiation, and other functions in many cell types 
(SEQ ID NO: 831) 

CD-Length = 102 residues, 100.0% aligned 
Score = 134 bits (336) , Expect = le-32 



25 



30 



NOV38: 
Sbjct: 
NOV38: 
Sbjct: 



252 CCI^HYVDFQELGWRDWILQPEGYQI^CSGQCPPHLAGSPGIAASFHSAVPSLLKANN 311 

i I I 1111**111 111+ 1*11 II |+|| |+ 1+ |+ | I I* I * 

1 CRRHDLYVDFKDLGVTODWIIAPKGYNAYYCEGECPFPLSERLN- - ATNHAIVQSLVHALD 5 8 



312 



59 



PWPASTSCCTOTAMiPLSIJiYLDHSGNWKTDVPDMVVEACGCS 

I HIM 111*11 I I I I I * |*|||| III 

PGAVPKPCCVPTKLSPLSMLYYDDDGNWLRNYPNMWEECGCR 



355 



102 



Table 38G. Domain Analysis of NOV38 

gnl|pfam|pfam000l9, TGF-beta, Transforming growth factor beta like 
domain (SEQ ID NO: 832) 

CD-Length = 105 residues, 97.1% aligned 
Score = 114 bits (286), Expect » 7e-27 



35 



40 



N0V38: 
Sbjct: 
NOV38: 
Sbjct: 



252 



312 



62 



CCRRDHYVDFQELGWIUWILQPEGYQLOTCSGQCPPH^ 

I I 1111**111 III* Mil Mill III | |* * + |* + | 

CRLRSLYVDFKDLGWGDWIIAPEGYIANYCSGSCPFPLRDDLN- - LSNHAI LQTLVRLRN 



311 



61 



PWPASTSCCVPTARRPLSLLYLDHSGNWKTDVPDMWEACGCS 

I Mill 111*1111 * III 1*1 I* III 

PRAVPQPCCVPTKLSPIiSMLYLDDNSNVVLRLYPNMSVKECXSCR 
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105 
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Activins are homo- or heterodimers of related beta subunits (see 147290) while 
inhibins are dimers composed of an alpha subunit (147380) and an activin beta subunit 
(summarized in Schmitt et al., Genomics 1996, 32(3):358-66). Activin proteins belong to the 
TGF-beta superfamily (see 190180), the members of which have important roles in cell 
5 determination, differentiation, and growth. 

TGFB is a multifunctional peptide that controls proliferation, differentiation, and other 
functions in many cell types. It was first identified by its ability to cause phenotypic 
transformation of rat fibroblasts. TGFB is chemically distinct from TGFA. It has essentially no 
sequence homology with TGFA or with epidermal growth factor, of which TGFA is an 

10 analog. Members of the same gene family as TGFB include inhibin, which inhibits pituitary 
secretion of follicle stimulating hormone, and Mullerian inhibitory substance, which is 
produced by the testis and is responsible for regression of the Mullerian ducts (anlagen of the 
female reproductive system) in the male embryo. Many cells synthesize TGFB and almost all 
of them have specific receptors for this peptide. Alpha and beta TGFs are classes of 

15 transforming growth factors. TGFB acts synergistically with TGFA in inducing 
transformation. It also acts as a negative autocrine growth factor. 

TGF-beta plays an important role in wound healing. A number of pathologic 
conditions, such as idiopathic pulmonary fibrosis, scleroderma, and keloids, which share the 
characteristic of fibrosis, are associated with increased TGF-beta- 1 expression. 

20 The disclosed NOV38 nucleic acid of the invention encoding a Activin Beta C Chain- 

like protein includes the nucleic acid whose sequence is provided in Table 38A or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 38A while still encoding a protein 
that maintains its Activin Beta C Chain-like activities and physiological functions, or a 

25 fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of non-limiting example, 

30 modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
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acids, and their complements, in one embodiment up to about 20% of the NOV38 residues 
may be so changed. 

The disclosed NOV38 protein of the invention includes the Activin Beta C Chain-like 
protein whose sequence is provided in Table 38B. The invention also includes a mutant or 
5 variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 38B while still encoding a protein that maintains its Activin Beta C Chain-like 
activities and physiological functions, or a functional fragment thereof. In one embodiment a 
mutant or variant protein of NOV38, up to about 39% of the bases may be so changed. 

The above defined information for this invention suggests that these Activin Beta C 

1 0 Chain-like proteins (NOV38) is a member of a "Activin Beta C Chain family". Therefore, the 
NOV38 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 

1 5 targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The nucleic acids and proteins of NOV38 are useful in Alzheimer disease-5, Myxoid 
liposarcoma, Stickler syndrome, type I (3), SED, Alpha-ketoglutarate dehydrogenase 

20 deficiency, Cerebral cavernous malformations-2, Greig cephalopolysyndactyly syndrome, 
Hyperinsulinism, familial, MODY, type 2, Pallister-Hall syndrome, Polydactyly, postaxial, 
types Al and B, Polydactyly, postaxial, type IV, Retinitis pigmentosa-9, Charcot-Marie-Tooth 
neuropathy-2D, Colton blood group, Deafness, autosomal dominant 5, Macular dystrophy, 
dominant cystoid, Radioulnar synostosis with amegakaryocytic thrombocytopenia, Von 

25 Hippel-Lindau (VHL) syndrome, cirrhosis, transplantation, fertility and/or other pathologies 
and disorders. 

NOV38 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
30 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

NOVX Antibodies" section below. For example the disclosed NOV38 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
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disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 



NOV39 

NOV39 includes novel Activin Beta C Chain-like and Inhibin Beta E Chain Precursor- 
5 like proteins disclosed below. The disclosed sequences have been named NOV39a and 
NOV39b. 



NOV39a 

A disclosed NOV39a nucleic acid of 1 1 12 nucleotides (also referred to as CG56737- 
10 02) encoding a novel Inhibin Beta E Chain Precursor-like protein is shown in Table 39A. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 40- 
42 and ending with a TAG codon at nucleotides 1090-1092. Putative untranslated regions, if 
any, found upstream from the initiation codon and downstream from the termination codon are 
underlined in Table 39A, and the start and stop codons are in bold letters. 



Table 39A. NOV39a Nucleotide Sequence (SEQ ID NO:147) 

CCATCCGAGGCTCCTGAACCGGGGCCATTCACC&GGAGCATGCGGCT 
TGCTGCTGTGGGGACTGGTGCGAGGAC^GGGGACAGG^ 

GGCACCCCAAGCAGAACGAGCTCTGGTGCTGGAGCTAGCCAAGCAGCAAATCCTGGATGGGT^ 
ACCAGTCGTC CCAGAATAACT CAT CCTCCACCCCAGGCAGCGCTGACCAGAGCCCTCCGGAGACTACAG C 
CAGGGAGTGTGG CTC CAGGGAATGGG GAGGAGGT CAT CAG CTTTGCTACTGTCACAGACTCCACTT GAG C 
CTACAGCTCCCTGCTCACrTTTraC^^ 
CTGCACGTGCTCCCCACCCTTCCTGGCACTCTTTGC 

GCCAAGGGTCCCGCACTCTCCTGGCTGAGCACCACATCACCAACCTGGGCTGGC^ 

CTCTAGTGGCTTGAGGGGTGAGAAGT CTGGTGTCCTGAAACTGCAACTAGACTGCAGACCCCTAGAAGG C 
AACAGCACAGTTACTGGACAACCGAGGCGGCTCTTGGAC^CAGCAGGACAC CAGCAGCCCTTCCTAGAG C 
TTAAGATCCGAGCCAATGAGCCTGGAGCAGGCCGGG CCAGGAGGAGGACC CCCAC CTGTGAGCCTGCGAC 
CCCCTTATGTTGCAGGCGAGACCATTACX3TAGACTTCCAGGAACTGGGATGGCGGGACTGGATACTGCAG 
CCCGAGGGGTACCAGCTGAATTACTGCAGTGGGCAGTGCCCTCCC 
CTGCCTCTTTCCATTCTGCCGTCTTCAGCCTCCTCAAAG^ 

TTGTGTCCCTACTGCCCGAAGGCCCCTCTCTCTCCTCTACCTGGATCATAATGGCAATGTGGTCAAGACG 
GATGTGCCAGATATGGTGGTGGAGGCCTGTGGCTGCAGCTAGCAAGAGGACCTGGGGCTTTG 



15 ' 

The disclosed NOV39a nucleic acid sequence, located on chromosome 7pl3-15, has 
923 of 1 1 10 bases (83%) identical to a gb:GElsBANK-ID:MMU96386|acc:U96386.1 mRNA 
from Mus musculus (activin beta E subunit mRNA, complete cds) (E = 2.1e~ 165 ) . 

A disclosed NOV39a polypeptide (SEQ ID NO: 148) encoded by SEQ ID NO: 1 47 is 
20 350 amino acid residues and is presented using the one-letter amino acid code in Table 39B. 
Signal P, Psort and/or Hydropathy results predict that NOV39a contains no signal peptide and 
is likely to be localized extracellularly with a certainty of 0.3700. The most likely cleavage site 
for a NOV39a peptide is between amino acids 19 and 20: VRA-QG. 
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Table 39B, Encoded NOV39a protein sequence (SEQ ID NO:148). 

MRLPDVQLWLVLLWAIjVRAQGTGSVCPSCGGSKLAPQAERALVLEIJUC 

LRRLQPGS VAPGNGEEVI SFATVTDSTSAYSSLLTFHLSTPRSHHLYHARLWLHVLPTLPGTLCLR I FRWGPRRR 
RQGSRTLLAEHHI TNLGWHTLTLPSSGLRGEKSGVLKLQLDCRPLEGNSTVTGQPRRIiLDTAGHQQPFLEL KIRA 
NEPGAGRARRRTPTCEPATPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHPAGSPGIAASFHSAVFS 
LLKANNPWPASTSCCVPTARRPLSLLYLDHNGNVVKTDVPDMVVEACGCS 



The disclosed NOV39a amino acid sequence has 287 of 350 amino acid residues 
(82%) identical to, and 301 of 350 amino acid residues (86%) similar to, the 350 amino acid 
residue ptnr:SWISSPROT-ACC:O08717 protein from Mus musculus (Mouse) (Inhibin Beta E 
5 Chain Precursor (Activin Beta-E Chain)) (E = 5.0e' 154 ). 

NOV39a is predicted to be expressed in at least the following tissues: ovary and liver. 
Expression information was derived from the tissue sources of the sequences that were 
included in the derivation of the NOV39a sequence. 



NOV39b 



10 A disclosed NOV39b nucleic acid of 1 1 12 nucleotides (also referred to as CG56647- 

03) encoding a novel Inhibin Beta E Chain-like protein is shown in Table 39C. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 40-42 and 
ending with a TAG codon at nucleotides 1090-1092. Putative untranslated regions, if any, , 
found upstream from the initiation codon and downstream from the termination codon are 

1 5 underlined in Table 39C, and the start and stop codons are in bold letters. 



Table 39C. NOV39b Nucleotide Sequence (SEQ ID NO:149) 

CCATC CGA GG CTCCTGAACCGGGGCCATTCACCAGGAGCA TGCGK5 

TGCTGCTGTGGGCACTGGTGCGAGCACAGGGGACAGGGTCTGTGTGTCCCTCCTGTGGGGGCTCCAAACT 

GGCACCCCAAGCAGAACGAGCTCTGGTGCTGGAGCTAGCCAAGCAGCAAATCCTGGATGGGTTGCACCTG 

ACCAGTCGTCCCAGAATAACTCATCTTCCACCCCAGGCAGCGCTGACCAGAGCCCTCCGGAGACTACAGC 

CAGGGAGTGTGGCTCCAGGGAATGGGGAGGAGGTCATCAGCTTTGCTACTGTCACAGACTCC^ 

CTACAGCTCCCTGCTCACTTTTCACCTGTCCACTCCTCGGTCCCACCACCTGTACCATGCCCGC 

CTGCACGTGCTCCCCACCCOTCCTGGCACTCOTTGCTTG 

GCCAAGGGTCCCGCACTCTCCTGGCTGAGCACCACATCACCAACCTGGGCTGGCATACCTTAACTCTGCC 

CTCTAGTGGCTTGAGGGGTGAGAAGTCTGGTGTCCTGAAACTGCAACTAGACTGCAGACCCC 

AACAGCACAGTTACTGGACAACCGAGGCGGCTOT 

TTAAGATCCGAGCCAATGAGCCTGGAGCAGGCCGGGCCAGGAGGGGGACCCCCACCTGTGAGCCCGCGAC 
CCCCTTATGTTGCAGGCGAGACCATTACGTAGACTTCCAGGAACT 

CCCGAGGGGTACCAGCTGAATTACTGCAGTGGGCAGTGCCCTCCCCACCTGGCTGGCAGCCCAGGCATTG 
CTGTCTCTTTCCMTCTGCCGTCTTC^ 

TTGTGTCCCTACTGCCCGAAGGCCCCTCTCTCTCCTCTACCTGGATCATAATGGCAATGTGGTCAAGA 
GATGTGCCAGATATGGTGGTGGAGGCCTGTGGCTGCAGCTAGCAAGAGGACCTGGGGCTTTG 



The disclosed NOV39b nucleic acid sequence, located on chromosome 7pl3-15, has 
920 of 1 1 10 bases (82%) identical to a gb:GENBANK-ID:MMU96386|acc:U96386.1 mRNA 
from Mus musculus (activin beta E subunit mRNA, complete cds) (E = 3.7e" 164 ) . 
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A disclosed NOV39b polypeptide (SEQ ID NO: 150) encoded by SEQ ID MO: 149 is 
350 amino acid residues and is presented using the one-letter amino acid code in Table 39D. 
Signal P, Psort and/or Hydropathy results predict that NOV39b contains a signal peptide and is 
likely to be localized extracellularly with a certainty of 0.3700. The most likely cleavage site 
5 for a NOV39b peptide is between amino acids 19 and 20: VRA-QG. 

Table 39D. Encoded NOV39b protein sequence (SEQ ID NO:150). 

MRLPDVQLWLVLLWMjVRAQGTGSVCPSCGGSKLAP . 

LRIOjQPGSVAPGNGEEVISFATvTDSTSAYSSLLT 

RQGSRTLLAEHHITNLGWHTLTLPSSGLRGEKSGVLKLQ 

NEPGAGRARRGTPTCEPATPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHIAGSPGIAVSFHSAVFS 
LLKANNPWPASTSCCVPTARRPLSLLYLDHNGNVVKTDVPDMVVEACGCS 

The disclosed NOV39b amino acid sequence has 285 of 350 amino acid residues 
(81%) identical to, and 299 of 350 amino acid residues (85%) similar to, the 350 amino acid 
residue ptnr:SWISSPROT-ACC:O08717 protein from Mus musculus (Mouse) (Inhibin Beta E 
1 0 Chain Precursor (Activin Beta-E Chain)) (E = 1 ,5e~ 152 ). 

NOV39b is predicted to be expressed in at least the following tissues: ovary and liver. 
Expression information was derived from the tissue sources of the sequences that were 
included in the derivation of the NOV39b sequence. 

Possible small nucleotide polymorphisms (SNPs) found for NOV39a are listed in 
15 Table 39E. 



Table 39E: SNPs 


Consensus 
Position 


Depth 


Base 
Change 


PAF 


1095 


11 


T>C 


N/A 



Possible small nucleotide polymorphisms (SNPs) found for NOV39b are listed in 
Table 39F. 



Table 39F: SNPs 


Consensus 


Depth 


Base 


PAF 


Position 




Change 




933 


9 


OT 


0.222 



20 NOV39a also has homology to the amino acid sequences shown in the BLASTP data 

listed in Table 39G. 



Table 39G. BLAST results for NOV39a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 
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gi | 13899338 |ref |NP_ 
113667.1 | 
(NM 031479) 


hypothetical 
protein MGC4638 
[Homo sapiens] 


350 


349/350 
(99%) 


349/350 
(99%) 


0 . 0 


gi | 14714539 | gb| AAH1 

0404.1|AAH10404 

(BC010404) 


inhibin beta E 
[Mus musculus] 


350 


288/351 
(82%) 


302/351 
(85%) 


e-153 


gi | 6680453 |ref |NP_0 
32408. 1| 
(NM 008382) 


innxbin beta E 
[Mus musculus] 


350 


287/351 
(81%) 


301/351 
(84%) 


e-152 


ni 1 1 1QOD1 CO 1 rof IMP 
gi | lj7«7XOU | tcl j Vic 

114003.1 | 
(NM 031815) 


aCLlVxXi JJcta Cm 

[Rattus 
norvegicus] 




281/351 
(80%) 


294/351 
(83%) 


e-146 


gi | 4809189 |gb| AAD30 

133.l|AF140032_l 

(AF140032) 


activin beta E 

[Rattus 

norvegicus] 


350 


279/351 
(79%) 


294/351 • 
(83%) 


e-146 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 39H. 

Table 39H Information for the ClustalW proteins 

1) NOV39a (SEQ ID NO: 148) 

2) NOV39b (SEQ ID NO: 150) 

3) gi 1 13899338 | ref |NPJL13667 . 1 1 (NM_031479) hypothetical protein MGC4638 [Homo 
sapiens] (SEQ ID NO: 497) 

4) gi|l4714539|gb|AAH10404.l|AAH10404 (BC010404) inhibin beta E [Mus musculus] (SEQ 
ID NO:498) 

5) gi | 6680453 |ref|NP_032408.l| (NM_008382) inhibin beta E [Mus musculus] (SEQ ID 
N0:499) 

6) gi 1 13929160 | ref |NP_114003 .1 1 (NM_031815) activin beta E [Rattus norvegicus] (SEQ 
ID NO: 500) 

7) . gi 1 4809189 |gb|AAD30133.l|AF140032_l (AF140032) activin beta E [Rattus 
norvegicus] (SEQ ID NO: 501) 



NOV39a 


1 


N0V39b 


1 


gi 


13899338) 


1 


gi 


14714539| 


1 


gi 


6680453 | 


1 


gi 


13929160| 


1 


gi 


4809189) 


1 



N0V39a 

N0V39b 

gi | 13899338 | 

gi j 14714539 | 

gi | 6680453 | 

gi j 13929160 | 

gi j 4809189 | 




170 




180 

..I 



LTLPSSGLF 
LTLPSSGLR 
LTLPSSGLE 
LTLPSSGLR 
LTLPSSGLR 
LTLPSSGLR 
LTLPSSGLR 



179 
179 
179 
179 
179 
179 
179 



190 



200 



210 
....|.. 
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220 



230 



240 



|....|....|. 
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NOV39a 
NOV39b 



gi 
gi 
gi 
gi 
gi 



13899338 | 
14714539| 
6680453 | 
13929160) 
4809189| 




10 



15 



NOV39a 


240 


NOV39b 


240 


gi 


13899338) 


240 


gi 


14714539 | 


240 


gi 


6680453 | 


240 


gi 


13929160 | 


240 


gi 


4809189 | 


240 



250 



260 



270 



I. 



280 



I. 



290 



300 



cepgtplccrrdhyvdfqelgwrdwilqpegyqlnycsgqcpphgagspgiaasfhsav; 

cepStplccrrdhyvdfqelgwrdwilqpegyqlnycsgqcpphlagspgia^sfhsavf 

cepBtplccrrdhyvdfqelgwrdwilqpegyqlnycsgqcpphlagspgiaasfhsavf 



CSPgTPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHLAGSPGIAASFHSAVF 
CEPJTPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHLAGSPGIAASFHSAVF 
CEggTPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHLAGSPGIAASFHSAVF 
CE™TPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHLAGSPGIAASFHSAVF 



299 
299 
299 
299 
299 
299 
299 



20 



25 



NOV39a 


300 


NOV39b 


300 


gi 


13899338) 


300 


gi 


14714539 | 


300 


gi 


6680453) 


300 


gi 


13929160 | 


300 


gi 


4809189) 


300 



310 
..|... 

m 



320 



330 



340 
. . I . . 



350 



SLLKANNPWPAgiSCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 
S LL KANNP W P AjSjS C C VP T ARRP L S LL YLDHNGNWKTDV PDM WE AC G CS 
SLLKAm T PW?ASsCCVPTARRPLSLLYLDHNGNVVKTDVPDMVVEACGCS 
SLLKAJSn^PWPA^SCCVPTARRPLSLLYLDHNGI^VVKTDVPDMVVEACGCS 
SLLKANNPWPA^SCCVPTARRPLSLLYLDHNGNVVKTDVPDMVVEACGCS 
S LL KANNP WPA^Ss C C VP T ARRPL S LL YLDHNGNWKTDV PDM WE AC GC S 
SLLKANNPWPA^SCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 



350 
350 
350 
350 
350 
350 
350 



30 Tables 3 91- J list the domain description from DOMAIN analysis results against 

NOV39. This indicates that the NOV39 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 391 Domain Analysis of NOV39 

gnl [Smart | smart 002 04, TGFB, Transforming growth factor-beta (TGF-beta) 
family; Family members are active as disulphide- linked homo- or 
heterodimers . TGFB is 7 a multifunctional peptide that controls 
proliferation, differentiation, and other functions in many cell 
types. (SEQ ID NO: 833) 

CD-Length = 102 residues, 100.0% aligned 
Score « 133 bits (335), Expect = le-32 



35 NOV39: 247 CCM^HYTOFQELGWRDWILQPEGYQLNYCSGQCPPHP^ 306 

I I I llllt+lll 111+ Ml II Ml ♦ 1+ 1+ I 11+ I + 

Sbjct: 1 CRRHDLYVDFKDLGVTODWI IAPKGYNAYYCEGECPFPLSERLN- - ATNHAIVQSLVHALD 58 

PWPASTSCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 350 

i inn in+ii i +IIM + miii mi 

PGAVPKPCCVPTKLS PI*SMLYYDDDGNWLRNY PNMWEECGCR 1 02 



Table 39 J. Domain Analysis of NOV39 

gnl|Pfam|pfam00019, TGF-beta, Transforming growth factor beta like 
domain. (SEQ ID NO: 834) 

CD-Length = 105 residues, 97.1% aligned 
Score = 114 bits (286) , Expect =* 7e-27 



NOV39: 307 

40 

Sbjct: 59 
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NOV46: 247 CCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHPAGSPGIAASFHSAVFSIiLKANN 306 

I I IIII++III 111+ I I I I I I I I I 1 1 I 1+ + +I++ I 

Sb j c t : 4 CRLRS L YVDFRDLGWGDWI IAPEGYIANYCSGSCPFPLRDDLN- - L SNHAI LQTLVRLRN 6 1 

5 NOV46: 3 07 PWPASTSCCVPTARRPLSLLYLDHNGNVVKTDVPDMVVEACGCS 350 

I Mill lll+llll I III M 1+ III 

Sbjct: 62 PRAVPQPCCVPTKLSPLSMLYLDDNSNWLRLYPNMSVKECGCR 105 

Activins are homo- or heterodimers of related beta subunits (see 147290) while 

1 0 inhibins are dimers composed of an alpha subunit (147380) and an activin beta subunit 

(summarized in Schmitt et al., Genomics 1996, 32(3):358-66). Activin proteins belong to the 
TGF-beta superfamily (see 190180), the members of which have important roles in cell 
determination, differentiation, and growth. 

TGFB is a multifunctional peptide that controls proliferation, differentiation, and other 

1 5 functions in many cell types. It was first identified by its ability to cause phenotypic 

transformation of rat fibroblasts. TGFB is chemically distinct from TGFA. It has essentially no 
sequence homology with TGFA or with epidermal growth factor, of which TGFA is an 
analog. Members of the same gene family as TGFB include inhibin, which inhibits pituitary 
secretion of follicle stimulating hormone, and Mullerian inhibitory substance, which is 

20 produced by the testis and is responsible for regression of the Mullerian ducts (anlagen of the 
female reproductive system) in the male embryo. Many cells synthesize TGFB and almost all 
of them have specific receptors for this peptide. Alpha and beta TGFs are classes of 
transforming growth factors. TGFB acts synergistically with TGFA in inducing 
transformation. It also acts as a negative autocrine growth factor. 

25 TGF-beta plays an important role in wound healing. A number of pathologic 

conditions, such as idiopathic pulmonary fibrosis, scleroderma, and keloids, which share the 
characteristic of fibrosis, are associated with increased TGF-beta- 1 expression. 

The disclosed NOV39 nucleic acid of the invention encoding a Activin Beta C Chain- 
like protein includes the nucleic acid whose sequence is provided in Table 39A, 39C or a 

30 fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 39A, or 39C while still 
encoding a protein that maintains its Activin Beta C Chain-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 

35 that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of non-limiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
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derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, in one embodiment up to about 17% of the NOV39a residues 
5 may be so changed, and in an additional embodiment up to about 1 8% of the NOV39b 
residues may be so changed. 

The disclosed NOV39 protein of the invention includes the Activin Beta C Chain-like 
protein whose sequence is provided in Table 39B, or 39D. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
1 0 residue shown in Table 39B, or 39D while still encoding a protein that maintains its Activin 
Beta C Chain-like activities and physiological functions, or a functional fragment thereof. In 
one embodiment a mutant or variant protein of NOV39a, up to about 39% of the bases may be 
so changed. 

The above defined information for this invention suggests that these Activin Beta C 

15 Chain-like proteins (NOV39) is a member of a "Activin Beta C Chain family". Therefore, the 
NOV39 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 

20 targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The nucleic acids and proteins of NOV39 are useful in Alzheimer disease-5, Myxoid 
liposarcoma, Stickler syndrome, type I (3), SED, Alpha-ketoglutarate dehydrogenase 

25 deficiency, Cerebral cavernous malformations-2, Greig cephalopolysyndactyly syndrome, 
Hyperinsulinism, familial, MODY, type 2, Pallister-Hall syndrome, Polydactyly, postaxial, 
types Al and B, Polydactyly, postaxial, type IV, Retinitis pigmentosa-9, Charcot-Marie-Tooth 
neuropathy-2D, Colton blood group, Deafness, autosomal dominant 5, Macular dystrophy, 
dominant cystoid, Radioulnar synostosis with amegakaryocytic thrombocytopenia, Von 

JO Hippel-Lindau (VHL) syndrome, cirrhosis, transplantation, fertility and/or other pathologies 
and disorders. 

NOV39 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
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known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV39 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
5 disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 



NOV40 

A disclosed NOV40 nucleic acid of 1 606-nucleotides (also referred to as CG56097-01) 
10 encoding a UDP glycosyltransferase-like protein is shown in Table 40A. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 1-3 and ending 
with a TAG codon at nucleotides 1600-1602. The start and stop codons are shown in bold in 
Table 40A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 40A. NOV40 nucleotide sequence (SEQ ID NO:151). 

ATGGCTATGAAATGGACTOCAGTCCTTCTGTTGA 

AATGTGCCG CTGTG GCCCATGGAAT AT AGTCCT TGG ATGAAT AT AAAGACAATCCTGG ATAAACTT ATG CAG 
ATAAGT CATGAGGTGACTGTTCT AACATTGTCAGC TTC CATTCTTGITGAT C CCAACATAACATCTGTTACT 
AAATTTGAGGTTTATTCTATATCTGTAATTAAAGATGATTT^ 

AAATGGATACATGATCTTC CAAAACAT ATATTTTGGTTTAAATGTG TTC CC GGAA 
T ATTCTGGT TAT ACTGAGAAGTT CTTTAAAGATGTAGTTTTGAAGAAGAAACT 
TCAAGGTCTGATGTCGTTCATGCAAATGCCATTGGTCCCTTTGGA 

TCCTTTGTGTACAGTCTCCACTTCTCTCCTGGCTACAGATTTGAGAAATACAGTGGAGGATTTCT^ 

C CTT CC TATGGAGCTGT TATT CTGT CAGAATTAAGTGGTTCGATGACATTCATGGAGAC AGT AAGAAATATT 

AT ATATGTG TT TTATTTTG ACT TTTGGTTC CAAACATTTGATATGAAGAAGGGAGACCAGTTTT ACAGTG AA 

GTTCTAGGTAAGTCATGTTTTTTATCTGAGATAATGGGAAAAGCTGAAATGTGGCTCATT 

TATTTGGAATTTCCTCGCCCACTCTTACCTAATTIT^ 

CCCCTGCCTAAGGAGAAAATGGAAGAATTTGCCCAGAGCTCTGATGAAGACGGTGTTGTGT^ 
TCAGCTGTGCAAAACCTTACAGAAGAAAAAGCTGATCTTATGACTTCGGCC 

GTCATGAAGTTCGGAAGGAAACCAAATACCTTAAGATCCAATACTCAGTGGCATAGGTGGATCC CACAGAAT 

GAATGTCTTATCCTAGATCATCCCCAAACCAAAGCCTTTATAACTTATGGTGG^CAAAT^ 

ATGATCTACCGTGGAGTCCCTTCCATGGGCArrcCTTTGTTTGCGGACCAACATGATAACATTGCTCACAT^ 

AAGGCCAAGGGAGCAGCTGTTATATTGGACTTGAGCACAAAGTCAAGTACAGATTTGCTCGATATATCTGTG 

TTCGT ATCT TT ATTTTTATC CTTC AGATATAAAGAG AGTGTTATGAAATTATCAAGAAT TCAACATGATC AA 

CC AGTG AAGC C CCT GGATCGAGCAGT CTTCTGG ATTGAATTTGTCATG CGC CACAAAGGAG C CAAACAC CTT 

CGAGTTGCAGCCCGTGACCTCACCTGGTTCCAGTACCACTCTTTGGATGTGATTGGGT^ 

GTGGCAACTGTGACATTTATCATCACAAAGTGTTGTCTGTTTTGTTO 

AAGGAAAAAAGGGATTAGTTAT 



In a search of public sequence databases, the NOV40 nucleic acid sequence, located on 
chromosome 4, has 1305 of 1606 bases (81%) identical to a gb:GENBANK- 
ID:HUMUDPGTA|acc:J05428.1 mRNA from Homo sapiens (Human 3,4-catechol estrogen 
UDP-glucuronosyltransferase mRNA, complete cds) (E = 6.4e~ 217 ). 

The disclosed NOV40 polypeptide (SEQ ID NO: 152) encoded by SEQ ID NO:151 has 
533 amino acid residues and is presented in Table 40B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV40 has no signal peptide and is 
likely to be localized to the endoplasmic reticulum (membrane) with a certainty of 0.8200. 
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Alternatively, NOV40 may also localize to the plasma membrane with a certainty of 0.4600, to 
the microbody (peroxisome) with a certainty of 0.3012, or to the lysosome (membrane) with a 
certainty of 0.2000. The most likely cleavage site for NOV40 is between positions 20 and 21 : 
SSS-GS. 

5 



Table 40B. Encoded NOV40 protein sequence (SEQ ID NO:152). 

MAMKWTSVLLLIQLSYYSSSGSCGNVPLWPMEYSPWMNI KTILDKLMQISHEVTVLTLSAS ILVDPNITSVT 
KFEVYSISVIKDDFAGFFFTQQITKWIHDLPKHI FWFKCVPFKNI LWEYSGYTEKFFKDWLNKKLMTNLQE 
SRSDVVHANAIGPFGELLAELLKISFWSLHFSPGYTFE 

I YVFYFDFWFQTFDMKKGDQyy SEVLGKS CFLSE IMGKAEMWLIROT WHjEFPRPLLPNFEFVVRLYCKPVN 
PLPKEKMEEFAQSSDEDGWFSLESAVQNLTEEKADLITSA^ 

ECLILDHPQTKAFITYGGTNS I YEMI YRGVPSMGIPLFADQHDNI AHMKAKGAAVILDLSTKSSTD SV 
FVSLFLS FRYKES VMKLSRI QHDQP VKPLDRAVFW I E FVMRHKGAKHLRVAARDLTWFQYHS LDVIGFLLAC 
VATVTFI ITKCCLFCFWKFTRKVKKEKRD 



A search of sequence databases reveals that the NOV40 amino acid sequence has 353 
of 533 amino acid residues (66%) identical to, and 412 of 533 amino acid residues (77%) 
similar to, the 529 amino acid residue ptnr:SWISSPROT-ACC:P16662 protein from Homo 
1 0 sapiens (Human) (UDP-Glucuronosy ltransferase 2b7 Precursor, Microsomal (EC 2.4. 1.17) 
(UDPGT) (3,4-Catechol Estrogen Specific) (UDPGTH-2)) (E - 72e~ 185 ). 

NOV40 is predicted to be expressed in at least the following tissues: liver tissue. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
1 5 Literature sources, and/or RACE sources. 

NOV40 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 40C. 



Table 40C BLAST results for NOV40 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 4507825 |ref|NP 0 

01065.1) 

(NM_0*01074) 


UDP 

glycosyl trans f era 
se 2 family, 
jpolypeptide B7 


529 


353/536 
(65%) 


412/536 
(76%) 


0.0 


gi| 6175083 |sp|P0613 
3 jUDB4_HUMAN 


UDP- 

GLUCURONOSYLTRANS 
FERASE 2B4 
PRECURSOR, 
MICROSOMAL 
(UDPGT) 

(HYODEOXYCHOLI C 
ACID) (HLUG25) 
(UDPGTH-1) 


528 


355/536 
(66%) 


409/536 
(76%) 


0.0 


gi|484383 |pir| | JN06 
19 


glucuronosyl trans 
ferase (EC 
2.4.1.17) 2B-4 
precursor - human 


528 


354/536 
(6S%) 


408/536 
(76%) 


0.0 
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gi 1 3153832 |gb | AAC95 
002. 1| (AF064200) 


UDP- 

glucuronosyltrans 
f erase 2B4 
precursor [Homo 
sapiens] 


528 


354/536 1 
(66%) 


409/536 
(76%) 


0.0 


gi|4079707|gb|AAC98 
726. 1| (AF016310) 


UDP- 

glucuronosyltrans 
ferase [Macaca 
fascicularis] 


529 


351/536 
(65%) , 


410/536 
(76%) 


0.0 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 40D. In the ClustalW alignment of the NOV40 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 40D. ClustalW Analysis of NOV40 

1) Novel NOV40 (SEQ ID NO: 152) 

2) gi|4507825|ref |NP_001065 .1 1 (NM_001074) UDP glycosyltransf erase 2 family, 
polypeptide B7 (SEQ ID NO: 502) 

3) gi| 6175083 |sp|P06133|UDB4_HUMAN UDP-GLUCURONOSYLTRANSFERASE 2B4 PRECURSOR, 
MICROSOMAL (UDPGT) (HYODEOXYCHOLIC ACID) (HLUG25) (UDPGTH-1) (SEQ ID NO: 503) 

4) gi (484383 |pir | | JN0619 glucuronosyl trans ferase (EC 2.4.1.17) 2B-4 precursor - 
human (SEQ ID NO: 504) 

5) gi |3153832 |gb|AAC95002 .1 1 (AF064200) UDP-glucuronosyltransf erase 2B4 precursor 
[Homo sapiens] (SEQ ID NO: 505) 

6) gi| 4079707 |gb|AAC98726.l| (AF016310) UDP-glucuronosyl transferase [Macaca 
fascicularis] (SEQ ID NO: 506) 



NOV40 

gi 1 4507825 | 
gi j 6175083 | 
gi j 484383 | 
gi [3153832 ) 
gi j 4079707 j 



1 
1 
1 
1 
1 
1 




MgMKWTS 
MSgKWTS 
MSMKWTS 
MSMKWTS 
MSMKWTS 
MSMKWTS 



NOV40 


61 




LVgHJjl 


gi 


4507825| 


61. 


SI 


IFDPNjS 


gi 


6175083| 


61 


SI 


?fdpn£ 


gi 


484383 | 


61 


SI 


Sfdpi# 


gi 


3153832 | 


61 


SI 


' FDPK& 


gi 


4079707 j 


61 


SI 


Ifdpnk 




110 



120 



fLPKDTFWHYFSQVQEIK 
ELPKDTFWgYFSQVQEIMWj 

elpkdtfwByIJsqvqeinfa 7 
elpkdtfwSyfsqvqeimk 

ELPKDTFVIgYFSQlQEIMW! 



120 
118 
118 
118 
118 



C 118 



130 

I— 1 



140 



150 



I 



160 
- • I . ■ 



170 



I 



180 
•-I 



NOV40 


121 


gi 


4507825| 


119 


gi 


6175083 j 


119 


gi 


484383 | 


119 


gi 


3153832 | 


119 


gi 


4079707 j 


119 



, , ^^^xa KF jjj KDjSVjjN KKLM JJJjLi Q E S RgD VWlAgjAp P FG E L LAE L L K I gF V Y S Lg F S P G V 
FgDI»RKFCKD^VSNKI®gKKS3QESRFDVE BADpfepEELLAEL5SlPFVYSL§FSPGY 
FSDlflRKFCKDpVSNKKLMKKLQESRFDWjJADJ^ 

f|diS.rkfckd^snkklmkklqesrfd\a/9ada|fpfg 
f|diSrkfckd^snkklmkklqssrfdvv2admf 

FHDlBRjSFCKD^VSNiagjj^ 



180 
178 
178 
178 
178 
178 
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190 



200 



210 



220 



230 



240 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



NOV40 



gi 
gi 
gi 
gi 
gi 



4507 82S I 
6175083 I 
484383 | 
3153832 | 
4078707 



NOV40 


241 


gi 


4507825) 


239 


gi 


6175083) 


239 


gi 


484383| 


239 


gi 


3153832 | 


239 


gi 


4079707| 


239 



NOV40 


301 


gi| 4507825 | 


298 


gij 6175083 | 


298 


gi | 484383 | 


298 


gij 3153832| 


298 


gij 4079707 j 


298 



NOV40 


358 


gi 


4507825) 


358 


gi 


6175083 | 


358 


gi 


484383) 


358 


gi 


3153832] 


358 


gi 


4079707] 


358 



NOV40 



gi 
gi 
gi 
gi 
gi 



4507825) 
6175083 j 
484383 | 
3153832) 
4079707 



NOV40 


478 


gi 


4507825) 


474 


gi 


6175083 j 


474 


gi 


484383 | 


474 


gi 


3153832) 


474 


gi 


4079707) 


474 




60 



Table 40E lists the domain descriptions from DOMAIN analysis results against 
NOV40. This indicates that the NOV40 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 40E Domain Analysis of NOV40 

gnl|Pfam|pfam00201, ODPGT, UDP-glucoronosyl and UDP-glucosyl 

transferase. (SEQ ID NO: 835) 

CD-Length = 501 residues, 100.0% aligned 

Score = 587 bits (1514), Expect = 4e-169 



10 



15 



20 



25 



30 



35 



40 



45 



NOV40: 


24 


Sbjct: 


1 


NOV40: 


84 


Sbjct: 


61 


NOV40: 


137 


Sbjct: 


110 


NOV40: 


197 


Sbjct: 


170 


NOV40: 


257 


Sbjct: 


229 


NOV40: 


316 


Sbjct: 


288 


NOV40: 


374 


Sbjct: 


346 


NOV40: 


434 


Sbjct: 


405 


NOV40: 


494 


Sbjct: 


464 



GNVPLWPME YS PWMNI KTI LDKLMQ I SHEVTVLTLSAS I LVDPNI TS VTKFEVYS I SVI K 83 

I I +111+ I lll + l II +1 + 1 MINI IIIII+ I I III I I I 

GK^WPMDGSHWMNMKGI LLELVQRGHEVTVLRPSAS ILI GPAKPSNLKFETYPDSATK 60 

DDFAGFFFTQQITKWIHDLPKHI FWFKCVPFKNILW EYSGYTEKFFKDWLNK 136 

++ I I + + II +1 III I++I II 

EELENLF PKRVMN WFMEAAEAGTVWSYFSALQEYSDGARVSCKELVGNK 109 



II 



l + ll 



+ ++ + 



1 1 Mill I II III 11+11+ 11 + 11 



404 



IMIII llllllllll 11+ I III lll+l II I III I III III I I 

AWLLRNYWDLEFPRPLLPNMEFI GGLNCKPAKPLPQE -MEAFVQSSGEHGVWFSLGSMV 
QNLTEEKADLITSAIiAQI PQKVM- KF-GRKPNTLRSNTQWHRWIPQNECLILDHPQTKAF 

1+ 1111+ I IIIIIIIIII+ +1 I ll+M +11+ +I+III+ +1 ll+l+ll 

SNI PEEKANE I ASALAQI PQKVLWRFDGTKPSTLGNNTRLVKWLPQND - - LLGHPKTRAF 

I TYGGTNS I YEMI YRGVPSMGI PLFADQHDNIAHMKAKGAAV JItDLSTKS5IZ!DXiLD J5VF 

+ 1+ l+l +11 I .III +I + III II II ll + llllll I++ I +1 111 + 
VTHAGSNGVYEAICHGVPMVGMPLFGDQMDNAK^ 

VSLFLS FRYKESVMKLSRI QHDQPVKPI^RAVFWI EFVMRHKGAKHLRVAARDLTWFQYH 

III++I + II I Mill I IMIII Mltlltl I II II 1 1 1 II llll + lll 

-TVINDPSYKENIMRLSSIHHDQPVKPLD^ 463 

SLDVIGFLLACVATVTFIITKCCLFCFWKFTRKVKKEK 531 

llllllillllllll II Mill +11 11+ I 
SLDVIGFLLACVATVAFITFKCCLFGYRKFVGKKKRVK 501 

The UDP-glucuronosyltransferases, a group of isoenzymes located primarily in hepatic 
endoplasmic reticulum and nuclear envelope, are encoded by a large multigene family that has 
evolved to produce catalysts with differing but overlapping substrate specificities. Two 
subfamilies are recognized by sequence identities. UGT1 consists of at least 4 isoenzymes that 
catalyze the glucuronidation of phenols and bilirubin. All 4 map to chromosome 2 and 
probably derive from the same gene (UGT1). The UGT2 family contains at least 5 members 
catalyzing steroid or bile acid glucuronidation. Members of the subfamily share 65 to 90% 
amino acid sequence identity. However, unlike the phenol UGT cDNAs, where the high 
degree of identity is concentrated in the 3-prime region of the cDNA, the steroid UGTs have a 
high degree of sequence homology throughout the cDNA. The disclosed NOV40 nucleic acid 
of the invention encoding a UDP Glycosyltransferase -like protein includes the nucleic acid 
whose sequence is provided in Table 40A or a fragment thereof. The invention also includes a 
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mutant or variant nucleic acid any of whose bases may be changed from the corresponding 
base shown in Table 40A while still encoding a protein that maintains its UDP[ 
Glycosyltransferase -like activities and physiological functions, or a fragment of such a 
nucleic acid. The invention further includes nucleic acids whose sequences are 

5 complementary to those just described, including nucleic acid fragments that are 

complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

10 derivatized. These modifications are carried out at least in part to enhance the chemical 

stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 19 percent of the bases may be so changed. 

The disclosed NOV40 protein of the invention includes the UDP Glycosyltransferase - 

15 like protein whose sequence is provided in Table 40B. The invention also includes a mutant 
or variant protein any of whose residues may be changed from the corresponding residue 
shown in Table 40B while still encoding a protein that maintains its UDP Glycosyltransferase 
-like activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 35 percent of the residues may be so changed. 

20 The invention further encompasses antibodies and antibody fragments, such as F ab or 

(F a b)2,that bind immunospecificaily to any of the proteins of the invention. 

The above disclosed information suggests that this UDP Glycosyltransferase -like 
protein (NOV40) is a member of a "UDP Glycosyltransferase family". Therefore, the NOV40 
nucleic acids and proteins identified here may be useful in potential therapeutic applications 

25 implicated in (but not limited to) various pathologies and disorders as indicated below. The 
potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 

30 and cell types composing (but not limited to) those defined here. 

The NOV40 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in brain disorders including Crigler-Najjar syndrome, 
Gilbert syndrome, and/or other diseases and pathologies. 
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NOV40 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV40 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
5 NOVX Antibodies" section below. The disclosed NOV40 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 
10 NOV41 

NOV41 includes three novel adrenal secretory serine protease-like proteins disclosed 
below. The disclosed sequences have been named NOV4 la and NOV4 1 b. 

NOV41a 

A disclosed NOV41a nucleic acid of 2155 nucleotides (also referred to as CG56680- 
15 01) encoding a Sodium/Hydrogen Exchanger 4-like protein is shown in Table 41 A. An open 
reading frame was identified beginning with a ATG initiation codon at nucleotides 16-18 and 
ending with a TAG codon at nucleotides 2140-2142. The start and stop codons are shown in 
bold in Table 41A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 41 A. NOV41a nucleotide sequence (SEQ ID NO: 153). 

GAGAAGCCCACAGGAATGGCTCTGCAGAT^ 

GCT CTTG AGTGTT CTGAAG CATCTT CTGATTTGAATGAAT CTG CAAATT C CACTGCT CAGTATG CAT CTAAC 
GCTTGGTTTG CTGCTGC CAGCTCAGAG CCAGAGGAAGGGATAT CTGTTTTTGAACTGGATTATGACTATGTG 
CAAATTCCTTATGAGGTCACTCTCTGGATACTTCTAGC^^ 
AGGCTGCCAGGCCTCATGCCAGAAAGCTGCCTCCTCATCCTG^ 

GGCACCGACCACAAATCGCCTCCGGTCATGGACTCCAGCATCTACTTCCTGTATCTCCT 
CTGGAGGGCGGCTACTTCATGCCCACCCGGCCCTTCTTTGAG 

TTGGGGGCCCTGATCAACGCCTTGGGCATTGGCCTCTCCCTCTACCTCATCTGCCAGGTGAAGGCCTTTGGC 
CTGGGCGACGTCAACCTGCTGCAGAACCTGCTGTTCGGCAGCCTGATCTCCGCCGTGGACCCAGTGGCCGTG 
CTAGCCGTGTTTGAGGAAGCGCGCGTGAACGAGCAGCTCTACAT 
GATGGCATTACTGTGGTGGTCTITATACAATATGTOAAT 

GAAACTGTCGACATTTTGGCTGGATGTGCCCGATTCATCGTTGTGGGGCTTGGAGOT 

GTTTTTGGATTTATTTCTGCATTTATCACACGTTTCACTCAGAATATCTCT 

TTCATGTTCAGCTATTTGTCTTACTTAGCTGCTGAAACCCTCTATCTCTCCGGCATCCT 

TGCGCAGTAACAATGAAAAAGTACGTGGAAGAAAACGTGTCCCAGACATCATACACGACCATCAAGTACTTC 

ATGAAGATGCrGAGCAGCGTCAGCGAGACCTTGATCTTCATCTTCATGGGTGTGTCCACTGTGGGC^AGAAT 

CACGAGTGGAACTGGGCCTTCATCTGCTTCACCCTGGCCTTCTGCCAAATCTGGAGAGC 

GCTCTCTTCTATATCAGTAACCAGTTTCGGACTTTCCCCTTCTCCATCA^ 

AGTGGTGTTCGAGGAGCTGGAAGTTTTTCACTTGCATTTTTGCTTCCTCTGTCTCTTTT 

ATGTTTGTCACTGCTACTCTAGTAGTTATATACTTTACTGTATTTATTCAGGGAATCACAGTTGGCCCTCTG 

GTCAGGTACCTGGATGTTAAAAAAACCAATAAAAAAGAATCCATCAATGAAGAGCTTCATATTCGTCTGATG 

GAT CACTT AAAGGCTGG AATCG AAGATGTGTGTGGG CACTGGAGT CACT AC CAAGTG AG AGACAAGT TTAAG 

AAGTTTGATCATAGATACTTACGGAAAATCCTCATCAGAAAGAACCTACC 

T AC AAG AAG CTGGAAATGAAGCAAGC CAT CGAGATGGTGGAGACTGGGATACTG AG CT CTACAG CTTT CTC C 

ATACCCCATCAGGCCCAGAGGATACAAGGAATCAAAAGACTTTCCCCTGAAGATGTGGAGTCCATAAGGG 

ATT CTGACATC CAACATGTACCAAGTT CGGCAAAGGAC C CTGT CCTACAACAAATACAACCT CAAAC C CCAA 

ACAAGTGAGAAGCAGGCTAAAGAGATTCTGATCTOCCGCCAGAACACCTTAAGGGAGAGC^TGAGGAAAG^ 

C^CAGCCTGCCCTGGGGAAAGCCGGCTGGCACCAAGAATATCCGCTACCTCTCCTACCCCTACGGGAATCCT 

CAGTCTGCAGGAAGAGACACAAGGGCTGCTGGGTTCTCAGGTAAGCTGCCCACCTGGCTGCTCT 

TCTGTAGAGTCAGGTGGTAAATATCTGGGGGTGTGGGCCAAGAGGCAACATTAAGAACATT 
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In a search of public sequence databases, the NOV41a nucleic acid sequence, located 
on chromosome 2, has 1820 of 2156 bases (84%) identical to a gb:GENBANK- 
DD:RATNHEXIV|acc:M85301 .1 mRNA from Rattus norvegicus (Rat sodium-hydrogen 

5 exchange protein-isoform 4 (NHE-4) mRNA, complete cds) (E = 6.4e~ 217 ). 

The disclosed NOV41a polypeptide (SEQ ID NO: 154) encoded by SEQ ID NO: 153 
has 708 amino acid residues and is presented in Table 4 IB using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV41a has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.8200. Alternatively, 

10 NOV41a may also localize to the Golgi body with a certainty of 0.4600, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3700, or to the endoplasmic reticulum (lumen) 
with a certainty of 0.1000. The most likely cleavage site for NOV41a is between positions 26 
and 27: SEA-SS. 



Table 41B. Encoded NOV41a protein sequence (SEQ ID NO:154). 

MAXjQMFVTySPWNCLLLLVAIi^ 

VTLWILLASIiAKI GFHLYHRLPGLMPESCLLILVGALVGGI IFGTDHKSPPVMDS S I YFLYLLPPI VLEGGY 
FMPTRPFFENI GS I LWWAVLGAL INALGI GLSL YLI CQVKAFGLGDVNLLQNLLFGSLI S AVDP VAVLAVFE 
EARVNEQLYMMIFGEALLNDGITVVVLYN^ 
SAFITRFTQNISAIEPLIVFMFSYLSYLAAETLYLSGI 

SVSETLIFIFMGVSTVGKNHEWNWAFICFTIAFCQIWRAISVFALFYISNQFRTFPFSIKDQCIIFYSGVRG 
AGSFSLAFLLPLSLFPRKKMFVTATLVVIYFTVFIQGITVGPLVRYLDV^ 

GI EDVCGHWSHYQVRDKFKKFDHRYLRKI LIRKNLPKSS I VSLYKKLEMKQAIEMVETGILSSTAFS IPHQA 
QRIQGI KRLSPEDVESI I^ILTSNMYQTOQRTLSYNKYNLKPQTSEKQAKEI LIRRQNTLRESMRKGHSLPW 
GKPAGTKNIRYLSYPYGNPQSAGRDTRAAGFSGKLPTWLLCCFSVESGGKYLGVWAKRQH 

15 

A search of sequence databases reveals that the NOV41a amino acid sequence has 599 
of 688 amino acid residues (87%) identical to, and 63 1 of 688 amino acid residues (91%) 
similar to, the 717 amino acid residue ptnr:SWISSPROT-ACC:P26434 protein from Rattus 
. norvegicus (Rat) (Sodium/Hydrogen Exchanger 4 (NA(+ )/H(+) Exchanger 4) (NHE-4)) (E = 
20 0.0). 

NOV41a is predicted to be expressed in at least the stomach. This information was 
derived by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or 
RACE sources. 
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In addition, the sequence is predicted to be expressed in stomach, colon and small 
intestine; lesser amounts in kidney, brain, uterus and skeletal muscle because of the expression 
pattern of (GENBANK-ID: gb:GENBANK-ID:RATNHEXIV|acc:M85301.1) a closely related 
Rat sodium-hydrogen exchange protein-isoform 4 (NHE-4) mRNA, complete cds homolog. 
5 NOV41b 

In the present invention, the target sequence identified previously, NOV41a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 

1 0 examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in • 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 

15 related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 

20 gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 

25 another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV41b. This differs from the previously 
identified sequence (NOV4 1 a) in having 1 7 different aminoacids. 

A disclosed NOV41b nucleic acid of 2436 nucleotides (also referred to as CG56680- 

30 02) encoding a Sodium/Hydrogen Exchanger 4-like protein is shown in Table 41C. An open 
reading frame was identified beginning with a ATG initiation codon at nucleotides 86-88 and 
ending with a TAA codon at nucleotides 2369-2371. The start and stop codons are shown in 
bold in Table 41C, and the 5' and 3' untranslated regions, if any, are underlined. 
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Table 41C. NOV41b nucleotide sequence (SEQ ID NO:155). 

ATGCAGTCACTCTCTAGAAGCCTCCCCGACTTCAGATGTGTGGCACACATCCACACAGGGGTGTAGGTAGGA 
GAAGCCCACAGGAA TGGCTCTGCAGATGTTCGTGACTTACAGTCCTTGGAATTGTTTGCTACTGCTAGTGGC 
TCTTGAGTGTTCTGAAGCATCTTCTGATTTGAATGAATCTGCAAATTCCACTGCTCAGTATGCATCTAACGC 
TTGGTTTGCTGCTGCCAGCTCAGAGCCAGAGGAAGGGATATCTC 

AATTCCTTATGAGGTCACTCTCTGGATACTTCTAGCATCCCTTGCAAAAATAGGCTTCCACCTCTACCACAG 
GCTGCCAGGCCTCATGCCAGAAAGCTGCCTCCTCATCCTGGTGGGGGCGCTGGTGGGCGGCATCATCTTCGG 
C ACCGACC ACAAATCGCCT CCGGTC ATGG ACTCCAGCATCTACTTCCTGTATCT C CTGCCACCCATCGTT CT 
GGAGGGCGGCTACTTCATGCCCACCCGGCCCTTCTT TGAGAACATCGGCT CCATCCTGTGGTGGGCAGTATT 
GGGGGCCCTGATCAACGCCTTGGGCATTGGCCTCTCCCTCTACCTCATCTGCCAGGTGAAGGCCTTTGGCCT 
GGGCGACGTCAACCTGCTGCAGAACCTGCTGTTCGGCAGCCTGATCTCCGCCGTGGACCCAGTGGCCGTGCT 
AGCCGTGTTTGAGGAAGCGCGCGTGAACGAGCAGCTCTACATGATGATCTTTGGGGAGGCCCTGCTCAATGA 
TGGC^TTACTGTGGTGTTATACAATATGTTAATTGCCTTTAGAAAGATGCATAAATTTGAAGACATAGAAAC 
TGTCGACATTTTGGCTGGATGTGCCCGATTCATCGTTGTGGGGCTTGGAGGGGTATTGTTTGGCZATCGTTTT 
TGGATTTATTTCTGCATTTATCACACGTTTCACTO^ 

GTT CAG CTATTTGTCTTACTTAGCTGCTGAAACCCTCT ATCTCTCCGGCATCCTGGCGATCACAGCCTGCGC 
AGTAACAATGAAAAAGTACGTGGAAG AAAACGTGT C C CAGACATCAT ACACGAC CAT CAAGTACTT CATG AA 
GATGCTGAGCAGCGTCAGCGAGACCTTGATCTTCATCTTCATGGGTGTGTCCACTGTGGGCAAGAATCACGA 
GTGGAACTGGGCCTTCATCTGCTTCACCCTGGCCTTCTGCCAAATCTGGAGAGCCATCAGTGTATTTGCTCT 
CTTCTATATCAGTAAC CAGTTTCGGACTTTCCCCTTCTCCATCAAGGACCAGTGCAT CATTTTCTACAGTGG 
TGTT CGAGGAGCTGGAAGT 'IT TrCACTTGCATT TT TGCTT C CTCTGT CT C T TTTT CC TAGGAAGAAAATGTT 
TGTCACTGCTACTCTAGTAGTTATATACTTTACTC^ 

GTACCTGGATGTTAAAAAAACC^TAAAAAAGAATCCATC^TGAAGAGCTTC^TATTCGTCrGATGGATC^ 
CTTAAAGGCTGGAATCGAAGATGTGTGTGGGCACTGGAGTCACTACCAAGTGAGAGACAAGTTTAAGAAGTT 
TGATCATAGATACTTACGGAAAATCCTCATCAGAAAGAACCTACCCAAA^ 

GAAGCTGGAAATGAAGC^GCCATCGAGATGGTGGAGACTGGGATACTGAGCTCTACAGCTTTCTCCATACC 
C CAT C AGGC C CAGAGGATACAAGGAAT CAAAAGAC TTTCC C CTGAAGATGTGGAGT C CATAAGGG AC ATT CT 
G AC ATC CAACATGTAC CAAGTTCGGCAAAGGACC CTGTC CTACAAC AAATACAAC CT CAAAC CCCAAACAAG 
TGAGAAGCAGG CT AAAGAGATTCT GATCCG CCGC CAGAACACCTT AAGG GAG AG CATGAGG AAAG GTCACAG 
CCTGCCCTGGGGAAAGCCGGCTGGCACCAAGAATATCCGCTACCTCTCCTACCCCTACGGGAATCCTCAGTC 
TGCAGGAAGAGACACMGGGCTGCTGGGTTCTCAGGTAAGCTGCCCACCTGGCTGCTCCTTTGGTTGAGGTT 
CGGTCGAGGTGGACAGCTGACCATGGAC^CGGCAGGGACC^TCAC^GGTCCCATAGTCCTTTGCrCCAAA^ 
AAATAG TGTT ATTG T C CACAAGATTGTTT TGGTGTTTCTCAAG AGT CTGTCTT C CTAT AACTGTG AAAGGAG 
GAT TT CT GGAAT TCAGAAGAG AG CT ATTGAGTTTG CTGTGT T G AAG CT ATT AAACAT GG ATC T AT AAGCAGC 
AGGAAGATTTTTTCCAAGGACTGGGAGCAAACTTGCAGGCTCTGCCATGTACTTATTGTG 



In a search of public sequence databases, the NOV41b nucleic acid sequence, located 
on chromosome 2, has 1818 of 2163 bases (84%) identical to a gb:GENBANK- 
ID:RATNHEXIV|acc:M85301.1 mRNA from Rattus norvegicus (Rat sodium-hydrogen 
5 exchange protein-isoform 4 (NHE-4) mRNA, complete cds) (E = 0.0). 

The disclosed NOV41b polypeptide (SEQ ID NO:156) encoded by SEQ ID NO:155 
has 761 amino acid residues and is presented in Table 41 D using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV41b has no signal peptide 
and is likely to be localized to the plasma membrane with a certainty of 0.8200. Alternatively, 
1 0 NOV4 1 b may also localize to the Golgi body with a certainty of 0.4600, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3700, or to the endoplasmic reticulum (lumen) 
with a certainty of 0.1000. The most likely cleavage site for NOV41b is between positions 26 
and 27: SEA-SS. 



Table 41D. Encoded NOV41b protein sequence (SEQ ID NO:156). 

MALQMFVTYSPWNCLIJ^LVALECSE 
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VTIiW I LLASLAKI GFHLYHRL PGLMP ESCLL ILVGALVGGI I FGTDHKS PPVMDS S I Y FLYLLPP I VLEGGY 
FMPTRPFFENI GS ILWWAVLGALINALGIGLSLYL ICQVKAFGLGDVNLLQNLLFGSL ISAVDPVAVLAVFE 
EARVNEQL YMM I FGEALLNDGITVVLYNMLIAFTKMHKFEDIETVDILAGCARFI WGLGGVLFGIVFGFI S 
AFITRFTQNISAIEPLIVFMFSYLSYLAAETLYLSGIL 

VS ETL I F I FMGVSTVGKNHE WNWAFI CFTLAFCQI WRAI SVFALFY I SNQFRTFP FS I KDQC 1 1 FY SGVRG A 
GSFSLAFLLPLSLFPRKKMFVTATLVVTYFTVFIQGITVGPLVRYLDVKK^ 

IEDVCGHWSHYQVRDKFK KFDHR YLRKI LI RKNL PKS S I VSL YKKLEMKQ AI EMVETG I LS S TAFS I PHQAQ 
RIQGIKRLSPEDVESIRDILTSNMYQVRQRTLSYNKYNLKPQTSEKQA^ 
KPAGTKNIRYLSYPYGNPQSAGRDTRAAGFSGKLPTWLLLWLRFGRGGQLTMDTAGTITC 
VHKIVLVFLKSLSSYNCERRISGIQKRAIEFAVLKLIiNMDL 



A search of sequence databases reveals that the NOV41b amino acid sequence has 606 
of 717 amino acid residues (84%) identical to, and 641 of 717 amino acid residues (89%) 
similar to, the 717 amino acid residue ptnr:SWISSPROT-ACC:P26434 protein from Rattus 
5 norvegicus (Rat) (Sodium/Hydrogen Exchanger 4 (NA(+)/H(+) Exchanger 4) (NHE-4)) (E = 
0.0). 

NOV41b is predicted to be expressed in at least adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
10 mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and uterus. . 

NOV41a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 41E. 



Table 41E. BLAST results for NOV41a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi| 127814 |sp|P26434 
|NAH4_RAT 


SODIUM / HYDROGEN 
EXCHANGER 4 
<NA(+)/H<+) 
EXCHANGER 4) 
(NHE-4) 


717 


599/688 
(87%) 


631/688 
(91%) 


0.0 


gi| 1346658 |sp|P4876 
3 |NAH2_RAT 


SODIUM /HYDROGEN 
EXCHANGER 2 
(NA(+)/H(+) 
EXCHANGER 2) 
(NHE-2) (H7) 


813 


421/659 
(63%) 


523/659 
(78%) 


0.0 


gi| 1709222 |sp|P5048 
2 |NAH2_RABIT 


SODIUM / HYDROGEN 
EXCHANGER 2 

(NA(+)/H(+) 
EXCHANGER 2) 

(NHE-2) 


809 


419/659 
(63%) 


522/659 
(78%) 


0.0 


gi 1 15529998 |ref|NP 

003039.2] 

(NMJ303048) 


solute carrier 
family 9 
( sodium/hydrogen 
exchanger) , 
i so form 2 [Homo 
sapiens] 


812 


405/611 
(65%) 


499/611 
(81%) 


0.0 
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gi | 6981560 |ref |NP_0 


solute carrier 


697 


372/560 


457/560 


0.0 


36785. l| 


family 9 




(66%) 


(81%) 




(NM 012653) 


{ sodium/hydrogen 
exchanger 2 ) , 
antiporter 2, 
Na+/H+ (Na+/H+ 
exchanger 2) 

[Rat tus 
norvegicus] 











The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 4 IF. In the ClustalW alignment of the NOV41 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 



15 



20 



Table 41F. ClustalW Analysis of NOV41 



1) Novel NOV41a (SEQ ID NO: 154) 

2) Novel N0V41D (SEQ ID NO: 15 6) 

3) gi 1 127814 | Sp | P26434 |NAH4_RAT SODIUM/HYDROGEN EXCHANGER 4 (NA(+)/H(+) EXCHANGER 

4) (NHE-4) (SEQ ID NO:507) 

4) gij 1346658 |sp|P48763|NAH2_RAT SODIUM/HYDROGEN EXCHANGER 2 (NA(+)/H( + ) EXCHANGER 
2) (NHE-2) (H7) (SEQ ID NO:508) 

5) gi|l709222|sp|P50482|NAH2JRABIT S ODIUM/ HYDROGEN EXCHANGER 2 (NA(+)/H( + ) 
EXCHANGER 2) (NHE-2) (SEQ ID NO:509) 

6) gi 1 15529998 | ref |NP_003 039 . 2 | (NM_003048) solute carrier family 9 
(sodium/hydrogen exchanger)., isoform 2 [Homo sapiens] (SEQ ID NO: 510) 

7) gi 1 6981560 1 ref |NP_036785 . 1 1 (NM_012653) solute carrier family 9 (sodium/hydrogen 
exchanger 2), antiporter 2, Na+/H+ (Na+/H+ exchanger 2) [Rattus norvegicus] (SEQ ID 
NO:511) 



25 



30 



35 



40 



45 



10 



NOV41a 


1 


NOV41b 


1 


gi 


127814| 


1 


gi 


1346658) 


1 


gi 


1709222 | 


1 


gi 


15529998) 


1 


gi 


6981560] 


1 



NOV41a 


50 


NOV41b 


50 


gi 


127814 | 


50 


gi 


1346658) 


61 


gi 


1709222) 


60 


gi 


15529998] 


60 


gi 


6981560] 


1* 



- - -MALQMFIWTYSPWNC 

- - - MALQMFVT YS PWNC 

- - -MGPAMLRAFSSWKW 
MGPSGTAHRMRAPLSWL 

mes agtgrslrtp pprl 
meplgnwrs|raplppm 




NOV41a 
NOV41b 
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gi 
gi 
gi 
gi 
gi 



127814) 
1346658 
1709222 
15529998 
6981560 | 



NOV41a 
NOV41b. 



gi 
gi 
gi 
gi 
gi 



127814 | 
1346658] 
1709222) 
15529598 | 
6981560) 



NOV41a 

NOV41b 

gi | 127814 | 

gi | 1346658 | 

gi | 1709222 j 

gi j 15529998 | 

gi | 6981560 | 



NOV41a 

NOV41b 

gi | 127814 | 

gi j 1346658 | 

gi 1 1709222 j 

gi|l5529998| 

gi j 6981560 | 



N0V41a 
N0V41b 



gi 
gi 
gi 
gi 
gi 



127814 | 

1346658) 

1709222) 

15529998) 

6981560) 



NOV41a 
NOV41b 

gi 
gi 
gi 
gi 
gi 



127814 | 
1346658) 
1709222 j 
15529998) 
6981560 | 



NOV41a 

NOV41b 

gi | 127814 | 

gi j 1346658 | 

gi j 1709222 j 

gi j 15529998 | 

gi j 6981560 | 



NOV41a 
NOV41b 




370 



380 
..|. 



390 



400 



410 



420 



349 
348 
348 
357 
356 
356 
241 



YTTIKYFMKMLSSVSETLIFIFMQVS 
YTT I KY FM KML S S VS ETLI F I FMGVS 
YTTI KYFMKMLS S VSETLI FIFMGVS 
YTTIKYFMkMLSSVSETLI FIFMGVS 
YTT I K Y ?M KML S S VS ETL I F I FMGVS 1 
YTTIKYFMKMLSSVSETLI FIFMGVS 1 
YTTIKYFMKMLSSVSETLI FIFMGVS' 




550 
. . t . . 



560 
..!.. 



570 
. . I . . 



^YLRKgLI R|j[Ng P"£SS I VS LYKKLeSk SaIEMEETG? 

ylrkHlir^Hpkssivslykkle'SkPj^iemSetg? 



358 
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TstafsiphqaqBi 

gSTAFS I PHQAQgl j 
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10 



15 



gi 


127814 | 


527 


gi 


1346658 | 


537 


gi 


1709222) 


536 


gi 


1S529998| 


536 


gi 


6981560 | 


421 



NOV41a 

NOV41b 

gi 1 127814 | 

gi 1 1346658 | 

gi 1 1709222 j 

gi | 15529998 | 

gi | 6981560 | 




20 



25 



NOV41a 

NOV41b 

gi | 127814 | 

gi | 1346658 | 

gi j 1709222 | 

gi | 15529998 | 

gi |6981560 | 




700 



710 



720 



s sdsdmdg - ttvlnlqpj 
issdseadagttvlnlqp) 
ssdsdadagttvlnlqpS 

SSDSDMDG-TTVLNLQj 




30 



35 



NOV41a 
NOV41b 



gi 
gi 
gi 
gi 



127814 | 
1346658 | 
1709222 j 
15529998) 
6981560| 



.|....|. 



730 740 

... ^afel 

687 LLWLRFGRGGQLTjjgTAGTgTG- 
687 LHFLLCRj-AMVp- 
716 PDQ^KKASPAY 
716 PEP^KpiSQAY 
716 PEQ^I^PQSY 

600 pdq^k^Mspay 



750 760 

j^YLj 

-jgiVLCSKKSsVIVIffil 

EI' 



tfKNElSDVGS ARApHsVT PAPRlSKEGGTQTp| 
i ^KNEjbAGSGQGQ |SPPAAPRiKEGGT<3TP, 
SWIQJEVDVDSGRDM |STPPTPtfgREKGTdTS| 
^"""^DVGSARAP§SVTPAPRkKEGG^'"~ 




40 



45 



NOV41a 


708 


KOV41b 


738 


gi 


127814| 


713 


gi 


1346658 | 


776 


gi 


1709222) 


772 


gi 


15529998 | 


775 


gi 


6981560] 


660 



790 



800 



810 




jEtbdvppkppp; 
IsBt^ggrpkppp: 
jgippkppp) 
i@t§dgppkpp: 




708 

761 

717 

PGNRKGRLGNEKP 813 

PGNRKSRLGSDKP 809 

IEPGSRKARFGSEKP 812 

! EPGNRKGRLGNEKP 697 



Na+/H+ antiporters are key transporters in maintaining the pH of actively metabolizing 
cells. Na+/H+ exchange proteins eject protons from cells, effectively eliminating excess acid 

50 from actively metabolising cells. Na+/H+ exchange activity is also crucial for the regulation of 
cell volume, and for the reabsorption of NaCl across renal, intestinal, and other epithelia. 
These antiports exchange Na+ for H+ in an electroneutral manner, and this activity is carried 
out by a family of Na+/H+ exchangers, or NHEs. In mammalian cells, Na+/H+ exchange 
activity is found in both the plasma membrane and inner mitochondrial membrane. To date, 

55 six mammalian isoforms have been identified (designated NHE1 -NHE6). These exchangers 
are highly-regulated (glyco)phosphoproteins, which, based on their primary structure, appear 
to contain 10-12 transmembrane regions at the N-terminus and a large cytoplasmic region at 
the C-terminus. The transmembrane regions M3-M12 share identity with other members of the 
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family. The M6 and M7 regions are highly conserved. Thus, this is thought to be the region 
that is involved in the transport of sodium and hydrogen ions. The cytoplasmic region has little 
similarity throughout the family. There is some evidence that they may exist in the cell 
membrane as homodimers, but the molecular mechanisms of antiport are unclear. Na+/H+ 
5 antiporters play an important role in signal transduction. 

The disclosed NOV41 nucleic acid of the invention encoding a Sodium/Hydrogen 
Exchanger 4 -like protein includes the nucleic acid whose sequence is provided in Table 41 A, 
41C or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 41 A or 41C while 

1 0 still encoding a protein that maintains its UDP[ Glycosyltransferase -like activities and 

physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 

15 thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject In the mutant or 

20 variant nucleic acids, and their complements, up to about 16 percent of the bases may be so 
changed. 

The disclosed NOV41 protein of the invention includes the Sodium/Hydrogen 
Exchanger 4 -like protein whose sequence is provided in Table 41B or 41D. The invention 
also includes a mutant or variant protein any of whose residues may be changed from the 
25 corresponding residue shown in Table 41B or 41D while still encoding a protein that 

maintains its Sodium/Hydrogen Exchanger 4 -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 37 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
30 (F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Sodium/Hydrogen Exchanger 4 - 
like protein (NOV41) is a member of a "Sodium/Hydrogen Exchanger 4 family". Therefore, 
the NOV41 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
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below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
5 and cell types composing (but not limited to) those defined here. 

The NOV41 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in brain disorders including hypercalcemia, ulcers, 
inflammatory bowel disease, diverticular disease; diseases of the kidney including diabetes, 
autoimmune disease, renal artery stenosis, interstitial nephritis, and others; diseases of the 

10 brain including Von Hippel-Lindau (VHL) syndrome , Alzheimer's disease, epilepsy, and 
others; endometriosis, fertility, muscular dystrophy, Lesch-Nyhan syndrome, myasthenia 
gravis, and/or other diseases and pathologies. 

NOV41 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV41 substances for use in 

1 5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV41 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 

20 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV42 

NOV42 includes three novel Kupffer Cell Receptor -like proteins disclosed below. 
The disclosed sequences have been named NOV42a, NOV42b, NOV42c, and NOV42d. 
25 NOV42a 

A disclosed NOV42a nucleic acid of 1760 nucleotides (also referred to as CG56682- 
01) encoding a Kupffer Cell Receptor -like protein is shown in Table 42A. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 16-18 and ending 
with a TGA codon at nucleotides 1661-1663. The start and stop codons are shown in bold in 
30 Table 42A, and the 5' and 3' untranslated regions, if any, are underlined. 



361 



WO 02/068649 



PCT/US02/02785 



Table 42A. NOV42a nucleotide sequence (SEQ ED NO:157). 

TGGCTGGGAGCAGTGCTGGAGGATGAAGGAAGCAGAGATGGACGGTGAGGCAGTCCGCTTCTGCACAGATAA 
CCAGTGTGTCTC CCTGCACCCCCAAGGTGTGGACTCTGTGGCAATGGCT CCTGCAGC CCCCAAGATACCGAG 
GCTCGTTCAGGCTACCCCGGCATTTATGGCTGTGACCTTGGTCTTCTCTCTTGTGACTCTCTTTGTAGTGGG 
TAAGCCCCCAGK5TGACCCAAATCTCACTAACTTO 

CACACTCGATCATCACCACTTTGGCAGGGAGGCAGAAATGCGAGAGCTTATCCAGACATTTAAAGGCCACAT 
GGAGAATTCCAGTGCCTGGGTAGTAGAAATCOVGATGTTGAAGTGCAGAGTGGACAATGTCAATTCGCAGCT 
CCAGGTGCTCGGTGATCATCTGGGAAACACC^TGCTGAC^TCCAGATGGTAAMGGAGTTCTAAAGGATGC 
CACTACATTGAGTTTGCAGACACAGATGTTAAGGAGTTCCCTGGAGGGAAC CAATGCTGAGATCCAGAGG CT 
CAAGG AAGACC TTG AAAAGG CAGATGCTTT AACTTT C CAGACG CTG AATTT CTT AAAAAGCAGTTTAGAAAA 
C^CCAGCATTGAGCTCCACGTGCTAAGCAGAGGCTTAGAAAATGCAAACTCTGAAATTCAGATGTTGAATGC 
CAGTTTGGAAACGGCAAATGCTTTAAACTCCCAGACCCAGGCCTTTATAAAAAGC^ 
TGCTGAGATCCAGTTCTTAAGAGGTCATTTGGAAAGAGCTGGTGATGAAAT^ 

GAAAATGGTCACAG CCCAG ACC CAAAAAGCAAATGGCCGTCTGGACCAGACAGATACTCAGATTCAGGTATT 

CAAGT CAGAGATGGAAAATGTGAATACCTTAAATGCCCAG ATTCAGGTC TTAAATGG TCATATGAAAAATGC 

C AGCAGAGAGAT ACAGAC C CTAAAACAAGG AATGAAGAATGCT TCAGC CTT AACTTC CCAGACC CAGATGTT 

AGACAG CAATCTGCAGAAGGCCAGTG C CGAGAT CCAG AGGTTAAGAGGGGATCTAGAGAACACCAAAGCTCT 

AACCATGGAAATCCAGCAGGAGCAGAGTCGCCTGAAGACCCTCCATGTGGTCATTACr^ 

AC AAAGAAC CCAAAGTAAGCAGCTT CT CCAGATGGT CCTGC AAGGCTGGAAGTTCAATGGTGGAAGC TT ATA 

TTATTTTTCTAGTCTCAAGAAGTCTTGGCATGAGGCTGAGCAGOT 

ATCTGTGGCCTCCAAGGAGGAGCAGGCATTTCTGGTAGAGTTCACAAGTAAAGTGTACTACTGGATC 
CACTGACAGGGGCACAGAGGGCrrCCTGGCGCTGGAC!AGATGGGACACCATTC^ 

GT TTTGGGAAAAGAATCAGTCTGACAACTGG CGG CACAAGAAT GGGCAG ACTGAAGACTGTGTCC AAATT CA 
GCAGAAGTGGAATGACATGACCTGTGACACCCCCTATCAGTGGGTGTGCAAGAAGCCCATGGGCCAGGGTGT 
GGCC^GAGGGCAGGCCAGAGCTGAGGGGCTGOTCCTGCTTGCCAATACTGACCCTCCTCCT 
GAGCCTCTGAGCTCTGCTTGTTCTCTGGGACC 



In a search of public sequence databases, the NOV42a nucleic acid sequence, located 
. on chromosome 2, has 1214 of 1730 bases (70%) identical to a gb:GENBANK- 
ID:D88577|acc:D88577. 1 mRNA from Mus musculus (mRNA for Kupffer cell receptor, 
5 complete cds) (E = 3.9e 162 ). 

The disclosed NOV42a polypeptide (SEQ ID NO: 158) encoded by SEQ ID NO: 157 
has 546 amino acid residues and is presented in Table 42B using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV42a has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.7900. Alternatively, 
10 NOV42a may also localize to the microbody (peroxisome) with a certainty of 0.3000, to the 
Golgi body with a certainty of 0.3000, or to the endoplasmic reticulum (membrane) with a 
certainty of 0.2000. The most likely cleavage site for NOV42a is between positions 65 and 66: 
VVG-KP. 



Table 42B. Encoded NOV42A protein sequence (SEQ ID NO:158). 

LT^LSFQHKVPHGPRCTLDHHHFGREAEM^ 

GNTNADIQMVKGVLKDATTLSLQTQMLRS SLEGTNAEIQRLKEDLEKADALTFQTLNFIiKSSLENTS IELHV 
LSRGLENANSEIQMLNASLETANALNSQTQAF 

QKANGRLDQTDTQIQVFKSEMENVNTLNAQIQVLNGHMKNASREIQTLKQGM 

SAEIQRLRGDLEOTKALTMEIQQEQSRLKTLHVV^ 

SWHEAEQFCVSQGAHLASVASKEEQAFLVEFTSKVYYWIGL^ 

DNWRHKNGQTEDCVQI QQKWNDMTCDTPYQWVCKKPMGQGVA 
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A search of sequence databases reveals that the NOV42a amino acid sequence has 301 
of 546 amino acid residues (55%) identical to, and 396 of 546 amino acid residues (72%) 
similar to, the 548 amino acid residue ptnr:SWISSPROT-ACC:P70194 protein from Mus 
musculns (Mouse) (Kupffer Cell Receptor) (E = 0.0). 
5 NOV42a is predicted to be expressed in at least cartilage. This information was derived 

by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or 
RACE sources. 

In addition, the sequence is predicted to be expressed in :Kupffer cells (liver) because 
10 of the expression pattern of (GENBANK-ID: gb:GENBANK-ID:D88577|acc:D88577.1) a 
closely related Mus musculus mRNA for Kupffer cell receptor, complete cds homolog. 
NOV42b 

In the present invention, the target sequence identified previously, NOV42a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 

15 starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 

20 silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 

25 brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 

30 other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
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the sequence reported below, which is designated NOV42b. This differs from the previously 
identified sequence (NOV42a) in having 3 less aminoacids and 26 different ones. 

A disclosed NOV42b nucleic acid of 1 769 nucleotides (also referred to as CG56682- 
02) encoding a Kupffer cell receptor-like protein is shown in Table 42C. An open reading 
5 frame was identified beginning with a ATG initiation codon at nucleotides 23-25 and ending 
with a TGA codon at nucleotides 1670-1672. The start and stop codons are shown in bold in 
Table 42C, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 42C. NO V42b nucleotide sequence (SEQ ID NO: 159). 

TC^CTGGGAGCAGTGCTGGAGGATGAAGGAAGC^GAGATGGACGGTGAGGCAGTCCGCTTCTGCAr'AGATAA 
CCAGTGTGTCTCCCTGCACCCCCAAGAGGTGGACTCTGTG 

GCTCGTTCAGGCTACCCCGGCATTTATGGCTGTGACCTTGGTCTTCTCTCTT6TGACTCTCT 

TCAACAGCAGACAAGACCTGCTCCGAAGCCTGTGCAAGCCGTAAT^ 

ACCTTTTGAACCCAACAATCATCACCACTTTGGCAGGGA 

AGGTCACATGGAGAATTCCAGTGCCTGGGTAGTAGAAATCCAGATGTTGAAGTGCAGAGTGGACAATGTCAA 
TTCGCAGCTCCAGGTGCTCGGTGATCATCTGGGAAACACCAATGCTG 

AAAGGATGC CACTACATTGAGTTTG CAG ACACAGATGTTAAGG AGTTCC CTGGAGGG AACCAATGCTGAG AT 
CCAGAGGCT CAAGG AAG AC CTT GAAAAGGC AGATGCTTT AACTTT C CAGACG CT GAATTTCTT AAAAAGCAG 
TTT AGAAAACAC CAGCATTGAG CTC C ACGTG CTAAGCAGAGGCTTAGAAAATGCAAACT CTG AAATT CAG AT 
GTTGAATGC CAGTTTGGAAACGG CAAATGCTTTAAACT C C CAGAC C CAGG CCTTTATAAAAAG CAGT TTTGA 
(^CACTAGTGCTGAGATCCAGTTCTTAAGAGGTCATTTGGAAAGAGCTGGTGATGAAATTCACGTGTTAAA 
AAGGGATTTGAAAATGGTCACAGCCCAGACCCAAAAAGCAAATGGCCGTCTGGACCAGACAGATACT 
TCAGGTATTCAAGTCAGAGATGGAAAATGTGAATACCTTAAATGCCCAGATT^ 

GAAAAATGC CAG CAGAG AG ATACAG AC CC T AAAACAAGGAATG AAGAATGCTTCAG C CTTAACTTC CCAG AC 
C CAGATGT TAG ACAGCAATCTG CAGAAGG C CAGTGCCGAGATC CAGAGGTTAAGAGGGGATC TAGAGAACAC 
CAAAGCTCTAACCATGGAAATC(^GCAGGAGCAGAGTCGCCTGAAGACCCTCCATGTGGT(^TTACTTCACA 
GGAACAGCTACAAAGAACCCAAAGTAAGCAGCTTCTCCAG^ 

AAG CTT ATATT AT1TT TCTAGTGTCAAGAAGTCTTGGCATG AGG CTGAG CAGTT CTG CGTGT CCCAGGG AGC 

CCATCTGGCATCTGTGGCCTCCAAGGAGGAGCAGGCATTTCTGGTAGAGT^ 

GATCGGTCTCACTGACAG^GGC^CAGAGGGCTCCTGGCGCTGGAC 

GAACAAAGGGT Tl'TGGGAAAAG AAT CAGTCTGACAACTGGCGG CACAAGAATGGG CAGACTGAAGAC TGTGT 
CCAAATTCAGCAGAAGTGGAATGACATGACCTGTGACACCCCCTATCAGTGGGTGTGCAAGAAGCCCATGGG 
CCAGGGTGTGGCCTGAGGGCAGGCCAGAGCTGAGGGGCTGCTCCTGCTTGCCAATACT 
ATGCCTTCGGAGCCrCTGAGCTCTGCTTGTTCTCTGGGACC 



In a search of public sequence databases, the NOV42b nucleic acid sequence, located 
1 0 on chromosome 2, has 1 054 of 1 469 bases (7 1 %) identical to a gbrGENB ANK- 

ID:D88577|acc:D88577.1 mRNA from Mus musculus (mRNA for Kupffer cell receptor, 
complete cds) (E = Lie" 161 ). 

The disclosed NOV42b polypeptide (SEQ IDNO:160) encoded by SEQ ID NO:159 
has 549 amino acid residues and is presented in Table 42D using the one-letter amino acid 
1 5 code. Signal P, Psort and/or Hydropathy results predict that NOV42b has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.7900. Alternatively, 
NOV42b may also localize to the microbody (peroxisome) with a certainty of 0.3000, to the 
Golgi body with a certainty of 0.3000, or to the endoplasmic reticulum (membrane) with a 
certainty of 0.2000. The most likely cleavage site for NOV42b is between positions 67 and 68: 
20 VQQ-QT. 
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Table 42D. Encoded NOV42b protein sequence (SEQ ID NO:160). 



MKEAEMDGEAVRFCTDNQCVS LHPQEVDS VAMAP AAPKI PRL VQATPAFMAVTLVFSLVTLFVWQQQT RP V 
PKPVQAVILGDNI TGHLPFEPNNHHHFGREAEMRELI QTFKGHMENS SAWWE IQMLKCRVDNVNSQLQVliG 
DHLGNTNADI QMVKGVLKDATTLSLQTQMLRSSLEGTNAEIQRLKEDLEKA0ALTFQTLNFLKS SLENTS I E 
LHVLS RGLENANS E I QMLNAS LETANALNS QTQAF I KS S FDNTS AE I QFLRGHLERAGDEIHVLKRDLKMVT 
AQTQKANGRLDQTDTQIQVFKSEMENVNT^ 

QKASAEIQRLRGDLENTKALTMEIQQEQSRLKTLHWI TSQEQLQRTQSKQLLQMVLQGWKFNGGSLYYFS S 
VKKSWHEAEQFC^SQGAHLASVASKEEQAFLVEFTSKVYYWIGLTDRGTEGSWRWTIXSTPFNAAQNKGFWE^ 
NQSDNV?RHKNGQTEDCVQIQQKWNDMTCDTPYQWVCKKPMGQGVA 



A search of sequence databases reveals that the NOV42b amino acid sequence has 304 
of 549 amino acid residues (55%) identical to, and 401 of 549 amino acid residues (73%) 
5 similar to, the 548 amino acid residue ptnr:SWISSPROT-ACC:P70194 protein from Mus 
musculus (Mouse) (Kupffer Cell Receptor) (E = 3.7e" 158 ). 

NOV42b is predicted to be expressed in at least adrenal gland, bone marrow, brain - 
amygdala, brain 7 cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
10 mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and uterus. 

NOV42c 

A disclosed NOV42c nucleic acid of 1 874 nucleotides (also referred to as CG56682- 
03) encoding a Kupffer cell receptor-like protein is shown in Table 42E. An open reading 
1 5 frame was identified beginning with a ATG initiation codon at nucleotides 1-3 and ending 
with a TAA codon at nucleotides 1 702-1 704. The start and stop codons are shown in bold in 
Table 42E, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 42E. NOV42c nucleotide sequence (SEQ ID NO:161). 



ATGGACGGTGAGGC^GTCCGCTTCTGCACAGATAACCAGTGTGTCTCCCTGCACCCCCAAGAGGTGGACTCT 
GTGGCAATGGCTCCTGCAGCCCCCAAGATACCGAGGCTCGTTCAGGCTACCCCGGCATTTATGGCTGTGACC 
TTGGTCTTCTCTCTTGTGACTCTCTTTGTA 

GCCGTAATTCTGGGAGACAACATTACrGGGCATTTACCTTTTGAACCCAACAA 

GAGGCAGAAATGCAAGAG C TTATC C AG ACATTT AAAGGCCACATGGAGAAT T C CAGTGCCTGGGTAGTAGAA 

ATCCAGATGTTGAAGTGCAGAGTGGACAATGTCAATTCGCAGCTCCAGGTGCTCGGTGATCATCTGGGAAAC 

ACCAATGCTGACATCCAGATGGTAAAAGGAGTTCTAAAGGATGCCACTACATTC 

TTAAGGAGTTCCCTGGAGGGAACGAATGCTGAGATCCAGMGCT 

TTAACTTTCCAGACGCTGAATTTCTTAAAAAGCAGTTTAGAAAACACCAGCATTGAG 

AGAGGCTTAGAAAATGCAAACTCTGAAATTCAGATGTTGAAT^ 

CAGTTAGCCAATAGCAGTTTAAAGAACGCTAATGCTGAGATCTATC 

AATGACTTGAGGACCCAGAACCAGGTTTTAAGAAATAGTTTGGAAG 

AAGGAAAATTTG CAGAACACAAATGCTTT AAACTCCCAGAC CCAGGCCT TT ATAAAAAGCAGTTTTGACAAC 
ACTAGTGCTGAGATCCAGTTCTTAAGAGGTCATTTGGAAAGAGCTGGTGATGAAATTCACGTGTTAAAAAGG 
GATTTGAAAATGGTCACAGCCCAGACCCAAAAAGCAAATGGCC^TCTGGACCAGACAGATACTCAGATTCA^ 
GTATTCAAGTCAGAGATGGAAAATGTGAATACCTTAAATC^ 

AATGCCAGC^GAGAGATACAGACCCTAAAACAAGGAATGAAGAATGCTTCAGCCTTAACT^ 
ATGTTAGACAGCAATCTGCAGAAGGCCAGTGCCGAGATCCAGAGGTTAAGAGGGGATCTAGAGAACACCAAA 
GCTCTAACCATGGAAATCCAGCAGGAGCAGAGTCGCCTGAAGACCCTCCATGTGGTCATTACTTCACAGGAA 
CAG CTACAAAGAAC CCAAAGTCAGCTT CTC CAGATGGT CCTGCAAGGCTGGAAGTTCAATGGTGGAAGCTTA 

GCATCTGTGGCCTCCAAGGAGGAGCAGGCATTTCTGGTAGAGTTCACAAGTAAAGTGTACTACT 
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CTCACTGACAGGGGCAC^GAGGGCTCCTGGCGCTGGAC^GATGGGACACCATTCAACGCCGCCCAGAACAAA 
GCCTCCCTAGGAGCCACAGCACGAGGAAGGGATGCTGCCTTCATCTA AC^GTATAAAGCCCTGTTGTCTTCG 
GGTTTTGGGAAAAGAATCAGTCGACAACTGGCGGCACAAGAATGGG(^ 

GCAGAAGTGGAATGACATGACCTGTGACACCCCCTATCAGTGGGTGTGCAAGAAGCCCATGGGCCAGGGTGT 
GG 



In a search of public sequence databases, the NOV42c nucleic acid sequence, located 
on chromosome 2, has 689 of 993 bases (69%) identical to a gb:GENBANK- 
ID:D88577|acc:D88577.1 mRNA from Mus musculus (mRNA for Kupffercell receptor, 
5 complete cds) (E = 1 .6e 120 ). 

The disclosed NOV42c polypeptide (SEQ ID NO: 162) encoded by SEQ ID NO: 161 
has 567 amino acid residues and is presented in Table 42F using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV42c has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.7900. Alternatively, 
1 0 NOV42c may also localize to the microbody (peroxisome) with a certainty of 03000, to the 
Golgi body with a certainty of 0.3000, or to the endoplasmic reticulum (membrane) with a 
certainty of 0.2000. The most likely cleavage site for NOV42c is between positions 62 and 63: 
VQQ-QT. 



Table 42F Encoded NOV42c protein sequence (SEQ ID NO:162). 

AVILGDNITGHLPFEPNNHHHFGREAEMQELI QTFKGHMENSS AWWEI QMLKCRVDNVNSQLCfVLGDHlrGN 

TNADIQMVKGVLKDATT^SLQTQMLRSSLEGTNAEI^ 

RGLENANSEIQMLNASLETANTQAQLANSSLKN^ 

KENLQNTNALNSQTQAFIKSSFDOTSAEIQFLRGH 

VFKSEMENVNTLNAQIQVLNGHMKNASREIQT^ 

ALTMEIQQEQSRLKTLHWITSQEQLQRTQSQLLQMVLQGWKKNGGSLYYFSSVKKSWHEAEQFCVSQGAHL 
^ I ASVASKEEQAFLVEFTSKVYYWIGLTDRGTEGSWRW^ 

A search of sequence databases reveals that the NOV42c amino acid sequence has 191 
of 412 amino acid residues (46%) identical to, and 273 of 412 amino acid residues (66%) 
similar to, the 548 amino acid residue ptnr:SWISSNEW-ACC:P70194 protein from Mus 
musculus (Mouse) (Kupffer Cell Receptor) (E = 5.3e -92 ). 

20 NOV42c is predicted to be expressed in at least the following tissues: adrenal gland, 

bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 

25 trachea and uterus. . 
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NOV42d 

A disclosed NOV42d nucleic acid of 1985 nucleotides (also referred to as CG56682- 
04) encoding a Kupffer cell receptor-like protein is shown in Table 42G. An open reading 
frame was identified beginning with a GTC initiation codon at nucleotides 2-4 and ending 
5 with a TAA codon at nucleotides 1658-1660. The start and stop codons are shown in bold in 
Table 42G, and the 5 9 and 3' untranslated regions, if any, are underlined. Because the starting 
codon is not a traditionl initiation codon, NOV42d could be a partial reading frame extending 
further into the 5' 

Table 42G. NOV42d nucleotide sequence (SEQ ID NO:163). 

AGTCCGCTTCTGCACAGATAACCAGTGTGTCTCCCTGCACCCCa^GAGGTGGACTCTGTGGC^TGGCTCC 
TGCAGCCCCCy^ATACCGAGGCTCGTTCAGGCTACCCCGGCATTTATGGCTGTGACCTTGGTCTTCTCTCT 
TGTGACTCTCTTTGTAGTGGTTCAACAGCAGACAAGACCTGTTCCGAAGCCTGTGCAAGCCGTAAl^CTG^ 
AGACAACAT T ACTGGG CATTT AC CT TTTGAACC CAACAATCATCACCAC TTTGG CAGGG AGGCAG AAATG CA 
AGAGCTTATCCAGACATTTAAAGGCCACATGGAGAATTCCAGTGCCTGGG 

GTGCAGAGTGGACAATGTCAATTCGCAGCTCCAGGTGCTCGGTGATCATCTGGGAAACACCAATGCTGACAT 
CCAGATGGTAAAAGGAGTTCTAAAGGATGCCACTACATTGAGTTTGCAGACACAGATGTTAAGGAGTTCCCT 
G G AGGG AAC C AATG CTG AG AT C CAG AG G CTCAAGG AAG ACC T TGAAAAGG CAGATG CTTTAAC TT TC CAGAC 
GCTGAAT TT CTT AAAAAGC AGTTTAGAAAACACCAG CATTGAGCTCCACG TG C T AAG CAGAGG CTT AGAAAA 
TG CAAACTCTGAAATTCAGATGTT GAATG C CAGTTTGGAAACGGCAAAT ACC CAGG CTCAGTTAGCCAATAG 
CAGTTT AAAGAACG CTAATGCTGAGATCTATGTTTTGAGAGGC CATCTAGATAGTGTCAATGACTTGAGGAC 
CCAGAAC CAGGTTTTAAGAAATAGTTTGGAAGGAG CC^AATGCTGAGATCCAGGGACTAAAGGAAAA 
G AACAC AAATGCTT TAAACTC C CAGAC C GAGGCCT TTATAAAAAGCAGTTTTGGCAAGACTAGTGCTGAG AT 
CCAGTT CTT AAGAGGT CAT TTGGAAAG AG CTGGTGATG AAATT CACGTG TTAAAAAGGGATTTGAAAATGGT 
CACAGC CCAG ACCCAAAAAGCAAATGGCCGTCTGGACCAG ACAGAT ACT CAGATT CAGGT ATT CAAGT CAG A 
GATGGAAAATGTGAATACCTTAAATGCCCAGATTCAGGTCTTAAATGGTCATATGAAAAA 
G ATACAGAC C CT AAAACAAGGAATGAAGAATGCTTCAG CCTTAACTTCC CAGAC C CAGATGTT AGACAG CAA 
T CTGCAGAAGG C CAGTGCCGAG AT C CAG AGGTT AAGAGGGGAT CTAGAG AACAC CAAAG CTCTAAC CATG GA 
AAT C CAG CAGGAGCAGAGTCG C CTGAAG AC CCTCC ATGTGGTC ATT ACT T CACAGGAACAGCTAC AAAG AAC 
C CAAAGT CAGCTT CTCCAGATGGTC CTGCAAGG CTGGAAGTTC AATGGTGG AAG CT T AT ATTATTTTTCTAG 
TGTCAAGAAGTCTTGGCATGAGGCTGAGCAGTTCTGCGTGTCCC^GGG 

CAAGGAGGAG CAGG CATTT CTGGTAGAGTT CACAAGTAAAGTGTACTACTGG AT CGGTCTCACTGACAGGGG 
CACAGAGGGCTCCTGGCGCTGGACAGATGGGACACCATTCAACGCCGCCCAGAACAAAGCGGCCACTAGGGG 
ATG AAGGAC CC AT CTCAAGT CAG CT CC CT AGACT C ATC CCATGT CAGCT C C CT AGGAG C CACAG CAC CAGGA 
AG^GATGCTGCCTTCATCTAACAGTATj^GCCCTGTTGTCTTCGGGTTTTGGGAAAAGAATCAGTCTGACA 
ACTGGCGGCACAAGAATGGGCAGACTGAAGACTGTGTCCAAATTCAGCAGAAGTGGAATGACATGACCTGTG 
ACACCCCCTATCAGTGGGTGTGCAAGAAGCCCATGG^CCAGGGTGTGGCCTGAGGGCAGGCCAGAGCTGAGG 
GGCTGCTCCTGCTTGCCAATACTGACCCTCCTCCTCGATGC 



10 In a search of public sequence databases, the NOV42d nucleic acid sequence, located 

on chromosome 2, has 705 of 1023 bases (68%) identical to a gb:GENBANK- 
ID:D88577|acc:D88577.1 mRNA from Mm musculus (mRNA for Kupffer cell receptor, 
complete cds) (E = 3.7e 124 ). 

The disclosed NOV42d polypeptide (SEQ ID NO: 164) encoded by SEQ ID NO: 163 

1 5 has 552 amino acid residues and is presented in Table 42H using the one-letter amino acid 

code. Signal P, Psort and/or Hydropathy results predict that NOV42d has a, signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.7900. Alternatively, 
NOV42d may also localize to the microbody (peroxisome) with a certainty of 0.3000, to the 
Golgi body with a certainty of 0.3000, or to the endoplasmic reticulum (membrane) with a 
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certainty of 0.2000. The most likely cleavage site for NOV42d is between positions 57 and.58: 
VQQ-QT. 



Table 42H. Encoded NOV42d protein sequence (SEQ ID NO:164). 

VRFCTDNQCV SI^PQEVDSVAMAPAAPKI PRLVQATPAFMAVTLVFSLVTLFVWQQQTRPVPKPVQAVILG 
DNITGHLPFEPNNHHHFGREAEMQELIQTFKGHMENSSAWVVE 

QMVKGVLKDATTLSLQTQMLRS SLEGTNAEIQRIiKEDLEKADAIiTFQTLNFLKSSLEtrTSI ELHVLSRGLEN 
ANSEIQMLNASLETANTQAQIANSSLKNANAEIYVLRGHLDSVND 

NTNALNSQTQAFI KS S FGNTSAE IQFLRGHLERAGDE IHVLKRDLKMVTAQTQKANGRLDQTDTQ I QVFKS E 
MENVNTLNAQ I QVLNGHMKNASRE I QTLKQGMKNASALTSQTQMLDSNLQKASAE I QRLRGDLENTKALTME 
I QQEQSRLKTLHWI TSQEQLQRTQSQLLQMVLQGWKFNGGSLYYFS SVKKSWHEAEQFCVSQGAHLAS VAS 
KEEQAFLVEFTSKVYYWIGLTDRGTEGSWRWTDGTPFNAAQNKAATRG 

5 A search of sequence databases reveals that the NOV42d amino acid sequence has 1 87 

of 404 amino acid residues (46%) identical to, and 269 of 404 amino acid residues (66%) 
similar to, the 548 amino acid residue ptnr:SWISSNEW-ACC:P70194 protein from Mus 
musculus (Mouse) (Kupffer Cell Receptor) (E = 1.2c* 9 ). 

NOV42d is predicted to be expressed in at least adrenal gland, bone marrow, brain - 
10 amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and uterus. . 

NOV42a also has homology to the amino acid sequences shown in the BLASTP data 
15 listed in Table 421. 



Table 421. BLAST results for NOV42a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 7949066 |ref|NP 0 
58031. l| 
(NMJ)16751) 


C-type (calcium 
dependent , 
carbohydrate 
recognition 
domain) lectin, 
superfamily 
member 13; 
kupffer cell 
receptor; Kupffer 
cell c-type 
lectin receptor 
[Mus musculus] 


548 


300/548 
(54%) 


395/548 
(71%) 


e-153 


gi 1 16758588 |ref|NP 
446205. 1| 
(NM_053753) 


Kupffer cell 
receptor [Rattus 
norvegicus] 


550 


293/547 
(53%) 


382/547 
(69%) 


e-147 


gi | 7657291 |ref|NP 0 
56532. 1| 
(NMJU5717) 


Langerhans cell 
specific c-type 
lectin; 1 anger in 
[Homo sapiens] 


328 


92/261 
(35%) 


143/261 
(54%) 


2e-4l 


gi| 17426713 |emb|CAC 
85632. 1| (AJ313164) 


langerin [Mus 
musculus] 


326 


91/264 
(34%) 


140/264 
(52%) 


le-40 



368 
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gi | 17059581 | emb | CAC 
82936.1 | (AJ302711) 



C type lectin 
[Mus musculus] 



331 


91/264 


140/264 


2e-40 




(34%) 


(52%) 





The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 42J. In the ClustalW alignment of the NOV42 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (ie. 9 regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 42J. ClustalW Analysis of NOV42 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Novel NOV42a 
Novel NOV42b 
Novel NOV42C 
Novel NOV42d 



(SEQ ID NO: 158) 
(SEQ ID NO: 160) 
(SEQ ID NO: 162) 
(SEQ ID NO: 164) 



1) 
2) 
3) 
4) 

5) ' gij 7949066 1 ref |NP_058031.1 1 (NM_016751) C-type (calcium dependent, carbohydrate, 
recognition domain) lectin, superf amily member 13; kupffer cell receptor; Kupffer 
cell c-type lectin receptor [Mus musculus] (SEQ ID NO: 512) 

6) gi | 16758588 | ref |NP_446205.l| (NMJ)53753) Kupffer cell receptor [Rattus 
norvegicus] (SEQ ID NO: 513) 

7) gi | 7657291 | ref |NP_C56532 . 1 | (NM_015717) Langerhans cell specific c-type lectin; 
langerin [Homo sapiens] (SEQ ID NO: 514) 

8) gi 1 17426713 | emb |CAC85632.l| (AJ313164) langerin [Mus musculus] (SEQ ID NO: 515) 

9) gi 1 17059581 1 emb | CAC82936.1] (AJ302711) C type lectin [Mus musculus) (SEQ ID 
N0:516) 



NOV42a 


1 


N0V42b 


1 


NOV42C 


1 


NOV42d 


1 


gi 


7949066| 


1 


gi 


16758588 | 


1 


gi 


7657291| 


1 


gi 


17426713 | 


1 


gi 


17059581 


1 




70 



N0V42a 


59 


N0V42b 


59 


NOV42C 


54 


N0V42d 


49 


gi 


7949066| 


59 


gi 


16758588 | 


59 


gi 


7657291| 


58 


gi 


17426713 | 


56 


gi 


17059581 


61 




80 

|....|....-| 

kppgd- -pnltnplsfqh: 
trpvpkpvqavi 
trpvpkpvqavi 

iTRPVPKPVQAVI 
jPWRPEWNKEPPSLL 
iPWRQKQNEDHPVKAG- -] 
iFMG- 




110 



120 



115 
|GHMEg 118 
IgHMeS 113 
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ww 


NOV42b 


119 
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WW 
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J^jDNVigS 

jp|DNvj*|S 

MpNVpS 
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)ATT 
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>1tq 
|tq 
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NOV42a 
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NOV42c 
NOV42d 
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7949066 I 
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17426713 | 
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NOV42C 


473 


NOV42d 


468 


gi 


7949066) 


416 


gi 


16758588 | 


416 


gi 


7657291| 


199 


gi 


17426713) 


197 


gi 


17059581 


202 



NOV42a 


477 e 


J a 




s 


NOV42b 


480 


II 3 




s 


NOV42C 


533 


f 3 




s 


NOV42d 


528 








gi 
gi 


7949066 | 
16758588] 


476 


1 2 




I 


476 








gi 
gi 


7657291 | 
17426713) 


259 








257 








gi 


17059581 


262 I 









NOV42a 
NOV42b 
NOV42c 
NOV42d 



gi 
gi 
gi 
gi 
gi 



7949066 | 

16758588) 

7657291] 

17426713) 

17059581 





610 

PMGQGVA 546 

MGQGVA 549 

567 

552 

ISTGWSAARVG-- 548 

iSTDWSVARTDQS 550 

PYVPSEP 328 

PYVQTTE 326 

•YVQTTE 331 



Tables 42K-N list the domain descriptions from DOMAIN analysis results against 
35 NOV42. This indicates that the NOV42 sequence has properties similar to those of other 
proteins known to contain this domain- 



Table 42K Domain Analysis of NOV42a 

gnl | Smart | smart 0003 4, CLECT, c-type lectin (CTL) or carbohydrate- 
recognition domain (CRD) Many of these domains function as calcium- 
dependent carbohydrate binding modules. (SEQ ID NO: 83 6) 
CD-Length = 124 residues, 98.4% aligned 
Score = 124 bits (311) , Expect = le-29 



40 



45 



50 



NOV42 : 


415 


Sb j ct : 


3 


N0V42 : 


471 


Sbjct: 


63 


N0V42: 


527 


Sb j Ct : 


113 



II +11 III* ll+l +1+ II I lllllk Ml 11+ III 

SGWVSYPGGKCYKFSTEKKTWADAQAFCQSLGAHLASIHSEEENDFLLSLLKHSNSDYYW 62 



III* + 111*1+11+ I I * 
IGLSRPDSNGSWQWSDGSGPVDYSN WAPGEPGG- 



* +11 + I I I I 

- SGNCWLSTSGGGKWND 112 



++I + 
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Table 42L Domain Analysis of NOV42a 

gnl|Pfam|pfam00059, lectin_c, Lectin C-type domain. This family 
includes both long and short form C-type (SEQ ID NO: 837) 
CD-Length = 107 residues, 99.1% aligned 
Score = 115 bits (288) , Expect » 6e-27 



NOV42: 431 KKSWHEAEQFCVSQGAHLASVASKEEQAFLVEFT- - S KVYYWIGLTDRGTEGS WRWTDGT 488 

l + l 11+ I I " I 1+ I III II III I I I I I I I I 1*1 Mil* 
Sbjct: 2 SKTWAEAQAACQKLGGGLVSIQSAEEQDFLTSLTKASNSYAWIGLTDIOTEGTWVWTDGS 61 

NOV42 : 489 PFNAAQNKGFWEKNQSDNWRHKNGQTEDCVQIQ- - -QKWNDMTCDTPYQWVCKK 539 

II I + +1 +1 llll+l I I I I I + +11 + 

Sbjct: 62 PVNYT NWAPGEPNNRGNK EDCVEIYTDGNKWNDEPCGSKLPYVCEF 107 
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Table 42M Domain Analysis of NOV42a 

gnl|Pfam|pfam01576 / Myosin_tail, Myosin tail. The myosin molecule is a 
multi-subunit* complex made up of two heavy chains and four light 
chains it is a fundamental contractile protein found in all eukaryote 
cell types. This family consists of the coiled-coil myosin heavy chain 
tail region. The coiled-coil is composed of the tail from two 
molecules of myosin. These can then assemble into the macromolecular 
thick filament. The coiled-coil region provides the structural 
backbone the thick filament. (SEQ ID NO: 838) 
CD-Length - 860 residues, 29.4% aligned 
Score =43.1 bits (100), Expect = 4e-05 
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20 
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30 



NOV42: 


121 


Sbjct: 


187 


NOV42: 


181 


Sbjct: 


247 


NOV42t 


241 


Sbjct: 


300 


NOV42 : 


301 


Sbjct: 


. 357 


N0V42 : 


361 


Sbjct: 


411 



++ 1+ ++I + II I 



I +1 I* 



I++I 



hi 



l ++ I +11 II + I I III 

-LQAQLRQLEHDLDSLREQLEEESEAKAELERQLSKANA 299 



I ++ +1 



+++ + |+| 



I 1+ 11+ +11 I 



I++ I* I + 1+ 



II 



+ 1 



35S 



+ | ++ + 

- RKVDELQAELDTAQREARNL 410 



+++| |h 
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Table 42N Domain Analysis of NO V42c 

gnl jpf am|pfam01576, Myos install, Myosin tail. The myosin molecule is a 
multi-subunit complex made up of two heavy chains and four light 
chains it is a fundamental contractile protein found in all eukaryote 
cell types. This family consists of the coiled-coil myosin heavy chain * 
tail region. The coiled- coil is composed of the tail from two 
molecules of myosin. These can then assemble into the macromolecular 
thick filament. The coiled-coil region provides the structural 
backbone the thick filament. (SEQ ID NO: 838) 
CD-Length = 860 residues, 30.2% aligned 
Score m 39.3 bits (90), Expect = 6e-04 



SQLQVliGDHLGNTNADIQMVKGVLKDATTLSLQTQMLRSSLEGTNAEIQRLKEDLEKADA 194 

++I++ II 11+ I 1+ I + k+ +1 I II +11 + I 

NELEIALDHANKANAEAQ KNVKKYQQQVKELQTQVE EEQRAREDAREQLA 605 

LTFQTLNFLKSSLENTSIELHVLSRGLENANSEI^ 254 

+ + I++ II I I + I +1 I" ♦ I I I +1 + I 

VAERRATALEAELEELRSALEQAERARKQAETE - - - LAEASERVNELTAQNSSLIAQK - R 661 

NTSAEIQFLRGHLERAGDEIHVXKRDLKM^ 3 14 

KLEGELAALQSDLDEAVNELKAAEE RAKKAQADAARLAEELRQEQEHSQHLER 714 

LNAQIQVXNGHMKNASREIQT--LKQGMKNASALTSQTQMLDSNL QKASAEIQR-LR 368 

I ++ I + III I I ++ + I++ I 1+ II 1+ II 

LRKQLESQVKELQVRLDEAEAAALKGGKKM I QKLEARVRELEAELDGEQRRHAETQKNIjR 774 

GDLENTKALTMEIQQEQSRLKTLHWITSQEQLQRTQSKQL 409 

| | ++++++ |+ | ++ + +| +| | 
KMERR VKELQFQVEEDKKNLERLQDLVDKLQAKI KTYKRQL 815 

Kupffer cells are found in the linings of the liver sinusoids, and are phagocytic. A 
receptor uniquely found on the surface of rat Kupffer cells binds oligosaccharides terminating 
in galactose, N-acetylgalactosamine, and fucose. A number of different families of proteins 

25 share a conserved domain which was first characterized in some animal lectins. Animal lectins 
display a wide variety of architectures. They are classified according to the carbohydrate- 
recognition domain (CRD) of which there are two main types, S-type and C-type. C-type 
lectins (CTL) display a wide range of specificities and function as a calcium-dependent 
carbohydrate-recognition domain. They are found predominantly but not exclusively in 

30 vertebrates. CTLs can be classified into a number of subgroups based on their function and 
structure: 1) Collectins, represented by the soluble mannose-binding proteins of mammalian 
serum and liver; 2) Selecting membrane-bound proteins involved in inflammation; and 3) 
Endocytic lectins, membrane-bound receptors that mediate endocytosis of glycoproteins. 
Endocytic lectins are type-II membrane proteins where the CTL domain is located at the C- 

35 terminal extremity of the proteins, and include the Kupffer Cell Receptor. 

The disclosed NOV42 nucleic acid of the invention encoding a Kupffer Cell Receptor- 
like protein includes the nucleic acid whose sequence is provided in Table 42A, 42C, 42E, 
42G , or a fragment thereof. The invention also includes a mutant or variant nucleic acid any 
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Sbjct: 


662 
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Sbjct: 
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NOV42: 


369 


Sbjct: 


775 



WO 02/068649 



PCT/US02/02785 



of whose bases may be changed from the corresponding base shown in Table 42A, 42C, 42E, 
or 42G while still encoding a protein that maintains its Kupffer Cell Receptor -like activities 
and physiological functions, or a fragment of such a nucleic acid. The invention further 
includes nucleic acids whose sequences are complementary to those just described, including 
5 nucleic acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
10 chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 32 percent of the bases may be so 
changed. 

The disclosed NOV42 protein of the invention includes the Kupffer Cell Receptor - 

15 like protein whose sequence is provided in Table 42B,42D, 42F, or 42H. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 42B, 42D, 42F, or 42H while still encoding a protein 
that maintains its Kupffer Cell Receptor -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 66 percent of the 

20 residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab)2,that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Kupffer Cell Receptor -like protein 
(NOV42) is a member of a "Kupffer Cell Receptor family". Therefore, the NOV42 nucleic 

25 acids and proteins identified here may be useful in potential therapeutic applications 

implicated in (but not limited to) various pathologies and disorders as indicated below. The 
potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

30 delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV42 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome, cirrhosis, 
transplantation, arthritis, tendinitis, and/or other diseases and pathologies. 
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NOV42 nucleic acids and polypeptides are farther useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV42 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
5 NOVX Antibodies" section below. The disclosed NOV42 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

10 NOV43 

A disclosed NO V43 nucleic acid of 1 1 08 nucleotides (also referred to as CG56690-0 1 ) 
encoding a P2Y Purinoceptor -like protein is shown in Table 43A. An open reading frame 
was identified beginning with a ATG initiation codon at nucleotides 12-14 and ending with a 
TAA codon at nucleotides 1095-1097. The start and stop codons are shown in bold in Table 

15 43 A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 43A. NOV43 nucleotide sequence (SEQ ID NO:165). 



GTCATGATGTT& TGCTGTCCATTTTGCCTCCT 
TCCTGGAGGGAGCCTCCCGGGACATGGAGAAGGTGGAC 

TCTCAGAGAAGTACAAGCAAGTCTACCTCTCCCTGGCCTACAGTATCATCTTTATCCTAGGGCTGCCACT 

ATGGCACTGTCTTGTGGCACTCCTGGGGCC^AACCAAGCGCTGGAGCTGTGCCACC^CCTATCTGGTGAACC 

TGATGGTGGCCGACCTGCTTTATGTGCTATTGCCCTTCCTCATCATCACCTACTCACTAGATGACAGGTGGC 

CCTTCGGGGAGCTGCTCTGCAAGCTGGTGCACTTCCTGTTCTATATCAACCTTTACGGCAGCATCCTGCTGC 

TGACCTGCATCTCTGTGCACCAGTTCCTAGGTGTGTGCCACCCACTGTGTTCGCTGCCCTACCGGACCCGCA 

GGCATGCCTGGCTGGGCACCAGmCCACCTGGGCCCTGGTGGTCCTCCAGCTGCTGCCCACACTGGCCTTCT 

C CC ACACGGACTACAT C AATGG CCAGATGAT CT GGTATGACATGAC CAG CCAAGAGAATTTTGAT CGGCTTT 

TTGCCTACGGCATAGTTCTGACATTGTCTGGCTTTCTTTCCCTCCTTGGTCATTTTGGTGTGTAT^ 

TGGTCAGGAGCCTGAT CAAGCCAGAGGAG AACCTCATGAGGACAGGCAACACAGCCCGAGCCAGGTCCAT CC 

GGACCATCCTACTGGTGTGTGGCCTCTTCACCCTCTGTTTTGTGCCCrTCCATATCACTCGCTCCTTCTACC 

TCACCATCTGCTTTCTGCTTTCTCAGGACTGCCAGCTCTTGATGGCAGCCCAGTGTGGCCTACAAGATATGG 

AGGCCTCTGGTGAGTGTGAGCAGCTGCCTCAACCCAGTCCTGTACTTTC^ 

T CAGGCT CCT CCAGAAACTGAGGCAGAACAAGTTGGGTG AGCATCCAG CTGGGAGGAAGAGATGC C CAGGGT 
TGAACAGATCTGGGTAATGCCAAGGTGA 



In a search of public sequence databases, the NOV43 nucleic acid sequence, located on 
chromosome 2, has 585 of 924 bases (63%) identical to a gbrGENBANK- 
ID:GDP2Y3|acc:X98283.1 mRNA from Gallusgallus (G.domesticus mRNA for G protein- 
20 coupled P2 receptor) (E = 3.6c 45 ). 

The disclosed NOV43 polypeptide (SEQ ID NO: 166) encoded by SEQ ID NO: 165 has 
361 amino acid residues and is presented in Table 43B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV43 has a signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
25 NOV43 may also localize to the mitochondrial inner membrane with a certainty of 0.5 862, to 
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the mitochondrial intermembrane space with a certainty of 0.41 14, or to the Golgi body with a 
certainty of 0.4000. The most likely cleavage site for NOV43 is between positions 13 and 14: 
SRS-GS. 



10 



15 



Table 43B. Encoded NOV43 protein sequence (SEQ ID NO:166). 



MLSILLPSRGSRSGSRRGALLLEGASRDMEKVDMNTSQEQGLCQFSEKYKQVYLSIA 

LWHS WGQTKRWS CATTYL VNLMVADLLYVLL PFL 1 1 T YS LDDRWP FGELLCKLVHFLFYINLYGS I LLLTCI 

SVHQFLGVCHPLCSLPYRTRRHAWLGTSTTWALVVLQLLPTLAFSHTD 

IVLTLSGFLSLKHFGVYSLWTC^^ 

FLLSQDCQLLMAAQCGLQDMEASGECEQLPQPSPVLSFKGGKNRVI^LQKLRQNKI^ 
G 



A search of sequence databases reveals that the NOV43 amino acid sequence has 105 
of 261 amino acid residues (40%) identical to, and 153 of 261 amino acid residues (58%) 
similar to, the 328 amino acid residue ptnr:SWISSNEW-ACC:Q98907 protein from Gallus 
gallus (Chicken) (P2Y Purinoceptor 3 (P2Y3) (Nucleoside Diphosphate Receptor)) (E = 0.0). 

NOV43 is predicted to be expressed in brain because of the expression pattern of 
(GENBANK-DD: gb:GENBANK-ID:GDP2Y3|acc:X98283.1) a closely related G.domesticus 
mRNA for G protein-coupled P2 receptor homolog in species Gallus gallus.. 

NOV43 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 43C. 



Table 43C BLAST results for NOV43 


Gene Index/ 
Identifier 


Protein/. Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi|2829680|sp|P7992 
8 | P2Y8_XENLA 


P2Y PURINOCEPTOR 
8 (P2Y8) 


537 


111/259 
(42%) 


154/259 
(58%) 


5e-49 


gi| 2707256 |gb|AAC60 
339. l| (AF031897) 


G protein coupled 
P2Y nucleotide 
receptor 
[Meleagris 
gallopavo] 


374 


107/277 
(38%) 


158/277 
(56%) 


4e-46 


gi|2495017|sp|Q9890 
7|P2Y3_CHICK 


P2Y PURINOCEPTOR 
3 (P2Y3) 
(NUCLEOSIDE 
DIPHOSPHATE 
RECEPTOR) 


328 


105/261 
(40%) 


153/261 
(58%) 


6e-45 


gi 10720180 |sp|0933 
61 P2Y3_MELGA 


P2Y PURINOCEPTOR 
3 (P2Y3) 
(NUCLEOSIDE 
DIPHOSPHATE 
RECEPTOR) 


328 


105/269 
(39%) 


155/269 
(57%) 


2e-44 
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gi | 13928944 | ret |NP_ 
113868. 1| 
(NM 031680) 



purmergic 


OCT 


1 1 q / n n 
XXo/ iiU 


Id?/ J -LU 




receptor P2Y, 6- 




















4; pyriraidinergic 










receptor P2Y, G- 










protein coupled, 










4 [Rattus 










norvegicus] 











The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 43D. In the ClustalW alignment of the NOV43 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 43D. ClustalW Analysis of NOV43 



1) Novel NOV43 (SEQ ID NO:166) 

2) gi | 2829680 sp P79928 ) P2Y8_XENLA P2Y PURINOCEPTOR 8 (P2Y8) (SEQ ID NO: 517) 

3) gi|2707256 gb AAC60339.l| (AP031897) G protein coupled P2Y nucleotide receptor 
[Meleagris gallopavo] (SEQ ID N0:518) 

4) gi| 2495017 |sp|Q98907|P2Y3_CHICK P2Y PURINOCEPTOR 3 (P2Y3) (NUCLEOSIDE 
DIPHOSPHATE RECEPTOR) (SEQ ID NO: 519) 

5) gi| 10720180 |sp|09336l|P2Y3_MELGA P2Y. PURINOCEPTOR 3 (P2Y3) (NUCLEOSIDE 
DIPHOSPHATE RECEPTOR) (SEQ ID NO: 520) 

6) gi | 13928944 | ref |NP_113868 . 1 1 (NM_031680) purinergic receptor P2Y, G-protein 
coupled, 4; pyrimidinergic receptor P2Y, G-protein coupled, 4 [Rattus norvegicus] 
(SEQ ID NO: 521) 
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2707256 
2495017 
10720X80 | 
13928944| 



NOV43 



gi 
gi 
gi 
gi 
gi 



2829680 
2707256 
2495017 
10720180 | 
13928944| 




130 



140 



150 



160 
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180 




377 



WO 02/068649 



PCT7US02/02785 



230 



10 



15 



20 



25 
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35 



40 



45 



50 



55 



60 



NOV43 

gi|2829680 | 
gi I 2707256 j 
gi j 2495017 | 
gi j 10720180 | 
gi | 13928944 | 



179 VLQ 
172 TL< 
170 Tr 
151 IAQC 
151 IAQC 
160 AG 



NOV43 



gi 
gi 
gi 
gi 
gi 



2829680 

2707256 

2495017 

10720180 

13928944 



NOV43 



gi 
gi 
gi 
gi 
gi 



28296801 
2707256| 
2495017 | 
10720180 | 
13928944 | 




3 60 
I 

>N 342 

RRSSVPNRRCMHTNHP 349 

>QR P 334 

308 
308 

IS 320 
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NOV43 




342 




gi 


2829680 




350 


QTEPHMTAG] 


gi 


2707256 




335 


RPVP TSj 


gi 


2495017 




308 


Y ] 


gi 


10720180 | 


308 


Y ] 


gi 


13928944) 


321 


KPKPRTAASi 




380 390 400 410 420 

|....|....|....|....|....| J_...J V ..|....| 

SGRKRCPGIiNR 359 

!EIPSNGSMVRDENGEGSREHRVEwffoT&^ 409 

„„rr*~„r„T„r«~^ |^ES||GMGT|7WS 370 

^KWjgQDHC&SY- 326 

|s!KWgHDHC|jrY 326 

WSDTHQDSTFSA 355 



PSVDSSWGSCCN- 



-LHEESISR- 
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gi 
gi 
gi 
gi 



2829680 

2707256 

2495017 

10720180 

13928944 




440 



450 



460 



470 



480 



|....|....|....|....|. 



361 

PYVEWEKEDYETKRENRKTTEQSSKTNAEQDELQTQIDSRLK 469 

374 

32 e 

328 

_ 361 



NOV43 



I 



490 



I 



500 



510 



520 



|....| 



530 
|....|.. 
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I---I 



gi 
gi 
gi 
gi 
gi 



2829680 

2707256 

2495017 

10720180 

13928944 



361 



NOV43 

B±]2829680\ 
gi|2707256| 
gi | 2495017 | 
gi j 10720180 | 
gi j 13928944 | 



361 

470 RGKWQLSSKKGAAQENEKGHMEPSFEGEGTSTWl^^ 529 

374 - 374 

32 8 " " --- 328 

328 - - 328 

361 « - 361 



....|... 

361 361 

530 ELQNFPKA 537 

374 374 

328 328 

328 328 

361 361 



65 Tables 43E lists the domain descriptions from DOMAIN analysis results against 

NOV43. This indicates that the NOV43 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 43E Domain Analysis of NOV43 

gnl |Pfam|pfam00001, 7tm_l, 7 transmembrane receptor (rhodopsin 

family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 85.0% aligned 

Score = ill bits (277) , Expect = 8e-26 



NOV43 : 69 NGTVLWHS WGQTKRWSCATTYLVNLMVADLLYVL - LPFLI ITYSLDDRWPFGELLCKLVH 127 

I 1+ I + +1+11 II + I + III- lllll 

5 Sbjct: 2 NLLVILVILRTKKLRTPTNIFLLNLAVADUiFLLTLPPWALYYLVGGDW 61 

NOV43 : 128 FLFYINLYGS ILLLTCI S VHQFLGVCHPLCSLPYRTRRHAWLGTSTTWALVVLQ- LLPTL 186 

II +1 I llllll 11+ ++| MM II I I + II +1 III 

Sbjct: 62 ALFWNGYAS ILLLTAI S IDR YLAXVHPLRYRRI RTPERAKVLI LLVWVLALLLSLPPLL 121 

L0 

NOV43: 187 AFSHTDYINGQMIWYDOTSQENFDRLFAYGIVLrLSGFLSLLGHFGVYSLMVRSLIKPE- 245 

I + I + + + I II 1+ ++I+I I 

Sbjct: 122 FSWLRTVEEGOTTVCLIDFPEESVKRSYVIJjSTLVG 181 

15 NOV43: 246 - ENLMRTGNTARARS I RT ILL VCGLFTLCFVPFHI T 280 
+ ++ +++ ++ + +|+| +| | |++|+| | 
Sbjct: 182 SQRSLKRRSSSERKAAKMLLWVWFVLCWLPYHIV 217 

The P2Y Purinoreceptor belongs to the family of G-Protein Coupled Receptors. G- 

20 protein-coupled receptors (GPCRs) constitute a vast protein family that encompasses a wide 
range of functions (including various autocrine, paracrine and endocrine processes). They 
show considerable diversity at the sequence level, on the basis of which they can be separated 
into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of 
families for which there are indications of evolutionary relationship, but between which there 

25 is no statistically significant similarity in sequence [1]. The currently known clan members 
include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal 
matingpheromone receptors, and the metabotropic glutamate receptor family. The rhodopsin- 
like GPCRs themselves represent a widespread protein family that includes hormone, 
neurotransmitter and light receptors, all of which transduce extracellular signals through 

30 interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands 
vary widely in structure and character, the amino acid sequences of the receptors are very 
similar and are believed to adopt a common structural framework comprising 7 
transmembrane (TM) helices. See CMKRL2 (601805). Using degenerate PCR to find cDNAs 
encoding new G protein coupled-receptors in human B cells, Owman et al. (1996) identified a 

35 CMKRL1 cDNA which encodes a 352-amino acid polypeptide with a calculated mass of 43 
kD. The nearest homologs of this novel sequence are the chemoattractant leukocyte receptors, 
such as the C5a anaphylatoxin receptor and the FMLP receptor. Northern blotting revealed 
transcripts of 5 kb and 7.5 kb in several tissues of the immune system including spleen, 
thymus, and lymph node. Owman et al. (1996) considered the high level of expression in 
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lymphoid tissues suggestive of the role of CMKRL1 in the regulation of the inflammatory 
system. The authors mapped the CMKRL1 gene to 14ql 1.2-ql2 by fluorescence in situ 
hybridization. Akbar et al. (1996) used a chicken P2Y3 cDNA to screen a human 
erythroleukemia (HEL) cell cDNA library and cloned a purinoceptor cDNA, which they 
5 termed P2Y7. Sequencing revealed an open reading frame coding for a polypeptide of 352 
amino acids having 7 putative transmembrane domains. The P2Y7 receptor has 23 to 30% 
identity to other P2Y receptors, but forms a unique branch within the P2Y family. Northern 
blot analysis showed that the P2Y7 gene produced a 1 .6-kb transcript which is expressed at 
highest levels in human heart, human skeletal muscle, rat heart, and rat cardiomyocytes and at 

10 lower levels in human brain and human liver. Akbar et al. (1996) noted that its expression in 
HEL cells is below the threshold of detection by Northern blot. Binding and displacement 
assays in COS-7 cells showed that P2Y7 has a high affinity for ATP and much less for UTP 
and ADP. The rank order of affinities in the binding series was distinct from any known for 
the P2Y1-P2Y6 receptors. Like other P2Y receptors, P2YR is coupled to phospholipase C and 

1 5 not to adenylate cyclase. Akbar et al. (1 996) speculated that P2Y7 may be the cardiac P2Y 
receptor involved in the regulation of cardiac muscle contraction through modulation of L- 
type calcium currents. Akbar et al. (1996) used PGR on a panel of mouse-rodent somatic cell 
hybrids to localize the P2RY7 gene to human chromosome 14. Somers et al. (1997) did 
sequence tagged site (STS) mapping of the P2RY7 gene using the National Center for 

20 Biotechnology Information (NCBI) database. In this way, they positioned the P2RY7 gene 
between D14S283 and D14S264. Leukotriene B4 (LTB4) is a potent chemoattractant that is 
primarily involved in inflammation, immune responses, and host defense against infection 
(Samuelsson et al., 1987; Chen et al., 1994). LTB4 activates inflammatory cells by binding to 
its cell surface receptor, BLTR. LTB4 can also bind and activate the intranuclear transcription 

25 factor PPAR-alpha, resulting in the activation of genes that terminate inflammatoiy processes 
(Devchand et al., 1996). Yokomizo et al. (1997) cloned the cDNA encoding a cell surface 
LTB4 receptor that is highly expressed in human leukocytes. Two cDNA clones isolated from 
retinoic acid-differentiated HL-60 cells contained identical open reading frames encoding a 
protein of 352 amino acids and predicted to contain 7 membrane-spanning domains, but 

30 different 5-prime untranslated regions. In Chinese hamster ovary (CHO) cells stably 

expressing this receptor, LTB4 induced increases in intracellular calcium, accumulation of D- 
myo-inositol-l,4,5-triphosphate, and inhibition of adenylyl cyclase. Furthermore, CHO cells 
expressing exogenous BLTR showed marked chemotactic responses toward low 
concentrations of LTB4 in a pertussis-toxin-sensitive manner. Yokomizo et al. (1997) found 
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that the putative purinoceptor P2Y7 has a primary structure identical to that of one of the 
BLTR clones, HL-5. To determine whether BLTR also functions as a purinoceptor, they 
established stable transformants of BLTR in glioma cells that possess negligible amounts of 
intrinsic purinoceptors. In these cells, up to 300 microM caused no change in intracellular 
5 calcium levels, but significant increases in the calcium concentrations were induced by 

exposure to 10 nanoM LTB4. These results were interpreted to indicate that this receptor is not 
a purinoceptor, but a BLTR. 

The disclosed NOV43 nucleic acid of the invention encoding a P2Y Purinoceptor - 
like protein includes the nucleic acid whose sequence is provided in Table 43 A or a fragment 
10 thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 43A while still encoding a protein 
that maintains its UDP[ Glycosyltransferase -like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to those just described, including nucleic acid fragments that are 
1 5 complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
20 stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 37 percent of the bases may be so changed. 

The disclosed NOV43 protein of the invention includes the P2Y Purinoceptor -like 
protein whose sequence is provided in Table 43B. The invention also includes a mutant or 
25 variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 43B while still encoding a protein that maintains its P2Y Purinoceptor -like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 62 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
30 (Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this P2Y Purinoceptor -like protein 
(NOV43) is a member of a "P2Y Purinoceptor family". Therefore, the NOV43 nucleic acids 
and proteins identified here may be useful in potential therapeutic applications implicated in 
(but not limited to) various pathologies and disorders as indicated below. The potential 
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therapeutic applications for this invention include, but are not limited to: protein therapeutic, 
small molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 
research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
5 (but not limited to) those defined here. 

The NOV43 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome , Alzheimer's 
disease, Stroke, Tuberous sclerosis, hypercalcemia, Parkinson's disease, Huntington's disease, 
Cerebral palsy, EpiIepsy,Lesch-Nyhan syndrome, Multiple sclerosis,Ataxia- 

10 telangiectasia,Leukodystrophies,Behavioral disorders, Addiction, Anxiety, Pain, 
Neuroprotection, and/or other diseases and pathologies. 

NOV43 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV43 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 

15 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV43 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 

20 disorders. 

NOV44 

A disclosed NOV44 nucleic acid of 934 nucleotides (also referred to as CG56692-0 1 ) 
encoding a G Protein Coupled Receptor-like protein is shown in Table 44A. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 15-17 and ending 
25 with a TAA codon at nucleotides 921-923. The start and stop codons are shown in bold in 
Table 44A, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 44A. NOV44 nucleotide sequence (SEQ ID NO:167). 

TGACCTTGGAATCT ATGGACATACC&CAAAATATCACAGAA a r TP an 

AGGTACAGAGAGTTCTCTTTGTGGTCTTTTTGCTGATCTATGTGGTCACGGTTTGTGGCAACATGCTCATO 

TGGTCACTATCACCTCCAGCCCCACGCTGGCTTCCCCTGTG 

ACAC CT TTT ATT CTT C TT C TATGGCT CCTAAAC T CATTGCT GACT CAT TGTATGAGGGGAGAA.CC AT CT CTT 
ATGAGTGCTGCATGGCTC^GCTCTTTGGAGCTCATTTTTTGGGAGGTGTTGA 

TGGCTTATGACCGCTATGTGGCCATCTGTAAGCCCCTGCAC^TACTACCATCATGACC^GGCATCTCTGTG 
CCATGCTTGTAGGGGTGGCTTGGCTTGGGGGCTTCCTGCATTCATTGGT^ 

TGCCCTTCTGTGGGCCCAATGTGATCAATCACTTTGCCTGTGACTTGTACCCTTTGCTGGAAGT^ 
CCAATACGTATGTCATTGGTCTGCTGGTGGTTGCCAACAGTGGTTTAATCTGCCTGTTGAACTTCCTCATG 
TGGCTGCCTCCTACATTGTCATCCTGTACTCCTTGAGGTCCCACAGTGCAGATGGGAGATGC^ 
CCACCTGTGGAGCCCACTTCATTGTTGTTGCCTTGTTCTTTGTGCCCT 

TT TCTACTTTACCT ATAGACAAAAATATGGCATT ATTTTATGGTATTCTGACACCTATGTTGAAT C CACT CA 
TTTATACCCTGAGAAATGAAGAGGTAAAAAATGCCATGAGAAAGCTC^ 
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In a search of public sequence databases, the NOV44 nucleic acid sequence, located on 
chromosome 7, has 783 of 920 bases (85%) identical to a gb:GENBANK- 
ID:AB030895|acc:AB030895.1 mRNA from Mus musculus (gene for odorant receptor 
MORI 8, complete cds) (E = 4.5e" 146 ). 
5 The disclosed NOV44 polypeptide (SEQ ID NO:168) encoded by SEQ ID NO:167 has 

302 amino acid residues and is presented in Table 44B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV44 has a signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV44 may also localize to the Golgi body with a certainty of 0.4000, to the endoplasmic 
1 0 reticulum (membrane) with a certainty of 0.3000, or to the mitochondrial inner membrane with 
a certainty of 0.0300. The most likely cleavage site for NOV44 is between positions 39 and 
40: VCG-NM. 



Table 44B. Encoded NOV44 protein sequence (SEQ ID NO:168). 

MDI PQNI TEFFMLGIjSQNSEVQRVLFWFLLI YWTVCGNMLI WTI TS SPTIoASPVYFFLANLSFIDTFYS 

SSMAPKLiI ADSLYEGRTI S YECCMAQLFGAHFLGGVE X I LLTVMAYDRYVAI CKPLHNTTIMTRHLCAMLVG 

VAWLGGFItfSLVQLLLVLWLPFCGPNV^ 

I\TCLYSIjRSHSADGRCKALSTCG&HFIWALF 

NEEVKNAMRKLFTW 



15 A search of sequence databases reveals that the NOV44 amino acid sequence has 257 

of 301 amino acid residues (85%) identical to, and 280 of 301 amino acid residues (93%) 
similar to, the 308 amino acid residue ptnr:SPTREMBL-ACC:Q9R0K2 protein from Mus 
musculus (Mouse) (Odorant Receptor MORI 8) (E = 5.0e-138). 

NOV44 also has homology to the amino acid sequences shown in the BLASTP data 

20 listed in Table 44C. 



Table 44C. BLAST results for NOV44 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1 17472367 |ref |XP_ 
061659. 1| 
(XM_061659) 


similar to 
odorant receptor 
16 (H. sapiens) 
[Homo sapiens] 


324 


302/302 
(100%) 


302/302 
(100%) 


e-143 


gi 1 11496249 1 ref | NP_ 

067343.1) 

(NM_021368) 


odorant receptor 
16 [Mus musculus] 


308 


257/301 
(85%) 


280/301 
(92%) 


e-127 


gi 1 11464995| ref |NP_ 
065261.1 | 
(NM_020515) 


gene for odorant 
receptor A16 [Mus 
musculus] 


302 


234/300 
(78%) 


262/300 
(87%) 


e-111 


gi 1 17459946 1 ref |XP 
062088. l| 
(XM 062088) 


similar to 
odorant receptor 
16 (H. sapiens) 
[Homo sapiens] 


316 


191/295 
(64%) 


232/295 
(77%) 


9e-95 
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gi| 17472365 |ref.|XP_ 


similar to 


544 


183/296 


230/296 


le-91 


061658. 1| 


odorant receptor 




(61%) 


(76%) 




(XM_061658) 


16 (H. sapiens) 
[Homo sapiens] 











The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 44D. In the ClustalW alignment of the NOV44 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (/.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 44D. ClustalW Analysis of NOV44 



1) Novel N0V44 (SEQ ID N0:168) 

2) gi 1 17472367 | ref |XP_061659.1 1 (XMJ561659) 
sapiens) [Homo sapiens] (SEQ ID NO: 522) 

3) gi 1 11496249 | ref |NPJ)67343 .1 1 (NM_021368) 
ID NO: 523) 

4) gi | 11464995 | ref |NP_065261 . 1 | (NM_020515) 
musculus] (SEQ ID NO: 524) 

5) gi | 17459946 | ref | XP_062088 . 1 | (XM_062088 ) 
sapiens) [Homo sapiens] (SEQ ID NO: 525) 

6) gi | 17472365 | ref | XP_061658 . 1 | (XM_061658 ) 
sapiens) [Homo sapiens] (SEQ ID NO: 526) 



similar to odorant receptor 16 (H. 
odorant receptor 16 [Mus musculus] (SEQ 
gene for odorant receptor A16 [Mus 
similar to odorant receptor 16 (H. 
similar to odorant receptor 16 (H. 



NOV44 


1 


gi 


17472367 


1 


gi 


11496249 


1 


gi 


11464995 


1 


gi 


17459946 


1 


gi 


17472365 


1 



10' 



20 



|....|....|....|.,...| 



30 



40 



50 



60 
•I 



MTQI SSNAFSRDFQNSNAFEVQVKDP IHVEDVPGPKSEFCSVFPSTPQASGNFQNQI FQD 



1 
1 
1 
1 
1 

60 



NOV44 


1 


gi 


17472367 


1 


gi 


11496249 


1 


gi 


11464995 


1 


gi 


17459946 


1 


gi 


17472365 


61 




steaipldedqrkinypntkld: 



NOV44 



gi 
gi 
gi 
gi 
gi 



17472367 
11496249 
11464995 
17459946 
17472365 



I 



130 



140 



GNMLIWTITgSPjl 

gnmliwtit|sp| 
gnmliwt@t|s@« 

GNMLl||jTITgSP£ 
GNMLIVYTl2|SP" 

SInE^iwtitIiisP; 



150 



160 



170 



180 




N0V44 

gi | 17472367 | 



190 



98 
120 



200 



210 



220 



230 



240 



QLFgAHFLGGVE 1 1 LLTVMAYDR YVAI CKPLhStt] 
QLF§AHFLGGVE 1 1 LLTVMAYDRYVAI CKPLHi" 



gTTnMTRHLCg^LV^AWLGGFLHSL|C 

SttRmtrklcS^lvSvawlggflhsuIq 
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gi (11496249 
gi I 11464995 
gi | 17459946 
gi | 17472365 



98 
98 
99 
180 



_ j FgAHF LGGVE 1 1 LLT VMAYDR YVAI C KPLHYTTyMTRHLCL. ... . - . . 

q5f^2lgg5eiilltvklayd^^ 

OLF^HF[^GVE|lgLT^yiAYDRYVAICKPLHY!^gMiSRi2LC^D|^A^ 
iHFf^GVSIILLffiVMAYDBYVAICKPLg^ 



VAWLGGFLHSLvflC 
VAWLGGHLHSggC 

IjawSggflhSlSq 



157 
157 
158 
239 



250 



260 



270 



280 



290 



300 



NOV44 


158 


gi| 17472367 


180 


gij 11496249 


158 


gij 11464995 


158 


gij 17459946 


159 


gij 17472365 


240 


N0V44 


218 


gi 


17472367 


240 


gi 


11496249 


218 


gi 


11464995 


218 


gi 


17459946 


219 


gi 


17472365 


300 


N0V44 


278 


gi 


17472367 


300 


gi 


11496249 


278 


gi 


11464995 


278 


gi 


17459946 


279 


gi 


17472365 


352 


NOV44 


302 


gi 


17472367 


324 


gi 


11496249 


308 


gi 


11464995 


302 


gi 


17459946 


316 


gi 


17472365 


412 


N0V44 


302 


gi 


17472367 


324 


gi 


11496249 




gi 


11464995 


302 


gi 


17459946 


316 


gi 


17472365 


472 


N0V44 


302 


gi 


17472367 


324 


gi 


11496249 


308 


gi 


11464995 


302 


gi 


17459946 


316 


gi 


17472365 


532 




310 



320 



330 



340 



350 



360 




370 



380 



PMLNPLIYTLRN|EVKNAMRgLp 
PMLNPLIYTLR^EVKNAMR|Lp| 



390 

..|....|. 



400 



410 

|....|....|. 



420 
••I 



PMLNPL I YTLRNjgEVKjjjAMR&L^gr 
PMBNPLIYTLR^EVKNAM!^L&j3& 



SEV- 



-VGA- 



PMLNPLIYTLR 



jEVKNAM^gLt?^ 



302 
324 
308 

302 

LMV VSDEKENIKL 316 

I IWGNLFISVI I FI SPALEHFFEAAEI ILLSVMAY 411 



430 440 
..|....|....|.. 



450 

..|....| 



460 



470 



480 

|....|. 



302 
324 
308 
302 
316 



490 500 510 

|....|....|....|....|....|....| 



520 



530 
|....|....| 



540 
•-I 



302 
324 
308 
302 
316 



550 



302 
324 
308 
302. 
316 

532 KKLWKQIMTTDDK 544 



Table 44E lists the domain description from DOMAIN analysis results against 
NOV44. This indicates that the NOV44 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 44E Domain Analysis of NOV44 

gnl |Pfam|pfam00001, 7tm_l, 7 transmembrane receptor (rhodopsin 

family) . (SEQ ID NO: 810) 

CD-Length = 254 residues, 100.0% aligned 

Score = 82.0 bits (201), Expect = 4e-17 



NOV43: 39 GNMLIVVTITSSPTLASPVYFFIiANLSFIDTFYSSSMAPKIiIi^SLYEGRTISYECCMAQ 98 

+ I ♦ I +1 II I 1+ I + ++ I + + | 

^ Sbjct: 1 GNLLVI LVI LRTKKLRTPTNI FLLl^VADLLFIiLTLPPWALYyLVGGDVA^FGDALCKLV 60 

NOV43: 99 LFGAHFLGGVEI ILLTVMAYDRYVAICKPLHNTTIMTRHLCAMLV^ 158 

I MM lll + ll II II +1+ + |+ | || | 

Sb j C t : 61 GALFVVNGYAS ILLLTAI S IDRYLAI VHPLRYRRIRTPRRAKVLILLVWVIiALIjIiSLPPL 120 

10 NOV43: 159 LLVLF^PFCXjPOTINHFACDLYPIjLEVACTNTW 218 
| | ++ + + )+++++ 

Sbjct: 121 LFSWLRTVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFV^ 180 

NOV43: 219 ILYSLRSHSADGRCKALSTCGAHFI WALFFVPC- 1 FTYVHPF STLPIDKNMAL 271 

15 + || +1 I ++I I + || + | 

Sbjct: 181 RSQRSLKRRSSSERKAAKMLLVVVWFVLCWL 240 

NOV43: 272 FYGILTPMLNPLIY 285 

+ + HIHI 
20 Sbjct: 241 WIAYVNSCLNPIIY 254 

G-protein-coupled receptors (GPCRs) constitute a vast protein family that 
encompasses a wide range of functions (including various autocrine, paracrine and endocrine 
processes). They show considerable diversity at the sequence level, on the basis of which they 

25 can be separated into distinct groups. We use the term clan to describe the GPCRs, as they 
embrace a group of families for which there are indications of evolutionary relationship, but 
between which there is no statistically significant similarity in sequence. The currently known 
clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP 
receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor 

30 family. 

The rhodopsin-like GPCRs themselves represent a widespread protein family that 
includes hormone, neurotransmitter and light receptors, all of which transduce extracellular 
signals through interaction with guanine nucleotide-binding (G) proteins. Although their 
activating ligands vary widely in structure and character, the amino acid sequences of the 

35 receptors are very similar and are believed to adopt a common structural fiamework 
comprising 7 transmembrane (TM) helices 

The disclosed NOV44 nucleic acid of the invention encoding a G Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 44A or 
a fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 

40 bases may be changed from the corresponding base shown in Table 44A while still encoding a 

protein that maintains its UDP[ Glycosyltransferase -like activities and physiological 
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functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
5 include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 

10 acids, and their complements, up to about 15 percent of the bases may be so changed. 

The disclosed NOV44 protein of the invention includes the G Protein Coupled 
Receptor -like protein whose sequence is provided in Table 44B. The invention also includes 
a mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 44B while still encoding a protein that maintains its G Protein Coupled 

15 Receptor -like activities and physiological functions, or a functional fragment thereof. In the 
mutant or variant protein, up to about 39 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fabk, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G Protein Coupled Receptor -like 

20 protein (NOV44) is a member of a "G Protein Coupled Receptor family". Therefore, the 
NOV44 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 

25 targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV44 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Systemic lupus erythematosus , Autoimmune disease, 

30 Asthma, Emphysema, Scleroderma, allergy, ARDS, and/or other diseases and pathologies. 
NOV44 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV44 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
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NOVX Antibodies" section below. The disclosed NOV44 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
5 disorders. 

NOV45 

A disclosed NOV45 nucleic acid of 994 nucleotides (also referred to as CG56694-01) 
encoding a Mas Proto-Oncogene-like protein is shown in Table 45 A. An open reading frame 
was identified beginning with a ATG initiation codon at nucleotides 17-19 and ending with a 
10 TGA codon at nucleotides 980-982. The start and stop codons are shown in bold in Table 
45 A, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 45A. NOV45 nucleotide sequence (SEQ ID NO:169). 

ACTAGGGTTCCTGAGCA TGGATCC^CCM^ 

GGAGACTCCTTGCTACAAGCAAACCCTGAGCCTCACAGGGCTGACGTGCATCGTTTCCCTTO 
AGGAAATGCAGTCGTGCTCTGGCTCCTGGGCTTCCGCATGCGC^ 
CCTGTCCATGGCCGACTTCCTCTTTCTCAGAAGCCA 
CCATCCCATCTTCAAAATCCrCAGCCCTGTGATGATGT 

CATGAGCACCGAGCGCTGCCTGTACGTCCTGTGGCCCATCTGGGAGCGCTGCCGCCCCCGCCCCTACACCTG 
TCAGCGGTCGTGTGTGTCATGCTCTGGGCCCTGTCTCTGCTGCGGAGCGTCCTGGAGTGGAGTTTCTGTGAC 
TTCCTGTTT AGTGGTG CTG ATT CTGTT TGGTGTAAAAC ATCAG ATTT C ATCATAGTAGGGGGG CTGATTTTT 
TTATGTGTGGCTCTCTGTGGTTCCAGCCTGGTCCTGCTGGTCAGGATCCTTTGTGGGTCCCGGAAGATGCCT 

CGGTTTTTC CTATTTTCATGGAACCACGTGGATTTGGAAGT CTTATATTGT CACGTT CAT CTAGTTTC CATT 
TTCCTTTCCTCTCTTJ^CGGCCAACCCCAACM 

GCAGAACCTGAAGCTGGTTCTCCAGAGGGCTCTGCAGGACACGACTGAGGTGAATGAAGGTGGACGATGGCT 
TCCTGAGGAAACCCTGGAGCTGTCAGGAAGCAGATTTGGGCAGTGAGGAAGAGCCTCT 

In a search of public sequence databases, theNOV45 nucleic acid sequence, located on 
chromosome 7, has 353 of 580 bases (60%) identical to a gb:GENBANK- 
15 ID:I08606|acc:I08606.1 mRNA from Unknown. (Sequence 1 from Patent WO 8707472) (E = 
1.2e 10 ). 

The disclosed NOV45 polypeptide (SEQ ID NO: 1 70) encoded by SEQ ID NO: 169 has 
319 amino acid residues and is presented in Table 45B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV45 has no signal peptide and is 
20 likely to be localized to the plasma membrane with a certainty of 0.6400. Alternatively, 
NOV45 may also localize to the Golgi body with a certainty of 0.4600, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3700, or to the endoplasmic reticulum (lumen) 
with a certainty of 0.1000. 



Table 45B. Encoded NOV45 protein sequence (SEQ ID NO:170). 

MIQLLITSGLCSQGHQTRVPEHGSNHPSLGYRTDTNQOT 
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PGLPHAQERIiLHLHLQPVHGRLPLSQKPHYTFSV 

RPVAHLGMiPPPPLHLSAWC^/MLWALSLLRSVLEWSFCDFLFSGADSWC 
SLVLLVRILCGSRKMPLTRLYVTILLIALVFLLCGLPFGIRFFLFSWNHVDLEVLYCHV^ 
PQHLLLRGLIiKAVSKKAEPEAGSPEGSAGHD ; 



A search of sequence databases reveals that the NOV45 amino acid sequence has 50 of 
168 amino acid residues (29%) identical to, and 87 of 168 amino acid residues (51%) similar 
to, the 378 amino acid residue ptnr:SWISSPROT-ACC:P35410 protein from Homo sapiens 
5 (Human) (Mas-Related G Protein-Coupled Receptor MRG) (E = 5.0e-138). 

NOV45 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 45C 



Table 45C. BLAST results for NOV45 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi|l5546062|gb|AAK9 
1804.1) (AY042213) 


MrgXl G protein- 
coupled receptor 
[Homo sapiens] 


322 


108/130 
(83%) 


116/130 
(89%) 


2er56 


~gi 1 17472340 | ref | XP__ 
061650.1) 
(XMJ>61650) 


similar to MrgXl 
G protein- coupled 
receptor (H. 
sapiens) [Homo 
sapiens] 


1589 


108/130 
(83%) 


116/130 
(89%) 


3e-56 


gi |16876453| ref |NP_ 

473372.1) 

(NM 054031) 


G protein-coupled 
receptor MRGX3 
[Homo sapiens] 


322 


104/130 
(80%) 


118/130 
(90%) 


2e-53 


gi | 17461239) ref |XP_ 
062249. 1| 
(XM_062249) 


similar to MrgX3 
G protein- coupled 
receptor (H. 
sapiens) [Homo 
sapiens] 


322 


103/130 
(79%) 


118/130 
(90%) 


7e-53 


gi | 16876455 | ref |NP_ 

473373.1) 

(NM 054032) 


G .protein- coupled 
receptor MRGX4 
[Homo sapiens] 


322 


97/124 
(78%) 


104/124 
(83%) 


2e-46 



10 The homology between these and other sequences is shown graphically in the 

ClustalW analysis shown in Table 45D. In the ClustalW alignment of the NOV45 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (Le., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 

1 5 can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 45D. ClustalW Analysis of NOV45 

1) Novel NOV45 (SEQ ID NO:170) 
20 2) gi | 15546062 |gb|AAK91804.l| (AY042213) MrgXl G protein-coupled receptor [Homo 
sapiens] (SEQ ID NO:527) 
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3) gi | 17472340 | ref |XP_061650.l| <XM_0616S0) similar to MrgXl G protein-coupled 
receptor (H. sapiens) [Homo sapiens) (SEQ ID NO:S28) 

4) gi | 16876453 | ref |NP_473372.l| <NM_054031) G protein-coupled receptor MRGX3 (Homo 
sapiens] (SEQ ID NO: 529) 

5) gi 1 17461239 I ref |XP_062249 .1 1 (XM_062249) similar to MrgX3 G protein- coupled 
receptor (H. sapiens) [Homo sapiens) (SEQ ID NO:530) 

6) gi | 16876455 | ref |NP_473373 .1 1 (NM_054032) G protein- coupled, receptor MRGX4 [Homo 
sapiens] (SEQ ID NO:531) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



NOV45 


1 


gi 


15546062 


1 


gi 


17472340 


1 


gi 


16876453 


1 


gi 


17461239 


1 


gi 


16876455 


1 



NOV45 


61 


gi 


15546062 


61 


gi 


17472340 


61 


gi 


16876453 


61 


gi 


17461239 


61 


gi 


16876455 


61 



NOV45 



gi 
gi 
gi 
gi 
gi 



15546062 
17472340 
16876453 
17461239 
16876455 



121 
121 
121 
121 
121 
121 



NOV45 


181 


gi 


15546062 


181 


gi 


17472340 


181 


gi 


16876453 


181 


gi 


17461239 


181 


gi 


16876455 


181 


NOV45 


241 


gi 


15546062 


241 


gi 


17472340 


241 


gi 


16676453 


241 


gi 


17461239 


241 


gi 


16876455 


241 



NOV45 


301 


gi 


15546062 


301 


gi 


17472340 


301 


gi 


16876453 


301 


gi 


17461239 


301 


gi 


16876455 


301 


NOV45 


319 


gi 


15546062 


322 


gi 


17472340 


361 


gi 


16876453 


322 




310 



I 



320 



Sskkaepeagsp egsaBhd 

gLEQ 



330 
..|.. 



340 



|....|....|. 



350 
..|.. 



360 

-I 



319 

322 

301 GTVTAEYRRRNLNAHAHHIGPNFSETLQ^ 360 



ffiflLELSGSRg ^HmWMHIM 


322 


'BeRlelsgsrS? *fl 


322 


ffiHSmaWtefciaiar.flP 


322 



370 



380 



390 



400 



410 



420 
••I 



319 
322 

361 TESQKGPPPLAQGKS SPLDKREKDKGKYQELSSS IVSYGFCTKTVSVFPPRDTRLGFLSV 420 

322 

390 
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60 



gi| 17461239 


322 


322 


gi 


16876455 


322 


322 








430 440 450 460 470 480 
t t 1 1 1 1 i 1 1 1 1 1 


NOV45 


3 19 


319 


gi 


15546062 


322 


322 


gi 


17472340 


421 


DPTT PAT\frrF/r.TPTNRP 48 0 


gi 


16876453 


"3 o*> 

J 44 


3 44 


gi 


17461239 


322 


322 


gi 


16876455 


322 


322 








490 500 510 520 530 540 
1 1 t t 1 1 1 1 1 I 1 1 


NOV45 


J ±P 




gi 


15546062 


322 


322 


gi 


17472340 


•iOl 


QTVT17WT.'^AnT?T.TrT.PQWTTPFPT»GT.T?JTT.^^ ca n 
ox x xr j^xjoi v iMXsr ur uivo.nx xixr xruoxixixxiixir' xrnxxio.tr v i*u*ir 0 ijjrtoiior xjohi*io idkl d»u 


gi 


16876453 




■5 4£ 


gi 


17461239 


322 


322 


gi 


16876455 


322 


322 








550 560 570 580 590 600 
1 1 1 1 1 I 1 1 1 1 1 1 


NOV45 


■SI Q 


31 Q 

-3 x y 


gi 


15546062 


322 


344 


gi 


17472340 


DfiX 


xjxj VxjxWKXx4L.^oi<JtUnr^Xili<^ 1 Hi VJNJwtoxCWJjirJ&B 1 IjoJjSIjoK bUU 


gi 


16876453 


O 44 


loo 
344 


gi 


17461239 


322 


322 


gi 


16876455 


322 


322 








610 620 630 640 650 660 
I I 1 1 1 I I 1 1 1 1 1 


NOV45 


J X*7 


319 


gi 


15546062 


"* *3*3 

34& 


"3*3*1 
344 


gi 


17472340 


bUl 


T?/~*r\ C\ FLXCT T.Ttrf'CT»TDr >, /^/^CT.T MCfT T gTrTrKTT^TT , GOT*TI?TTTg'TJT?TJR/^tJP^TrPeta'*^Tfn^PT flTJT f*» cet\ 

r tj^a VJlLXiWaiiiiWKUuUbJjJjMauXjXj ott.VJNlJx 1 as Wr nXi\xi.&riAtjHir\jVr SRKV iKiiGr IjS 66 0 


gi 


16876453 




322 


gi 


17461239 


322 


322 


gi 


16876455 


322 


322 








670 680 690 700 710 720 
1 t 1 1 1 1 1 1 1 1 1 > 


NOV45 




jxy 


gi 


15546062 


"3*3*3 
344 


"3 *3 *3 


gi 


17472340 


661 


MTiPTT QTT.n*T17T.TOTM'^TPP'TT.r r VTrn*TT.GT.*r T i7T.Tr , T VQT.TfnT.TflKTJV^nrr WT T /T r DVTDDKT71 "7*3 n 
riL/rr X XOXXJLJXiUJXr' XJMoliLCi IXjUXXV^IIjOXjI VX1XL.X VDIjV \jxj x \jx\±\ V VljVyijXJ^UiUaltKXNi* i 4\J 


gi 


16876453 


322 




gi 


17461239 


322 


322 


gi 


16876455 


322 


322 








730 740 750 760 770 780 
1 1 1 1 1 I 1 t 1 1 1 1 


NOV45 


319 


31 Q 


gi 


15546062 


322 


322 


gi 


17472340 


721 


f o x x x Xii.i iinnnu c xic xjovtxiujx x piiiior x ox rni xorcxui jt v 1*111 r 0 x rAuiior Jjonvoliut / 0 U 


gi 


16876453 | 322 


344 


gi 


17461239| 322 


322 


gi 


16876455| 322 


322 








790 800 810 820 830 840 
....|....|....|....|....|....|....|....|....|....|....|....| 


NOV45 


319 


319 


gi 


15546062 


322 


322 


gi 


17472340 


781 


CLSVLWPIWYRCHRPTHLSAWCVflxLW 84 0 


gi 


16876453 


322 


322 


gi 


17461239 


322 


322 


gi 


16876455 


322 


322 



NOV45 

gi 1 15546062 I 



319 
322 



850 



860 



870 



880 



890 



I I 



900 
••I 



319 
322 
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gi 1 17472340 | 841 VAWLIFLCVVLCGSSLVIjLIRILCGSRKIPLTRLyVTILLTVLVFLLCGLPFGIQFFLFL 900 

gi 1 1687 6453 j 322 322 

gi j 17461239 j 322 322 

gi j 16876455 j 322 322 



NOV45 



gi 
gi 
gi 
gi 
gi 



15546062 
17472340 
16876453 
17461239 
1687 6455 



910 920 930 940 950 960 

....|....|....|....|....|....|....|....|....|....|....|.,..| 
319 319 
322 322 
901 WIHVDREVLFCHVHLVSIFLSALNSSANPIIYFFVGSFRQRQNRQNLKLVLQRALQDASE 960 
322 322 
322 322 
322 322 



NOV45 



gi 
gi 
gi 
gi 
gi 



15546062 
17472340 
16876453 
17461239 
16876455 



970 980 990 1000 1010 1020 

319 ' 319 

322 322 

961 VDEGGGQLPEEIIiELSGSRLEQGTRLGFLSMDPTIPVLGTELTPINGTEETPCYNQTLSF 1020 

322 322 

322 322 

322 322 



NOV45 

gi 
gi 
gi 
gi 
gi 



1030 1040 1050 1060 1070 1080 

....|....|....|....|....|....|....|..-.|..-.|....|....|....| 

319 319 

15546062 | 322 322 

17472340 j 1021 TVLTCIVSLVALTGNAVVLWLLGFRMCRNAVS I Y ILNLVAANFLLLS SHI IHPCYTSS IT 1080 

16876453| 322 322 

17461239| 322 322 

16876455] 322 322 



N0V45 



gi 
gi 
gi 
gi 
gi 



15546062 
17472340 
16876453 
17461239 
16876455 



1090 1100 1110 1120 1130 1140 

....|....|....|....|....|....|....|....|....|....|....|....| 
319 319 
322 322 
1081 PEYSECHEHQALPVNPVAHLPGPSGQDPLWIPEDAATOAVHDHLLTVLVFLLCGLPIGIQ 1140 
322 322 
322 322 
322 322 



NOV45 



gi 
gi 
gi 
gi 
gi 



15546062 
17472340 
16876453 
17461239 
16876455 



1150 1160 1170 1180 1190 1200 

...-|....|....|....|....|....|..,.|....|....|....|....|....| 
319 319 
322 322 
1141 WALFSRIHMDWEVLYSHVHLPS I FLSSLNSSANP 1 I YFFMGFVRQHQNWQNLKLVLQRDL 1200 
322 322 
322 322 
322 322 



NOV45 



gi 
gi 
gi 
gi 
gi 



15546062) 
17472340 j 
16876453) 
17461239| 
16876455) 



1210 3.220 1230 1240 1250 1260 

| | | | | .... | | .... | | | | | 

319 319 

322 322 

1201 QDTPEVDEEQNNVLRYIILYIMYTLDTMSLHKNSLGNWFYGKSSAVICLQNGDMLALVSS 1260 

322 322 

322 322 

322 322 



NOV45 



gi 
gi 
gi 
gi 
gi 



15546062 
17472340 
16876453 
17461239 
16876455 



1270 1280 1290 1300 1310 1320 

| | | | ^| | | | | | | | 

319 319 

322 322 

1261 KDNHGPFPLRTDIGGTGLLFLAVTAEYRRRNLNAHAHH^ 1320 

322 322 

322 322 

322 322 



1330 
|....|... 



1340 



1350 



1360 



1370 



|....|....|. 



1380 
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NOV45 



10 



15 



20 



25 



30 



35 



40 



9i 

gi 
gi 
gi 
gi 



15546062 
17472340 
16876453 
17461239 
16876455 



NOV45 

gi | 15546062 
gi j 17472340 
gi | 16876453 
gi | 17461239 
gi | 16876455 



NOV45 



gi 
gi 
gi 
gi 



15546062) 
17472340 | 
16876453 | 
17461239 | 
16876455 



NOV45 

gi 
gi 
gi 
gi 
gi 



15546062| 
17472340 j 
16876453 | 
17461239 j 
16876455 | 



319 319 

322 322 

1321 ISFLSVYVEISPKLHOTKGSSEWESQKGSTSI^ 1380 

322 322 

322 322 

322 322 



1390 



1400 



1410 
..|... 



1420 



1430 



1440 



NOV45 



gi 
gi 
gi 
gi 
gi 



15546062 
17472340 
16876453 
17461239 
16876455 



319 319 
322 322 
1381 GTLWNIiRGTRLGFLSMNPTI P ALDTE I AP I SDTEETHPHRC6MEVLVLI VLILI IDLVG 1440 

322 322 
322 322 
322 322 



1450 



1460 



|....| 



1470 
..|... 



1480 
..|... 



1490 



1500 
...| 

319 319 
322 322 
1441 LAGNAVMLWLLGFCMHSNTFSLYILNLARADFLCTCFQIITFINFFSDEVSSLSIHFS 1500 

322 322 
322 322 
322 322 

1510 1520 1530 1540 1550 1560 

....|....|....|....|....|....|....|....|....|....|....|.-..| 

319 3" 
322 322 
1501 VTTVLFSACITGLSMLSTI STEHRLSVLWPI CSANPI I YFFMGSFRQLQNRKTLKLVLQR 1560 
322 322 
322 322 
322 322 

1570 1580 
....|....|....|....|....|.... 

319 319 

322 322 

1561 ALQDMLEVDEGGGQLPEETLKLSGSRLGP 1589 

322 322 

322 322 

322 322 



45 



50 



Table 45E lists the domain description from DOMAIN analysis results against 
NOV45. This indicates that the NOV45 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 45E Domain Analysis of NOV45 

gnl|Pfam|pfam00001, 7tm_l, 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO: 810) 

CD-Length » 254 residues, only 55.9% aligned 
Score = 41.2 bits (95), Expect = 9e-05 



NOV44: 165 CVMLWALSLLRSVLE . WSFCDFLFSGADSVWCKTSDFI IVGGLI FLCV 211 

+++| |+|| |+ +| | +11 ++ I I 

Sbjct: 105 ILLVWVLAIjLLSLPPLLFSWLRTVEEGOT 164 

55 
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NOV44 : 212 ALCGSSLVLL VRILCGSRKMPL TRLYVTILLIALVFLLCGLPFGIRFFLFSWN 264 

I + +1 II + + I++ +11+11 ll+lll 

Sbjct: 165 I LVC YTR I LRTLRKRARS QRS LKRRSS SE RKAAKMLL VVVVVFVLCWLP YHI VLLLDSLC 224 

5 NOV44: 265 HVDLEVLYCHVHLVSIFLSSLN 286 
+ + + |++++|+ + | 

Sbjct: 225 LLS I WRVL PT ALL I TL WLAYVN 246 

1 0 The Mas Proto-Oncogene belongs to the family of G-Protein Coupled Receptors.G- 

protein-coupled receptors (GPCRs) constitute a vast protein family that encompasses a wide 
range of functions (including various autocrine, paracrine and endocrine processes). They 
show considerable diversity at the sequence level, on the basis of which they can be separated 
into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of 

15 families for which there are indications of evolutionary relationship, but between which there 
is no statistically significant similarity in sequence [1]. The currently known clan members 
include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal 
mating pheromone receptors, and the metabotropic glutamate receptor family. 

The human mas oncogene was originally detected by its ability to transform NIH 3T3 

20 cells. We previously showed that the protein encoded by this gene is unique among cellular 
oncogene products in that it has seven hydrophobic potential transmembrane domains and 
shares strong sequence similarity with a family of hormone-receptor proteins. We have now 
cloned the rat homolog of the mas oncogene, determined its DNA sequence, and examined its 
expression in various rat tissues. A comparison of the predicted sequences of the rat and 

25 human mas proteins shows that they are highly conserved, except in their hydrophilic amino- 
terminal domains. Our examination of the expression of mas, determined by RNA-protection* 
studies, indicates that high levels of mas RNA transcripts are present in the hippocampus and 
cerebral cortex of the brain, but not in other neural regions or in other tissues. This pattern of 
expression and the similarity of mas protein to known receptor proteins suggest that mas 

30 encodes a receptor that is involved in the normal neurophysiology and/or development of 
specific neural tissues. 

The rhodopsin-like GPCRs themselves represent a widespread protein family that 
includes hormone, neurotransmitter and light receptors, all of which transduce extracellular 
signals through interaction with guanine nucleotide-binding (G) proteins. Although their 

35 activating ligands vary widely in structure and character, the amino acid sequences of the 
receptors are very similar and are believed to adopt a common structural framework 
comprising 7 transmembrane (TM) helices 

The disclosed NOV45 nucleic acid of the invention encoding a Mas Proto-Oncogene- 
like protein includes the nucleic acid whose sequence is provided in Table 45A or a fragment 
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thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 45A while still encoding a protein 
that maintains its Mas Proto-Oncogene -like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 40 percent of the bases may be so changed. 

The disclosed NOV45 protein of the invention includes the Mas Proto-Oncogene-like 
protein whose sequence is provided in Table 45B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 45B while still encoding a protein that maintains its Mas Proto-Oncogene-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 32 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(F a b)2,that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Mas Proto-Oncogene -like protein 
(NOV45) is a member of a "Mas Proto-Oncogene family". Therefore, the NOV45 nucleic 
acids and proteins identified here may be useful in potential therapeutic applications 
implicated in (but not limited to) various pathologies and disorders as indicated below. The 
potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV45 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome , Alzheimer's 
disease, Stroke, Tuberous sclerosis, hypercalcemia, Parkinson's disease, Huntington's 
disease, Cerebral palsy, Epilepsy,Lesch-Nyhan syndrome, Multiple sclerosis,Ataxia- 

395 



WO 02/068649 PCT/US02/02785 



telangiectasia, Leukodystrophies,Behavioral disorders, Addiction, Anxiety, Pain, 
Neuroprotection, and/or other diseases and pathologies. 

NOV45 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV45 substances for use in 
5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV45 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
1 0 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV46 

NOV46 includes three novel Mas Proto-Oncogene -like proteins disclosed below. The 
disclosed sequences have been named NOV46a and NOV46b. 
15 NOV46a 

A disclosed NOV46a nucleic acid of 997 nucleotides (also referred to as CG56696-01) 
encoding a Mas Proto-Oncogene-like protein is shown in Table 46A. An open reading frame 
was identified beginning with a ATG initiation codon at nucleotides 12-14 and ending with a 
TGA codon at nucleotides 978-980. The start and stop codons are shown in bold in Table 
20 46A, and the 5 5 and 3' untranslated regions, if any, are underlined. 

Table 46A. NOV46a nucleotide sequence (SEQ ID NO:171). 

GGTTTCTGAGCATGGATCCAACCATCTCAACCnTGGACACAGAAC^ 

CTCTTTGCTACAAGC^GACCTTGAGCCTCACG 

ACGCAGTTGTGCTCTGGCTCCTGGGCTGCCGCATGCGCAGGAA 

CCGCAGCAGACTTCCTCTTCCTCAGCGGCCGCCTTATATA . 

CCATCTCTAAAATCCTCTATCCTGTGATGATGTTTTCCTACTTTGCAGGCCTGAGCTTTCTGAGTGCCGTGA 

GCACCGAGCGCTGCCTGTCCGTCCTGTGGCCCATCTGGTACCGCT 

TGGTGTGTGTCCTGCTCTGGGCCCTGTCCCTGCTGCGGAGCATCCTGGAGTGGATGTTATGTGGCTTCCTGT 

TCAGTGGTGCTGATTCTGCTTGGTGTCAAACATCAGATTTCATCACAGTCGCGTGGCTGATTTTTTTATGTG 

TGGTTCTCTGTGGGTCCAGCCTGGTCCTGCTGATCAGGATTCTCTGTGGATCCCGGAAGATACCGCT 

GGCTGTACGTGACCATCCTGCTCAC^GTACTGGTCTTC 

TCCTATTTTTATGGATCCACGTGGACAGGGAAGTCT^ 

CCGCTCTTAACAGCAGTGCCAACCCCATCATTTACTTCTTCGTGGGCTCCTTTAGGCAGCGTCAAAATAGG 

AGAACCTGAAGCTGGTTCTCCAGAGGGCTCTGCAGGACGCGTCTGAGGTGGATGAAGGTGGAGGGC^GCTTC 

CTGAGGAAATCCTGGAGCTGTCGGGAAGCAGATTGGAGCAGTGAGGAAGAGCCTCTGCCCT 



In a search of public sequence databases, the NOV46a nucleic acid sequence, located 
on chromosome 7, has 430 of 705 bases (60%) identical to a gb:GENBANK- 
ID:MMU249895|acc:AJ249895.1 mRNA from Mus musculus (mas proto-oncogene and Igf2r 
25 gene for insulin-like growth factor type 2 and L4 lps and Au76 pseudogenes) (E = 9.3c" 22 ). 
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The disclosed NO V46a polypeptide (SEQ ID NO: 1 72) encoded by SEQ ID NO: 1 7 1 
has 322 amino acid residues and is presented in Table 46B using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV46a has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV46a may also localize to the Golgi body with a certainty of 0.4000, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or to the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV46a is between positions 45 and 46: 
TGN-AV. 



Table 46B. Encoded NOV46a protein sequence (SEQ ID NO:172). 



MDPTI STLDTELTP INGTEETLCYKQTLSLTVLTCIVSLVGLTGNAVVLWIiLGCRMRBNAFS I YILNLAAAD 
FLFIiS GRL I YSLLS FIS I PHTI S KI L YP VMMFSYFAGLS FLS AVS TERCLSVLWP I WYRCHRPTHLS AWCV 
LLWALSIJjRSILEVMLCGFLFSGADSAWCQTSDF^ 

TILLTVLVFLLCGLP FGI QFFLFLWI HVDRE VLFCHVHLVS I FLS ALNS S ANP 1 1 YFFVGS FRQRQNRQNLK 
LVLQRALQDASEVDEGGGQLPEEILELSGSRLEQ 



10 

A search of sequence databases reveals that the NOV46a amino acid sequence has 110 
of 275 amino acid residues (40%) identical to, and 175 of 275 amino acid residues (63%) 
similar to, the 324 amino acid residue ptnr:SWISSPROT-ACC:P12526 protein from Rattus 
norvegicus (Rat) (Mas Proto-Oncogene) (E = l^e" 45 ). 

15 NOV46b 

In the present invention, the target sequence identified previously, NOV46a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 

20 examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 

25 related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, . 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 

30 gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 

' uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
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redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
5 were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV46b. This differs from the previously 
identified sequence (NOV46a) in having 2 amino acid changes. 

A disclosed NOV46b nucleic acid of 964 nucleotides (also referred to as CG56696-02) 
encoding a Mas-Related G Protein-Coupled Receptor -like protein is shown in Table 46C. An 
10 open reading frame was identified beginning with a ACC initiation codon at nucleotides 3-5 
and ending with a TGA codon at nucleotides 960-962. The start and stop codons are shown in 
bold in Table 46C, and the 5' and 3' untranslated regions, if any, are underlined. Because the 
start codon is not a traditional initiation codon, NOV46b could be a partial reading frame 
extending further in the 5 5 direction. 



15 



Table 46C. NOV46b nucleotide sequence (SEQ ID NO:173). 



CAACCATCTCAAC CTTGGACACAGAAC TGACAC CAATCAACGG AAC TG AG GAGACT CTTTG CT AC AAG CAGA 
CCTTGAGCCTCACGGTGCTGACGTGC^TCGTTTCCCTTGTCGGGCrGACAGGAAACGCGGTTGTGCTCTGGC 
TCCTGGGCTGCCGCATGCGCAGGAACGCCTTCTCCATCTACATCCrCAACTTGGCCGCAGCAGACTTCCTOT 
TCCTCAGCGGCCGCCTTATATATTCCCTGTTAAGCTTCATCAGTATCCCCCATACC7VTCTCTAAAATCCTCT 
ATCCTGTGATGATGTTTTCCTACTTTGCAGGCCTG 

GGGCCCTGTCCCTGCTGCGGAGCATCCTGGAGTGGATGTTATGTGGCTTTCTGTTCAGTGGTGCTGATTCTG 
CTTGGTGTCAAACATCAGATTTCATCACAGTCGCGTGGCTGATTTTTTTATGTC 

GCCTGGTCCTGCTGATCAGGATTCTCTGTGGATCCCGGAAGATACCGCTGACCAGGCTGTACGTGACCATCC 
CGCTCACAGTACTGGTCTTCCTCCTCTGTGGCCTGCCCTTTGGCATTCAGTTTTTC 

ACGTGGACAGGGAAGTCTTATTTTGTCATGTTCATCTAGTTTCTATTTT CCTGTCCG CTCTTAACAGCAGTG 
CCAACCCCATCATTTACTTCTTCGTGGGCTCCTTTAGGCAGCGTCAAAATAGGCAGAACCTGAAGCTGGTTC 
TCCAGAGGGCTCTGCAGGACGCGTCTGAGGTGGATGAAGGTGGAGGGCAGCTTCCTGAGGAAATCCTGGAGC 
TGTCGGGAAGCAGATTGGAGCAGTGAGG 



In a search of public sequence databases, the NOV46b nucleic acid sequence, located 
on chromosome 1 1, has 494 of 800 bases (61%) identical to a gb:GENBANK- 
ID:AF295365|acc:AF295365.1 mRNA from Mus musculus (G-protein coupled receptor 
GPR90 mRNA, complete cds) (E = 1 .2e~ 28 ). 

20 The disclosed NOV46b polypeptide (SEQ ID NO: 1 74) encoded by SEQ ID NO: 1 73 

has 319 amino acid residues and is presented in Table 46D using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV46b has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV46b may also localize to the Golgi body with a certainty of 0.4000, to the endoplasmic 

25 reticulum (membrane) with a certainty of 0.3000, or to the microbody (peroxisome) with a 
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certainty of 0.3000. The most likely cleavage site for NOV46b is between positions 42 and 43: 
TGN-AV. 



Table 46D. Encoded NOV46b protein sequence (SEQ ID NO:174). 

TISTLDTELTPINGTEETLCYKQTLSLTVLTCIVSLVGLTGNAVVLWLLGCRMRRNAFSIYILNLAAADFLF 
LSGRLIYSLLSFISI PHTISKILYPVMMFSYFAGLSSLSAVSTERCLSVLWPIWYRCHRPTHLSAWCVLLW 
ALSLLRSILEWMLCGFLFSGADSAWCQTSDFITVAWLIFLCVVLCGSSLVLLIRILCGSRKIPLTRLYVTIP 
LT VLVFLLCGLPFGI QFFLFLWI HVDRE VLFCHVHLVS I FLS ALNS SANP 1 1 YFFVGS FRQRQNRQNLKLVIi 
QRALQDASEVDEGGGQLPEEILELSGSRLEQ ; 

A search of sequence databases reveals that the NOV46b amino acid sequence has 1 10 
of 275 amino acid residues (40%) identical to, and 174 of 275 amino acid residues (63%) 
similar to, the 324 amino acid residue ptnr:SWISSPROT-ACC:P 12526 protein from Rattus 
norvegicus (Rat) (Mas Proto-Oncogene) (E = 6.8e^ 5 ). 

NOV46c 

A disclosed NOV46c nucleic acid of 1 030 nucleotides (also referred to as CG56698- 
01) encoding a Mas Proto-Oncogene-like protein is shown in Table 46E. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 17-19 and ending 
with a TGA codon at nucleotides 1007-1009. The start and stop codons are shown in bold in 
Table 46E, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 46E. NOV46c nucleotide sequence (SEQ ED NO:175). 

ACT AGGGTTT CTGAG C ATG AAT CCAAC CAT CC C AG CCTTGG AT ACAG AAATTGCACCAATTAGTGAT ACAG A 
GGAGACCCATCCTCATCGTTGTGGCATGGAGGTCCTGGTCCTCATAGTGCTGATCCTC^TCATTGACCTGGT 
CGGGCTGGCAGGAAATGCAGTCATGCTCTGGCTCCTGGGCTTCTGCATGCACAGTAACACCTTCTCTCT 
CAT C CT CAAC CTGGCCAGGGC T GACTT CCT CTG CACCTGCTTC C AG ATT AT AACATT CATT AATTT <jtjl 1 CAG 
TGACnTTGTTAGTTCTCTCTCCATCCATTTCTCT 

AGGCCTGAGCATGCTGAGCAC(^TCAGCACCGAGCACCGCCTGTCCGTCCTGTGGCCCATCTGGTACTGCTG 
CCACTGCCCCAC^CACCTGTCAGCGGTCATGTGTGTCCTGCTCrrGGGCCCrrGTCCCTGTTGCAGAGCATCCT 
GGAGTGGATGTTCTGTAGCTTC CTGTTTAGTGATGTTGACT CTGATAATTGGTGTCAAATATTAGATTTCCT 
CACTGCTGTGTGGCTGATTTTTTTATCTGTGGTTCTCTGTGGGTTCACCCTGGTCCTGCTTGTCAGGATCAT ■ 
ATGTGGATCCCAGAAGATGCCGCTGACCAGGCTGTATGTGACCATCCTGCTCAGAGGGCTGGTCTTCCTCTT 
CTGCAGCCTGCCCCTCAGCATTCAGGGATTCCTATTATACT 
TGTTGTTCGTTTAATTTCCATTTTCCTGTCTGCTCITAAC^ 
GGGCTCCTTTAGGCAGCrrTC^AAACAGGAAGACCCTCAAGCIGGTTCTCC^ 

T GAGGTGGATGAAGGTGGAGGGCAG CTTC CTGAGG AAACC CTGAAGCTGT CGGGAAGCAGAT TGGGGCCATG 
AGGAAGAGCCTCTGCCCTGTTA 

15 ^~ " ~ ~ ~ ~ ■ ~ . 

In a search of public sequence databases, the NOV46c nucleic acid sequence, located 
on chromosome 7, has 381 of 621 bases (61%) identical to a gb:GENBANK- 
ID:RATMAS|acc:J03823.1 mRNA from Rattus norvegicus (Rat mas oncogene, complete cds) 
(E = 9.3e 22 ). 

20 The disclosed NOV46D polypeptide (SEQ ID NO: 176) encoded by SEQ ID NO: 175 

has 330 amino acid residues and is presented in Table 46F using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV46c has a signal peptide and 
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is likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV46c may also localize to the Golgi body with a certainty of 0.4000, to the endoplasmic 
reticulum (membrane) with a certainty of 0J000, or to the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV46c is between positions 65 and 66: 
5 TFS-LY. 



Table 46R Encoded NOV46c protein sequence (SEQ ID NO:176). 

MNPTI PALDTEIAP I SDTEETHPHRCGMEVLVLI VLILI IDLVGLAGNAVMLWLLGFCMHSNTFSLYILNLA 
RADFLCTCFQIITFINFFSDFVSSLSIHFSRFVTTVLF 

LSAVMCVLLWALSLLQSIIiEWMFCSFLFSDVDSDlMCQILDFLTAVWLIFLSVVLCGFTLVLLVRIICGSQK 
MPLTRL YVTI LLTGL VFL FCS LPLS I QGFLL YWI EKDLDDLP CWRL I S I FLS ALNS S ANP 1 1 YFFMGS FRQ 
LQNRKTLKLVLQRALQDMLEVDEGGGQLPEETLKLSGSRLGP 



A search of sequence databases reveals that the NOV46c amino acid sequence has 1 06 

of 279 amino acid residues (37%) identical to, and 166 of 279 amino acid residues (59%) 

f 

1 0 similar to, the 324 amino acid residue ptnnS WISSPROT-ACC:P 1 2526 protein from Rattus 
norvegicus (Rat) (Mas Proto-Oncogene) (E = 1 .6c" 45 ). 

NOV46c is predicted to be expressed in at least teratocarcinoma cell. This information 
was derived by determining the tissue sources of the sequences that were included in the 
invention including but not limited to SeqCalling sources, Public EST sources, Literature 
15 sources, and/or RACE sources. 

In addition, the sequence is predicted to be expressed in hippocampus and brain 
because of the expression pattern of (GENBANK-ID: gb:GENBANK- 
ID:RATMAS|acc:J03823.1) a closely related Rat mas oncogene, complete cds homolog. 

NOV46d 

20 A disclosed NOV46d nucleic acid of 1005 nucleotides (also referred to as CG56702- 

01) encoding a Mas Proto-Oncogene-like protein is shown in Table 46G. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 17-19 and ending 
with a TGA codon at nucleotides 986-988. The start and stop codons are shown in bold in 
Table 46G, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 46G. NOV46d nucleotide sequence (SEQ ID NO:177). 

ACTACCGTTTCTGAGCA TCGATCCAAGCAACCCAGCCTTGGATC 

GGAGACTCCTTGCTACAAGCAGACCCTGAGCCTCATGGGGCTGACGTGCATCATTTCCCTTGTCACGCTGAC 

AGGAAACGCGGTTGTGCTCTGGCTCCTGGGCTTCCGCATGCGCAGGAA 

CCTGGCTGCGGC^GACTTCCTCTTCCTCAGCGGCCACGT^ 

TCATCCCAT CTCCAAAATCCTCATTCCTGTGATGACCT TTCTATACTTTAC AGGCCTGAGCTTTCTGAGTGC 

CAT^GC^CCGAGCGCTGCCTGTGCGTCCTGTGGCCCATCTGGTACCGCTGCCTCCTCCCCACACACCTGTC 

AGCGGTCGTGTGTGTCTTGCTTTGGGCCCTGTCCCTACTGCGGAGCATCCTGGAGGGAATGTTCTGTGACTT 

CCTGTTTAGTGATGCTGATTCH'ATTTGGTGTCAACCATC 

ATGTGTGGTTCTCTGTGGGTCCAGCCTGGTCCTGCTGATTAGGA 

GACCGGGCTGTACGTGACGATCCTGCTCAGAGTGCTAGTCTTCCT^ 

GTGGGCTCTGTCTACTGGGATCCACCTGGATTTGGAAGTCATTTTCTGTCATGTCCATCTAGTTTCCATTTT 
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CCTGTCCCCTCTAAACGGTAGTGCCAACCCCGTCATTTACTTCTTCGTGGGCTCCTTTAGGCAGCGTCAAAA 
TAGGCAGAACCTGAAGCTGGTTCTCCAGAGGGCTCTGCAGGACATGCCTGAGGTGAAGGTGGAAGGTGGAGG 
GCGGCTTCCTGAGGGAACCCTGGAGCTGTCGGGAAGCAGATTCGGGCAGTG AGGAAGAACCTCTGCCCT 



In a search of public sequence databases, the NOV46d nucleic acid sequence, located 
on chromosome 7, has 379 of 632 bases (59%) identical to a gb:GENBANK- 
ID:RATMAS|acc:J03 823.1 mRNA from Rattus norvegicus (Rat mas oncogene, complete cds) 
5 (E-4.7e 14 ). 

The disclosed NOV46d polypeptide (SEQ ID NO:178) encoded by SEQ ID NO:177 
has 323 amino acid residues and is presented in Table 46H using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results, predict that NOV46C has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
10 NOV46d may also localize to the Golgi body with a certainty of 0.4000, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or to the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV46d is between positions 46 and 47: 
GNA-VV. 



Table 46H. Encoded NOV46d protein sequence (SEQ ID NO:178). 



MDPSNPALDAELT P INRTEET PC YKQTLS LMGLTCI I SL VTLTGNAWLWLLGFRMRRNAVS I Y I LNLAAAD 
FLFLSGHVIHSASIJjINICHPISKILIPVMTFL^ 

LLWALSLLRSILEGMFCDFLFSDADSIWCQPSDFITVVWLIFLCTIA^CGSSLVLLIRILCGSWKMPLTGLYV 
TILLT VIiVFLLRSLPFGI RWALSTGI HLDLEVI FCHVHLVS I FLSPLNGS ANPVI YFFVG S FRQRQNRQNLK 
LVLQRALQDMPEVKVEGGGRLPEGTLELSGSRFGQ 



15 

A search of sequence databases reveals that the NOV46d amino acid sequence has 1 07 
of 275 amino acid residues (38%) identical to, and 167 of 275 amino acid residues (60%) 
similar to, the 324 amino acid residue ptnr:SWISSPROT-ACC:P12526 protein from Rattus 
norvegicus (Rat) (Mas Proto-Oncogene) (E = l.Se" 40 ). 
20 In addition, NOV46d is predicted to be expressed in hippocampus and cerebral cortex 

of the brain because of the expression pattern of (GENBANK-ID: gb:GENBANK- 
ID:RATMAS|acc:J03823.1) a closely related Rat mas oncogene, complete cds homolog. 

NOV46a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 461. 

25. 
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Table 461. BLAST results for NOV46a 



Gene Index/ 
Identifier 


Protein/ O reran ism 


Tinner t*H 

(aa) 


xucuL.1 uy 
(%) 


CUo J. U. J.Vcb 

(%) 




gi | 17472340 |ref|XP 

061650.1) 

(XM_061650) 


similar to MrgXl 
G protein- coupled 
receptor (H. 
sapiens) [Homo 
sapiens] 


1589 


322/322 
(100%) 


322/322 
(100%) 


e-172 


gi | 15546062 | gb | AAK9 
1804. l| (AY042213) 


MrgXl G protein- 
coupled receptor 
[Homo sapiens] 


322 


322/322 
(100%) 


322/322 
(100%) 


e-158 


gi| 16876453 |ref |NP 
473372. l| 
(NM 054031) 


G protein- coupled 
receptor MRGX3 
[Homo sapiens] 


322 


269/322 
(83%) 


285/322 
(87%) 


e-131 


gi 1 17461239 |ref|XP 

062249.1) 

(XM_062249) 


similar to MrgX3 
G protein- coupled 
receptor (H. 
sapiens) [Homo 
sapiens] 


322 


268/322 
(83%) 


285/322 
(88%) 


e-131 


gi | 16876455 |ref|NP 
473373.1) 
(NM 054032) 


G protein- coupled 
receptor MRGX4 
[Homo sapiens] 


322 


255/320 
(79%) • 


275/320 
(85%) | 


e-119 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 46J. In the ClustalW alignment of the NOV46 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (Le., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



1 0 Table 46J. ClustalW Analysis of NOV46 

1) Novel N0V46a (SEQ ID NO: 172) 

2) Novel NOV46b (SEQ ID N0:174) 

3) Novel N0V46C (SEQ ID NO: 176) 

4) Novel N0V46d (SEQ ID NO:178) 

15 5) gi 1 17472340 | ref |XP_061650 . 1 1 (XM_061650) similar to MrgXl G protein- coupled 
receptor (H. sapiens) [Homo sapiens] (SEQ ID NO: 53 2) 

6) gi|l5546062|gb|AAK91804.l| (AY042213) MrgXl G protein- coupled receptor [Homo 
sapiens] (SEQ ID NO: 533) 

7) gi 1 16876453 | ref |NP_473372.l| (NM_05403l) G protein- coupled receptor MRGX3 [Homo 
2AJ sapiens] (SEQ ID N0:534) 

8) gi| 17461239 I ref |XP_062249.1 1 (XM_062249) similar to MrgX3 G protein- coupled 
receptor (H. sapiens) [Homo sapiens] (SEQ ID NO:535) 

9) gi 1 16876455 | ref |NP_473373 . 1 1 (NM_054032) G protein- coupled receptor MRGX4 [Homo 
sapiens] (SEQ ID NO:536) 

25 

10 20 30 40 50 60 

, ::::!:::±:::!::::!:::±:::!::::!::::!:;::!;;:;!::::!::;:J , 

NOV46b 1 _._ % 

30 NOV46C 1 1 

NOV46d l x 

gi 1 17472340 1 1 MTQTTLHSHLPADPCTGNHRAVQYFDKHQASLQLLQAPPDTRLPFIiSVDPSNPALDAELT 60 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



gi 
gi 
gi 
gi 



15546062 
168764S3 
17461239 
16876455 



NOV46a 
NOV46b 
NOV46C 
NOV46d 



gi 
gi 
gi 
gi 
gi 



17472340 
15546062 
16876453 
17461239 
16876455 



NOV4 6a 
NOV46b 
NOV46C 
N0V46d 



gi 
gi 
gi 
gi 
gi 



17472340 
15546062 
16876453 
17461239 
16876455 



NOV46a 
NOV46b 
NOV46C 
NOV46d 



gi 
gi 
gi 
gi 
gi 



17472340 
15546062 
16876453 
17461239 
16876455 



NOV46a 
N0V46b 
NOV46c 
NOV46d 



gi 
gi 
gi 
gi 
gi 



17472340 
15546062 
16876453 
17461239 
16876455 



NOV46a 
NOV46b 
KOV46C 
NOV46d 



gi 
gi 
gi 
gi 
gi 



17472340 
15546062 
16876453 
17461239 
16876455 



NOV46a 
NOV46b 
NOV46c 



1 1 

1 1 

1 1 

1 1 

70 80 90 100 110 120 

....|....|....|....|....|....|....|...-|.....|....|....|....| 

1 --- 1 

1 1 

1 1 

1 " — 1 

61 PINRTEETPCYKQTLSLMGLTCI I SLVTLTGNAVVLWLLGFRMRRNAVSI YILNLAAADF 120 

1 " 1 

1 " 1 

1 1 

1 - -- - 1 

130 140 150 160 170 180 

,...|....|....|....|....|....|....|....|....|....|....|....| 

1 - -- - "-- 1 

1 - 1 

1 1 

1 1 

121 LFLSGHVIHS AS LLINI CHP I SKI LI PVMTFL YFTGLS FLS AMSTERCLCVLWP I CLVLL 180 

1 1 

1 " 1 

1 - " " 1 

1 1 

190 200 210 220 230 240 

....|....|....|....|....|....|...-|....|....|....|-...|...-| 

1 - " 1 

1 1 

1 1 

1 1 

181 IRILCGSWKMPLTGLYWILLTVLWLLRSLPFGIRWALSTGIHLDIiEVIFCHVHLVS I F 240 

1 1 

1 1 

1 1 

1 - - 1 

250 260 270 280 290 300 

....|....|....|....|....|....|....|....|....|....|..-.|...-| 

1 --- -- 1 

1 - " 1 

1 -- - 1 

1 1 

24 1 LSPLNGSANPVIYFFVGSFRQRQNRQ^KLVLQRALQDMPEVKVEGGFLREPWSCREADS 3 00 

1 - 1 

1 1 

1 1 

1 1 

310 320 330 340 350 360 

....|....|....|.,..|....|....|....|....|....|....|....|....| 

1 1 

1 - - - -- 1 

1 - 1 

1 - 1 

301 GTWAEYRRRNLNAHAHHIGPNFSETLQDENTRKESRDRADLIPECASPELSNTEESSEV 360 

1- -- - 1 

1 - 1 

1 - 1 

1 -- 1 

370 380 390 400 410 420 

....|....|-...|....|..'..|....|....|....|...-|...-|....|.-..| 

1 - 1 

1 - 1 

1 - 1 
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NOV46d 



gi 


17472340 


361 


gi 


15546062 


1 


gi 


16876453 


1 


gi 


17461239 


1 


gi 


16876455 


1 


NOV46a 


1 


NOV46b 


1 


NOV46C 


1 


NOV46d 


1 


gi 


17472340 


421 


gi 


15546062 


1 


gi 


16876453 


1 


gi 


17461239 


1 


gi 


16876455 


1 



TES QKGP P PLAQGKS S PLDKREKDKGKYQELS S S I VSY G FC Y K I VS VFP PRDTRLGFLS V 420 

1 

x 

x 

" 1 



430 



440 



450 



460 



470 



480 
•-I 



1 

1 

1 

1 

DPTI PALGTELTPINGREETPCYKQTLSLTGLTCI VSLVGMTGNAWLWLLGFRMRRNAF 480 
----- -- - l 

; i 

_ - i 

" - r- 1 



490 



NOV46a 
NOV46b 
NOV46C 
NOV46d 



gi 
gi 
gi 
gi 
gi 



17472340 | 
15546062 j 
16876453 j 
17461239 | 
16876455) 



■I-... I. 



500 
..|.. 



510 
..|.. 



520 



530 
..|.. 



540 
••I 



1 

- 1 

" 1 

2 

81 S I YI FNLSMADFL FLRSHI I RFPLSLINI LHP IFKI LS P VMM FS YLAS LS FLS AM STERC 54 0 

_ - - 1 

x 

-- 1 

__ i 



NOV46a 
NOV46b 
NOV46c 
NOV46d 



gi 


17472340 


gi 


15546062 


gi 


16876453 


gi 


17461239 


gi 


16876455 


NOV46a 


NOV46b 


NOV46C 


NOV46d 


gi 


17472340 


gi 


15546062 


gi 


16876453 


gi 


17461239 


gi 


16876455 



1 
1 
1 
1 

541 

1 

1 

1 

1 



550 
|....|.. 



560 



570 
..|.. 



580 



590 



600 



± 

_ 1 

x 

x 

LLVLLVRILCGSRKMPLTRLQCQNRQNLKLVLQRALQDTTEVNEGGRWLPEETLELSGSR 600 

1 

r 

1 

_ i 



610 
..|.. 



620 



630 



640 



650 



660 



|....|....| 



601 FGQSVHCLWEEEWRGCGSliMSCLLSKVNDITSSWFHIKEEHAGHPGVFSRKVTRliGFLS 



660 



670 



680 



NOV46a 
NOV46b 
NOV46C 
NOV46d 
gi) 17472340 | 
gi | 15546062 | 
gi 1 16876453 j 
gi | 17461239 | 
gi j 16676455 j 




I 



ldteltpingteet 
ldteltpingteet 

ld^eltpi^teet 
ldteltpingteet 
ldteltpingteet 
lEteltpingEeet 
lsteltpingBeet 



690 



L 



700 



I 



710 



I, 



720 ' 
.-I 



L--- 
3PHP 



o 



cykqtlsltvltcivslvgltgnawlwllgcrmr 
c ykqt lsltvltcivslvgltgnawlwllgcrmr, 
cgJ^jLELgvLEJIi SJlvgl^gnavJJlwllgRSi^ 
cykqtlsl^ltciBslvJJltgnawlwllgSrmr 
cykqtlsltvltcivslvgltgnawlwllgcrmr 
cykqtlsltvltcivslvgltgnawlwllgcrmr 
cykqtlsBtHltcivslvSltgnawlwllgcrmr 



CYKQTLSO 



Jltgnawlwllgcrmr 



T^TPINGgEET^HCYSiQTLsBTVLTClBSLVGLTGNAVVLWLLGBRMR 



57 

54 

60 

57 

717 

57 

57 

57 

57 



NOV46a 



58 



J_ 



73 0 



1 



740 



tSIYILNLAAADFLFLSGSil 



750 



760 
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NOV46b 
NOV46C 
NOV46d 



NOV46a 
NOV4 6b 
NOV46C 
NOV46d 



55 
61 
58 



gi 


17472340 


/ID 


gi 


15546062 


58 


gi 


16876453 


fl 


gi 


17461239 


58 


gi 


16876455 


58 


NOV46a 


114 


NOV46b 


111 


N0V46C 


121 


NOV46d 


114 


gi 


17472340 


774 


gi 


15546062 


114 


gi 


16876453 


114 


gi 


17461239 


114 


gi 


16876455 


114 



NOV46a 


173 


NOV46b 


170 


N0V46C 


181 


NOV46d 


173 


gi 


17472340 


833 


gi 


15546062 


173 


gi 


16876453 


173 


gi 


17461239 


173 


gi 


16876455 


173 



233 
230 
241 
233 



gi 


17472340 


893 


gi 


15546062 


233 


gi 


16876453 


233 


gi 


17461239 


233 


gi 


16876455 


233 


NOV46a 


293 


NOV46b 


290 


NOV46C 


301 


NOV46d 


293 


gi 


17472340 


953 


gi 


15546062 


293 


gi 


16876453 


293 


gi 


17461239 


293 


gi 


16876455 


293 




850 



860 
..I.- 



870 



880 



890 



900 



CQTSDFITVAWLIFLCWLCGSSLVLLQRILCGSRKg 

cqtsdfitvawliflcvvlcgsslvllHrilcgsrkH; 

CQMDI^T^WLIFLgWLCGggLVLLSRigCGSgKB 

cq^dfitvbwliflcvvlcgsslvllBrilcgsSkS 
cqtsdfitvawliflcwlcgsslvllmrilcgsrkg 

CQTSDFITVAWL IFLCVVLCGS SLVLLMRILCGSRKm 

cgtsdfit^wlwflcvvlcgsslvllsrilcgsrks 
cotsdfit^wl^lcwlcgsslvllBrilcgsrkS. 
cStsdfiBvawliflcvvlcBsslvllSrilcgsrkS" 



Z VVLCffiS SLVLLgRILCGSRP 



pltrlyvtilltvlvfllcglpf 
pltrlyvtilltvlvfllcglpf 

pltglyvtilltvlvfll^lpf 
pltrlyvtilltvlvfllcglpf 

PLTRLYVTILLTVLVFLLCGLPF 
PLTRLYVTILLTVLVFLLCGLPF 
PLTRLYVTILLTVLVFLLCGLPF' 
PLTRLYVTILLTVLVFLLCGLPF 



232 
229 
240 
232 
892 
232 
232 
232 
232 




920 



I. 



930 
..I.. 



940 



I 



950 



960 



IHgDgEVLFCHVHLVSIFLSALNSSANPIIYFFVGSFRQRQNRQNLKLVLQ 
IHgD^EVLFCHVHLVS IFLSALNSSANPII YFFVGSFRQRQNRQNLKLVLQ 
I iffi D S3S L S c B v S Ij i3 s 1 FL s ALNS s MP 1 I YFF.jGS FRQJJqNRJjIJl KLVLQ 
I^dReVuFCHVHLVS I FLS^LN^S ANPjjl YF3?VGS FRQRQNRQNLKLVLQ 



ih|d|evlfchvhlvsiflsalnssanpiiyffvgsfrqrqnrqnlklvlq 
ihSd^vlfchvhlvsiflsalnssanpiiyffvgsfrqrqnrqnlklvlq 
ihgd^vlfchvhlvs iflsalnssanpi i yfe^vgs frqrqnrqnlklvlq 
ifJjdSJvlfchvklvsiflsalnssanpiiyffvgsfrqrqnrqnlklvlq 



VLQCHVgLVg 



3L SgLNSS ANP I IYFFVGS FRQRQNRQNLKLVLQ 



292 
289 
300 
292 
952 
292 
292 
292 
292 




1000 



1010 



1020 



-- -- 322 

- 319 

330 

323 

gGTRLGFLSMDPTIPVLGTELTPINGTEET 1011 

322 

322 

322 

322 



1030 1040 1050 1060 1070 1080 

....|....|....|....|....|,...|....|....|....|....|....|....| 

NOV46a 322 322 

NOV46b 319 319 

NOV46C 330 330 

NOV46d 323 323 

gi 17472340 | 1012 PC YNQTL S FTVLTC I VSL VM*TGNAVVLWLI^FRMCRNAVS I YI LNLVAANFLLLSSHI I 1071 

gi 15546062 | 322 - - 322 

gi 16876453) 322 322 

gi 17461239 | 322 322 

gi 16876455| 322 322 



1090 



1100 



1110 



1120 



1130 



1140 
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NOV46a 
NOV46b 
NOV46C 
NOV46d 



gi 
gi 
gi 
gi 
gi 



17472340 
15546062 
16876453 
17461239 
16876455 



NOV46a 
NOV46b 
NOV46c 
NOV46d 



gi 
gi 
gi 
gi 
gi 



17472340 
15546062 
16876453 
17461239 
16876455 



NOV46a 
NOV46b 
NOV46c 
NOV46d 



gi 
gi 
gi 
gi 
gi 



17472340 
15546062 
16876453 
17461239 
16876455 



NOV46a 
NOV46b 
NOV46C 
NOV45d 



gi 
gi 
gi 
gi 
gi 



17472340 
15546062 
16876453 
17461239 
16876455 



NOV46a 
NOV46b 
NOV46C 
NOV46d 
gi 1 17472340 
gi) 15546062 
gi j 16876453 
gi j 17461239 
gij 16876455 



NOV46a 
NOV46b 
N0V46C 
NOV46d 



gi 
gi 
gi 
gi 
gi 



17472340 
15546062 
16876453 
17461239 
16876455 



....|....|....|....|....|....|....|....|....|....|....|....| 

322 322 

319 - 319 

330 - 330 

323 323 

1072 HPCYTS S ITPEYSECHEHQALPVNPVAHLPGPSGQDPLWI PEDAADQAVHDHLLTVLVFL 1131 

322 --- 322 

322 - - - 322 

322 322 

322 322 

1150 1160 1170 1180 1190 1200 

....|....]....|...-|....|....|....|...,|....|....|....|....| 

322 <- 322 

319 — 319 

330 -- - 330 

323 -r - 323 

1132 LCGLPIGIQWALFSRIHMDWEVLYSHVHLPS I FLSSLNS SANP I IYFFMGFVRQHQNWQN 1191 

322 322 

322 - 322 

322 - --- -- — 322 

322 r 322 

1210 1220 1230. 1240 1250 1260 

....|....|....|...-|....|....|....|....|....|....|....|....| 

322 - 322 

319 -"- - 319 

330 - 330 

323 — - 323 

1192 LKLVLQRDLQDTPEVDEEQNNVLRYI ILYIMYTLDTMSLHKNSLGNWFYGKS SAVICLQN 1251 

322 r - 322 

322 - -- -- 322 

322 - 322 

322 --- - - 322 

1270 1280 1290 1300 1310 1320 

322 322 

319 . 319 

330 — - - — 330 

323 - - 323 

1252 GDMIALVSSKDiraGPFPLRTDIGGTGLLFIAVTAEYRlUlNLNAHAHHIGPIVLKPCRMQT 1311 

322 - " --- 322 

322 - - - 322 

322 - — - 322 

322 — - 322 

1330 1340 1350 1360 1370 1380 

..-.|...-|..-.|....|....|....|....|....|....|....|....|....| 

322 322 

319 - 319 

330 — - - 330 

323 - - - --- 323 

1312 QGRI^KTQWISFLSVYVEISPKLHNTKGSS'EOTESQKGSTSLARGSTSSTLDRIUIRKDAQ 1371 

322 --- - 322 

322 - - r 322 

322 — - 322 

322 - - 322 

1390 1400 1410 1420 1430 1440 

.-..|....|....|....|....|....|....|....|....|....|....|....| 

322 322 

319 — " - - 319 

330 - - - - 330 

323 - — - - 323 

1372 QSHIEPHFKGTLVVNLRGTRLGFLSMNPTIPALDTEIAPISDTEETHPHRCGMEVLVLIV 1431 

322 - - 322 

322 322 

322 — - 322 

322 322 
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10 



15 



20 



25 



30 



35 



NOV46a 
NOV46b 
NOV46C 
NOV46d 



gi 
gi 
gi 
gi 
gi 



17472340 
15546062 
16876453 
17461239 
16876455 



NOV46a 
NOV46b 
NOV46C 
NOV46d 
gi [17472340 
gi | 15546062 
gi I 16876453 
gi | 17461239 
gi j 16876455 



NOV46a 
NOV46b 
NOV46C 
NOV46d 



gi 
gi 
gi 
gi 
gi 



17472340 
15546062 
16876453 
17461239 
16876455 



1450 1460 1470 1480 1490 1500 

322 - 322 

319 - 319 

330 --- 330 

323 323 

1432 LILIIDLVGIAGNAVMLWLLGFCMHSOTFSLYI 1491 

322 - 322 

322 322 

322 322 

322 --- 322 

1510 1520 1530 1540 1550 1560 

....|....|....|....|....|....]..,.|....|....|....|....|....| 

322 - 322 

319 319 

330 - 330 

323 -- 323 

1492 SLSIHFS RFVTTVLFSAC ITGLSMLSTI STEHRLSVLWPICS ANP 1 1 YFFMGSFRQLQNR 1551 

322 - --- 322 

322 322 

322 - 322 

322 322 

1570 1580 1590 

....|....|....|....|....|....|....|... 

322 - 322 

319 319 

330 ~ --- 330 

323 323 

1552 KTLKLVLQRALQDMLEVDEGGGQLPEETLKLSGSRLGP 1589 

322 - - 322 

322 322 

322 - - 322 

322 -- 322 



Table 46K lists the domain description from DOMAIN analysis results against 
40 NOV46. This indicates that the NOV46 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 46K Domain Analysis of NOV46 

gnl |Pfam|pfam00001, 7tm_l, 7 transmembrane receptor {rhodopsin 

family). (SEQ ID NO:810) 

CD-Length = 254 residues, 37.8% aligned 

Score = 51.2 bits (121), Expect = 9e-08 



GNAWLWLLGORMR- RNAFS I YI LNLAAADFLFLSGRLI YSLLSFIS IPHTISKIL 98 

|| + | + ++ + |. + | ++ |||| || I]) ++ | + + | ++ 

GNLLVILVILRTKKIiRTPTNIFLLNIA^ 60 

N0V53 : 99 YPVMMFSYFAGLSFLSAVSTERCLSVLWPIWYRCHR 134 
+ + + + | + |+| + | + | | +++ | + || | 

Sbjct: 61 GALFWNGYAS ILLLTAI S IDRYLAI VHPLRYRR IR 96 



The Mas Proto Oncogene belongs to the family of G-Protein Coupled Receptors.G- 
protein-coupled receptors (GPCRs) constitute a vast protein family that encompasses a wide 
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range of functions (including various autocrine, paracrine and endocrine processes). They 
show considerable diversity at the sequence level, on the basis of which they can be separated 
into distinct groups. The currently known clan members include the rhodopsin-like GPCRs, 
the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the 
5 metabotropic glutamate receptor family. 

The rhodopsin-like GPCRs themselves represent a widespread protein family that 
includes hormone, neurotransmitter and light receptors, all of which transduce extracellular 
signals through interaction with guanine nucleotide-binding (G) proteins. Although their 
activating ligands vary widely in structure and character, the amino acid sequences of the 

1 0 receptors are very similar and are believed to adopt a common structural framework 

comprising 7 transmembrane (TM) helices The human mas oncogene was originally detected 
by its ability to transform NIH 3T3 cells. We previously showed that the protein encoded by 
this gene is unique among cellular oncogene products in that it has seven hydrophobic 
potential transmembrane domains and shares strong sequence similarity with a family of 

15 hormone-receptor proteins. We have now cloned the rat homolog of the mas oncogene, 
determined its DNA sequence, and examined its expression in various rat tissues. A 
comparison of the predicted sequences of the rat and human mas proteins shows that they are 
highly conserved, except in their hydrophilic amino-terminal domains. Our examination of the 
expression of mas, determined by RNA-protection studies, indicates that high levels of mas 

20 RNA transcripts are present in the hippocampus and cerebral cortex of the brain, but not in 
other neural regions or in other tissues. This pattern of expression and the similarity of mas 
protein to known receptor proteins suggest that mas encodes a receptor that is involved in the 
normal neurophysiology and/or development of specific neural tissues. 

The disclosed NOV46 nucleic acid of the invention encoding a Mas Proto-Oncogene- 

25 like protein includes the nucleic acid whose sequence is provided in Table 46A or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 46A while still encoding a protein 
that maintains its Mas Proto-Oncogene -like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 

30 are complementary to those just described, including nucleic acid fragments that are 

complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
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derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 40 percent of the bases may be so changed. 
5 The disclosed NOV46 protein of the invention includes the Mas Proto-Oncogene-like 

protein whose sequence is provided in Table 46B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 46B while still encoding a protein that maintains its Mas Proto-Oncogene-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 

10 variant protein, up to about 21 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Mas Proto-Oncogene -like protein 
(NOV46) is a member of a "Mas Proto-Oncogene family". Therefore, the NOV46 nucleic 

15 acids and proteins identified here may be useful in potential therapeutic applications 

implicated in (but not limited to) various pathologies and disorders as indicated below. The 
potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

20 delivery/gene ablation); research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV46 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome , Alzheimer's 
disease, Stroke, Tuberous sclerosis, hypercalcemia, Parkinson's disease, Huntington's disease, 

25 Cerebral palsy, Epilepsy,Lesch-Nyhan syndrome, Multiple sclerosis,Ataxia- 

telangiectasia,Leukodystrophies,Behavioral disorders, Addiction, Anxiety, Pain, 
Neuroprotection, and/or other diseases and pathologies. 

NOV46 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV46 substances for use in 

30 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV46 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
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understanding^ pathology of the disease and development of new drug targets for various 
disorders. 

NOV47 

A disclosed NOV47 nucleic acid of 523 nucleotides (also referred to as AF152363) 
5 encoding a Peptidyl-Prolyl Cis-Trans Isomerase -like protein is shown in Table 47A. An open 
reading frame was identified beginning with a ATG initiation codon at nucleotides 1 7-1 9 and 
ending with a TAA codon at nucleotides 509-5 1 1 . The start and stop codons are shown in 
bold in Table 47A, and the 5* and 3' untranslated regions, if any, are underlined. 

Table 47A. NOV47 nucleotide sequence (SEQ ID NO:179). 

CCCCGTATTACCAGCTA TGGTCAACCCCACTGTTTTCTTCGAC^ 
CGTCTCCTTCGAGCTGTTTGCAGACAAGCTTCCAAAGACAGC 

AAAAGGATTTGATTATGAGGGTTACTGCTTTCACAGAATTATTCCAGGGTTTGTATGTCAGGGTGGTGACTT 
C^CATGCCATAATGGCACTGGTAGCAAGTCCATCTACAGGGAGAAATTTGATGACGAGAACTTCATCCTC 
GCATAC AGGTCCTGGCATCCTGTCCATGGCAAATGCTGGAC CCAACGCAAATGGTTC CCAGTTTTT CATGTG 
CCCTGCC^^GACCAAGTGGTTGGATGGCAAGCAAGTGGTCTTT 

GGAGGCCATGGAGCGCTTTGTGTTCAGGAATGGCAAGACTAGCAAGAAGGTCACTATTGCTGACTGTGGACA 
G CTCTAATAAGTTT GACTT 

10 In a search of public sequence databases, the NOV47 nucleic acid sequence, located on 

chromosome 3, has 523 of 523 bases (100%) identical to a gb:GENBANK- 
ID:AF152363|acc:AF152363.1 mRNA from Homo sapiens (constitutive fragile region FRA3B 
sequence) (E = 5.7e~ lu ). 

The disclosed NOV47 polypeptide (SEQ ID NO: 1 80) encoded by SEQ ID NO: 1 79 has 

15 164 amino acid residues and is presented in Table 47B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV47 has no signal peptide and is 
likely to be localized to the microbody (peroxisome) with a certainty of 0.6400. Alternatively, 
NOV47 may also localize to the plasma membrane with a certainty of 0.6000, to the 
cytoplasm with a certainty of 0.4500, or to the mitochondrial matrix space with a certainty of 

20 0.1000.' 



Table 47B. Encoded NOV47 protein sequence (SEQ ID NO:180). 

MVNPTVFFDI AVNSEPLGCVS FELFADKLPKTAENFHALSTGEKGFDYEGY CFHRI I PGFVCQGGDFTCHNG 
TGSKSIYREKFDDENFILKOTGPGILSMANAGPNANGSQFFMCPAKTO 

FVFRNGKTSKKVTIADCGQL 

A search of sequence databases reveals that the NOV47 amino acid sequence has 141 
of 164 amino acid residues (85%) identical to, and 151 of 164 amino acid residues (92%) 
25 similar to, the 165 amino acid residue ptnr:pir-id:CSHUA protein from human (peptidylprolyl 
isomerase (EC 5.2.1.8) A) (E = 5.6e' 75 ). 
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NOV47 is predicted to be expressed in small intestine because of the expression 
pattern of (GENB ANK-ID: gb:GENBANK-ID:E02765|acc: E02765.1) a closely related Sus 
scrofa peptidyl-prolyl cis-trans isomerase A sequence homolog. 

NOV47 also has homology to the amino acid sequences shown in the BLASTP data 
5 listed in Table 47C. 



Table 47C. BLAST results for NOV47 






Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi| 17440554 |ref |XP_ 
067503. 1| 
(XM_067503) 


similar to 
PEPTIDYL-PROLYL 
CIS -TRANS 
ISOMERASE 
A(PPIASE) 
(ROTAMASE) 
(CYCLOPHILIN A) 
(CYCLOSPORIN A- 
BINDING PROTEIN) 
(H. sapiens) 
[Homo sapiens] 


164 


164/164 
(100%) 


164/164 
(100%) 


3e-91 


gi| 12804335 |gb|AAH0 

3026.l|AAH03026 

(BC003026) 


Unknown (protein 
for 

IMAGE:2823490) 
[Homo sapiens] 


174 


141/164 
(85%) 


151/164 
(91%) 


3e-76 


gi|4033689|sp|P0437 
4 | CYPHJBOVIN 


PEPTIDYL- PROLYL 
CIS -TRANS 
ISOMERASE A 
(PPIASE) 

(ROTAMASE) (CYCLOP 
HILIN A) 
(CYCLOSPORIN A- 
BINDING PROTEIN) 


164 


141/164 
(85%) 


151/164 
(91%) 


le-75 


gi| 10863927 |ref |NP_ 

066953.1] 

(NM_021130) 


pept idylprolyl 
isomerase A 
(cyclophilin A) 
[Homo sapiens] 


165 


141/164 
(85%) 


151/164 
(91%) 


2e-75 


gi| 68401 |pir| |CSBOA 
B 


pept idylprolyl 
isomerase (EC 
5.2.1.8) A - 
bovine 


163 


140/163 
(85%) 


150/163 
(91%) 


5e-75 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 47D. In the ClustalW alignment of the NOV47 protein, as 
1 0 well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that maybe required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 47D. ClustalW Analysis of NOV47 

1) Novel NOV47 (SEQ ID NO:180) 

2) gi 1 17440554 | ref |XP_067503 . 1 | (XM_067503) similar to PEPTIDYL- PROLYL CIS-TRANS 
ISOMERASE A(PPIASE) (ROTAMASE) (CYCLOPHILIN A) (CYCLOSPORIN A- BINDING PROTEIN) (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 537) 

3) gi|l2804335|gb|AAH03026.l|AAH03026 (BC003026) Unknown (protein for 
IMAGE: 2823490) [Homo sapiens] (SEQ ID NO:538) 

4) gi| 4033689 |sp|P04374|CYPH_BOVIN PEPTIDYL- PROLYL CIS-TRANS ISOMERASE A (PPIASE) 
(ROTAMASE) (CYCLOPHILIN A) (CYCLOSPORIN A- BINDING PROTEIN) (SEQ ID N0:539) 

5) gi | 10863927 | ref |NP_066953 . 1 | (NM_021130) peptidylprolyl isomerase A (cyclophilin 
A) [HOJDO sapiens] (SEQ ID NO: 540) 

6) gi|6840l|pir) |CSB0AB peptidylprolyl isomerase (EC 5.2.1.8) A - bovine (SEQ ID 
NO:541) 



NOV47 


1 


gi 


17440554| 


1 


gi 


12804335) 


1 


gi 


4033689) 


1 


gi 


10863927) 


1 


gi 


68401) 


1 




NOV47 


52 


gi 


17440554) 


52 


gi 


12804335) 


61 


gi 


4033689| 


52 


gi 


10863927) 


52 


gi 


68401) 


51 



70 
.1.. 



80 



90 



100 



110 



120 



CFHRIIPGFjSCQGGDFTffiHNGTGgKSIVgEKFDDENFILKHTGPGILSMANAGPNgNGSQ 
CFHRI I PGF|CQGGDFt5hNGTG§KS I YgEKFDDENFI LKHTGPGI LSMANAGPNgNGSC 

cfhri i pgfmcqggdftrhngtggksi ygekf^denfilkhtgpgilsmanagpntngsg 
cfhri ipgfmcqggdftrhngtggksiygekfddenfilkhtgpgilsmanagpntngsq 
cfhriipgfmcqggdftrhngtggksiygekf|denfilkhtgpgilsmanagpntngsq 
cfhriipgfmcqggdftrhngtggksiygekfddenfilkhtgpgilsmanagpntngsq 



111 
111 
120 
111 
111 
110 



NOV47 


112 


gi 


17440554| 


112 


gi 


12804335 | 


121 


gi 


4033689| 


112 


gi 


10863927 | 


112 


gi 


68401) 


111 



130 
..I.. 



140 



150 



160 



170 



F F;sjCHaKT j»j W LD G KSBw FGpj V K E GMjj^I VE AM E R F jjy RNG KTS KKgjjT I ADCGQLB 
FFgCgAKl^WLDGK§WFGgVKEGM^I VEAMERfJhRNGKTS KKgjT I ADCGQlH 

ffictaktewldgkhwfgkvkegmniveamerfgsrngktskkitiadcgqlB 
ffictaktewldgkhwfgkvkegmniveamerfgsrngktskkitiadcgqgi 
f f i ctaktewldgkhwfgkvkegmni ve ame rfgs rngkts kki t i adcgqlg 
ff i ctaktewldgkhwfgkvkegmni veamerfg3rngkts kkiti adcgqpb 



Table 47E lists the domain descriptions from DOMAIN analysis results against 
NOV47, This indicates that the NOV47 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 47E Domain Analysis of NOV47 

gnl|Pfam|pfam00160, pro_isomerase, Cyclophilin type peptidyl-prolyl 
cis-trans isomerase (SEQ ID NO: 839) 
CD-Length » 162 residues, 100.0% aligned 
Score m 219 bits (558) , Expect = le-58 



NOV47: 5 TVFFDIAVNSEPLGCVSFELFADKLPKTAENFHALSTGEKGFD - YEGYCFHRI IPGFVCQ 63 

i+iii + +iii + i i i i i +iii iii ii iiiiii i+i i i i*i i i+ i 

Sbjct: 1 KVY FDI T IGGKPLGRI VFELFGDWPKTVENFRALCTGEKGFGFYKGSTFHRVI PNFMI Q 60 
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NOV47 * 64 GGDFTCHNGTGS KS I YREKFDDENFILKHTGPGILSMANAGPNANGSQFFMCPAKTKWLD 123 

lilll 1 1 1 1 1 1 1 1 III 1 1 1 1 III II minimi mm* MM 

Sbjct : 61 GGDFTRGNGTGGKS I YGE KFKDENFNLKHDRP GTLSMANAGPNTNGSQF FITT VATPWIJD 120 
5 N0V47: 124 GKQWFGRVKEGMDIVEAMERFVFRNG- KTSKKVTIADCGQL 164 

ii 1 1 1 1- 1 + i +i- ii i mini 

Sb j ct : 121 GKHVVFGKVVEGMDVVDKIENVGTDSGDVPSKDVKIADCGQL 1 62 

Cyclophilin A (peptidyl-prolyl cis-trans isomerase A) is the major high-affinity 

10 binding protein in vertebrates for the immunosuppressive drug cyclosporin A (CSA). Because 
of its dramatic effects on decreasing morbidity and increasing survival rates in human 
transplants, the molecular mechanism of immunosuppression by cyclosporin A has been a 
matter of much interest. Cyclophilin A is a member of the immunophilin class of proteins that 
all possess peptidyl-prolyl cis-trans isomerase (PPIase) activity and, therefore, are believed to 

15 be involved in protein folding and/or intracellular protein transport. PPIase accelerates protein 
folding by catalyzing the cis-trans isomerization of proline imidic peptide bonds in 
oligopeptides. It is probable that CSA mediates some of its effects via an inhibitory action on 
PPIase. Cyclophilin is a cytosolic protein that belongs to a family of isozymes, including 
cyclophilins B and C, PPIase, and natural killer cell cyclophilin-related protein. The sequences 

20 of the different forms of cyclophilin-type PPIases are well conserved. Additional interest in 
cyclophilin A stems from studies performed by Luban et al. (1993), who showed that 
cyclophilin A binds to the gag protein of human immunodeficiency virus type 1 (HIV-1). This 
interaction can be inhibited by the immunosuppressant cyclosporin A and also by 
nonimmunosuppressive, cyclophilin A-binding cyclosporin A derivatives, which were also 

25 shown to exhibit potent anti-HIV-1 activity. Thus, cyclophilin A may have ah essential 
function in HIV-1 replication. 

The disclosed NOV47 nucleic acid of the invention encoding a Peptidyl-Prolyl Cis- 
Trans Isomerase-like protein includes the nucleic acid whose sequence is provided in Table 
47A or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 

30 whose bases may be changed from the corresponding base shown in Table 47A while still 
encoding a protein that maintains its Peptidyl-Prolyl Cis-Trans Isomerase -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 

35 invention additionally includes nucleic acids or nucleic acid fragments, or complements 

thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
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chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 1 percent of the bases may be so 
changed. 

5 The disclosed NOV47 protein of the invention includes the Peptidyl-Prolyl Cis-Trans 

Isomerase-Iike protein whose sequence is provided in Table 47B. The invention also includes 
a mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 47B while still encoding a protein that maintains its Peptidyl-Prolyl 
Cis-Trans Isomerase-like activities and physiological functions, or a functional fragment 
1 0 thereof. In the mutant or variant protein, up to about 1 5 percent of the residues may be so 
changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab>2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Peptidyl-Prolyl Cis-Trans 
1 5 Isomerase -like protein (NOV47) is a member of a "Peptidyl-Prolyl Cis-Trans Isomerase 
family". Therefore, the NOV47 nucleic acids and proteins identified here may be useful in 
potential therapeutic applications implicated in (but not limited to) various pathologies and 
disorders as indicated below. The potential therapeutic applications for this invention include, 
but are not limited to: protein therapeutic, small molecule drug target, antibody target 
20 (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or prognostic 

marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo 
and in vitro of all tissues and cell types composing (but not limited to) those defined here. 

The NOV47 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in inflammatory bowel disease, diverticular disease, and/or 
25 other diseases and pathologies. 

NOV47 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV47 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
30 NOVX Antibodies" section below. The disclosed NOV47 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 



414 



WO 02/068649 



PCT/US02/02785 



NOV48 

NOV48 includes three novel Phospholipase C Delta-4-like proteins disclosed below. , 
The disclosed sequences have been named NOV48a and NOV48b. 
NOV48a 

5 A disclosed NOV48a nucleic acid of 3238 nucleotides (also referred to as CG56743- 

01) encoding a Phospholipase C Delta-4-like protein is shown in Table 48 A. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 370-372 and 
v ending with a TGA codon at nucleotides 2626-2628. The start and stop codons are shown in 
bold in Table 48A, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 48A. NOV48a nucleotide sequence (SEQ ID NO: 181). 

TTCTGATCTTTATGTGATTTGAGGCAGGTTTCTAAACCTATCTAAAGTGTC^ 

AG AAGC AAAAATCAGCTACAGACTAT CTTCAAGATTCACCC AG AGC C CT TT G CT CTT C CTTGCT C CTTT AGG 
TGATCTGGTGCCAGCTGGTGGAACAGTGGGTGATGGCGTC^ 

GCAGGGGAAGAGGCCTTAGTGTACAGCTCAGGGAAGGGAAGGAGGTTGGACCCCTGTTCCAGAGCTCTCCCT 

GGGCCTGCTACCCrCTCTGCTGGCTACCTAACCCCTGCTTTTCCTGACCTAGAGCTGACCACTGATCA 

TTGCTGCTGA TGCAGGAAGGCATGCCGATGCGCAAGGTGAGGTCCAAAAGCTGGAAGAAGCTAAGATACTO 

AGACTTCAGAATGACGGCATGAC^GTCTGGCATGCACGGCAGGCCAGGGGCAGTGCCAAGCCCAGCGTCTCA 

ATCTCrGATGTGGAGAC^TACGTAATGGCCATGATTCCGAGTTGCTGCGTAGCCTGGCAGAGGAGCTCCCC 

CTGGAG CAGGG CTTCACCATTGTCTTCCATGGCCGCCGCTC CAACCTGGACCTGATGGCCAACAGTGTTGAG 

GAGGCC CAGATATGGATGCGAGGGCTCCAGCTGTTGGTGGATCTTGTCACCAGCATGGACCAT CAGGAGCGC 

CTGGAC CAGTATCTGAG CGATTGGTTTCAACGTGGAGACAAAAAT CAGG AT GGT AAGATGAGTTTC C AAGAA 

GTT CAG CGGTT ATTGCAC CTAATGAATGTGGAAATGGAC CAAG AAT ATGCC TT CAGT CTCT T GCAGG CAG CA 

GACACGTCCCAGTCTGGAACCCTGGAAGGAGAAGAATTCGTACAGTTOTATAAGGCATTGACTAAACGTC 

GAGGTGCAGGAACTGTTTGAAAGTTTTTCAGCTGATGGGCAGAAGCTGACTCTGCTGGAATTT^ 

CT C CAAGAGGAGCAGAAGG AGAGAGACTG CACCT CTGAGCTTG CT C TGG AAC T CATTGACCG CTATGAAC CT 

TCAGACAGTGGTAAGCTGCGGCATGTGCTGAGTATGGATGGCTTCCTC^GCTACCTCrGCTCTAAGGATGGA 

GACATCTTCAACCCAGCCTGCCTCCCCATCTATCAGGATATGACTCAACCCCTGAACCACTACT^ 

TCTTCTCATAACACCTACCTAGTGGGGGACCAGCTTTGTGGCCAGAGCAGCGTCGAGGGATATATACGGGCC 

CTGAAGCGGGGGTGCCGCTGCX3TGGAGGTGGATGTATGGGATGGACCTAGCGGGGAACCTGTCGTTTACCAC 

GGACACACCCTGACCTCCCGCATCCTGTTCAAAGATGTCGTGGC 

TCAGACTACCCAGTCATCTTGTCCCrGGAGACCCACTGCAGCTGGGAGCA 

CTGACTGAGATCCTGGGGGAGCAGCTGCTGAGCACCACCTTGGATGGGGTGCTGCCCACTC^GCT 
CCTGAGGAGCTT CGGAGGAAGAT CCTGGTG AAGGGGAAGAAGT TAACACT TGAGGAAGAC CTGGAAT ATGAG 
G AAG AGGAAG C AG AACC TG AGTTGGAAGAGT CAGAATTGG C G CTGGAGTCC CAGTTTG AGACTGAGC CTGAG 
C C C CAG GAG C AG AACCTTCAGAATAAGGAC AAAAAGAAGGTAAG CCAGCTT CTCCAGAAATCCA^ 
TTGTGTCCAGCCCrrCTCTTCCCTGGTTATCTACTTG^^ 
GAGCACTACCACTTCTACGAGATATCATCTTTCTCTGAAACCAAGGCCAAGC^ 

AATGAGTTTGTGC^GCAC^TACTTGGCAGTTAAGCCGTGTGTATCCCAGCGGCCTGAGGACAGACTCTTCC 
AACTACTACAACCCCCAGGAACTCTGGAATGCAGGCTGCC^GATGGTGGCC^^ 
CTTGAAATGGACATCTGTGATGGGCATTTCCGCCAGAATGGCGGCTGTGGCTATGTGCT 
CTGCGTGATATCCAGAGTTCTTTCCACCCTGAGAAGCCCATCAGCCCTTTCAAA 

CAGGTGATC^GCGGTCAGCAACTCCCCAAAGTGGACAAGACCAAAGAGGGGTCCATTGTGGATCCACTGG 
AAAGTGGAGATCTTTGGCGTTCGTCTAGACACAGGACGGCAGGAGACCAAC^^ 
AATCCATACTGGGGGCAGACACTATGTTTCCGGGTGCTGGTGCCTGAACTTGCCATGCTGCGTTT^ 
ATGGAT T ATGACTGGAAAT CCCGAAATGACTTT AT TGGT CAGT ACACCCTG CCTTGGAC CTG CATGCAACAA 
GGTTACCGCCACATTCACCTGCTGTCCAAAGATGKyCATC^ 

TGCATCCAGGAAGGCCTGGAGGGGGATGAGTCCTGAGGTGGGCATTTCACGGGAAGGGTTGGTGTGCTGGCT 

TTAGACGGGGAGAAACATCTGGAAGGATGCTCGAGAGAACAAATGGAGGTGGTGAAAATCAAGCTTTG^ 

GTGCATTCCTAGGCAC^AAATTACCTGATTCTTCCTAACAAGCAATCTC 

CTCTTTTCTTCCCTTCCTTTGTTTTCATAAGCCTTTGGTATCTTTCCTCCC 

ACTGGAGTTCCCTTCTTCCTCTTGCTGTAGGCTCAATCCCATACCGACATCTACAACT 

ACTCTGTGTGAAGGCAGGTTGCAACTAGAAATTCAGAGGGGCTTGGAATAGAGAAACCTAAAGAAGCATCAT 

CCCCTCCATCCCCAACTTCCTCAAAGCCCAAAGCC^GGGAAGGATAAATCAAGGCTCAAGGCTTCC 

AAAG AT TAG GGAAAGAGACTTGACC C CAGGACTGTACT ACGACTCTTAAGAGAACAC TG CACAGCACTCAAA 

GTCCCCCACTGGACTGCTTCCTCCTTAGCCCCACTGGTATAAATACZATCTCTCTCCAATTTGGCTTCAAA 
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In a search of public sequence databases, the NOV48a nucleic acid sequence, located 
on chromosome 3, has 1279 of 1285 bases (99%) identical to a gb:GENBANK- 
ID:AK023083|acc:AK023083.1 mRNA from Homo sapiens (cDNA FLJ13021 fis, clone 
NT2RP3000742, weakly similar to l-Phosphatidylinositol-4,5-Bisphosphate 
5 Phosphodiesterase Delta 1 (EC 3.1.4.1 1)) (E = 0.0). 

The disclosed NOV48a polypeptide (SEQ ID NO: 1 82) encoded by SEQ ID NO: 1 8 1 
has 752 amino acid residues and is presented in Table 48B using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV48a has no signal peptide 
and is likely to be localized to the cytoplasm with a certainty of 0.4500. Alternatively, 
10 NOV48a may also localize to the microbody (peroxisome) with a certainty of 0. 1265, to the 
mitochondrial matrix space with a certainty of 0.1000, or to the lysosome (lumen) with a 
certainty of 0.1 000. 



Table 48B. Encoded NOV48a protein sequence (SEQ ID NO:182). 



MQEGMPMRKVRSKSWKKLRYFRLQNDGMTVWHAR^ 

GFTIVFHGRRSNLDLMANSVEEAQIWMR^ 

LLHLMNVEMDQF¥AFSLLQAAOT 

EQKERDCTSELALEXjIDRYEPSDSGKLIUIVLSMDGFIjSYLCSKDGDIFNPACL 

NTYLVGDQI^GQSSVEGYIRALKRGCRCVEVDVWDGPS 

PVILSLETHCSWEQQQTMARmTEILGEQLLSTTI^ 

AEPELEESELALESQFETEPEPQEQNLQNKDKKKVSQ 

HFYEISSFSETKAKI^IKEAGirei^QHNTW^ 

DI CDGHFRQNGGCGYVLKPDFLRDIQSSFHPEKPI SPFKAQTLLNQVISGQQLPKVDKTKEGS IVDPLVKVQ 

IFGVRLDT^QETOYVE^GFNPYWGQTLCFRVLTO 

HIHLLSKDGISLRPASIFVYICIQEGLEGDES 



15 A search of sequence databases reveals that the NOV48a amino acid sequence has 61 9 

of 752 amino acid residues (82%) identical to, and 675 of 752 amino acid residues (89%) 
similar to, the 764 amino acid residue ptnr:pir-id:S141 13 protein from bovine (1- 
phosphatidyIinositoI-4,5-bisphosphate phosphodiesterase (EC 3.1.4.1 1) delta-2) (E = 0.0). 
NOV48a is predicted to be expressed in at least Amygdala, Bone Marrow, Brain, 

20 Epidermis, Heart, Hypothalamus, Lung, Mammary gland/Breast, Pituitary Gland, Placenta, 
Retina, Skeletal Muscle, Small Intestine, Stomach. This information was derived by 
determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
sources. 

25 NOV48b 

In the present invention, the target sequence identified previously, NOV48a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
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downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 

5 silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 

10 brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 

1 5 other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated.manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV48b. This differs from the previously 

20 identified sequence (NOV48a) in having a deletion of 6 amino acids in one region and one 
amino acid at another region. 

A disclosed NOV48b nucleic acid of 2341 nucleotides (also referred to as CG56743- 
02) encoding a Phospholipase C Delta-4-like protein is shown in Table 48C. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 55-57 and ending 

25 with a TGA codon at nucleotides 2278-2280. The start and stop codons are shown in bold in 
Table 48C, and the 5 5 and 3' untranslated regions, if any, are underlined. 

Table 48C NOV48b nucleotide sequence (SEQ ID NO:183). 

TTTCCTGACCTAGAGCTGACCACTGATCAGGGCTTGCTGCTGGTGCAGGAAGGCATGCCGATGCGCAAGGTG 
AGGTCCAAAAG CTGGAAGAAGCTAAG AT ACTT CAG ATTT CAGAATGACGG C ATGACAGT CTGGCATG CACGG 
CAGGCCAGGGGCAGTGCCAAGCCCAGCOTCTCAA 

GAGTTGCTGCGTAGCCTGGCAGAGGAGCTCCCCCTGGAGC^GGGCTTCACCATTGTCTTCCATGGCCGCCGC 

TCC^CCTGGACCTGATGGCCAACAGTGTTGAGGGGGCCCAGATATGGATGCGAGGGCTCCAGCTGTTGGTG 

GATCTTGTCACC^GCATGGACCATCAGGAGCGCCTGGACCAATGGCTGAGCGATTGGTTTCAACGTGGAGA^ 

AAAAATCAGGATGGTAAGATGAGTT TCC AAGAAGTTCAGCG GTT ATTGCAC CT AATGAATGTGGAAATGG AC 

CAAGAATATGCCTTCAGTCTTTTTCAGGCAGCAGACACGTCCCAGTCTGGAACCCTGGAAGGAGAAGAATTC 

GTACAGTTCTATAAGGCATTGACTAAACGTGCTGAGGTGCAGGAACrrGTTTGAAAGTTrrTCAGCTGATGGG 

CAGAAGCTGACTCTGCTGGAATTTTTGGATTTCCTCCAAGAGGAGCAGAAGGAGAGAGACTGCACCTCTGAG 

CTTGCTCTGGAACTCATTGACCGCTATGAACCTTCAGACAGTGGCAAACTGCGGCA^ 

GGCTOCCTC^GCTACCTCTGCTCTAAGGATGGAGACATCTTCAACCCAGCCTG 

ATGACTC^CCCCTGAACCACTACTTCATCTGCTCTC 

GGC CAGAGCAG CGT CGAGGGAT ATATACGGG CCCTGAAGCGGGGGTGCCGCTGCGTGGAGGTGGATGT ATGG 
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GATGGACCTAGCGGGGAACCTGTCGTTTACCACGGACACACCCCGACCTCCCGCATCCTGTTCAAAGATGTC 
GTGGCCACAGTAGCACAGTATGCCITCCAGACATCAGACTACCCAGTCATCTTGTCCCTGGAGACCCACTGC 
AGCTGGGAGCAGCAGCAGACCATGGCCCGTCATCTGACTGAGATCCTGGGGGAGCAGCTGCTGAGCACCACC 
TTGGATGGGGTGCTGCCCACTCAGCTGCCCTCGCCTGAGGAGCTTCGGAGGAAGATCCTGGTGAAGGGGAAG 
AAGTT AACACTTGAGGAAG AC CTGG AATATGAGGAAGAGGAAG CAG AAC CTGAGTTGGAAGAGTCAG AAT TG 
GCGCTGGAGTCCCAGTTTGAGACTGAGCCTGAGCCCCAGGAGCAGAACCTTCAGAATAAGGACAAAAAGAAG 
AAATCCAAGCCCATCTTGTGTCCAGCCCTCTCTTCCCTGGTTATCTACTTGAAGTCTGTCTCATTCCGCAGC 
TTCACACAT TC AAAGG AG CACT ACCACTT CTACG AGAT ATCAT CTTTCT CTGAAACCAAGGC CAAGCG CCTC 
ATCAAGGAGGCTGGCAATGAGTTTGTGCAGCACAATACTTGGCAGTTAAGCCGTGTGTATCCCAGCGGCCTG 
AGGACAGACTCTTCCAACTACAACCCCCAGGAACTCTGGAATGCAGGCTGCCGGATGGTGGCCATGAATATG 
CAGACTGC^GGGCTTGAAATGGACATCTGTGATGGGCATTTCCGCCAGAATGGCGGCTGTGGCTATGTGCTG 
AAGCCAGACTTCCTGCGTGATATCCAGAGTTCTTTC 

ACTCTCTTAATCCAGGTGATCAGCGGTCAGCAACTCCCCAAAGTGGACAAGACCAAAGAGGGGTCCATTGTG 
GAT C C^CTGGTGAAAGTGCAGAT CTTTGGCGTT CGTCT AG ACACAG CAC GG CAGGAG ACCAACTATGTGG AG 
AACAATGGTTTTAATCCATACTGGGGGCAGACACTATGTTTCCGGGTGCTGGTGCCTGAACTTGCCATGCTG 
CGTTTTGTGGTAATGGATTATGACTGGAAATCCCGAAATGACTTTATTGGTCAGTACACCCTGCCTTGGACC 
TGCATGCAACAAGGTTACCGCCACATTCACCTGCTGTCCAAAGATGGCATCAGCCTCCGCCCAGCTTCCATC 
TTTGTGTATATCTGC^TCC^GGAAGGCCrGGAGGGGGGTGAGTCCTG AGGTGGGCATTTC^CGGGAAGGG^ 
GGTGTGCTGGCTTTAGACGGGGAGAAACATCTGGAAG 



10 



In a search of public sequence databases, the NOV48b nucleic acid sequence, located 
on chromosome 2, has 1069 of 1075 bases (99%) identical to a gb:GENBANK- 
ID:AK023083|acc:AK023083.1 mRNA from Homo sapiens (cDNA FLJ13021 fis, clone 
NT2RP3000742, weakly similar to l-Phosphatidylinositol-4,5-Bisphosphate 
Phosphodiesterase Delta 1 (EC 3.1.4.1 1)) (E = 0.0). 

The disclosed NOV48b polypeptide (SEQ ID NO:184) encoded by SEQ ID NO:183 ' 
has 741 amino acid residues and is presented in Table 48D using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV48b has no signal peptide 
and is likely to be localized to the mitochondrial matrix space with a certainty of 0.6523. 
Alternatively, NOV48b may also localize to the mitochondrial inner membrane with a 
certainty of 0.3462, to the mitochondrial intermembrane space with a certainty of 0.3462, or to 
the mitochondrial outer membrane with a certainty of 0.3462. 
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Table 48D. Encoded NOV48b protein sequence (SEQ ID NO:184). 



MPMRKVRSKS WKKLRYFRFQNDGMTVWHARQ ARGS AKPS FS I SDVET I RNGHDSELLRSLAEELPLEQGFTI 
VFHGRRSNIiDLMANSVEGAQIWMRGLQLLVDLV^ 
MNVEMDQEYAFSLFQAADTSQSGTLEGEEFVQFYKALTKRAE^ 
RDCTSELALELIDRYEPSDSGKLRHVLSMTCFLSYLCSKD^ 

VGDQLCGQSSVEGYIRALKRGCRCVEVDVWDGPSGEPVVYHGHTPTSRILFKDVVATVAQYAFQTSDYPVIL 
SLETHCSWEQQQTMARHLTEILGEQLLSTTLDGVLPTO 

LEESELALESQFETEPEPQEQmQNKDKKKKSKPILCPALSSLVIYLKSVSFRSFTHSKEHYHFYEISSFSE 

TKAKRLIKEAGNEFVQHNTWQLSRVYPSGLRTDSSNYNPQELWNAGCRMVAMNMQTA 

GOSYVLKPDFLRDIQSSFHPEKPISPFKAQTLL^^ 

ETNYVENNGFNPYWGQTLCFRVLVPELAMLRFVV^ 

LRPAS I FVYI CI QEGLEGGES 



A search of sequence databases reveals that the NOV48b amino acid sequence has 736 
of 741 amino acid residues (99%) identical to, and 737 of 741 amino acid residues (99%) 
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similar to, the 762 amino acid residue ptnr:TREMBLNEW-ACC:AAH06355 protein from 
Homo sapiens (Human) (Unknown (Protein For MGC:12837)) (E = 0.0). 

NOV48b is predicted to be expressed in at least Heart, Stomach, Small Intestine, Bone 
Marrow, Skeletal Muscle, Brain, Hypothalamus, Pituitary Gland. .The sequence is predicted to 

5 have the expression pattern of (GENBANK-ID: gb.GENBANK- 

ID:AK023083|acc:AK023083.1) a closely related Homo sapiens cDNA FLJ13021 fis, clone 
NT2RP3000742, weakly similar to l-PhosphatidylinositoI-4,5-Bisphosphate 
Phosphodiesterase Delta 1 (EC 3 . 1 .4. 1 1) homolog. 

NOV48a also has homology to the amino acid sequences shown in the BLASTP data 

10 listed in Table 48E. 



Table 48E. BLAST results for NOV48a 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1 1424934 0 |ref|NP 
116115. l| 
(NM_032726) 


hypothetical 
protein MGC12837 
[Homo sapiens] 


762 


736/752 
(97%) 


738/752 
(97%) 


0.0 


gi | 108854 |pir| |S141 
13 


1- 

phosphatidylinosi 
tol-4,5- 
bisphosphate 
phosphodiesterase 
(EC3.1.4.11) 
delta- 2 - bovine 


764 


613/757 
(80%) 


671/757 
(87%) 


0.0 


gi 1 18093100 |ref |NP_ 

542419.1] 

(NM 080688) 


phospholipase C, 
delta 4 [Rattus 
norvegicus] 


772 


550/755 
(72%) 


631/755 
(82%) 


0.0 


gi | 1304189 |dbj | BftAO 
9046. 1| (D50455) 


phodpholipase C 
delta4 [Rattus 
norvegicus] 


771 


548/756 
(72%) 


629/756 
(82%) 


0.0 


gi | 12855950 | dbj | BAB 
30513. l| (AK016945) 


data source :MGD, 
source 

key :MGI: 107469, ev 
idence : ISS-phosph 
olipase C, delta 
4 -putative [Mus 
mus cuius] 


447 


335/430 
(77%) 


375/430 
(86%) 


0.0 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 48F. In the ClustalW alignment of the NO V48 protein, as 
15 well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 48F. ClustalW Analysis of NOV48 

1) Novel NOV48a (SEQ ID NO: 182) 

2) Novel NOV48b (SEQ ID NO: 184) . 

3) gi 1 14249340 1 ref |NP_116115 . 1 | (NM_032726) hypothetical protein MGC12837 [Homo 
sapiens] (SEQ ID NO: 542) 

4) gi| 108854 |pir| [S14113 l-phosphatidylinositol-4 , 5-bisphosphate phosphodiesterase 
(EC3.1.4.11) delta-2 - bovine (SEQ ID NO: 543) 

5) gi| 18093100 1 ref |NP_542419.1 1 (NMJ)80688) phospholipase C, delta .4 [Rattus 
norvegicus] (SEQ ID NO: 544) 

6) gi 1 1304189 1 dbj |baa09046.i| (D50455) phodpholipase C delta4 [Rattus norvegicus] 
(SEQ ID NO: 545) 

7) gi | 12855950 | dbj |BAB305l3.l| (AK016945) data source :MGD, source 

key:MGI:107469,evidence;ISS-phospholipase C, delta 4-putative [Mis musculus] (SEQ ID 
NO:546) 



NOV48a 
NOV48b 

gi 
gi 
gi 
gi 
gi 



14249340 | 
108854 | 
18093100 | 
1304189) 
12855950 | 



10 

|....|.. 



20 



30 



40 
. I . . 



I 



MRKVR£ 
MRKVR 



MRKVR^KSWKKLRYFRLQgDGMTVWBgRQ 
KSWKKLRYFRgQ|DGMTVl«JHgRQ 
gQDLLLMQgGigMRKVR|KSWKKLRYFRLQ|DGMTVWEgRQ : 
QDLLLMQgGj2LMRKVR|KSWKKLR5?RLQSDGMTVl^KgRQ 
jQDLLLMQ|G2SMRKVRjjKSWKKLRYFRLQ|DGMTVWKSRg 
QDLLLMQgG JBUrK VR j| KS WKKL RY FRLQgDGM T VW K§r5 
gQDLLLMQiajGfflMRKVR^KSWKKLRYFRLQlDGMTVWK 





NOV48a 


44 


NOV48b 


40 


gi 


14249340) 


61 


gi 


108854) 


61 


gi 


18093100) 


61 


gi 


1304189) 


61 


gi 


12855950 | 


61 



NOV48a 


104 


NOV48b 


100 


gi 


14249340) 


121 


gi 


108854) 


121 


gi 


18093100) 


121 


gi 


1304189) 


121 


gi 


12855950) 


121 



NOV48a 


164 


NOV48b 


160 


gi 


14249340) 


181 


gi 


108854 | 


181 


gi 


18093100) 


181 


gi 


1304189) 


181 


gi 


12855950| 


181 



NOV48a 
NOV48b 



gi 
gi 
gi 
gi 
gi 



14249340) 

108854) 

18093100) 

1304189) 

12855950) 




224 f 
220 
241 
241 
241 
241 
241 



JL 



250 



260 
..J.. 



270 
..1.. 



280 
..I.. 



290 

. . I . . 



SgLAIjELIDRYEPSMj gGjjLjgHVLS j'DGFLSYLCSKDGj? 
^SjgLAIiELIDRYEPSjj gGjjLgHVLsSDGFLSYLCSKDGg 
JiSgjLALELIDRYEPSjj JGgLgHVLsSDGFLSYLCSKDGS 
SgLALELIDRYEPSg gGjJLg^nijsSDGFL^YLCSKDGS 

sSlalISlidryepsi *" "" - 



IFNPgCLPIYQDMTQPLgjHY 
IFNPgCLPIYQDMTQPLgHY 

ifnpBclpiyqdmtqpmhy 



API 



5 ^jG*LaHVLs"pGFL5YLCSKDGj 



I FNPpCiJjPijYQDMTQPLSHY 
IFNPgCIjPIYQDMTQPLgHY 
I ENpJcL P I YQDMTQPLgH Yg 

ifn^clpiyqdmtqplIhy^ 



300 

1 

R 283 
| 279 
i 300 

I 300 

300 
300 
300 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



NOV48a 
NOV48b 



gi 
gi 
gj- 
gi 
gi 



14249340 | 
108854 | 
18093100 | 
1304189| 
128559501 




330 



KRGCRCVEVDj 
KRGCRCVEVDj 
KRGCRCVEVDj 
KRGCRCVEVD 
KRGCRCVEVD 1 
KRGCRCVEVDj 
KRGCRCVEVD 



360 




370 



380 



390 



400 



410 



N0V48a 


344 


NOV48b 


340 


gi 


14249340) 


361 


gi 


108854 | 


361 


gi 


18093100) 


361 


gi 


1304189] 


361 


gi 


12855950 | 


361 



420 
•I 

'TQ 402 
TQ 398 
|PTQ 419 
PPTQ 419 
|IDI 419 
-IDI 419 
iS 419 



NOV48a 


403 I 






NOV48b 


399 p 








gi 


14249340] 
108854) 


420 [ 






a E 








gi 


420 1 






□ D 


gi 


18093100) 


420 p 








gi 


1304189) 


420 J* 






9 - 


gi 


12855950) 


420 1 









NOV48a 
NOV48b 

gi 
gi 
gi 
gi 
gi 



14249340 | 
108854) ' 
18093100) 
1304189 | 
12855950) 




NOV48a 


519 


R 


NOV48b 


509 


R 


gi 


14249340) 


530 


R 


gi 


108854) 


527 


S 


gi 


18093100 | 


540 


N 


gi 


1304189) 


539 


N 


gi 


12855950) 


447 





550 



560 



I KjE AGNE FVQH 



QLERVYPSGLRTDSSN 



570 580 

t 



590 



600 



.iSeagnefvqhn^ql JRVYPSGLRTDSSN 
jI gEAGSEFVQHNfflQL gRVYPSGLRTDSSN 
j 1 8 E AGNE F VQHMgjQL JjRV Y P S GLRTD S S N 

li§eagnefvqhn™ol"rvypsglrtdssn 



RVY PS GLRTD SSI 




610 



620 



630 



640 650 



660 



HOV48a 
NOV48b 



gi 
gi 
gi 
gi 
gi 



14249340| 
108854 | 
18093100) 
1304189| 
128559501 



qvisgqslpkvd 
qvisgqslpkvd 
qvisgqslpkvd 
qvisgqslpkvd 
qvisgqSlpkvd 
qvisgqSlpkvd 



633 
622 
643 
645 
653 
652 
447 



NOV48a 
NOV48b 

gi 
gi 
gi 
gi 
gi 



14249340) 
108854 | 
18093100) 
1304189) 
12855950) 
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10 



NOV48a 


694 M 




DW 


NOV48b 


683 M 






gi 


14249340 | 


704 M 




DW 


gi 


108854) 


706 K 




DW] 


gi 


18093100| 


714 K 




SR r 


gi 


1304189) 


713 K 




SR' 


gi 


12855950) 
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Tables 48G-N lists the domain descriptions from DOMAIN analysis results against 
NOV48. This indicates that the NOV48 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 48G Domain Analysis of NOV48 

gnl | Smart | smart 00 14 8, PLCXc, Phospholipase C, catalytic domain (part); 
domain X; Phosphoinositide- specific phospholipases C. These enzymes 
contain 2 regions (X and Y) which together form a TIM barrel- like 
structure containing the active site residues. Phospholipase C enzymes 
(PI -PLC) act as signal transducers that generate two second 
messengers, inositol- 1,4, 5-trisphosphate and diacylglycerol . The 
bacterial enzyme appears to be a homologue of the mammalian PLCs . (SEQ 
ID NO: 840) 

CD-Length = 145 residues, 100.0% aligned 
Score = 202 bits (514), Expect « 6e-53 



20 



NOV48 : 273 QDMTQPLNHYFI CSSHNT YLVGDQLCGQS S VEGY I RALKRGC RCVEVDVWDGP SGEP WY 

111**11*1111 lllllll I II 1*1 I I I I I 1*1 I I 111111*1 Mil 1111*1 
Sbj ct : 1 QDMSKPLSHYFINSSHNTYLTGKQLWGESSVEGYIQALKHGCRCVELDCWDGPDGEPVIY 



332 



60 



25 



NOV48: 
Sbjct: 
N0V48: 
Sbjct: 



333 



61 



HGHTLTSRILFKDVVATVAQYAFQTSDYPViLSLETHCSWEQQQTMARHLTEIIiGEQLLS 

illl I I *|* * **|| II MINIM III + || || + IN* I * 

HGHTFTLPIKLSEVLEAIKKFAFVTSPYPVILSLENHCSPDQQAKMAQMFKEIFGDLLYT 



392 



120 



393 TTLDGVLPTQLPSPEELRRK1LVKGK 418 

I 11111*1* 111*111 
121 PPTTSSL-EYLPSPEQLKGKILLKGK 145 



30 



35 



N0V48: 
Sbj Ct : 
NOV4 8: 
Sbj Ct: 
NOV48: 



Table 48H Domain Analysis of NOV48 

gnl|Pfam.|pfam00388, PI-PLC-X, Phosphatidylinositol- specific 
phospholipase C, X domain. This associates with pfara00387 to form a 
single structural unit. (SEQ ID NO: 841) 
CD-Length = 145 residues, 100.0% aligned 
Score « 192 bits (489) , Expect = 4e-50 



274 DMTQPLbraYFICSSHNTYLVGDQLCGQSSVEGYIRALKRGCRC^VDVWDGPSGEPVVYH 

M* II*IMI lllllll I II 1*1 111*1 111111*1 Illl 11**11 
1 JMS I PLSHYFI SSSHNTYLTGKQLWGKSQVES YRQQLDHGCRCVELDCWDGPDDEPI I YH 



333 



60 



334 GHTLTSRILFKDVVATVAQYAFQTSDYPVILSLETHCSWEQQQTMARHLTE 393 

I I I I 111+ + + 1*11 11*11111 II* *ll* II** II I* II* 
6 1 GGTFTLEIKLKDVLEAIKDFLFKTSPYPIILSLENHCNSDQ^ 12 0 



394 TLDGVLPTQLPSPEELRRKILVKGKK 419 
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II I Kill ++I+ 1 1 l + l II 

Sbjct: 121 PLDS - LTTKLPSLKDLKGKILLKNKK 145 



Table 481 Domain Analysis of NOV48 

gnl | Smart | smart 00 14 9, PLCYc, Phospholipase C, catalytic domain (part) ; 
domain Y; Phosphoinosi tide-specific phospholipases C. These enzymes 
contain 2 regions (X and Y) which together form a TIM barrel -like 
structure containing the active site residues. Phospholipase C enzymes 
(PI- PLC) act as signal transducers that generate two second 
messengers, inositol- 1,4 , 5- trisphosphate and diacylglycerol . The 
bacterial enzyme appears to be a homologue of the mammalian PLCs . 
(SEQ ID NO: 842) 

CD-Length = 117 residues, 100.0% aligned 
Score = 182 bits (462) , Expect « 6e-47 



N0V4 8 : 482 LSSLVIYLKSVSFRSFTHSKEHYHFYEISSFSETKAKRLIKEAGNEFVQHOTWQLSRWP 

II II I I IMI ++I + + + n ++ i I ■ I I ■ I I 

Sbjct: 1 LSELVSYCAPVKFRSFELAEEKNPFYEMSSFSETKAKKLLEKAPTDFVRYNQRQLSRVYP 



541- 



60 



10 NOV48: 542 S GLRTDS SNYYNPQELWNAGCQMVAMNMQTAGLEMDI CDGHFRQNGGCGYVLKPDFLR 599 

I I lllll III II IIIIIM II I.* I II llllllllllllll 

Sbjct: 61 KGTRVDSSNY-NPQVFWNHGCQMVALNFQTPDKAMQLNQGMFRANGGCGYVLKPDFLR 117 



Table 48 J Domain Analysis of NOV48 

gnl|Pfam|pfamO0387, PI-PLC-Y, Phosphatidylinositol-specif ic 
phospholipase C, Y domain. This associates with pfam00388 to form a 
single structural unit. (SEQ ID NO: 843) 
CD-Length = 118 residues, 99.2% aligned 
sfcore = 163 bits (412) , Expect = 4e-4l 



15 NOV48: 482 LSSLVIYLKSVSPRSFTHSKEHYHFYEISSFSETKAKRLIKEAGNEFVQHNTWQLSRVYP 541 
IMI |*+|4 |||| + | IMIIII III + I + M+ IM^II lllllll 
Sb j c t : 2 LSNLVNYIQS IKFRSFSLPTEKNTSYEMSSFSERKAKQLLKESPIEFVKHNKRQLSRVYP 6 1 

o 

20 NOV48: 542 SGLRTDSSNYYNPQELWNAGCQMVAMNMQTAGLEMDIODGHFRQNGGCGYVIiKPDFliR 599 

I I llll * II IIIIIIIIM 11+ I lllllll IMII III 

Sbjct: 62 KGTRFDSSN-FMPQPFWNAGCQ^AI^FQTSDLPMQINLQ4FEYNGGSGYLLKPPFLR 118 



Table 48K Domain Analysis of NOV48 



gnl|Pfam|pfam00168, C2, C2 domain. (SEQ ID NO:844) 
CD-Length = 88 residues, 95.5% aligned 
Score = 88.2 bits (217), Expect = 2e-18 



25 NOV48: 623 VI SGQQLPKVDKTKEGSIVDPLVKVQI FGVRLDTARQETNYVENNGFNPYWGQTLCFR - V 681 

III ♦ 1 1 i+i + II III + 1 II ♦ +1 I* II M I I 

Sbjct: 5 VI SARNLPKMDMN GLSDPYVKVDLDGDPKDTKKFKTKTVKKT-LNPVWNETFVFEKV 60 



NOV48: 682 LVPELAMLRFWMDYDWKSRNDFIGQYT 709 

30 +| + || HI | | I IMIIII I 

Sbjct: 61 PLPDLASLRFAVYDEDRFSRDDFIGQVT 88 
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Table 48L Domain Analysis of NOV48 

gnl [Smart |smart00239, C2, Protein kinase C conserved region 2 (CalB) 
Ca2+ -binding motif present in phospholipases, protein kinases C, and 
synaptotamins (among others) . Some do not appear to contain Ca2+- 
binding sites. Particular C2s appear to bind phospholipids, inositol 
polyphosphates, and intracellular proteins. Unusual occurrence in 
perforin. Synaptotagmin and PLC C2s are permuted in sequence with 
respect to N- and C- terminal beta strands. SMART detects C2 domains 
using one or both of two profiles. (SEQ ID NO: 845) 
CD-Length = 101 residues, 100.0% aligned 
Score = 83.6 bits (205), Expect = 4e-17 



N0V48: 6-18 TLLNQVISGQQLPKVDKTKEGS I VDPLVKVQI FGVRLDTARQETNYVENNGFNPYWGQTL 677 

II ++II + II II" II III - I + + I 1+1 || | +| 

Sb j ct : 1 TLTVKIISARNLPPKDKG- - -GKSDPYVKVSLDGDPREKKK- - TKWKNTL -NPVWNETF 54 

NOV48: 678 CFRVLVPELAMLRFVVMDYDWKSRNDFIGQYTLPWTCMQQGYRHIHL 724 

I I 111+ I III II + MII+ M + + III I 
Sbjct: 55 EFEVPPPELSELEIEVYDKDRFSRDDFIGRVTIPLSDLLLGGRHEKL 101 



10 



15 



Table 48M Domain Analysis of NOV48 

gnl|Smart|smart00233, PH, Pleckstrin homology domain.; Domain commonly 
found in eukaryotic signalling proteins. The domain family possesses 
multiple functions including the abilities to bind inositol 
phosphates, and various proteins. PH domains have been found to 
possess inserted domains (such as in PLC gamma, syntrophins) and to be 
inserted within other domains. Mutations in Brutons tyrosine kinase 
(Btk) within its PH domain cause X- linked agammaglobulinaemia (XLA) in 
patients. Point mutations cluster into the positively charged end of 
the molecule around the predicted binding site for 
phqsphatidylinositol lipids. (SEQ ID NO: 846) 
CD-Length « 104 residues, 87.5% aligned 
Score = 47.8 bits (112), Expect ■ 2e-06 



NOV48: 9 KVRSKSWKKLRYFRLQNDGMTVWHARQARGSAKPSVS IS - DVETIRNGHDSELLRSLAEE 67 

Mill III I I + + +++ + Ml || | + |. || + 
Sb}ct: 12 SGGKKSWKK-RYFVLFNGVLLYYKSKKKKSSSKPKGSIPLSGCTVREAPDSD --S 63 

NOV48: 68 LPLEQGFTIVFHGRRSNLDLMANSVEEAQIWMRGLQLLVD 107 

+11! I++ I I I Ml + 1 + 1+ + 
Sb^ct: 64 DKKKNCFEIVTPDRKT-LLLQAESEEERKEWVEALRKAIA 102 



Table 48N Domain Analysis of NOV48 

gnl|Pfam|pfara00169, PH, PH domain. PH. stands for pleckstrin homology 
(SEQ ID NO: 847) 

CD-Length = 100 residues, 88.0% aligned 
Score o 45.1 bits (105), Expect = le-05 



N0V48: 9 KVRSKSWKKLRYFRLQNDGMTVWHARQARGSAKPSVSISDVETIR 68 

1+ I III III 1 II + + ++ I 1+ +1 ♦ | 

Sbjct: 12 TVKKKRWKK - RYFFLFNDVL I YYKDKKKS YE PKGS I PLSGCS VEDVPDSEF 61 

4 

N0V48: 69 PLEQGFTIVFHGRRSNLDLMANSVEEAQIWMRGLQLLV 106 

I ♦ + I I I II I I++ +1 ♦ 

Sbjct: 62 KRPNCFQLRSRDGKETFILQAESEEERQDWIKAIQSAI 99 
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Phosphatidylinositol-specific phospholipase C (EC 3.1.4.1 1), an eukaryotic 
intracellular enzyme, plays an important role in signal transduction processes. It catalyzes the 
hydrolysis of l-phosphatidyl-D-myo-inositol-3,4,5-triphosphate into the second messenger 
molecules diacylglycerol and inositol- 1,4,5 -triphosphate. This catalytic process is tightly 

5 regulated by reversible phosphorylation and binding of regulatory proteins. In mammals, there 
are at least 6 different isoforms of PI-PLC, they differ in their domain structure, their 
regulation, and their tissue distribution. Lower eukaryotes also possess multiple isoforms of 
PI-PLC. All eukaryotic PI-PLCs contain two regions of homology, sometimes referred to as 
'X-box' and T-box'. The order of these two regions is always the same (NH2-X-Y-COOH), 

1 0 but the spacing is variable. In most isoforms, the distance between these two regions is only 
50-100 residues but in the gamma isoforms one PH domain, two SH2 domains, and one SH3 
domain are inserted between the two PLC-specific domains. The two conserved regions have 
been shown to be important for the catalytic activity. At the C-terminal of the Y-box, there is a 
C2 domain possibly involved in Ca-dependent membrane attachment. Phosphoinositide- 

1 5 specific phospholipase C (PLC) mediates the cellular actions of a variety of hormones, 

neurotransmitters and growth factors. Agonist-dependent activation of PLC causes hydrolysis 
of membrane phosphatidylinositol 4,5-bisphosphate (PIP2), generating the second messengers 
inositol 1,4,5-trisphosphate (IP3) and diacylglycerol (DAG). IP3 binds specific intracellular 
receptors to trigger Ca2+ mobilisation, while DAG mediates activation of a family of protein 

20 kinase C isozymes. Based on molecular size, immunoreactivity and amino acid sequence, 

several subtypes have been classified. In PLC-beta subtypes, X and Y domains are separated 
by a stretch of 70-120 amino acids rich in Ser, Thr and acidic residues. Their C-terminus is 
rich in basic residues. In PLC-gammas, there is an insert of more than 400 residues containing 
an SH3 and two SH2 domains. PLCs show little similarity in the 300-residue N-terminal 

25 region preceding the X-domain. 

The disclosed NOV48 nucleic acid of the invention encoding a Phospholipase C Delta- 
4-like protein includes the nucleic acid whose sequence is provided in Table 48A, 48C or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 48A or 48C while still 

30 encoding a protein that maintains its Phospholipase C Delta-4 -like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
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. include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
5 binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 1 percent of the bases may be so changed. 

The disclosed NOV48 protein of the invention includes the Phospholipase C Deltas- 
like protein whose sequence is provided in Table 48B or 48D. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
10 residue shown in Table 48B or 48D while still encoding a protein that maintains its 

Phospholipase C Delta-4-like activities and physiological functions, or a functional fragment 
thereof. In the mutant or variant protein, up to about 28 percent of the residues may be so 
changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 

1 5 (Fabh, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Phospholipase C Delta-4 -like 
protein (NOV48) is a member of a "Phospholipase C Delta-4 family". Therefore, the NOV48 
nucleic acids and proteins identified here may be useful in potential therapeutic applications ■ 
implicated in (but not limited to) various pathologies and disorders as indicated below. The 

20 potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

25 The NOV48 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in Cardiomyopathy, Atherosclerosis,Hypertension, 
Congenital heart defects, Aortic stenosis Atrial septal defect (ASD) Atrioventricular (A-V) 
canal defect, Ductus arteriosus , Pulmonary stenosis , Subaortic stenosis, Ventricular septal 
defect (VSD), valve diseases/Tuberous sclerosis, Scleroderma, Obesity/Transplantation, 

30 Osteoporosis,Hypercalceimia, Arthritis, Ankylosing spondylitis, Scoliosis, Von Hippel-Lindau 
(VHL) syndrome , Alzheimer's disease, Stroke, Tuberous sclerosis, hypercalcemia, 
Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, Lesch-Nyhan syndrome, 
Multiple sclerosis, Ataxia-telangiectasia, Leukodystrophies, Behavioral disorders, Addiction, 
Anxiety, Pain, Neuroprotection, and/or other diseases and pathologies. 
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NOV48 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV48 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

5 NOVX Antibodies 5 ' section below. The disclosed NOV48 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

10 NOV49 

A disclosed NOV49 nucleic acid of 1588 nucleotides (also referred to as CG56739-01) 
encoding a Leukotriene-B4 Omega- Hydroxylase -like protein is shown in Table 49 A. An 
open reading frame was identified beginning with a ATG initiation codon at nucleotides 2-4 
and ending with a TGA codon at nucleotides 1577-1579. The start and stop codons are shown 

15 in bold in Table 49 A, and the 5 5 and 3' untranslated regions, if any, are underlined. 



Table 49A. NOV49 nucleotide sequence (SEQ ID NO:185). 

GATGTTCGCTGCTGAGCCTGTCCTGGCTGGGCCTCGGGCCGG 

GGTCGGGGCCTCCTGGCTCCTGGCCCGTGTCCTGGCCTGGACGTACGCCTTCTATGACAACTGCCACCGCCT 
CCAGTGTTTCCAGCAGCCTCCAAAACGGAACTGCTTTTTAGGTCACCTG 

GGACATGAGGCTGATGGAGGATCTGGGCCACTACTTCCGTGATGTCCAACrrCTGGTGGCTTGGGTCTTTCTA 
CCCTGTCCTGCATCTCGTCCACCCTACGTTCACTGCCCCrrGTGCTCCAGGCTTCAGCTGCTGTTGCACTCAA 
GGATATGAGTrrCTATGGCTTCCTGAAGCCCTGGCTGGGTCCTGATGGGCTCCTGATTAGTGCCGGTGAC^ 
GTGGAGATGGCACCGCCACCTGCTCACACCTGCCTTCCACTTCAAAATCCTGAAGCCCTATGTGAAGATTTT 
CAATGAGAGCACGAACATCATGCACGCC^AATGGCAACGCCTGGCCTTGGAGGGCAGTGTCCGTCTGGAAAT 
GT1TGAGCACATCAGCCTCATGACCTTGGACAGTCTC 

GGAGAAGCCCAGCGAATATATTGATGCCATCTTGGAGCTCAGTGCCCTCAGTCTGAAACGGCACCAGCACAT 
CTT CC TGCTCACGG ACTTCTTGTACTTCCTCACTCCCAATGGGCGACXSCTTCr 

GCAC^^CTTCACAGATGCTGTCATCCAGGAGCGGCGTCGCACCCTCACTAGCCAGGGTGTCGATGACrTCCT 
GCAGGCCAAGGCCAAGTCCAAGACTTTGGACTTCATTGACGTGCT 

GAAGT T GT CAGATGAGAACATAAGAG CGGAGGCTG ACAC CTTCATGTCTGGGGG CCATGACACCT CGGCCAG 
TGGTCTCTCCTC3GGTCCTGTACAACCTCGCGAGGTACCCAGAATACCAGGAGCACTGCCGACAGGAGGTGCA 
AGAGCTCCTGAAGAACGGTGATCCTAAAGAGATTGAATGGGATGACCTGGCCCAGTTGCCCTTCCTGACCAT 
GTGCCTGAAGGAGAGCCTGCGGCTGCATTCCCCAGTCTCCAGGATCCACCGCTGCTGCCCCCAGGACGGGGT 
GCTCCCGGATGGCCGGGTCATCCCCAAAGGTAACACTTGCACCATCAGCATCTTTGGGATCCATCACAACCC 
TTCAGTCTGGCCGGACCCGGAGGTGTATGACCCCTTTCGCTTCGACCCAGAAAATCTCCAGAAGACATCACC 
T CT GG CTTT T ATT C CC TTCT CAGCAGTGCC CAGGAACT GCAT CG GC CAGACGTT CGC CATGGCTG AGATG AA 
GGTGGTCCTGGCGCTCACGCTGCTGCGCTTCCGCGTCCTGCCGGACCACGCGGAGCCCCGCAGGAAGCTGGA 
GCTGATCGTGCGCGCGGAGGATGGACTTTGGCTACGGGTGGAGCCCCTGAGCGCGGATCTGCAGTGACCCAC 
CACT 



In a search of public sequence databases, the NOV49 nucleic acid sequence, located on 

chromosome 19, has 1320 of 1584 bases (83%) identical to a gb:GENBANK- 

ID:HUMLB4OH|acc:D26480.1 mRNA from Homo sapiens (Human mRNA for leukotriene 

20 B4 omega-hydroxylase, complete cds) (E = 9.7c" 237 ). 

The disclosed NOV49 polypeptide (SEQ ID NO: 1 86) encoded by SEQ ID NO: 1 85 has 

525 amino acid residues and is presented in Table 49B using the one-letter amino acid code. 
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Signal P, Psort and/or Hydropathy results predict that NOV49 has a signal peptide and is 
likely to be localized extracellularly with a certainty of 0.8200. Alternatively, NOV49 may 
also localize to the lysosome (lumen) with a certainty of 0.4520, to the microbody 
(peroxisome) with a certainty of 0.161 1, or to the endoplasmic reticulum (membrane) with a 
certainty of 0.1000. The most likely cleavage sit for NOV49 is between positions 36 and 37: 
VLA-WT. 



Table 49B. Encoded NOV49 protein sequence (SEQ ID NO:186). 

MSLLSLSWLGLGPVAASPWLL^ 

DMRLMEDLGHYFRDVQIiWWLGS FYPVLHLVHPTFTAPVLQASAAVALKDMSFYGFLKPWLGPDGIjLI SAGDK 
WRWHRHUjTPAFHFKILKPYVKI FNESTNIMHAKWQRIiAIiEGSVRLEMFEHI SLMTLDSLQKCIFSFDSNCQ 
EKPSEYIDAILELSALSLKI^QHIFLLTDFLYFLTPNGRRFCRACDIVHNFTDAVIQERRRTL 
QAKAKSKTLDFIDVLLLAKDENGKKLSDENIRAEAOT 

ELLKNGDPKEI EVTODLAQLPFLTMCIiKESLRIjHSPVSRIHRCCPQDGVLPDGRVI PKGNTCTI S I FGIHHNP 
SVWPDPEVYDPFRFDPENLQKTSPLAFI PFSAVPRNCI GQTFAMAEMKVVLALTLLRFRVLPDHAEPKRKIiE 
LIVRAEDGLWLRVEPLSADLQ 

A search of sequence databases reveals that the NOV49 amino acid sequence has 397 
of 521 amino acid residues (76%) identical to, and 444 of 521 amino acid residues (85%) 
similar to, the 520 amino acid residue ptnr:SWISSPROT-ACC:Q08477 protein from Homo 
sapiens (Human) (CYTOCHROME P450 4F3 (EC 1.14.13.30) (CYPIVF3) (Leukotriene-B4 
Omega- Hydroxylase) (Leukotriene-B4 20-Monooxygenase) (Cytochrome P450- LTB- 
Omega))(E = 4.3e 21 *). 

NOV49 is predicted to be expressed in at least Prostate. This information was derived 
by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or 
RACE sources. 

In addition, the sequence is predicted to be expressed in Bone Marrow, Peripheral 
Blood, Brain, Colon, Coronary Artery, Hippocampus, Kidney, Kidney Cortex, Liver, Lymph 
node, Pituitary Gland, and Prostate because of the expression pattern of (GENBANK-ID: 
gb:GENBANK-ID:HUMLB4OH|acc:D26480. 1 ) a closely related Human mRNA for 
leukotriene B4 omega-hydroxylase, complete cds homolog. 

NOV49 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 49C. 
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Table 49C. BLAST results for NOV49 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1 17456512 J rex |«xp__ 
065069. l| 
(XM_065069) 


similar to 
CYTOCHROME P450 
4F6 (CYPIVF6) (H. 
sapiens) [Homo 
sapiens] 




452/495 
(91%) 


455/495 
(91%) 


0.0 


gi | 14767705 | ref |XP_ 
029072. l| 
(XM_029072) 


cytochrome P450, 
subfamily IVF, 
polypeptide 3 
[Homo sapiens] 


520 


399/521 
(76%) 


446/521 
(85%) 


0.0 


gi 1 4503241 1 ref |NP_0 
00887. 1| 
(NM_000B96) 


cytochrome P450, 
subfamily IVF, 
poxypepc xae 
3;leukotriene B4 
omega 

hydroxylase; 
leukotriene-B4 
2 0 -monooxygenase ; 
cytochrome P450- 
LTB - omega [Homo 
sapiens] 


520 


397/521 
(76%) 


444/521 
(85%) 


0.0 


gi| 2997737 |gb|AAC08 
589. l| (AF054821) 


cytochrome P-450 
[Homo sapiens] 


520 


395/521 
(75%) 


440/521 
(83%) 


0.0 


gi| 1706095 |sp|P5187 
l|CPF6 RAT 


CYTOCHROME P450 
4F6 (CYPIVF6) 


537 


392/521 
(75%), 


443/521 
(84%). 


0.0 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 49D. In the ClustalW alignment of the NOV49 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (£&, regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 49D. ClustalW Analysis of NOV49 

1) Novel N0V49 (SEQ ID NO:186) 

2) gi 1 17456512 | ref |XP_065069.l| (XM_065069) similar to CYTOCHROME P450 4F6 
( CYPIVF6 ) (H. sapiens) [Homo sapiens] (SEQ ID NO: 547) 

3) gi 1 14767705 1 ref |XP_029072 . 1 1 (XM_029072) cytochrome P45Q, subfamily IVF, 
polypeptide 3 [Homo sapiens] (SEQ ID NO: 548) 

4) gi 1 4503241 1 ref |NP_000887 .1 1 (NM_000896) cytochrome P450, subfamily IVF, 
polypeptide 3 ; leukotriene B4 omega hydroxylase; leukotriene-B4 20-monooxygenase; 
cytochrome P4 50 -LTB- omega [Homo sapiens] (SEQ ID NO: 549) 

5) gi| 2997737 |gb|AAC08589.l| (AF054821) cytochrome P-450 [Homo sapiens] (SEQ ID 
NO:550) 

6) gi 1 1706095 | sp | P51871 1 CPF6_RAT CYTOCHROME P450 4F6 (CYPIVF6) (SEQ ID NO:551) 



10 



20 



' 30 



40 



50 



60 



NOV49 

gi 1 17456512 1 
gi | 14767705 | 



1 gsifl^ggwg^Gfj^^ffi 




gg^^^ggo^gj^Scg 60 


|tf SQLSLSg LGLjjfpi^AAS PWL 


LLLLVGAS WLLARI LAWTY TF 


YDNCCRLRCFPQPPKRNWFB32 
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gi 
gi 



450324l| 
2997737 j 
1706095 I 



NOV49 

gi | 17456512 | 
gi 1 14767705 | 
gi I 4503241 
gi | 2997737 
gi | 1706095 



NOV49 



gi 
gi 
gi 
gi 
gi 



17456512 

14767705 

4503241 

2997737 

1706095 



NOV49 



gi 
gi 
gi 
gi 
gi 



17456512 

14767705 

4503241 

2997737 

1706095 



NOV49 



gi 
gi 
gi 
gi 
gi 



17456512 | 
14767705 | 
4503241 
2997737 
1706095 




250 

. - I . . 



260 
. . I . . 



270 
..I.. 



280 
. .1 .. 



290 
..I.. 



300 
- . I 



241 5 
171 I 
240 I 
240 I 
240 I 
240 J 



LMDFL YttLTP^GgRFERAC gg VHWFTDAVIQERRRTlHs QG VDDFL QAKAKSKTLDFI 
LYgDFLYYLTPDGSRFRRACgVHgFTDAVIQERRRTLgSQGVDDFLQAKAKSKTLDFI 

IjYmdflyyltpdgsr frrac^ vhSftdJv i qerrrtlH s qg vdd fl q akak s KT LDF I 
lyRdflyyltpdgSrfrracSvhIftdaviqerrrtl^qgvddflqakaksktldfi 

LYgDFLYYLTgDGflRFRjfoC^VH"^^ 



300 
230 
299 
299 
299 
299 



NOV49 




301 


DVLLL 


gi 


17456512) 


231 


DVLLL 


gi 


14767705 


300 


DVLLL 


gi 


4503241 




300 


DVLLL 


gi 


2997737 




300 


DVLLL 


gi 


1706095 




300 


DVLLL 



310 
..I.. 



320 



gKDEgGKI 

|kde|gki 
SkdeSgki 
SkdeSgk* 



330 

Ssg 



340 



350 



360 
.J 



•l^gGHDT^ASGLSWLYijLAj^PEYQEgCRQEVQ 

^^ghdttasglswlySlaSBpeyoeBcroevo 

4F@GHDTTASGLSWVLYgLAgHPEYQERCRQEVQ 
GHDTTASGLSOTLYiIiLAj^HPEYOERCROEVO 



/iJIT.AitIhp 



EYQERCRQE 



360 
290 
359 
359 
359 
359 



370 



380 



390 



400 



410 



420 



NOV49 



gi 
gi 
gi 
gi 
gi 



17456512 

14767705 

4503241 

2997737 

1706095 



NOV49 


421 


gi 


17456512 | 


351 


gi 


14767705) 


420 


gi 


4503241) 


420 


gi 


2997737 | 


420 


gi 


1706095) 


420 




490 



NOV49 



467 



500 



510 



520 



530 



540 



MCIGQfflFAiV[AEMKWL2LTLLRFRVLPDHEEPRRK| 



430 



504 
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10 



15 



gi 
gi 
gi 
gi 



17456512 | 
14767705 j 
4503241 | 
2997737) 
1706095 



NOV49 



gi 
gi 
gi 
gi 
gi 



17456512 

14767705 

4503241 

2997737 

1706095 



411 usKKbUAGMGVLGTVRVPTPSP* 

466 - ■ 

466 - 

466 

466 




505 
471 
504 
504 
504 
504 



550 



560 



570 



LfSRAEiTGLWLRVEPLS! 

l|§rae2glwlrve pls 1 
l|lraegglwlrve pls 
lIlraegglwlrvepls 
l|lraegg|wlrvepls 



LQ 525 

RDLQ - - 491 

520 

- 520 

--- 520 

TVTSQLPWDIiLAHPPTS 537 



20 



Table 49F lists the domain description from DOMAIN analysis results against NOV49. 
This indicates that the NOV49 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 49F Domain Analysis of NOV49 

gnl|Pfam|pfam00067, p450. Cytochrome P450. Cytochrome P450s are 
involved in the oxidative degradation of various compounds. 
Particularly well known for their role in the degradation of 
environmental toxins and mutagens. Structure is mostly alpha, and 
binds a heme cof actor. (SEQ ID NO: 848) 
CD- Length = 445 residues, 98.9% aligned 
Score = 308 bits (790) , Expect « 4e-85 



25 



30 



35 



40 



45 



50 



NOV49 : 


55 


Sbjct: 


4 


NOV49: 


115 


SbjCt: 


61 


NOV49: 


175 


Sbjct: 


116 


NOV49: 


234 


SbjCt: 


176 


NOV49: 


294 


SbjCt: 


220 


NOV49: 


353 


Sbjct: 


280 


NOV49: 


412 


Sbjct: 


337 


NOV49: 


472 


Sbjct: 


397 



PKRNCFLGHLSLWGNEEDMRLME^ 114 
| +| + | + + + +| + I +11 11+ + I II 

PPPLPLIGNLLQLGRG- - PIHSLTELRKKYGPVFTIiYLGP - RPVVWTGPEAVKEVLIDK 6 0 



AAVALKDMSFYGFLKPWLGPDGLLISAGDKWRW 

I I MM l + l II +11 I III I I + I 1 + 

GEEFAGRGDFPVF- - PWLG- YGILFSNGPRWRQLRRLLTLRF- FGMGKRS- KLEERIQEE 



+ M | +++ | ++ |+ + +| + ++ II + II I 

ARDLV^RLRKEQGSPIDITELLAPAPLNVICSLLFGVRFDYEDPEFLKLIDKLNELFFLV 



LKRHQHI FLLTDFLYFLTPNGRRFCRACDI VHNFTDAVI QERRRTLTSQGVDDFLQAKAK 
+| + 1+ +1 + ++ I +I+IM M 



-LLDFFRYLPGSHRKAFKAAKDLKDYLDKLIEERRETLE- 



174 



115 



175 



293 



P 219 



SKTLDFIDVLLL - AKDENGKKLSDENIRAEADTF^GGHDTS ASGLSWVLYNLARYPE YQ 352 

M+l 11+ II I I +I+M ++I + I MNI N++II- I 

GDPRDFLDSLLI EAKREGGSELTDEELKATVLDLLFAGTDTTS STLSWALYLLAKHPEVQ 279 
EHCRQEVQELLKNGDPKE IEWDDLAQLPFLTMCLKESLRLHSPV- SRIHRCCPQDGVLPD 411 

|+i+ |++ | + +|| I +|+| +II+MII I +1 + l + I 



GRVIPKGNTCTISIFGIHHNPSVTO>DPEVYDPFRFDPENLQCT^ 

I + |||| ++++ + | + | i + i + H +|| || M + ll + ll I IIIM 

GYLIPKGTLVIVTCLYSLHRDPKVFPNPEEFDPERFLDENGKFOT 



471 



396 



+ | |+ + II I II + I 

ERLARMELFLFLATLLQRFELELVPPGDI PLTPKPLGLPSKPPLYQL 



443 
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Leukotrienes are a group of bioactive compounds that play important roles in such 
processes as inflammation. Kikuta et al. (1993) (J. Biol. Chem. 268: 9376-9380) isolated a 
cDNA for the human leukotriene B4 omega-hydroxylase (LTB4H), an enzyme which 
catalyzes the omega-hydroxylation of leukotriene B4. Their cDNA encoded a 520-amino acid 
5 protein with a predicted molecular weight of 59,805 Da. The deduced amino acid sequence 
contains a cysteine in the conserved heme-binding domain near the C-terminus, which is a 
characteristic feature of the cytochrome P450 superfamily; the protein shares 31 to 44% 
similarity with CYP4A, CYP4B, and CYP4C. Kikuta et al. (1993) (J. Biol. Chem. 268: 9376- 
9380) detected transcript from the LTB4H gene in polymorphonuclear leukocytes and 

10 leukocytes. Kikuta et al. (1998) (DNA Cell Biol. 17: 221-230) determined that the CYP4F3 
gene contains 13 exons and spans approximately 22.2 kb. By fluorescence in situ 
hybridization, they mapped the CYP4F3 gene to 19pl3.2. The cytochrome P450 enzymes 
usually act as terminal oxidases in multicomponent electron transfer chains, called P450- 
containing monooxygenase systems. P450-containing monooxygenase systems primarily fall 

15 into two major classes: bacterial/mitochondrial (type I), and microsomal (type II). All P450 
enzymes can be categorised into two main groups, the so-called B- and E-classes: P450 
proteins of prokaryotic 3-component systems and fungal P450nor (CYP55) belong to the B- 
class; all other known P450 proteins from distinct systems are of the E-class. This family 
contains a number of subtypes of both B and E classes. 

20 The disclosed NOV49 nucleic acid of the invention encoding a Leukotriene-B4 

Omega- Hydroxylase-like protein includes the nucleic acid whose sequence is provided in 
Table 49A or a fragment thereof. The invention also includes a mutant or variant nucleic acid 
any of whose bases may be changed from the corresponding base shown in Table 49A while 
still encoding a protein that maintains its Leukotriene-B4 Omega- Hydroxylase -like activities 

25 and physiological functions, or a fragment of such a nucleic acid. The invention further 

includes nucleic acids whose sequences are complementary to those just described, including 
nucleic acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 

30 of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
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variant nucleic acids, and their complements, up to about 1 7 percent of the bases may be so 
changed. 

The disclosed NOV49 protein of the invention includes the Leukotriene-B4 Omega- 
Hydroxy lase-Iike protein whose sequence is provided in Table 49B. The invention also 
5 includes a mutant or variant protein any of whose residues may be changed from the 

corresponding residue shown in Table 49B while still encoding a protein that maintains its 
Leukotriene-B4 Omega- Hydroxylase-like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 25 percent of the 
residues may be so changed. 

10 The invention further encompasses antibodies and antibody fragments, such as F a b or 

(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Leukotriene-B4 Omega- 
Hydroxylase -like protein (NOV49) is a member of a "Leukotriene-B4 Omega- Hydroxylase 
family". Therefore, the NOV49 nucleic acids and proteins identified here may be useful in 

15 potential therapeutic applications implicated in (but not limited to) various pathologies and 
disorders as indicated below. The potential therapeutic applications for this invention include, 
but are not limited to: protein therapeutic, small molecule drug target, antibody target 
(therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or prognostic 
marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo 

20 and in vitro of all tissues and cell types composing (but not limited to) those defined here. 
The NOV49 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Atherosclerosis, Aneurysm, Hypertension, 
Fibromuscular dysplasia, Stroke, Scleroderma,Obesity, Transplantation, Myocardial 
infarction, Embolism, Cardiovascular disorders, Bypass surgery, Osteoporosis, 

25 Hypercalcemia, Arthritis, Ankylosing spondylitis, Scoliosis, Von Hippel-Lindau (VHL) 
syndrome , Alzheimer's disease, Stroke, Tuberous sclerosis, hypercalceimia, Parkinson's 
disease, Huntington's disease, Cerebral palsy, Epilepsy, Lesch-Nyhan syndrome, Multiple 
sclerosis, Ataxia-telangiectasia, Leukodystrophies, Behavioral disorders, Addiction, Anxiety, 
Pain, Neuroprotection, Diabetes, Autoimmune disease, Renal artery stenosis, Interstitial 

30 nephritis, Glomerulonephritis, Polycystic kidney disease, Systemic lupus erythematosus, 
Renal tubular acidosis, IgA nephropathy, Hypercalceimia, Lesch-Nyhan syndrome, and/or 
other diseases and pathologies. 

NOV49 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV49 substances for use in 
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therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV49 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
5 assay systems for functional analysis of various human disorders, which will help in 

understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV50 

NOV50 includes three novel Protein Arginine N-Methyltransferase 2-like proteins 
10 disclosed below. The disclosed sequences have been named NOVSOa and NOV50b. 
NOV50a 

A disclosed NOVSOa nucleic acid of 1 196 nucleotides (also referred to as CG56771- 
01) encoding a Protein Arginine N-Methyltransferase 2-like protein is shown in Table 50A. 
An open reading frame was identified beginning with a ATG initiation codon at nucleotides 
15 13-15 and ending with a TGA codon at nucleotides 1068-1070. The start and stop codons are 
shown in bold in Table 50A, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 50A. NOVSOa nucleotide sequence (SEQ ID NO:187). 

AGAGCGGCC^GA TGTCGCAGCCCAAGAAAAGAAAGCTTGAGTCGGGGGGCGGCGCCGAAGGAGGGGAG^A 
ACTGAAGAGGAAGATGGCGCGGAGCGGGAGGCGGCCCTGGAGCGACCCCGGAGGACTAAGCGGGAACGGGAC 
CAGCTGTACTACGAGTGCTACTCGGACGTTTCGGTCCACGAGGAGATGATCGCGGACCGOGTCCGCACCGAT 
GCCTACCGCCTGGGTATCCTTCGGAACTGGGCAGCACTGCGAGGCAAGACGGTACTGGACGTGGGCGCGGGC 
ACCGGCATTCTGAGCATCTTCTGTGCCCAGGCCGGGGCCCGGCGCGTGTACGCGGTAGAGGCCAGCGCCATC 
TGGCAACAGGCCCX3GGAGGTGGTGCGGTTCAACGGGCTGGAGGACCGGGTGCACGTCCTGCCGGGACCAGTG 
GAGACTGTAGAGTTGCCGGAACAGGTGGATGCCATCGTGAGCGAGTGGATGGGCTACGGACTCCTGCACGAG 
TCCATGCTGAGCTCCGTCCTCCACGCGCGAACCAAGTGGCTGAAGGAGGGCGGTCTTCTCCTGCCGGCCTCC 
GCCGAGCTCTTCATAGCCCCCATCaGCGACCAGATGCTGGAATGGCGCCTGGGCrTCTGGAGCCAGGTGAAG 
CAGCACTATGGTGTGGACATGAGCTGCCTGGAGGGCTTCGCCACGCGCTGTCTCATGGGCTAGAGCTCTCCC 
GCGCCGGCTTGGAGCAGGAGCTGGAGGCCGGAGTGGGCGGGCGCTTCCGCTGCAGCTGCTATGGCTCGGCGC 
CCATGCATGGCTTTGCCATCTGGTTCCAGGTGACCTTCCCTGGAGGGGAGTCGGAGAAACCCCTGGTGCTGT 
CCACCTCGCCTTTTCACCCGGCCACTCACT 

AGCAAGACACGGACGTTTCAGGAGAGATCACGCTGCTGCCCTCCCGGGACAACCCCCGTCGCCTGCGCGTGC 
TG C TGCGCT ACAAAGTGGG AGAC CAGG AGG AG AAGACCAAAGACTTTG C CATGG AGGACTGAGCGTTGCCTT 
TTCTCCCAGCTACCTCCCAAAGCAGCCTGACCTGCGTGGGAGAGGCGCCACTCGGAGATCGTTGTGCAGGGA 
TTGTCCGG CGAGGACGTGCTGGCCCGGCCGCAGCGCTTTGCTCA 



In a search of public sequence databases, the NOV50a nucleic acid sequence, located 
on chromosome 19, has 681 of 719 bases (94%) identical to a gb:GENBANK- 
20 ID:AK001421|acc:AK001421.1 mRNA from Homo sapiens (cDNA FLJ10559 fis, clone 

NT2RP20026 1 8, weakly similar to Protein Arginine N-Methyltransferase 2 (EC 2. 1 . 1 .-)) (E = 
2.7e 136 ). 

The disclosed NOVSOa polypeptide (SEQ ID NO: 188) encoded by SEQ ID NO: 187 
has 375 amino acid residues and is presented in Table 50B using the one-letter amino acid 
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code. Signal P, Psort and/or Hydropathy results predict that NOVSOa has no signal peptide 
and is likely to be localized to the nucleus with a certainty of 0.7000. Alternatively, NOV50a 
may also localize to the microbody (peroxisome) with a certainty of 0.2641, to the 
mitochondrial matrix space with a certainty of 0.1000, or to the lysosome (lumen) with a 
5 certainty of 0.1000. 



Table SOB. Encoded NOVSOa protein sequence (SEQ ID NO:188). 

MS Q PKKRKLE SGGGAEGGEGTEEEDGAEREAALERPRRTKRERDQLY YEC YSDVS VHEEM I ADRVRTDAYRL 
GI LRNWAALRGKTVLDVGAGTGI LS I FCAQAGARRVYAVEAS AIWQQ AREVVRFNGLEDRVHVLPGPVETVE 
LPEQVDAI VSEWMGY GLLHESMLS S VLHARTKWIiKEGGLLLPAS AEL FI API SDQMLEWRLGFWS QVKQHYG 
VDMSCLEGFATRCLMGHSEIWQGLSGEDVIARPQRFAQLELSRAGLEQELEAGVGGRFRCSCYGST^MHGF 
AIWFQVTFPGGESEKPLVLSTSPFHPATHWKQALLYIiNEPVQVEQDTOVSGEITLLPSRDNPR 
VGDQEEKTKDFAMED . - : 

A search of sequence databases reveals that the NOVSOa amino acid sequence has 3 16 
of 316 amino acid residues (100%) identical to, and 316 of 316 amino acid residues (100%) 
10 similar to, the 3 16 amino acid residue ptnr:SPTREMBL-ACC:Q9NVR8 protein from Homo 
sapiens (Human) (CDNA FLJ10559 FIS, Clone NT2RP2002618, Weakly Similar To Protein 
Arginine N-Methyltransferase 2 (EC 2.1.1.-)) (E = Lie 169 ). 

NOV50a is predicted to be expressed in at least lung, bronchus, kidney. This 
information was derived by determining the tissue sources of the sequences that were included 
1 5 in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 
NOVSOb 

In the present invention, the target sequence identified previously, NOVSOa, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 

20 starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 

25 silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 

30 brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 

435 



WO 02/068649 PCT/US02/02785 



lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
5 other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOVSOb. This differs from the previously 

10 identified sequence (NOV50a) at aminoacid position 1 5 A->G. 

A disclosed NOVSOb nucleic acid of 1 165 nucleotides (also referred to as CG56771- 
02) encoding a Protein Arginine N-Methyltransferase 2-like protein is shown in Table 50C. 
An open reading frame was identified beginning with a ATG initiation codon at nucleotides 4- 
6 and ending with a TGA codon at nucleotides 1 129-1 131. The start and stop codons are 

15 shown in bold in Table 50C, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 50C. NOVSOb nucleotide sequence (SEQ ID NO:189). 

AAG ATQTCG CAGCC CAAGAAAAGAAAG CTTGAGT CGGGGGGCGGCG G CG AAGG AGGGG Affl^GAArTG A A Pi AC; 
GAAGATGGCGCGGAGCGGGAGGCGGCCCTGGAGCGACCCCGGAGGACTAAGCGGGAACC^ 
TACGAGTGCTACTCGGACGTTTCGGTCC^CGAGGAGATG^ 
CTGGGTATCCTTCGGAACTGGGCAGCACTGCGAGGCAAGACG 

CTGAGCATCTTCTGTGCCCAGGCCGGGGCCCGGCGCGTGTACGCGGTAGAGGCCAGCGCCATCTGGCAACA 

GCCCGGGAGGTGGTGCGGTTCAACGGGCTGGAGGACCGGG 

GAGTTGCCGGAACAGGTGGATGCCATCGTGAGCGAGTGGAT 

AGCTCCGTCCTCCACGCGQ3AACCAAGTGGCTGAAGGAGGGOT 

TTCATAGCCCCCATCAGCGACCAGATGCTG 

GGTGTGGACATGAGCTGCCTGGAGGGCTTCGCCACGCGCT^ 

GGATTGTCCGGCGAGGACGTGCTGGCCCGGCCGCAGCGCTTTGCTCAGCTAGAGCTCTCCCGCGC 
GAGCAGGAGCTGGAGGCCGGAGTGGGCGGGCGCTTCCGCT 

TTTGCCATCTGGTTCCAGGTGACCTTCCCTGGAGGGGAGTCGGAGAAACCCCTGGTGOT 
TTTCACCCGGCCACTCACTGGAAACAGGCGCTCCT 

GACGTTTCAGGAGAGATCACGCTGCTGCCCTCCCGGGACAACCCCCGTCGCCTGCGCGTGCTGCTGC 
AAAGTGGGAGAC CAGGAGG AGAAG AC CAAAGACT TTG C CATGG AGGACTGAGCGTTG C CTTTTCC C C CAG CT 
ACCTCCCAAAGCA 



In a search of public sequence databases, the NOVSOb nucleic acid sequence, located 
on chromosome 19, has 1090 of 1091 bases (99%) identical to a gb:GENBANK- 
ID:AK001421|acc:AK001421.1 mRNA from Homo sapiens (cDNA FLJ10559 fis, clone 
20 NT2RP20026 1 8, weakly similar to Protein Arginine N-Methyltransferase 2 (EC 2. 1 . 1 .-)) (E = 
2.2e- 240 ). 

The disclosed NOV50b polypeptide (SEQ ID NO: 1 90) encoded by SEQ ID NO: 1 89 
has 375 amino acid residues and is presented in Table SOD using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOVSOb has no signal peptide 
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and is likely to be localized to the nucleus with a certainty of 0 J000. Alternatively, NOVSOb 
may also localize to the microbody (peroxisome) with a certainty of 0.2766, to the 
mitochondrial matrix space with a certainty of 0.1000, or to the lysosome (lumen) with a 
certainty of 0.1 000. 

5 



Table 50D. Encoded NOV50b protein sequence (SEQ ID NO:190). 

llSQPKKRKLESGGGGE 

GI LRNWAALRGKT VLDVGAGTG I LS I FCAQAGARRVYAVEAS AI WQQARE VWFNGLEDRVHVLPGPVETVE 
LPEQVDAIVSEWMGYGLLHESMLSSVIiHARTKVfo 

VDMSCLEGFATRCLMGHSEIWQGLSGEDVLARPQRFAQLEXSI^GLEQELEAGVGGRFRCSCTGSAPMHGF 
AIWFQWFPGGESEKPLVLSTSPFHPATHWKQALIjYLNEPVQVEQDTDVSGEITLLPSRDNPRRLRVLLRYK 
VGPQEEKTKDFAMED - 



A search of sequence databases reveals that the NOV50b amino acid sequence has 316 
of 316 amino acid residues (100%) identical to, and 316 of 316 amino acid residues (100%) 
similar to, the 316 amino acid residue ptnr:TREMBLNE W- ACC : A AH02729 protein from 
10 Homo sapiens (Human) (Hypothetical 35.2 Kda Protein) (E = 1 .8e" 169 ). 

NOVSOb is predicted to be expressed in at least lung, bronchus, kidney. . 

NOV50a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 50E 



Table 50E. BLAST results for NOV50a 



Gene Index/ 
Identifier 


Protein/ organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 15822652 | gb | AAK8 
5733. l| (AY043278) 


arginine 

methyl transferase 
6 [Homo sapiens] 


375 


374/375 
(99%) 


374/375 
(99%) 


0.0 


gi | 8922515 | ref |NP_0 
60607. l| 
<NM_018137) 


hypothetical 
protein FLJ10559 
[Homo sapiens] 


316 


316/316 
(100%) 


316/316 
(100%) 


0.0 


gi |S293956 |dbj|BAB0 
1859. l| (AP000383) 


protein arginine 
N- 

methyltransf erase 
-like 

protein [Arabidops 
is thaliana] 


399 


148/317 
(46%) 


193/317 
(60%) 


5e-66 


gi 1 15231011) ref |NP_ 

18B637.l| 

(NC_003074) 


arginine 

methyl transferase 
, putative 
[Arabidopsis 
thaliana] 


409 


143/310 
(46%) 


185/310 
(59%) 


le-65 


gi 1 15233606 | ref |NP_ 
194680. l| 
(NC_003075) 


arginine 

methyl transferase 
(paml ) 

[Arabidopsis 


390 


135/365 
(36%) 


201/365 
(54%) 


4e-58 



15 

The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 50F. In the ClustalW alignment of the NOV50 protein, as 
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well as all other CI ustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (Le. 9 regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
5 function. 



Table 50F. ClustalW Analysis of NOV50 



10 



15 



20 



(AY043278) arginine methyl transferase 6 [Homo 
M_018137) hypothetical protein FLJ10559 [Homo 



1) Novel NOVSOa (SEQ ID NO: 188) 

2) Novel NOVSOb (SEQ ID NO: 190) 

3) gi | 15822652 |gb|AAK85733 .1 | 
sapiens] (SEQ ID NO: 552) 

4 ) gi | 8922515 | ref |NP_060607 . 1 | 
sapiens] (SEQ ID NO: 553) 

5) gi|9293956|dbj |BAB01859.1| (AP000383) protein arginine N-me thy 1 transferase -like 
protein [Arabidopsis thaliana] (SEQ ID NO:554) 

6) gi 1 15231011 1 ref |NP_188637 . 1 1 (NC_003074) arginine tnethyltransf erase, putative 
[Arabidopsis thaliana] {SEQ ID NO: 555) 

7) gi 1 15233606 | ref |NP_194680 . 1 1 (NCJ)03075) arginine methyltransf erase (paml) 
[Arabidopsis (SEQ ID NO: 55 6) 



25 



30 



35 



40 



45 



50 



55 



60 



NOVSOa 
NOVSOb 



9i 

gi 
gi 
gi 
gi 



15822652 | 
8922515 | 
9293956) 
15231011 | 
15233606| 



NOVSOa 


28 


NOVSOb 


28 


gi 


1S822652| 


28 


gi 


8922515 | 


1 


gi 


9293956 j 


61 


gi 


15231011 | 


61 


gi 


15233606) 


52 



NOVSOa 


85 


NOVSOb 


85 


gi 


15822652] 


85 


gi 


8922515 | 


26 


gi 


9293956) 


121 


gi 


15231011) 


121 


gi 


15233606 | 


110 



NOVSOa 
NOVSOb 



gi 
gi 
gi 
gi 
gi 



15822652 | 

8922515) 

9293956) 

15231011) 

15233606) 




1 
1 
1 
1 

1 MQSGGDFSNGFHGDHHRELELEDKQGPSLSSFG|A|^SHAGARDP< 

1 MQSGGDFSNGFHGDHHRELELEDKQGPSLSSFGgA^§SHAGARDP: 

1 MTKNSNHDENEFI SFEPNQNTKIR- FEDADgDEVAEGSGVi 




LVSgQL 60 
VSgQL 60 
-PQDESMF 51 




190 



I 



200 



210 



220 



145 
145 
145 
86 

181 SED 
181 sb 

170 B3tp 



sRvdSiSsewmgySllS^ 
bHvdBiSsewmgySl^ 
E SvDg I fi SEWMGY i L ^ 

E B^3 I 8 SEWMGY i ,LL S ESML i s 



EgVDffl J m SEWMGyBllSeSMlSs 

e§vdSi m sewmgyBllBesmlSs 



ajVDjjlfj SEWMGYgLLgESMLgSVjgARggWLiq 
3VbSlBsEWMGYBLL§BSML§SV^^^WLSl 




240 



-pjML 
3ML 



202 
202 
202 
143 



DRYS 239 
PDRYS 239 
IgYKg 229 
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250 



260 



270 



280 



290 



300 



10 



15 



20 



25 



30 



35 



40 



NOVSOa 
NOV50b 



gi 
gi 
gi 
gi 



15822652| 
8522515 | 
9293956] 
15231011| 
15233606 




IRFAQ&ELE 259 

(RFAQLEL:--- 259 

IRFAQPX:--- 259 

(RFAQLELg 200 

" 285 

. (CKTIK 294 

iQNQ^DSRl^KT{©lS--- 277 



330 



340 



350 



360 



NOV50a 


259 


NOV50b 


259 


gi 


15822652 | 


259 


gi 


8922515) 


200 


gi 


9293956) 


285 


gi 


15231011| 


295 


gi 


15233606) 


277 




370 



380 



390 



400 



410 



1SPSGE 354 



420 



NOVSOa 


296 


NOVSOb 


296 


gi 


15822652 | 


296 


gi 


8922515) 


237 


gi 


9293956) 


319 


gi 


15231011 | 


355 


gi 


15233606) 


311 



311 BrMCHKLLG- 




430 



NOVSOa 


353 




NOVSOb 


353 


&RV3 


gi 


15822652) 


353 


j&RVJ 


gi 


8922515) 


294 




gi 


9293956) 


379 




gi 


15231011) 


392 




gi 


15233606) 


366 





440 

••I I 

HCDFAMED-- 375 

rjfcDFAMED-- 375 

rjKDFAMED-- 375 

rj^DFAMED- - 316 

SSAGRSFVKjSSVMR 399 

SIISNSASAYIlgKN 409 

SiNGgHCSlSRTQHYKMR 390 
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Table 49G lists the domain description from DOMAIN analysis results against 
NOV50. This indicates that the NOV50 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 50G Domain Analysis of NOV50 

gnl |Pfam|pfam0l209, Ubiejraethyltran, ubiE/COQ5 methyltransf erase 

family. (SEQ ID NO: 84 9) 

CD-Length » 237 residues, 63.3% aligned 

Score =35.0 bits (79), Expect = 0.008 



50 



55 



NOV50: 40 



Sbjct: 5 



NOV50: 91 



Sbjct: 60 



NOV50: 



148 



KRERDQLYYECYSDVS VHEEMXADRVRTDAYRLGILRNW AALRGKTVLDVG 9 0 

| | ++ | + ++ | + +++ | || | | (I! Ill 

KEEKEQKVHHVFASVAKKYDLM NDVMSFGIHRLWKDHFTMKLMGPKRGKKFLDVA 59 



AGTGILSI - FCAQAGAR- RVYAVEASA- IWQQAREVVRFNGLEDRVHVLPGPVETVELPE 

III ++ II +| ++ + + + ++ +| + | | Ml 

GGTGDIAFRLLRHAGTSGKVVVLDINENMIiKVGKKRAE -EEGKIRIEWLCANAE- -ELPF 

QVDAI VSE WMGYGLLHESMLS S VLHARTKWLKEGGLLL 185 
.+ + + +|+ + + || + || || |+ 
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Sbjct: 117 EDNTFDLVTI SFGIRNFTDYLKVLREAFRVLKPGGQLV 154 

Methyl transfer from S-Adenosyl-L-methionine (SAM) to either nitrogen, oxygen or 
5 carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and 
mammals. The reaction is catalyzed by methyltransferases (MTases) and modifies DNA, 
RNA, proteins and small molecules like catechol. The catalytic domain of SAM-MTases is of 
the alphafteta type with a central mixed beta-sheet around which several alpha-helices are 
arranged. Topologically it can be divided into two halves. The first half, formed by betal- 

10 alphaA-beta2-alphaB-beta3-alphaC, is mainly responsible for SAM binding. The second half, 
beta4-alphaD-beta5-alphaE-beta6-beta7, is primarily responsible for catalysis. According to 
the sequential order of these two sites, the SAM-MTases can be divided into three families 
Protein arginine methylation has been implicated in signal transduction, nuclear transport and 
transcription regulation. Protein arginine methyltransferases (PRMTs) mediate the AdoMet- 

1 5 dependent methylation of many proteins, including many RNA binding proteins involved in 
various aspects of RNA processing and/or transport. 

The bulk of methylated arginine residues in eukaryotic cells are found in 
heterogeneous nuclear ribonucleoproteins (hnRNPs), RNA-binding proteins that play essential 
roles in the metabolism of nuclear pre-mRNA. Lin et al. (1 996) identified a rat cDNA 

20 encoding PRMT1 (protein-arginine N-methyltransferase 1 ; EC 2. 1 . 1 .23). Recombinant 
PRMT1 methylated histones and hnRNPAl (164017) in vitro. By using a yeast 2-hybrid 
screen to identify proteins that interact with the intracytoplasmic domain of the interferon- 
alphafteta receptor-1 (IFNAR1; 107450), Abramovich et ah (1997) identified a human cDNA 
encoding a protein that was nearly identical to PRMT1. The deduced 361-amino acid protein 

25 was designated IR1B4 for 'interferon receptor-1 -bound protein 4. 1 Epitope-tagged IR1B4 
bound the IFNAR1 intracytoplasmic domain in vitro. Antibodies against IFNAR1 
coimmunoprecipitated a methyltransferase activity from human cell extracts. An antisense 
oligonucleotide strongly reduced methyltransferase activity in human cells, and caused them 
to become more resistant to growth inhibition by interferon. Abramovich et al. (1997) 

30 concluded that protein methylation, like phosphorylation, may be an important signaling 

mechanism for certain cytokine receptors. Scott et al. (1998) identified HRMT1L2 transcripts 
with variable 5-prime ends that encode 3 protein variants with different N-terminal regions. 
Rat PRMT1 and HRMT1L2 variant 2 (v.2) share 95% sequence identity, but diverge at their N 
termini. The amino acid sequences of HRMT1L2 and HRMT1L1 (601961) are 27% identical. 

35 Recombinant protein methylated human hnRNPAl and a yeast hnRNP in vitro. The 

HRMT1L2 gene complemented mutations in the yeast hnRNP methyltransferase gene HMT1. 
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Northern blot analysis revealed that HRMT1L2 is expressed as a predominant 1.4-kb mRNA 
in various adult and fetal tissues. Additional larger and smaller bands were observed in some 
tissues. 

The disclosed NOV50 nucleic acid of the invention encoding a Protein Arginine N- 

5 Methyltransferase 2-like protein includes the nucleic acid whose sequence is provided in Table 
50A, 50C, or a fragment thereof. The invention also includes a mutant or variant nucleic acid 
any of whose bases may be changed from the corresponding base shown in Table 50A or 50C 
while still encoding a protein that maintains its Protein Arginine N-Methyltransferase 2 -like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 

10 further includes nucleic acids whose sequences are complementary to those just described, 
including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 

15 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 6 percent of the 
bases may be so changed. 

20 The disclosed NOV50 protein of the invention includes the Protein Arginine N- 

Methyltransferase 2-like protein whose sequence is provided in Table 50B or SOD. The 
invention also includes a mutant or variant protein any of whose residues may be changed 
from the corresponding residue shown in Table 50B or 50D while still encoding a protein that 
maintains its Protein Arginine N-Methyltransferase 2-like activities and physiological 

25 functions, or a functional fragment thereof. In the mutant or variant protein, up to about 63 
percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Protein Arginine N- 

30 Methyltransferase 2 -like protein (NOV50) is a member of a "Protein Arginine N- 

Methyltransferase 2 family". Therefore, the NOV50 nucleic acids and proteins identified here 
may be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 
invention include, but are not limited to: protein therapeutic, small molecule drug target, 
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antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
those defined here. 

5 The NOV50 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in systemic lupus erythematosus , autoimmune disease, 
asthma, emphysema, scleroderma, allergy, ards, diabetes, autoimmune disease, renal artery 
stenosis, interstitial nephritis, glomerulonephritis, polycystic kidney disease, renal tubular 
acidosis, IGA nephropathy, hypercalcemia, Lesch-Nyhan syndrome, and/or other diseases and 
10 pathologies. 

NOV50 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV50 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

1 5 NOVX Antibodies" section below. The disclosed NOV50 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

20 NOV51 

A disclosed NOV51 nucleic acid of 984 nucleotides (also referred to as CG56759-01) 
encoding a Olfactory Receptor -like protein is shown in Table 51 A. An open reading frame 
was identified beginning with a ATG initiation codon at nucleotides 9-1 1 and ending with a 
TAA codon at nucleotides 954-956. The start and stop codons are shown in bold in Table 

25 51 A, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 51A. NOV51 nucleotide sequence (SEQ ID NO:191). 

~CTGGCCTAATGAATC 
CTTGTGAGTGGACAATTCAGATCTTCCTCTTCTCA 

ATGGAGCCATTGCTTTTGCCCTGTGGTGTGACCGGCGACTTCACACTCCCATGTACATGTTCCTGGGAGA^ 
TCTCCTTTTTAGAGATATGGTATGTCTTTTCTACAGTTCC 
CAAACATCTCCTTTGCTGGATGTTTTCTCCAGTTTTATTTCTTCTTCT 
TTTTGACTGTGATGGCCTTTGATCAGTACCTTGCTATCTGCC^ 

GGCATCTCTGTGCCAAACTGGTC^TACTGTGCTGGGTTTGTGGATTTCTGTGGTTCCTGATCCCCATTGTT 
TCATCTCTCAGATGCCCTTCTGTGGCCCAAACATTATTC 

CATTGG ATT GTGTTT CTGCCC CAAGAAT CCAACTGTTTO TGGTA 

ACTTCCTCTTTATTATTGGATCCTATACTATTGTCCTGAAAGCTC 

GACATAAGGCCITCTCTACCrGTGGGTCTCATTTGG 

TGTATGTGAGCCC^GGACTCGGACATTCTACGGGGATGCAGAAAATTGTAACTTTGTTCTATGCTATGGTGA 
CCCCACTCTTCAATCCCCTTATCTATAGCCTCCAGAATAAGGAGATAAAGGCAGCCCTGAGGAAAGTTCTOT 
GGAGT T C CAACATAATCTAAGG C AT ATT AG ATT ATTC CTCCATGAT CA 



442 



WO 02/068649 



PCT/US02/02785 



In a search of public sequence databases, the NOV51 nucleic acid sequence, located on 
chromosome 22, has 967 of 984 bases (98%) identical to a gb:GENBANK- 
ID:AP000534|acc:AP000534.1 mRNA from Homo sapiens (genomic DNA, chromosome 
22ql 1 .2, Cat Eye Syndrome region, clone:c23H5) (E = 2.9e" 208 ). 

5 The disclosed NOV5 1 polypeptide (SEQ ID NO: 1 92) encoded by SEQ ID NO: 1 9 1 has 

315 amino acid residues and is presented in Table 5 IB using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV51 has a signal peptide and is 
likely to be localized to the plasma mambrane with a certainty of 0.6000. Alternatively, 
NOV51 may also localize to the Golgi body with a certainty of 0.4000, to the endoplasmic 

10 reticulum (membrane) with a certainty of 0.3000, or to the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV51 is between positions 40 and 41 : 
IYA-LT. 



Table 51B. Encoded NOV51 protein sequence (SEQ ID NO:192). 

MNVSE PNS S FAFVNEF ILQGFSCEWT I QI FLFSL FTT I YALTITGNGAI AFALWCDRRLHTPMYMFLGDFS F 
LEIWYVFSTVPK>1LVNFLSEKTNISFAGCFLQFYFFF 

CAKLVILCWVCGFLWFLI PI VLI SQMPFCGPNI IDHVVCDPGPLFALDCVS APRI QLFCYTLNSLVIFGNFL 
FIIGSYTIVLKAVLGTPSSTGRHKAFSTCGSHLAWS^ 

FNPLI YSLQNKE IKAALRKVLGSSNI I 

15 A search of sequence databases reveals that the NOV51 amino acid sequence has 191 

of 314 amino acid residues (60%) identical to, and 226 of 314 amino acid residues (71%) 
similar to, the 324 amino acid residue ptnr:SPTREMBL-ACC:Q9WU86 protein from Mus 
musculus (Mouse) (Odorant Receptor S 1 ) (E = 7.0e" 100 ). 

NOV51 is predicted to be expressed in at least Apical microvilli of the retinal pigment 

20 epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma cell lines, corpus 
callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 
cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 
umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 
cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 

25 cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express MHC II and III 
nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 
spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus, and thymus 
tissue. This information was derived by determining the tissue sources of the sequences that 

30 were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 
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NOV51 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 51C 



Table 51 C. BLAST results for NOV51 


Gene Index/ 

Identifier 


S ^VJLdLll/ y Gill JL- Olll 


(aa) 


Trfpnfc 1 t* v 
x ucn l. j. u y 

(%) 


CUD i LXVCO 
(%) 


Expect 


gi 1 15293807 |gb|AAK9 
5096. ll (AF399611) 


olfactory 

J. WW w Wl J» l*JVi/4liWr 

sapiens) 


217 


205/217 


208/217 


4e-99 


gi | 9938010 |ref |NP_0 

64684.1) 

(NM_020288) 


odorant receptor 
SI gene [Mus 
musculus] 


324 


191/314 
(60%) 


226/314 
(71%) 


5e-90 


gi | 17476501 |ref|XP 
063251. 1| 
(XMJ>63251) 


similar to 
OLFACTORY 
RECEPTOR- LIKE 
PROTEIN F6 (H. 
sapiens) [Homo 
sapiens] 


1056 


134/293 
(45%) 


181/293 
(61%) 


2e-66 


gi 1 15293805 |gb | AAK9 
5095. l| (AF399610) 


olfactory- 
receptor [Homo 
sapiens] 


217 


142/217 
(65%) 


163/217 
(74%) 


2e-65 


gi | 17476700 |ref|XP 

063315.1) 

(XM 063315) 


similar to 
odorant receptor 
SI gene (H. 
sapiens) [Homo 
sapiens] 


195 


131/189 
(69%) 


156/189 
(82%) 


2e-64 



5 The homology between these and other sequences is shown graphically in the 

ClustaTW analysis shown in Table 51D. In the ClustalW alignment of the NOV51 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
10 can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 51D. ClustalW Analysis of NOV51 

1) Novel N0V51 (SEQ ID NO: 192) 
15 2) gi|l5293807|gb|AAK95096.l| (AF399611) olfactory receptor [Homo sapiens] (SEQ ID 
N0:557) 

3) gi|9938010|ref |NP_064684.l| (NM_020288) odorant receptor SI gene [Mus musculus] 
(SEQ ID NO: 558) 

4) gi 1 17476501 1 ref |XP_063251.l| (XMJ)63251) similar to OLFACTORY RECEPTOR- LIKE 
20 PROTEIN F6 (H. sapiens) [Homo sapiens] (SEQ ID NO: 559) 

5) gi|l5293805|gb|AAK95095.l| (AF399610) olfactory receptor [Homo sapiens] (SEQ ID 
N0:560) 

6) gi 1 17476700 1 ref |XP_063315 .1 | (XM_063315) similar to odorant receptor SI gene (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 561) 

25 

10 20 30 40 50 60 

| | | .... | | | | | | | | | 

N0V51 1 -- 1 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



gi 
gi 
gi 
gi 



15293807) 
9938010 | 
17476S01) 
15293805) 
17476700 



NOV51 



gi 
gi 
gi 
gi 
gi 



15293807 I 

9938010) 

17476501 

15293805 

17476700 



NOV51 



gi 
gi 
gi 
gi 
gi 



15293807) 

9938010) 

17476501) 

15293805) 

17476700 



N0V51 



gi 
gi 
gi 
gi 
gi 



15293807 | 

9938010) 

17476501 

15293805 

17476700 



NOV51 

gi | 15293807 | 
gi) 9938010 | 
gi j 17476501 | 
gi 1 15293805 | 
gi | 17476700 | 



NOV51 



gi 
gi 
gi 
gi 
gi 



15293807 
9938010) 
17476501 
15293805 
17476700 



NOV51 



gi 
gi 
gi 
gi 
gi 



15293807 
9938010) 
17476501 
15293805 
17476700 



NOV51 

gi 15293807) 
gi 9938010) 
gi 17476501 
gi 15293805 
gi 17476700 



1 1 

1 1 

1 MPVLLPVHFSAKCPLLLLCDPANPPSEPLPSQGCFIFIHRVLLDLSTAGESGNTAGFICD 60 

1 1 

1 --- - 1 

70 80 90 100 110 120 

| | | | | ) | | | | | | 

1 1 

1 - 1 

1 1 

61 QALLTSPVREDGAENGLGFHQPVELHICGDAVGFVGMGQRRKPMSVPWSHPKISEKCASD 120 

1 - — - -- 1 

1 - - -- 1 

130 140 150 160 170 180 

....|....| | | | | | | |....|....| | 

1 - - 1 

1 — 1 

1 - - - - 1 

121 TWCTDATYHREHSKPSGPWEHGPLKPFEDWVPALPYPLWPQELLHCGSQSGDCMCLLLLE 180 

1 1 

1 - 1 

190 200 210 220 230 240 

....|....|....|....|....|....|....|....|....|....|....|....| 

1 1 

1 - - -- 1 

1 - - 1 

181 SSRRSPPTLPIPLTFPRLCQSFPLLTASGKEPSCGFTSALRRLYGCGAAERPQSPVTPKT 240 

1 1 

1 1 

250 260 270 280 290 300 

.•..|.. |.. v |....|....|....|....|....|. ..|....| 

i i 

i - - — i 

i i 

241 ETSEQGPKDPPIHLAHPSDRALSPSCFIiSLRAVILTCKNRDAQVEEGHRREPPVLDCGYQ 300 

1 --- - --- — 1 

1 --- --- 1 

310 320 330 340 350 360 

|....| | | | | | | )....) ....) | 

1 — - 1 

1 - - - 1 

1 1 

3 01 RSGTRGNHTRRICSTLRGSRIEAWVAAATLQRGPYFRKQQPLGKDSWSVAEDWIEAFMLA 360 

1 1 

1 -- 1 

370 380 390 400 410 420 

....|....| |....| |....| |....| |....|....| | 

1 - - - - - 1 

1 - - 1 

1 - - 1 

361 FGTOVLWDASMALEAQRDPSSNDTKGKDQLTKIU)QRNPQNFAIiLQKSAASDWNSQPVCRR 420 

1 1 

1 - 1 

430 440 450 460 470 480 

....|....|....|....|....|....|....|....|....|....|....|....| 

1 - - 1 

1 - --- 1 

1 1 

421 GYLTCASASI^EISSPHFPVHLNAPKCHWGLSSSPVERWMLRERKAVTDESSSSWMVAIR 480 

1 1 

1 ' 1 



70 



490 



500 



510 



520 



530 



540 
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NOV51 



gi 


15293807 


1 1 


gi 


9938010) 


1 


gi 


17476501 


481 


gi 


15293805 


1 


gi 


17476700 


1 


NOV51 


1 


gi 


152938071 1 


gi 


9938010) 


1 


gi 


17476501 


541 


gi 


15293805 


1 


gi 


17476700 


1 • 


NOV51 


1 


gi 


15293807 


1 


gi 


9938010) 


2 


gi 


17476501 


601 


gi 


15293805 


1 


gi 


17476700 


1 


NOV51 


56 


gi 


15293807 


1 


gi 


9938010) 


62 


gi 


17476501 


661 


gi 


15293805 


1 


gi 


17476700 


1 


NOV51 


116 


gi 


15293807 


45 


gi 


9938010) 


122 


gi 


17476501! 


721 


gi 


15293805| 


45 


gi 


17476700 


1 



NOV51 


175 


gi 


15293807 


104 


gi 


9938010) 


181 


gi 


17476501 


781 


gi 


15293805 


104 


gi 


17476700 


54 



NOV51 


235 


gi 


15293807) 


164 


gi 


9938010) 


241 


gi 


17476501) 


841 


gi 


15293805) 


164 


gi 


17476700| 


114 



NOV51 


295 


gi 


15293807 


217 


gi 


9938010) 


301 


gi 


17476501 


901 


gi 


15293805 


217 


gi 


17476700 


174 



550 
).... |....|, 



560 



570 



580 



590 
|....|.. 



600 



-- 1 
1 

-M 1 



541 PNGPWERGPLKPSGDWiyrCLHyLLWPQELFHCRSQTEDYTVTWFDVVDRQMQKYSQSPFL 600 

- - 1 

_ 1 

610 620 630 640 650 660 

MNVSEPNSSFAFVNEFILQGFSCEWTIQIFLFSLFTTIYALTITGNGAIAFALWC 55 

1 

SLFPQRNLDAMNRSAAHVTEFVLLGFPGSWKIQI FLFVLFLVFYVLTLLGNGAI ICAVRC 61 



670 

....|....|....| 
DRRLHTPMYMFLGDFSl 

DSRLHTPMYFLLGNFS 



680 



690 




710 



720 



700 

•L 



ISFgGCFLQFYFFFSLGggE 
liiJjIS FgGCFLQFYFFFSLGggE 

I s fSgcflqf yfffslgSJe 

I S F^cJJJqF YF^F^IiG^E 

isfSgcflqfyfffslgS!e 



115 

44 

121 

720 

44 

1 



730 



740 



750 



760 



770 



780 




910 



920 



930 



940 



950 



|....|.-..|. 



-SSNII- 



).... |....|. 



960 
.-I 



315 

— - 217 

GMRIVKNM 324 

IFRKGCDFAFERCNSACNCRKGSLTTTTKSATLRCGAGAKARAGARL 960 

— 217 

GLTVSQN 195 
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10 



15 



NOV51 



gi 

gi 
gi 
gi 
gi 



15293807 
9938010) 
17476501 
15293805 
17476700 



NOV51 



gi 
gi 
gi 
gi 
gi 



15293807 
9938010) 
17476501 
15293805 
17476700 



970 980 990 1000 1010 1020 

....|..-..|....|....|....|....|....|....|....|....|....|-...| 

315 - 315 

217 - " 217 

324 — 324 

961 HPAAGSPI®SRKVNVRVQKDPRRSVPKVETFISGSGPSCVGQCTGRVCILKGTRTISGGL 1020 

217 217 

195 _ — — 195 

1030 1040 1050 

....|....|....|....|....|....|. — |. 

315 — " 315 

217 217 

324 - 324 

1021 WLEDPRKTRTTDFTHRKIKVTAGLAGEKVEPTLPRC 1056 

217 217 

195 195 



20 



Table 51E lists the domain descriptions from DOMAIN analysis results against 
NOV51 . This indicates that the N0V51 sequence has properties similar to those of other 
proteins known to contain this domain. 



25 



30 



35 



40 



45 



50 



Table 51E Domain Analysis of NOV51 

gnl|Pfam|pfam00001, 7tm_l, 7 transmembrane receptor (rhodopsin 

family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 100.0% aligned 

Score w 105 bits (262) , Expect = 4e-24 



NOV51 : 


45 


Sbjct: 


1 


NOV51: 


105 


Sbjct: 


61 


NOV51: 


165 


Sb j Ct : 


121 


NOV51: 


225 


Sbjct: 


181 


NOV51: 


281 


Sbjct: 


241 



GNGAIAFALWC^RRLHTPMYMFIiGDFSFLEI^ 104 

II + + ++I I I + 1 I + + ++ ++ + | | + III 
GNLLVILVILRTKKLRTPTNI FLLNIAVADIiLFLLTLPPWALYYLVGGDWVFGDALCKLV 60 



I I + INI ++ 1*1-1 1 1 III M M II I I I* 

GALFVVNGYASI LLLTAIS IDRYIiAIVHPLRYRRIRTPRI^AKVLI LLVWVliALLLSLPPL 120 



++ 



+++ 



+ IM+II +1 +1+ + + 

181 RSQRSLKRRSS SERKAAKMLLVVVWFVLCWLP YHI VLLLDSLCLLS I WRVLPTALLI Th 240 



Ml 



G-Protein Coupled Receptor (GPCRs) have been identified as an extremely large 
family of protein receptors in a number of species. At the phylogenetic level they can be 
classified into four major subfamilies. These receptors share a seven transmembrane domain 
structure with many neurotransmitter and hormone receptors. They are likely to be involved in 
the recognition and transduction of various signals mediated by G-Proteins, hence their name 
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G-Protein Coupled Receptors. The human GPCR genes are generally intron-less and belong to 
four gene subfamilies, displaying great sequence variability. These genes are dominantly 
expressed in olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large family of GPCRs in 
5 a number of species. As members of the GPCR family, these receptors share a seven 

transmembrane domain structure with many neurotransmitter and hormone receptors, arid are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. Like 
GPCRs, the ORs they can be expressed in a variety of tissues where they are thought to be 
involved in recognition and transmission of a variety of signals. The human OR genes are 

10 typically intron-less and belong to four different gene subfamilies, displaying great sequence 
variability. These genes are dominantly expressed in olfactory epithelium. 

The disclosed NOV51 nucleic acid of the invention encoding a Olfactory Receptor-like 
protein includes the nucleic acid whose sequence is provided in Table 5 1 A or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 

1 5 be changed from the corresponding base shown in Table 5 1 A while still encoding a protein 

that maintains its Olfactory Receptor -like activities and physiological functions, or a fragment 
of such a nucleic acid. The invention further includes nucleic acids whose sequences are 
complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 

20 nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 

25 binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 2 percent of the bases may be so changed. 

The disclosed NOV51 protein of the invention includes the Olfactory Receptor-like 
protein whose sequence is provided in Table 5 IB. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 

30 in Table 5 IB while still encoding a protein that maintains its Olfactory Receptor-like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 
up to about 54 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
i (Fab)2, that bind immunospecifically to any of the proteins of the invention. 
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The above disclosed information suggests that this Olfactory Receptor -like protein 
(NOV51) is a member of a "Olfactory Receptor family". Therefore, the NOV51 nucleic acids 
and proteins identified here may be useful in potential therapeutic applications implicated in 
(but not limited to) various pathologies and disorders as indicated below. The potential 
5 therapeutic applications for this invention include, but are not limited to: protein therapeutic, 
small molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 
research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
(but not limited to) those defined here. 

10 The NOV5 1 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in developmental diseases, MHCI1 and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 

15 disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 

20 hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 

25 autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome and/or other pathologies and disorders of the like.. The 
polypeptides can be used as immunogens to produce antibodies specific for the invention, and 
as vaccines. They can also be used to screen for potential agonist and antagonist compounds. 
For example, a cDNA encoding the OR -like protein may be useful in gene therapy, and the 

30 OR-like protein may be useful when administered to a subject in need thereof. By way of 
nonlimiting example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from bacterial, fungal, protozoal and viral infections 
(particularly infections caused by HTV-l or HIV-2), pain, cancer (including but not limited to 
Neoplasm; adenocarcinoma; lymphoma; prostate cancer; uterus cancer), anorexia, bulimia, 
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asthma, Parkinson's disease, acute heart failure, hypotension, hypertension, urinaiy retention, 
osteoporosis, Crohn's disease; multiple sclerosis; and Treatment of Albright Hereditary 
- Osteodystrophy, angina pectoris, myocardial infarction, ulcers, asthma, allergies, benign 
prostatic hypertrophy, and psychotic and neurological disorders, including anxiety, 
5 schizophrenia, manic depression, delirium, dementia, severe mental retardation and 

dyskinesias, such as Huntington's disease or Gilles de la Tourette syndrome, and/or other 
diseases and pathologies. 

NOV51 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV51 substances for use in 
10 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV51 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
1 5 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV52 

A disclosed NOV52 nucleic acid of 3828 nucleotides (also referred to as CG56731-01) 
encoding a H326-like protein is shown in Table 52A. An open reading frame was identified 
20 beginning with a ATG initiation codon at nucleotides 177-179 and ending with a TAA codon 
at nucleotides 1968-1970. The start and stop codons are shown in bold in Table 52A, and the 
5' and 3' untranslated regions, if any, are underlined. 



Table 52A. NOV52 nucleotide sequence (SEQ ID NO:193). 

CTCTTAGCGCTCAGGTCTTTTCCITCCGCCC& 

GTCCTTCAGTG AAT CT ACAG AG CCT ATTTC CT C AGGAG CCT CAGCCTGGTC CTT ACTTCAGTGATAAAAG GA 

GGAAAGGCTGG CTACAG CAAACATCAT TCAAGATG TCCAGCAAAGGGAG CAGCACAG ATGGCAGAACAGACT 

TAG CT AATGG AAGC CTGTC TAG CAGTC CAG AGGAG ATGT CTGG AG C TGAAG AGGGGAGGGAGACATCCT CAG 

GCATTGAAGTGGAGGCCTCAGACCTGAGTTTGAGCTTGACTGGGGATGATGGTGGCCCCAACCG 

CAGAAAGTCGAGGCACAGACACAGAGAGCTCAGGTGAAGATAAGGACTCTGACAGCATGGAGGACACTGGTC 

ATTACTCCATTAATGATGAAAATCGAGTCC^TGACCGCTC^GAGGAAGAGGAAGAGGAGGAAGAAGAGGAGG 

AAGAAGAGC^CCTCGGCGCCGTGTACAGCGCAAGCGGGCTAACCGTGACCAGGACTCATCAGATGATGAGC 

GGGCCCTAGAGGACTGGGTGTCCTCAGAAACATCAGCTCTACCCCGACCTCGCTGGCAAGCCCTCCCTGCCC 

TTCGGGAGCGGGAGCTGGGTTCAAGTGCCCGCTTTGTCTATGAGGCCTGTGGGGCAAGAGTCTTTGTGCA^ 

GTTTCCGCerGCAGCATGGGCTTGAGGGeCATACTGGTTGTGTCAATACCCTGCACTTTAA 

CCTGGCTGGCCAGTGGCAGCGATGACCTGAAGGTGGTGGTGTGGGATTGGGTACGGCGGCAGCCAGTACTGG 

ACTTTGAGAGTGGCCACAAAAGTAATGTGTTCCAGGCCAAGTTTCTTCCTAACAGTGGTGATTCT 

CCATGTGTGCCCGTGACGGGCAGGTTCGAGTAGC&GAACT 

GTGTGGCCCAGCACAAGGGAGCGTCCCACAAGTTGGCACTGGAACCAGACTCTCCCTGTACGTTCTTATCTG 
CAGGTGAAGATGCAGTTGTTTTCACCATTGACCTGAGACA^ 

AAGAGAAAGAGAAGAAAGTGGGGCTGTATACGATCTATGTGAATCCTGCCAATACCCACCAGTTTGCA 

GTGGACGAGATCAGTTTGTAAGGATTTATGACCAGAGGAAAATTGATGAGAATGAGAACAATGGAGTACTC^ 

AGAAGTTCTGTCCTCATCACCTGGTGAACAGTGAGTCCAAAGCAAACATCACCTGTCTTGTGTACAGCCACG 

ACGGCACAGAGCTCCTGGCCAGTTACAATGATGAAGACATTTACCTCTTCAACTCCTCTCACAGTG 

CCCAGTATGTTAAGAGATACAAGGGCCACAGAAATAATGCC^CACn'AAAAGGCGTCAATTTCrrATGGCCC^ 

AGAGTGAGTTTGTGGTGAGCGGTAGTGACTGTGGGCACATCTTCCTCTGGG 
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TTCAGTTCATGGAGGGGGACAAGGGAGGCGTGGTAAACTGTCTTGAGCCCCACCCTCACCTGCCTGTGCTGG 
CAACCAGTGGCCTAGACCATGATGTGAAGATCTGGGCACCC^CAGCTGAAGCTTCCACTGAGCTGACAGGGT 
T AAAAG ATGTG ATT AAG AAGAACAAGCGGGAGC GGGATG AAG ATAG CTTGC AC CAAACTGAC CTGTT TG AT A 
GTCACATGCTGTGGTTCCTTATGCATCACCTGAGACAGAGACGCCATCACCGGC^ 

1!TGGGGCCACAGACGCGGACTCTGATGAGTCTCCCAGCT CCTCAGACAC ATCGGACGAGGAGGAGGG CCCTG 
ACCGGGTGCAGTGCATGCCATCTTG AGGCCTCATACCTAGGTGGGGCAGGCTGGGGCTGCCAACCTGATCCT 
GCCTGGGCAACCCTTTCCTGTCCCAGGCCCTACATTCAG^^ 

AAAGAAAGACATCCCAGGAGAAGGACAAACCAGAGGAGTGAACCAACAAAGAGTACCTAGGAATGGGAGTTG 

AGCCTGGAATGGGCTCCATGGAGAGGTGCATAGGACTCGGCAGAAATGGCCTCTCCCCAAAGCCTCTTTTTC 

AGAGGAGAGGGAAGCCTATTTGTTAACTGGTTTGGGATAGGGAATGGGGTTTCTTTTTCT 

GTTTCTTGGGCTGGGGGAGGGGTGGGGGGAACAACTGGCTATTCAGTACCAAGGGGCCAGAGTGGAGGGTAG 

GAGTGC(^CTCTCTCTTTGGTTTAGGTTTTTGACCTTTTCTTC 

CTCCCNI^ACCCCACAACCCCATCCCAGAATCCTATTTTCCTGGGAAGTCCT 

CACTCTTCACTTTCCTTTCCACCTTATTCATTCTCrTGTACTTACCACAGTATTTTGCACTTGATTACATATC 
CTTC^TCTCTTCTCTTCATCCCATC^CCCCCTAAATAGGTCAGGTGAGGGAGGCTGGGAAGAGGTGGGAG^ 
GGGGAGAAGTGAAGGAAGATAGGAAGGATATTACCTCTTCTGTTATTTTTTTAAGAAACATTGTTTGGTGGC 
AGCAATCTCCCTGTCCCTATCACTGTTAGAGGCCTAATTTTATATCTATAT^ATATATTAAAAAGCAAGTCAA 
ACTTGGATGTATCAAGGTAAAATTATTGTCAAAGTTTAAATAC^ 

CTTAAGAGTGAACAAGAGTAATGGTGTGGAAGTGACACCTGGGGTCAGTTTACCTCTGTGTATGGTCACTAG 
AGATTGGGACTTACCCTTTAGGTTTTAGGAGGCTTGAGAATGGAAGGATCCTCATTTCTGCCCTTCCTGGTT 
CCCTGCTTTGGTGTAGGGGTTGGGAAAAACAGGAAATTCCTCTCAGCT 

TTAAGT CTTGTAGGGGGTTCCAAGGATGGCT CT TCTAACCAG AGG CTGG CCTGTCTT TAAAACTTAACTACT 
TTAGGGTGGTGCCACCACTGCAGACTATTGTGGTACTTTGTGACAGAAGACATGTACACACACACCACACAC 
ATACATACACACTCTCTCACTCTGTCTCTCTTACCTTTAGCTGCTTGATCATTAA 
CAGTTCCCTTCTTTATAGAAGAGTGAAGGGAAAGACTTCCTGGGTTTGACT 

ATAT TTTAGGATTGAGGAATAAAGTCATTAATC TAAGGAACTG ATTACAGTGGCTGG AG CTTGGG CACTTGT 

CTTATCACTGGTCACTGAGTCTGAAAGTCCGAGNTGAATTCTTGCCCTTAAGTGCTTTTGCT 

TGCCCCCAGTTCCACAAGATCCAACC^GAATTCTGTATCCTGGCAACAGTCAGATTCTTCTAAATCAGCCA 

GCAAGAGGGNAAAGAGTGAGAGATGGTATTCCCAGATCATTCTTCCTCCTGCCCCTTTCCCAGCAGCTCTAG 

ACGAGATGTTGGCTGCTGTACTTACTCCCTGAGGTAGGGAATG 

TGCTGGTGGGGTGATAGGGTGGGCTAAAAACCATGCACTCTGGAATTTGTTGTATTTTCTCCCAGTAAAGCT 
TTTCTTCTCCCG 



In a search of public sequence databases, the NOV52 nucleic acid sequence, located on 
chromosome 22, has 3818 of 3828 bases (99%) identical to a gb.GENBANK- 
ID:HSU06631|acc:U0663Ll mRNA from Homo sapiens (Human (H326) mRNA, complete 
cds)(E = 0.0). 

The disclosed NOV52 polypeptide (SEQ ID NO:194) encoded by SEQ ID NO:!93 has 
597 amino acid residues and is presented in Table 52B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV52 has no signal peptide and is 
likely to be localized to the microbody (peroxisome) with a certainty of 0.3000. Alternatively, 
NOV52 may also localize to the nucleus with a certainty of 0.3000, to the mitochondrial 
matrix space with a certainty of 0.1000, or to the lysosome (lumen) with a certainty of 0,1000. 



Table 52B. Encoded NOV52 protein sequence (SEQ ID NO:194). 

TlSSKGSSTOGRTDLANGSL 

GEDKDSDSMEDTGHYS INDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDS SDDERALEDWVS SET 

SALPRPRWQALPALRERELGSSARFVYEACGARVFVQ 

VVVWDWVRRQPVLDFESGHKSNVFQAKFLPNSGDS^ 

LALEPDSPCTFLSAGEDAWFTIDLRQDRPASKLVV^ 

QRKIDENEIWGVLKKFCPHHLVNSESKANITCLVYSHDG 

NNATVKGVNFYGPKSEFOTSGSDCGHIFLWEKSSCQIIQ 

WAPTAEASTELTGIJCDVIKKNKRERDEDSLH^ 

PSSSDTSDEEEGPDRVQCMPS 
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A search of sequence databases reveals that the NOV52 amino acid sequence has 588 
of 597 amino acid residues (98%) identical to, and 589 of 597 amino acid residues (98%) 
similar to, the 597 amino acid residue ptnr:SPTREMBL-ACC:Q12839 protein from Homo 
5 sapiens (Human) (H326) (E = 0.0). 

NOV52 is predicted to be expressed in at least adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 

1 0 small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, uterus, Amnion, 
Appendix, Bone, Bronchus, Brown adipose, Cervix, Chorionic Villus, Colon, Coronary 
Artery, Dermis, Epidermis, Foreskin, Hair Follicles, Hypothalamus, Kidney Cortex, Liver, 
Lung, Lung Pleura, Lymph node, Lymphoid tissue, Muscle, Ovary, Oviduct/Uterine 
Tube/Fallopian tube, Parathyroid Gland, Parotid Salivary glands, Peripheral Blood, 

15 Respiratory Bronchiole, Retina, Right Cerebellum, Skin, Synovium/Synovial membrane, 
Temporal Lobe, Thymus, Tonsils, Umbilical Vein, Urinary Bladder, Vein, Vulva, Whole 
Organism. 

This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
20 sources, Literature sources, and/or RACE sources. 

In addition, the sequence is predicted to be expressed in plasma cells (myeloma) 
because of the expression pattern of (GENBANK-ID: gb:GENBANK-ID:HSU06631|acc: 
U06631 .1) a closely related Human (H326) mRNA, complete cds homolog. 

NOV52 also has homology to the amino acid sequences shown in the BLASTP data 
25 listed in Table 52C 



Table 52C. BLAST results for NOV52 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 13636682 |ref |XP_ 
010501. 2| 
(XM_01050l) 


similar to H326 
(H. sapiens) 
[Homo sapiens] 


597 


597/597 
(100%) 


597/597 
(100%) 


0.0 


gi | 7657148 |ref|NP 0 
56541. l| 
(NM_015726) 


H326 [Homo 
sapiens] 


597 


588/597 
(98%) 


589/597 
(98%) 


0.0 


gi | 17485807 |ref|XP 
066683. l| 
<XM 066683) 


similar to H326 
(H. sapiens) 
[Homo sapiens] 


779 


401/603 
(66%) 


463/603 
(76%) 


0.0 
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gi| 17485821 _ 

066690.1) 

(XM 066690) 


similar to H326 
(H. sapiens) 
[Homo sapiens] 


577 


387/601 
(64%) 


449/601 
(74%) 


0.0 


gi | 6679281 | ref | NPJ) 
32847.1) 
(NM 008821) 


plasmacytoma 
expressed 
transcript 2 [Mus 
musculus] 


747 


308/469 
(65%) 


383/469 
(80%) 


0.0 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 52D. In the ClustalW alignment of the NOV52 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 



15 



20 



Table 52D. ClustalW Analysis of NOV52 



1) Novel N0V52 (SEQ ID NO: 194) 

2) gi| 13636682 | ref |XP_010501 .2 | (XM_010501) similar to H326 (H. sapiens) [Homo 
sapiens] (SEQ ID NO: 562) 

3) gi | 7657148 | ref |NP_056541 . 1 1 (NMJ)15726) H326 [Homo sapiens] 

4) gi | 17485807 | ref |XP_066683 . 1 | (XM_066683) similar to H326 (H. 
sapiens] (SEQ ID NO: 564) 

5) gi 1 17485821 1 ref |XP_066690 .1 1 (XMJJ66690) similar to H326 (H. 
sapiens] (SEQ ID NO: 565) 

6) gi 1 6679281 1 ref |NP_032847 .1 | (NM_008821) plasmacytoma expressed transcript 2 [Mus 
musculus] (SEQ ID NO: 566) 



(SEQ ID NO: 563) 
sapiens) {Homo 

sapiens) [Homo 



10 



20 



30 



25 



30 



35 



40 



45 



50 



N0V52 



gi 
gi 
gi 
gi 



13636682) 
7657148 | 
17485807) 
17485821 j 
6679281) 



40 

•I I 



50 



60 



1 
1 
1 
1 
1 
1 



NOV52 


1 


gi 


13636682) 


1 


gi 


7657148) 


1 


gi 


17485807) 


61 


gi 


17485821) 


1 


gi 


6679281) 


51 



1 

1 

— 1 

MANSKNPVE IE I MVFS STLNMPDVGDNKPLS I KQVNDI SRQSTMNSVHLLNLLLCFATAL 60 

1 

- - MSSHES YTNAAETPENIS ILSCliGETSGALVDTKTI SDI KTMDPRVSLTP 50 



70 80 

.).... !....).. 



90 



100 

.|....|....|. 



110 



120 



1 

--- 1 

- 1 

61 ' KHHGLGDLQAACTLRQRVDVSLIGEQHPTRI PVI IGYKSEEQLPVLDKTKFLVLANQAFF 120 

1 

51 SSDVTGTEDSSVLTPQSTDVN SVDSYQGYEGDDDDEEDDEDDKDGDSNLPSL 102 



130 
1 I I 



140 



150 



160 
I I I 



170 



180 



N0V52 


1 


gi 


13636682) 


1 


gi 


7657148) 


1 


gi 


17485807 | 


121 


gi 


17485821) 


1 


gi 


6679281) 


103 



1 
1 
1 



190 



200 



210 



220 



230 



-EWEGQ 150 
240 
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NOV52 

gi|l3636682| 
gi| 7657148 | 
gi 1 17485807 | 
gi j 17485821 j 
gi|'667928l| 



1 

1 

1 

181 QANFLPRAFRGLAFAVPDLLQQTSF1 
1 

151 GGGSLFYPYELEAGEWEAQNVQNLFHRYBLEEGgWEAQWQg 





NOV52 


35 T 


ss 




gi 


13636682 | 


35 T 


ss 




gi 


7657148 | 


35 T 


ss 




gi 


17485807 | 


241 A 


ss 




gi 


17485821| 


35 A 


ss 




gi 


6679281] 


211 EVQGFFQIT 




NOV52 

gi | 13636682 
gi 1 7657148 | 
gi j 17485807 
gi 1 17485821 
gi j 6679281 1 



310 



320 



330 



■|....|. 



8 2 g^GHYSINDENRVHpRSEEEEEEEEEEEEEQggRRVQ] 
8 2 TOTGHYSimENRVHDRSEEEEEEEEEEEEEoBpRVORK: 
8 2 ™TGHYS INDENRVHDRSEEEEEEEEEEEEEQggGACTAQi 

294 ^ FEHFLMSGS 

85 GEG LFGYPLgj^MCPRi 

271 r 



340 



350 



360 




370 



380 



390 



400 



410 



420 



NOV52 


142 


gi 


13636682) 


142 


gi 


7657148 | 


142 


gi 


17485807 | 


325 


gi 


17485821) 


122 


gi 


6679281 | 


302 



NOV52 


202 


gi 


13636682 | 


202 


gi 


7657148) 


202 


gi 


17485807) 


385 


gi 


17485821) 


182 


gi 


6679281) 


362 



NOV52 

gi | 13636682 | 
gi|7657148| 
gi 1 17485807 | 
gi j 17485821 | 
gif 6679281 | 



262 
262 
262 
445 
242 
422 




NOV52 



gi 
gi 
gi 
gi 
gi 



13636682) 
7657148 | 
17485807) 
17485821 | 
6679281) 



550 



I. 



560 



570 



I. 



580 



I . 



590 



I. 



600 
..I 



322 
322 
322 
505 
302 
482 



LWTKE^KKVGLYTlgVNPANTgQFAVGGgDQFVRIYDQRKIDENENNGVLKKFCPHHL 

lwtke||kkvglytiRvnpantoq^ 

lwtke^kkvglytiRvnpantBqfavggSdqfvriydqrkidenenngvlkkfcphhl 



^WTgE^KKVGLYTIf 
|WT§ES'RKVGL.YTl| 
L??V0KaS3K-KVGLYTH? 



^PANTOQFAVGGKDQFVRIYDQRKIDggENNGVLKKFgPHHL 
STjPAKiflQFAVGGgDQFVRIYDQRglDl^ENNGVLKKFgPHHL 

jqfavggsdqfIiSriydqrkidenEnwgvlkkfcphhl 



381 
381 
381 
564 
361 
541 



NOV52 


382 


gi 


13636682 | 


382 


gi 


7657148) 


382 


gi 


17485807) 


565 


gi 


17485821) 


362 


gi 


6679281) 


542 



610 



620 



630 



640 
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gi 
gi 
gi 
gi 
gi 



13636682 | 
7657148 | 
17485807) 
17485821] 
66792811 



700 
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710 720 



NOV52 


502 


gi 


13636682) 


502 


gi 


7657148 | 


502 


gi 


17485807 | 


685 


gi 


17485821) 


482 


gi 


6679281) 


662 




NOV52 


562 


gi 


13636682) 


562 


gi 


7657148) 


562 


gi 


17485807 | 


745 


gi 


17485821) 


542 


gi 


6679281| 


722 




800 



810 



SDES jSSSDTSgEEEGgDRVQCMPS 

sdes ssssdtsBeeegHdrvqcmps 

SDES j SSSDTSgEEEGgDRVQCMPS 
SDES : sSsjgTslEEEBSDRVQ'CMPS 

"|des : s§sdtsIeeeg§drvqc0ps 



597 
597 
597 
779 
577 
747 



Tables 52E-F list the domain descriptions from DOMAIN analysis results against 
NOV52. This indicates that the NOV52 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 52E Domain Analysis of NOV52 

gnl| Smart |smartO032O, WD40, WD40 repeats; Note that these repeats are 
permuted with respect to the structural repeats (blades) of the beta 
propeller domain. (SEQ ID NO: 850) 
CD-Length = 40 residues, 82.5% aligned 
Score = 43.1 bits (100) , Expect = 4e-05 



N0V52: 189 LEGHTGCVNTLHFNQRGTWLASGSDDLKVVVWD 221 

i+i 1 1 1 i ++ 1+ i i inn i + +ii 

Sbjct: 8 LKGHTGPVTSVAFSPDGKLIiASGSDDGTIKLWD 40 



Table 52F Domain Analysis of NOV52 

gnl|Pfam|pfam00400, WD40, WD domain, G-beta repeat. (SEQ ID NO: 851) 
CD-Length = 39 residues, 97.4% aligned 
Score = 43.1 bits (100), Expect = 4e-05 



N0V52: 184 RliQHGLEGHTGCVlTOjHFNQRGTWLASGSDDLKVVVWD 221 

+i i mi i ++ 1+ i iiinii i +ii 

Sbjct: 2 KLLRTLSGHTGSVTSVAFSPDGKLLASGSDDGTVKIWD 39 
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Beta-transducin (G-beta) is one of the three subunits (alpha, beta, and gamma) of the 
. guanine nucleotide-binding proteins (G proteins) which act as intermediaries in the 
transduction of signals generated by transmembrane receptors. The alpha subunit binds to and 
hydrolyzes GTP; the functions of the beta and gamma subunits are less clear but they seem to 
5 be required for the replacement of GDP by GTP as well as for membrane anchoring and 
receptor recognition. 

In higher eukaryotes G-beta exists as a small multigene family of highly conserved 
proteins of about 340 amino acid residues. Structurally G-beta consists of eight tandem repeats 
of about 40 residues, each containing a central Trp-Asp motif (this type of repeat is also called 

10 a WD-40 repeat). Such a repetitive segment has been shown to exist in a number of other 

proteins, including G-beta-Iike peptides, yeast STE4, MSI1, CDC4, CDC20, MAK1 1, PRP4, 
PWP1 and TUP1, slime-mould AAC3 and coronin, and Drosophila Groucho protein. The 
number of repeats within these proteins varies between 5 (PRP4, TUP1, and Groucho) and 8 
(G-beta, STE4, MSI1, AAC3, CDC4, PWP1, etc.). In G-beta and G-beta like proteins, the 

15 repeats span the entire length of the sequence, while in other proteins, they make up the N- 
terminal, the central or the C-terminal section. 

The protein of this invention contains 7 WD-40 repeats. Although the function of this 
H326-like protein is not precisely known, it has potential importance in the intracellular 
transduction of signals, similarly to other WD-40 repeat-containing proteins. 

20 The disclosed NOV52 nucleic acid of the invention encoding a H326-like protein 

includes the nucleic acid whose sequence is provided in Table 52A or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 52A while still encoding a protein that maintains 
its H326 -like activities and physiological functions, or a fragment of such a nucleic acid. The 

25 invention further includes nucleic acids whose sequences are complementary to those just 

described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 

30 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1 percent of the 
bases may be so changed. 
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The disclosed NOV52 protein of the invention includes the H326-like protein whose 
sequence is provided in Table 52B. The invention also includes a mutant or variant protein 
any of whose residues may be changed from the corresponding residue shown in Table 52B 
while still encoding a protein that maintains its H326-like activities and physiological 

5 functions, or a functional fragment thereof. In the mutant or variant protein, up to about 36 
percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab>2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this H326 -like protein (NOV52) is a 

10 member of a "H326 family". Therefore, the NOV52 nucleic acids and proteins identified here 
may be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 
invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or 

15 prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 

regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
those defined here. 

The NOV52 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in intracellular transduction of signals and any relevant 
20 diseases that may result from dysregulation of signal transduction, and/or other diseases and 
pathologies. 

NOV52 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV52 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 

25 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV52 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 

30 disorders. 

NOV53 

A disclosed NOV53 nucleic acid of 1233 nucleotides (also referred to as CG56745-01) 
encoding a uracil phosphoribosyltransferase-like protein is shown in Table 53A. An open 
reading frame was identified beginning with a ATG initiation codon at nucleotides 142-144 
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and ending with a TAA codon at nucleotides 1 069-1 071 . The start and stop codons are shown 
in bold in Table 53 A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 53A. NOV53 nucleotide sequence (SEQ ID NO:195). 



CTAGGGGTGAAAGGACAGCCAGGGTTAGATGTTCTGAGGAGGCGGGAGCAACCGAGAGAGCACGTGAGCATC 

TGTCCTTTCTACCCGTTCCTCTTTATCTTTAGTGTTCAGTAGCAGCGGGGATAGCCCGGGGCCCGGTGT ATG 

GCCACGGAGTTACAGTGTCCGGACTCCATGCCCTGTCAC^CC^GCAAGTAAACTCTGCCTCAACCCCAAGT 

CCCGAGCAGCTGCGACCTGGCGATCTGATCCTGGACCACGCAGGGGGAA 

ATTCTCCTCACGGGGTACGCCCATTCTAGCCTGC 

CTCAACTCAGAGGGCAACAGTGGTAGTGGTGACAGTAGC^ 

GAGGACTGCGAACTCTCCCGGCAGATCGGGGC^CAGCTTAAGCTGCTGCCTATGAATGATCAGATACGGGAG 

CTACAGACCATCATCCGGGACAAGACAGCCAGTAGAGGTGACTTCATGTTTTCTGCGGATCGTTTGA 

CTTGTTGTGGAAGAGGGATTGAATCAGCTGCCATATAAAGAATGCATGGTGACCACTC 

TATGAAGGAGTGAAATTTGAGAAGGGAAATTGTGGGGTCAGCATAATGAGAAGCGGTGAGG(^U^TGGAAC^ 

GGTTTACGAGACTGCTGTCGATCCATACGAATTGGAAAGATCCTGATTCAGAGTGATGAGGAGACACAAAGA 

GCC AAAGTATATTATGC CAAATTC C CC C CAGACATTT AC CGGAGAAAAGTC CTTCTGATGT ATC CAATT CTC 

AGCACTGGAAATACTGTAATTGAAGCTGTAAAGGTTCTTATAGAACATGGA 

CTACTC AGT CTGTTCTC CACTC CTCATGGTG CCAAATC AAT CATTC AGG AGTTTCCAGAGAT CACAATTTTA 
ACTACTGAAGTTCATCCTG TTG CACCTACACATTT TGG ACAG AAATACTT TGG AACAGACTAAGTTATTTAA 
GTAAAATAATTGTCTTATGTAATATTACAATCATGTTTTGATTTTCTATTTGTTTTACTGATTCAC 
GTGGCAGAGACAAATGT GTT ACAATGCTTTTT AGTTTTGGAAGTGGGT AT ATTTGAGCTT ATAT CTCACTTA 
GTTATTTGT 



10 



In a search of public sequence databases, the NOV53 nucleic acid sequence, located on 
the X chromosome, has 312 of 544 bases (57%) identical to a gb:GENBANK-ID: 
YSCFURlAJacc: M36485.1 mRNA from Saccharomyces cerevisiae (S.cerevisiae uracil 
phosphoribosyltransferase (FUR1) gene, complete cds). 

The disclosed NOV53 polypeptide (SEQ ID NO: 196) encoded by SEQ ID NO: 195 has 
309 amino acid residues and is presented in Table 53B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV53 has no signal peptide and is 
likely to be localized to the nucleus with a certainty of 0.3000. Alternatively, NOV53 may also 
localize to the mitochondrial matrix space with a certainty of 0.1000, or to the lysosome 
(lumen) with a certainty of 0.1000. 



15 



20 



Table 53B. Encoded NOV53 protein sequence (SEQ ID NO:196). 



MATELQCPDSMPCHNQQTOSASTPSPEQLRPGDLILDHAGGN^ 
SLNSEGNSGSGDSSSYDAPAGNSFLEDCELSRQIGAQLKI^ 

RLVVEEGLNQLPYKECMVTTPTGYKYEGVKFEKGNCGVS IMRSGEAMEQGLRDCCRS IRIGKILIQSDEETQ 
RAKVYYAKFPPDIYRRKVLLMYPILSTGNTVIEAVKVL^ 

LTTEVHPVAPTHFGQKYFGTD 



A search of sequence databases reveals that the NOV53 amino acid sequence has 588 
of 597 amino acid residues (98%) identical to, and 138 of 209 amino acid residues (66%) 
identical to, and 165 of 209 amino acid residues (78%) similar to, the 261 amino acid residue 
ptnr:SPTREMBL-ACC:Q9VRQl protein from Drosophila melanogaster (Fruit fly) (CG5537 
Protein) (E = 8.5e- 72 ). 
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NOV53 is predicted to be expressed in at least Bone Marrow, Brain, Bladder, Eye, 
Cervix, Kidney, Liver, Lymph node, Prostate, Small Intestine, Umbilical Vein. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, and/or 
5 RACE sources. 

NOV53 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 53C 



Table 53C. BLAST results for NOV53 



Gene index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 17486179 | ref | XP_ 
060041. l| 
(XM__060041) 


similar to 
Unknown (protein 
for MGC:23937) 
(H. sapiens) 
[Homo sapiens] 


309 


309/309 
(100%) 


309/309 
(100%) 


e-172 


gi| 14388454 |dbj |BAB 
60766.1) (AB063019) 


hypothetical 
protein [Macaca 
fascicularis] 


309 


300/309 
(97%) 


302/309 
(97%) 


e-167 


gi| 13874465 |dbj | BAB 
46861. l| (AB060829) 


hypothetical 
protein [Macaca 
fascicularis] 


309 


299/309 
(96%) 


302/309 
(96%) 


e-166 


gi| 14388519 |dbj | BAB 
60785. l| (AB063065) 


hypothetical 
protein [Macaca 
fascicularis] 


309 


298/309 
(96%) 


301/309 
(96%) 


e-166 


gi | 8217490 | emb | CAB9 
2761.1 | (AL137013) 


DA311P8.3 
(probable uracil 
phosphor ibosyltra 
nf erase) [Homo 
sapiens] 


166 


166/166 
(100%) 


166/166 
(100%) 


3e-94 



1 0 The homology between these and other sequences is shown graphically in the 

ClustalW analysis shown in Table 53D. In the ClustalW alignment of the NOV53 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 

1 5 can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 53D. ClustalW Analysis of NOV53 

1) Novel NOV53 (SEQ ID NO:196) 
20 2) gi 1 17486179 | ref |XP_060041 . 1 1 (XM_060041) similar to Unknown (protein for 
MGC:23937) (H. sapiens)' [Homo sapiens] (SEQ ID N0:567) 

3) gi 1 14388454 |dbj |BAB60766.l| (AB063019) hypothetical protein [Macaca 
fascicularis] (SEQ ID NO: 56 8) 

4) gi|l3874465|dbj |BAB46861.l| (AB060829) hypothetical protein [Macaca 
25 fascicularis] (SEQ ID NO:569) 
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5) gi| 14388519 |dbj |babgo785.i| (AB0G30S5) hypothetical protein (Macaca 
fascicularis] (SEQ ID NO:570) 

6) gi|8217490|emb|CAB92761.l| (AL137013) bA311P8.3 (probable uracil 
phosphoribosyltranf erase) [Homo sapiens] (SEQ ID NO: 5 71) 
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NOV53 


1 


gi 


17486179 


1 


gi 


14388454 


1 


gi 


13874465 


1 


gi 


14388519 


1 


gi 


8217490 | 


1 


NOV53 


61 


gi 


17486179 


61 


gi 


14388454 


61 


gi 


13874465 


61 


gi 


14388519 


61 


gi 


8217490) 


1 


NOV53 


121 


gi 


17486179 


121 


gi 


14388454 


121 


gi 


13874465 


121 


gi 


14388519 


121 


gi 


8217490 | 


1 


NOV53 


181 


gi 


17486179 


181 


gi 


14388454 


161 


gi 


13874465 


181 


gi 


14388519 


181 


gi 


8217490 | 


38 


NOV53 


241 


gi 


17486179 


241 


gi 


14388454 


241 


gi 


13874465 


241 


gi 


14388519 


241 


gi 


8217490) 


98 


NOV53 


301 


gi 


17486179 


301 


gi 


14388454 


301 


gi 


13874465 


301 


gi 


14388519 


301 


gi 


8217490 | 


158 
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matelqcpdsmpchnqqvnsastpspeqlrpgdjj 
matelqcpdsmpchnqqvnsastpspeqlrpgd" 
matelqcpdsmpchnqqvnsastpspeqlrpgdE 
matelqcpdsmpchnqqvnsastpspeqlrpgdr 
matelqcpdsmpchnqqvnsastpspeqlrpgd" 



ILDHAGGNRASRAKVtjLLTGgH |SL 
ILDHAGGNRASRAKVMLLTGggK gSL 

ildhaggnrasrakvKlltoWh Ss L 
ildhaggnrasrakvBlltgHJh Ssl 

ILDHAGGNRASRAKvBlLTgSRh *SL 



60 
60 
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90 



100 



110 



120 



paeld|gacBgsslns E 
paeldIgacSgsslnse 
paeld|gacJ5g s slns e 
paeldBgacegsslnse 
paeld"gac5gsslnse 


j^sgsgdsssydapagnsflgdcelsrqigaqlkllpmndqir 
ensgsgdsssydapagnsflgdcelsrqigaqlkllpmndqir 
snsgsgdsssydapagnsflSdcelsrqigaqlkllpmndqir 
jnsgsgdsssydapagnsflSdcelsrqigaqlkllpmndqir 
snsgsgdsssydapagnsflSdcSlsrqigKqlkllpmndqir 





120 
120 
120 
120 
120 
1 



130 



I 



140 



J 



150 



160 



I 



170 



180 
. . I 



elqtiirdktasrgdfm?sadrlirlweeglnqlpykecm\t?tptgykyegvkfekgnc 
elqtiirdktasrgdfmfsadrlirlweeglnqlpykecmvttptgykyegvkfekgnc 
elqtiirdktasrgdfmfsadrlirlweeglnqlpykecmvttptgykyegvkfekgnc 
elqtQirdktasrgdfmfsadrlirlweeglnqlpykecmvttptgykyegvkfekgnc 
slqtiirdktasrgdfm fsadrl irlweeglnqlpykecmvttptgykyegvkfekgnc 

irlweeglnqlpykecmvttptgykyegvkfekgnc 



190 



200 
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210 

. . I . . 



220 



230 

. . I . . 



240 

..I 



gvsimrsgeameqglrdccrsirigxiliqsdeetqrakvyyakfppdiyrrkvllmypi 
gvsimrsgeameqglrdccrsirigkiliqsdeetqrakvyyakfppdiyrrkvllmypi 
gvsimrsgeameqglrdccrsirigkiliqsdeetqrakvyyakfppdiyrrkvllmypi 
gvsimrsgeameqglrdccrsirigkiliqsdeetqrakvyyakfppdiyrrkvllmypi 
gvsimrsgeameqglrdccrsirigkiliqsdeetqrakvyyakfppdiyrrkvllmypi 
gvsimrsgeameqglrdccrsirigkiliqsdeetqrakvyyakfppdiyrrkvllmypi 
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lstgntvieavkvliehgvqpsviillslfstphgaksiiqefpeitilttevhpvapth 
lstgntvieavkvliehgvqpsviillslfstphgaksiiqefpeitilttevhpvapth 
lstgntvieavkvliehgvqpsviillslfstphgaksiiqefpeitilttevhpvapth 
lstgntvieavkvliehgvqpsviillslfstphgaksiiqefpeitilttevhpvapth 
lstgntvieavkvliehgvqpsviillslfstphgaksiiqefpeitilttevhpvapth| 
lstgntvieavkvliehgvqpsviillslfstphgaksiiqefpeitilttevhpvapth! 




60 



Table 53E lists the domain descriptions from DOMAIN analysis results against 
NOV53. This indicates that the NOV53 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 53E Domain Analysis of NOV53 

gnl|Pfam|pfam00156, Pribosyltran, Phosphoribosyl transferase domain. 
This family includes a range of diverse phosphoribosyl transferase 
enzymes. This family includes: Adenine phosphoribosyl transferase 
EC : 2 . 4 • 2 . 7 , Hypoxanthine- guanine -xanthine phosphoribosyl transferase, 
Hypoxanthine phosphoribosyl transferase £0:2.4.2.8, Ribose -phosphate 
pyrophosphokinase i EC : 2 . 7 . 6 . 1 , Amidophosphoribosyl transferase 
EC: 2. 4. 2. 14, Orotate phosphoribosyl transferase EC: 2. 4. 2. 10, Uracil 
phosphoribosyl transferase EC : 2 . 4 . 2 . 9 , Xanthine -guanine 
phosphoribosyltransferase EC:2.4.2.22. (SEQ 10 NO:852) 
CD-Length = 153 residues, 43.1% aligned 
Score « 35.0 bits (79), Expect - 0.006 



NOV53 : 226 PPDIYRRKVLLMyPILSTGNTV^EAVTCVLIEHGVQPSVIILLSLFSTPHGAKSIIQEFPE 285 

|+ ++H++ ++ || |+ | ++| | | + + +| + + + || 

Sb j Ct : 87 VGDVGGKRVliIVDDVIDTGGTIRAAAELLKEAG 14 6 

5 - ... 

NOV53: 286 ITILTT 291 
+++ 

Sbjct: 147 PSLIVL 152 

1 0 The gene . of invention is a novel uracil phosphoribosyltransferase (UPR1>like gene. 

UPRT catalyzes the formation of uridine 5-monophosphate in the pyrimidine salvage pathway 
from uracil and 5-phospho-alpha-D-ribose 1 -diphosphate. The Saccharomyces cerevisiae 
FUR1 gene encodes UPRT (Kern et a!., Gene 88:149-157(1990)). Mutations in the FUR1 gene 
have been correlated to resistance to 5-fluorouracil, a common chemotherapeutic agent (Kern 

15 et ah, Curr Genet 1991 May;19(5):333-7). 

The novel gene belongs to a family of phosphoribosyl transferases, as evidenced by the 
presence of a characteristic domain. It is anticipated that this gene plays a role in the 
pyrimidine salvage pathway and that it influences the growth or growth restriction of various 
tissues, and especially of tumor cells. 

20 The disclosed NOV53 nucleic acid of the invention encoding a Uracil 

Phosphoribosyltransferase-like protein includes the nucleic acid whose sequence is provided 
in Table 53A or a fragment thereof. The invention also includes a mutant or variant nucleic 
acid any of whose bases may be changed from the corresponding base shown in Table 53 A 
while still encoding a protein that maintains its Uracil Phosphoribosyltransferase -like 

25 activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to those just described, 
including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 

30 include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
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in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 43 percent of the 
bases may be so changed. 
5 The disclosed NOV53 protein of the invention includes the Uracil 

Phosphoribosyltransferase-like protein whose sequence is provided in Table 53B. The 
invention also includes a mutant or variant protein any of whose residues may be changed 
from the corresponding residue shown in Table 53B while still encoding a protein that 
maintains its Uracil Phosphoribosyltransferase-like activities and physiological functions, or a 

10 functional fragment thereof. In the mutant or variant protein, up to about 4 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Uracil Phosphoribosyltransferase- 

1 5 like protein (NOV53) is a member of a "Uracil Phosphoribosyltransferase family". Therefore, 
the NOV53 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 

20 targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV53 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated Von Hippel-Lindau (VHL) syndrome, Alzheimer f s disease, 

25 stroke, tuberous sclerosis, hypercalcemia, Parkinson's disease, Huntington's disease, cerebral 
palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, 
leukodystrophies, behavioral disorders, addiction, anxiety, pain, neurodegeneration, 
hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, autoimmune disease, 
allergies, immunodeficiencies, transplantation, graft versus host disease, fertility disorders, 

30 anemia, bleeding disorders, scleroderma, cystitis, incontinence, diabetes, renal artery stenosis, 
interstitial nephritis, glomerulonephritis, polycystic kidney disease, systemic lupus 
erythematosus, renal tubular acidosis, IgA nephropathy, hypercalcemia, cirrhosis, 
inflammatory bowel disease, diverticular disease, lymphedema, cancer, trauma, tissue 
degeneration, bacterial/viral/parasitic infections, and/or other diseases and pathologies. 
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NOV53 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV53 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
5 NOVX Antibodies" section below. The disclosed NOV53 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in , 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 
10 NOV54 

NOV54 includes two novel protein phosphatase 2C -like proteins disclosed below. 
The disclosed sequences have been named NOV54a and NOV54b. 

NOV54a 

A disclosed NOV54a nucleic acid of 2185 nucleotides (also referred to as CG56773- 
15 01) encoding a protein phosphatase 2C-Iike protein is shown in Table 54A. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 1-3 and ending 
with a TGA codon at nucleotides 1402-1404. The start and stop codons are shown in bold in 
Table 54A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 54A. NOV54a nucleotide sequence (SEQ ID NO:197). 

ATQTCCGCCXSGCTGGTTCCGGCGCCGCTTCCrGCCTGGGGAGCCGCTCCCCGCGCCGCGGCCGCCTGGGCCG 
CATGCCAGCCCCGTGCCCTACCGACGGCCCCGCTTCCTTCGCGGCTCCAGCTCCAGCCCCGGGGCGGCCGAC 
GCCTCGCGCCGCCCAGACTCCCGGCCCGTGCGCAGCCCCGCACGAGGACGCACGCTACCCTGGAATGCAGGC 
TACGCCGAGATTATCAATGCAGAGAAATCTGAATTC^TGAGGATC^ 

CGGAGATGTGAGTTTGGGGCTGAAGAAGAGTGGCTGACCCTGTGCCCAGAGGAGTTCCTGACAGGCCATTAC 

TGGGCACTGTTCGATGGGCACGGCGGTCCTGCAGCAGCCATCTTGGCTGCCAACACCCTGCACTCCTGCTO 

CGCCGGCAGCTGGAGGCCGTGGTGGAAGGCTTGGTGGCCACTCAGCCCCCCATGCACCTCAATGGCCGCTGC 

ATCTGCCCCAGTGACCCTCAGTTTGTGGAGGAAAAGGGCATCAGGGCAGAAGACTTGGTGATCGGGGCATTG 

GAGAGTGCCTTTCAGGAATGTGATGAGGTGATCGGGCGGGAGCTGGAGGCCTCAGGCC^GATGGGCGGCTGC 

ACAGCCCTGGTGGCTGTGT CCCTGCAGGGAAAGCTGTACATGGCCAATGCTGGGGATAG CAGGGCCATCTTG 

GTGCGGAGAGATGAGATACGGC CACTGAGCTTCGAGTTCACCCCAGAGACTGAGCGGCAG CGGATCC AGCAG 

CTGGCCTTTGTCTATCCTGAGCTTCTGGCTGGTGAGTTCACCCGACTGGAGTTCCCTCGGCGGCTGAAGGGG 

GATGACTTGGGACAGAAGGTTTTGTTCAGGGAT CAC CACATGAGTGGCTGGAGCTACAAACGTGTGGAGAAA 

TCGGATCTCAAGTACCCACTGATCCATGGACAGGGTAGGCAGGCTCGGTTACTAGGAACACTGGCTGTCTCC 

CGGGGCCTGGGAGACCATCAGCTCAGAGTCCTGGACACAAACATCC?^^ 

CCACAGGTG ACTGTGCTGGATGTGG ACCAG CTGGAGCT ACAGGAGGATGATGTGGTTGT CATGG CAACTG AT 

GGACTCTGGGATGTACTGTCCAACGAGC^GGTGGCATGGCTGGTGCGGAGCTTCCrrCCCT 

GACC CACAC AGCTAT CTGCAGG ATGGT CTT CACAGGTT CT C AAAGCTGG C CC AG ATG CTG AT ACACAGCACA 

CAGGGAAAGGAAGACAGTCTCACAGAGGAAGGGCAGGTGTCCTACG 

TTGC^CAGTCAGGGCCAAGAGAGCAGTGACCACTG&GGATTCAGACACT^ 

CCGGGTGTGGTOTGGGC^TCCCTCCAGTGTGACCAAGAGCAAATCCTGCCTGCCCTATCCCTAGCC^ 
CAGTGCTCTCACTATCCACCTCAACACACATCCATCTCAAGAGGAAC^^ 

AAGTGTATGGAGAGCCCAGCCCACCAGGTCCTGCCTTTTGCGGTGATAACCTTCTCTGGCAGAGTGACTTTA 
C AACTT AACTAGGAAAC CCATGTGAGGCTCCT CAG ACAGGATCTTGAAC AG C CC AAAGT ATCATT CT CAG AT 
AGGGGCACC CAAGCTAAGGG TATTAGCCAAAGATG CCAGGATGGGTAGCTAGCCCATGTTTAGATCC AGGT C 
TCCAATTCATGGTTATCAGGGCATGTGTTCAACAACCCCC71AAGTCCACGCAGGT 

GGG CAGC CT CATGT CTGCT AAAACAGC CAT CTTCAAGACAG C C CCTGAAAAGAGACC AGTTC AGGT C CTG CC 
CTGCTGTTCTTTG CTGG AGATGAGG AACAGGTGCTGGGGC T AAAGT TTGGGGTAGAG CAC AAGGG AC AAGAG 
GAACTCTTGGAGTTGGCTGGGTGAGAGGGCTCTCCATTTGCTACCTGTAGTAGCCTGCCTCTT 
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C C CT ATT ATTAAAT GTTTT CT ACAG 



In a search of public sequence databases, the NOV54a nucleic acid sequence, located 
on the p2 1.1 region of chromosome 3, has 592 of 928 bases (63%) identical to a 
gb:GENBANK-ID:AK023315|acc:AK023315.1 mRNA from Homo sapiens (cDNA 
5 FLJ13253 fis, clone OVARC1000751) (E = 4.5e -41 ). 

A disclosed NOV54a polypeptide (SEQ ID NO: 198) encoded by SEQ ID NO: 197 has 
467 amino acid residues and is presented in Table 54B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV54a has no signal peptide and is 
likely to be localized to the microbody (peroxisome) with a certainty of 0.3941. Alternatively, 
10 NOV54a may also localize to the nucleus with a certainty of 0.3000, to the lysosome (lumen) 
with a certainty of 0.1558, or to the mitochondrial matrix space with a certainty of 0.1000. 



Table 54B. Encoded NOV54a protein sequence (SEQ ID NO:198). 

MS AGWFRRRFLPGEPLPAPRPPGPHAS PVPYRRPRFLRGSSS SPGAADASRRPDSRPVRSPARGRTLPWNAG 
YAEI INAEKSEFNEDQAACGKLCIRRCEFGAEEEWLTLC PEEFLTGHYWALFDGHGGPAAAILAANTLHSCL 
RRQLEAVVEGLVATQPPMHLNGRCICPSDPQFVEEKGI 

TALVAVSLiQGKLYMANAGDSRAI LVRRDE IRPLS FE FTPETE RQRI QQLAFVYPELLAGEFTRLEFPRRLKG 

DDI^QKVLFRDHHMSGWSYKRVEKSDLKYPLIHGQGRQARLLGTLAVSRGLGDHQLRVLDra 

PQVTVLDVDQLELQEDDVVVMATDGLWD^ 

QGKEDSLTEEGQVS YDDVSVFVI PLHSQGQESSDH 

A search of sequence databases reveals that the NOV54a amino acid sequence has 32 
15 of 77 amino acid residues (4 1 %) identical to, and 48 of 77 amino acid residues (62%) similar 
to, the 413 amino acid residue ptnr:SPTREMBL-ACC:Q9M3Vl protein from Fagus sylvatica 
(Beechnut) (Protein Phpsphatase 2C (PP2C) (EC 3.1.3.16)) (E = 9.2e ,6 ). 

NOV54a is predicted to be expressed in at least bone marrow, lymphoid tissue, tonsils, 
brain, colon, uterus, endometrium, placenta, mammary gland/breast, prostate, testis, foreskin, 
20 heart, kidney, lung, spleen, peripheral blood, pituitary gland, retina, and pooled germ cell 

tumors. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 

In addition, the sequence is predicted to be expressed in ovarian carcinoma because of 
25 the expression pattern of (GENBANK-ID: gb:GENBANK-ro:AK023315|acc:AK023315.1) a 
closely related Homo sapiens cDNA FLJ13253 fis, clone OVARC1000751 homolog. 
NOV54b 

A disclosed NOV54b nucleic acid of 1930 nucleotides (also referred to as CG56773- 

02) encoding a protein phosphatase 2C-like protein is shown in Table 54C. An open reading 
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frame was identified beginning with a ATG initiation codon at nucleotides 1-3 and ending 
with a TGA codon at nucleotides 1 147-1 149. The start and stop codons are shown in bold in 
Table 54C, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 54C NOV54b nucleotide sequence (SEQ ID NO:199). 

ATGTCCGCCGGCTGGTTCCGGCGCCGCTTCCTGCCTGGGGAGCCGCTCCCCGCGCCGCGGCCGCCTGGGCCG 
CATGCCAGCCCCGTGCCCTACCGACGGCCCCGCTTCCTTCGCGGCTCCAGCTCCAGCCCCGGGGCGGCCGAC 
GCCTCGCGCCGCCCAGACTCCCGGCCCGTGCGCAGCCCCGCACGAGGACGCACGCTACCCTGGAATGCAGGC 
TACGCCGAGATTATCAATGCAGAGAAATCTGAATTCAATGAGGATCAAGCCGCCTGTGGGAAGCTGTGCATC 
CGGAGATGTGAGTTTGGGGCTGAAGAAGAGTGGCTGACCCTGTGCCCAGAGGAGGATGAGGTGATCGGGCGG 
GAGCTGGAGGCCTCAGGCCAGATGGGCGGCTGCACAGCCCTGGTGGCTGTGTCCCTGCAGGGAAAGCTGTAC 
ATGGCCAATGCTGGGGATAGCAGGGCCATCTTGGTGCGGAGA^ 

ACCCCAGAGACTGAGCGGCAGCGGATCCAGCAGCTGGCCTTTGTCTATCCTGAGCTTCrGGCTGG 

ACCCGACTGGAGTTCCCTCGGCGGCTGAAGGGGGATGACTTGGGACAGAAGGTTTTGTTCAGGGATCACCAC 

ATGAGTGGCTGGAGCTACAAACGTGTGGAGAAATCGGATCTCAAGTACCCACTGATCCATGGACAGGGTAGG 

CAGGCTCGGTTACTAGGAACACTGGCTGTCTCCCGGGGCCTGG^ 

AACATCCAGCTCAAGCCCTTCTTGCTCTCTGTGCC^CA 

CAGGAGGATGATGTGGTTGTCATGGCT^CTGATGGACTCTGGGATGTACTGTCCAACGAGO^GGTGGCATGG 
CTGGTGCGGAGCTTCCTCCCTGGGAACCAAGAGGACCCAC^ 

TCAAAGCTGGCCCAGATGCTGATACACAGCACACAGGGAAAGGAAGACAGTCTCACAG^ 
TCCTACGATGACGTCTCTGTGTTCGTGATTCCCTTGCACAGTCAGGGCCAAGAGAGCAGTGACCACTGAGGA 
TTCAGACACTGTATCCCAGAACTGCTCTAGTGCCCGGGTGTC 
CAAATCCTGCCTGCCCTATCCCTAGCCACCGCCCAGTGCTCTCAC 

AGAGGAACATTTATACCAGGCAGTCAGAGCTGGAAGTGTATGGAGAGCCCAGCCCACCAGGTCCTGCCTTOT 
GCGGTGATAACCTT CTCTGGCAGAGTGACTTTACAACTTAACTAGGAAACC CATGTGAGGCTCCTC^ 
GATCTTGAACAGCCCAAAGTATCATTCTCAGATAGC^ 

GATGGGTAGCTAGCCC^TGTTTAGATCCAGGTCTCCAATTCATGGTTATCAGGGaVTGTGTTCAACAACCCC 

CAAAGTCC^CGCAGGTGGCTTGTAGAAACCTTTGGGCAGCCTCATGTCTGCTAAAACAGCCATCTTCAAGAC 

AGCCCCTGAAAAGAGACCAGTTCAGGTCCTGCCCTGCTGTTCTTTGCTGGAGATGAGGAACAGGTGCTGGGG 

CT AAAGTT TGGGGTAGAG CACAAGGGACAAGAGGAACTC TTGGAGTTGG CTGGGTGAGAGGGCTCT^ 

GCTACCTGTAGTAGCCTGCCTCTTAACTGGTTGCTTCTCCCTAGlTCC^GCCCTGCCCTGGTCrGATGCCCC 

AACACTGCCCTTGCTTTGTTTTCCCTGTCACCTCCCTATTATTAAATGTTTTCTACAG 



5 In a search of public sequence databases, the NOV54b nucleic acid sequence, located 

on chromosome 3, has 446 of 660 bases (67%) identical to a gbrGENBANK- 
ID:BC01 1 803|acc:BC01 1803.1 mRNA from Homo sapiens {Homo sapiens, Similar to RIKEN 
cDNA 2310008J22 gene, clone MGC: 19531 IMAGE:4336762, mRNA, complete cds) (E = 
2.0e" 56 ). 

10 The disclosed NQV54b polypeptide (SEQ ID NO:200) encoded by SEQ ID NO:199 

has 382 amino acid residues and is presented in Table 54D using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV54b has no signal peptide 
and is likely to be localized to the microbody (peroxisome) with a certainty of 0.4037. 
Alternatively, NOV54b may also localize to the nucleus with a certainty of 0.3000, to the 

1 5 lysosome (lumen) with a certainty of 0. 1 000, or to the mitochondrial matrix space with a 
certainty of 0.1000. 



Table 54D. Encoded NOV54b protein sequence (SEQ ID NO:200). 

MSAGWFRRRFLPGEPLPAPRPPGPHAS PVPYRRPRFLRGS5S SPGAADASRRPDSRPVRSPARGRTLPWNAG 
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YAEIINAEKSEFNEDQAACGKLCIRRCEFGAEEEWLTLCPEEDEVIGRELEASGQMGGCTALVAVSLQGKLY 
MANAGDSRAILTORDEIRPLSFEFTPETERQRIQQLAFVYPEL^ 

MSGWSYKRVEKSDLKYPLIHGQGRQARLLGTlAVSRGLGDHQLRVIJ)TNIQLKPFIiLSVPQVTVI t DVDQLEL 
QEDD WVMATDGLWDVLSNEQ VAWLVRS FLPGNQEDPHS YLQDGLHR FS KLAQML I HS TQGKEDSLTEEGQV 
S YDDVSVFVI PLHSQGQE S SDH 

A search of sequence databases reveals that the NOV54b amino acid sequence has 231 
of270 amino acid residues (85%) identical to, and 244 of 270 amino acid residues (90%) 
similar to, the 453 amino acid residue ptnr:SPTREMBL-ACC:Q9CSD6 protein from Mus 
5 musculus (Mouse) (2810423O19RK Protein) (E = 1.6e 167 ). 

NOV54b is predicted to be expressed in at least the following tissues: bone marrow, 
lymphoid tissue, tonsils, brain, colon, uterus, endometrium, placenta, mammary gland/breast, 
prostate, testis, foreskin, heart, kidney, lung, spleen, peripheral blood, pituitary gland, retina, 
and pooled germ cell tumors. 
1 0 NOV54 also has homology to the amino acid sequences shown in the BLASTP data 

listed in Table 54E 



Table 54E. BLAST results for NOV54 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
<aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi| 12850332 |dbj | BAB 
28679.1) (AK013149) 


Protein 
phosphatase 2C 
containing 
protein-datasourc 
e:Pfam f source 
key:PF00481, 
evidence : ISS-puta 
tive [Mus 
musculus] 


453 


400/459 
(87%) 


423/459 
(92%) 


0.0 


gi 1 16552416 |dbj| BAB 
71302. l| (AK056894) 


unnamed protein 
product [Homo 
sapiens] 


270 


251/252 
(99%) 


251/252 
(99%) 


e-143 


gi | 17462396 |ref|XP 

059571.1) 

(XMJ359571) 


similar to 
putative (H. 
sapiens) [Homo 
sapiens] 


247 


247/255 
(96%) 


247/255 
(96%) 


e-137 


gi| 12856386 |dbj |BAB 
30649. l| (AK017245) 


Protein 
phosphatase 2C 
containing 
protein-datasourc 
e:Pfam, source 
key:PF00481, 
evidence : ISS-puta 
tive [Mus 
musculus] 


254 


222/261 
(85%) 


234/261 
(89%) 


e-12i 


gi 1 17455719 |ref|XP 

051093.2] 

(XM 051093) 


KIAA1157 protein 
[Homo sapiens] 


514 


213/480 
(44%) 


288/480 
(59%) 


6e-98 



The homology between these and other sequences is shown graphically in the 
15 ClustalW analysis shown in Table 54F. In the ClustalW alignment of the NOV54 protein, as 
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well as all other Clustal W analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 54F. ClustalW Analysis of NOV54 



1) Novel N0V54a (SEQ ID NO: 198) 

2) Novel NOV54b {SEQ ID NO:200) 

3) gi| 12850332 |dbj|BAB28679.l| (AK013149) Protein phosphatase 2C containing 
protein-datasource:Pf am, source key:PF00481, evidence : I SS -putative [Mus musculus] 
(SEQ ID NO: 572) 

4) gi| 16552416 |dbj|BAB71302.l| (AK056894) unnamed protein product [Homo sapiens) 
(SEQ ID NO: 573) 

5) gi | 17462395 | ref |XP_059571 . 1 | (XMJ)5957l) similar to putative (H. sapiens) [Homo 
sapiens) (SEQ ID NO: 574) 

6) gi|l2B56386|dbj |BAB30649.l[ (AK017245) Protein phosphatase 2C containing 
protein-da tasource : Pf am, source key:PF0048l, evidence :ISS-putative [Mus musculus] 
(SEQ ID NO: 575) 

7) gi | 17455719 | ref |XP_051093 .2 | (XM_051093) KIAA1157 protein [Homo sapiens) (SEQ ID 
N0:576) 



NOV54a 


1 


NOV54b 


1 


gi 


12850332 


1 


gi 


16552416 


1 


gi 


17462396 


1. 


gi 


12856386 


1 


gi 


17455719 


1 



10 20 30 40 50 60 

MS AGWFRRRFL PG E PL PAP RP PG PHAS P V P YRRPRFLRGS S SS PGAADAS 5 0 

MS AGWFRRRFL PGE PL PAPRPPGPHASPV P YRRPRFLRGS S S S PGAADAS 5 0 

VLPGGPLPEPRPAGPRSSPV PYHRPRFLRGSGSS PGATDAS 4 1 

-- 1 

- - 1 

1 

MLTRVKS AVANFMGGI MAGSSGSEHGGGS CGGSDLPLRFPYGRPEFLG - - -LSQDEVECS 57 



NOV54a 


51 


NOV54b 


51 


gi 


12850332 | 


42 


gi 


16552416] 


1 


gi 


17462396) 


1 


gi 


12856386) 


1 


gi 


17455719) 


58 



70 



80 



90 



100 



110 



120 



|....|. 



NEDQAACGKLCIR 97 

RRPDSRPVRSPARGRTLPWNAGYAEIINAEKSEFNEDQAACGKLCIR 97 

RRPDARPVRSPARGRTLPWNAGYAEVINAEKSEFNEDQAACGKLCIR 88 

1 

1 

- 1 

ADH I ARP I LILKETRRLP WATG YAEVINAGKSTHNEDQASCEVLT VKKKAG AVTSTPNRN 117 



NOV54a 


97 


NOV54b 


97 


gi 


12850332 


B8 


gi 


16552416 


1 


gi 


17462396 


1 


gi 


12656386 


1 


gi 


17455719 


118 



13 0 



140 



150 



160 



170 



180 



RCEFG- -AEEEWLTLCPEEFLTGHYWALFDGHGGPAAAILAANTLHSCLRRQLEAV 151 

RCEFG- -AEEEWLTLCPEE - - 114 

RCEFGI EEHQEWLTVCPEEFLTGHYWALFDGHGGP AAAI LAANTLHS CLRRQLEAV 14 4 

- 1 

1 

- - 1 



NOV54a 
N0V54b 



gi 
gi 
gi 
gi 



12850332 
16552416 
17462396 
12856386 



190 200 
....|....|-...|. ...).. 

152 VEGLV — ATQPPMHL 

114 - -- 

145 VEGMI - - APQPPMHL- - 

1 MHL 

1 

! 



210 
.|....|.. 
-NGRCIC-- 



220 



---SGRCVC- 
NGRCIC- 



230 240 
.|....|....|....| 

- PSDP- -QFVEEKGIRA 184 
- 114 

- PSDP - -QFVEEKGIQA 177 

- PSDP- -QFVEEKGIRA 23 

1 

1 
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gi 1 17455719 | 178 VDILKNSAVLPPTCLGEEPENTPANSRTLTRAASLRGGVGAPGSPSTPPTRFFTEKKIPH 237 



NOV54a 


NOV54b 


gi 


12850332 


gi 


16552416 


gi 


17462396 


gi 


12856386 


gi 


17455719 



NOV54a 
NOV54b 



gi 
gi 
gi 
gi 
gi 



12850332| 
16552416) 
17462396 | 
12856386) 
174557191 



NOV54a 
NOV54b 



gi 
gi 
gi 
gi 
gi 



12850332 
16552416 
17462396 
12856386 
17455719 



250 
I 

185 EDLVT GALE S AFQEt 
114 

178 EDLVIGALENAFQE* 
24 EDLVIGALESAFQE* 
1 
1 

238 eclvigalesaf: 



NOV54a 


3 05 


NOV54b 


220 


gi 


| 12850332 


| 298 


gi 


16552416 


1 144 


gi| 17462396) 93 


gi 


1 12856386 


99 


gi| 17455719 


358 


NOV54a 


365 


NOV54b 


280 


gi 


12850332 


358 


gi 


16552416 


204 


gi 


17462396 


153 


gi 


12856386 


159 


gi 


17455719 


418 




430 



440 



450 



460 



470 



480 



VLDVgQLgQEDDVVVMATDGLWDVLSNEQVAraLVRSFLPGNQi».D. 
VLDVgQLfflQEDDVVVMATIXSLWDVLSNEQVABLVRSFLPGNQSp 
VLDvBoL ^ OEnfoVVVMATIXSLWDV^ 

VLDVfflQLgQEDDVVVMATDGLWDVIjSNEQVABLVRSFLPGNQgDr 
VLD VgQLgQEDDVWMATDiSLWDVLSNE QVAfflLVRS FLPGNQ»5d[ 

vld v^ql^qe ^vvvma,tdglwdvlsneqv aB.lvrs fl Bgnq° dii 



PHSYLQDG] 
PHSYLQDGL 
p 




490 



500 




Tables 54G-H list the domain descriptions from DOMAIN analysis results against 
NOV54. This indicates that the NOV54 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 54G Domain Analysis of NO V54 

gnl | Smart | smart 003 32, PP2Cc, Serine/ threonine phosphatases, family 2C, 
catalytic domain; The protein architecture and deduced catalytic 
mechanism of PP2C phosphatases are similar to the PP1, PP2A, PP2B 
family of protein Ser/Thr phosphatases, with which PP2C shares no 
sequence similarity. (SEQ ID NO: 853) 
CD-Length = 260 residues, 73.5% aligned 
Score = 114 bits (286), Expect = 9e-27 



10 



15 



20 



NOV54 : 


118 


Sbjct: 


40 


NOV54 : 


178 


Sbjct: 


76 


NOV54 : 


237 


Sbjct: 


130 


NOV54 : 


297 


Sbjct: 


155 


NOV54 : 


357 


Sbjct: 


188 



GHYWALFDGHGGPAAAILAANTLHSCLRRQLEAVVEGLVATQ^^ 177 

I ++ +11 II II II +1 I + l + 

GGFFGVFDGHGGSEAAKFLSKNLPEILAEELIKDKD - 75 

EEKGIRAEDLVIGALESAFQECDEVIGRELEASG - QMGGCTALVAVSLQGKLYMANAGDS 236 

I II II II I 111+ I I 11+11+ lll+ll III 
EDEDVEDALRKAFLRTDEEILEELESLEDQRSGTTAVVALIRGNKLYVANVGDS 12 9 

RAILVRRDEIRPLSFEFTPETERQRI QQLAFVYPELIiAGEFTRLEFPRRLKGDDLGQKVL 296 
11+11+ 1+ + I I +1 

RAVLCRNGKAVQLTEDHKPSNEDER 154 

FRDHHMSGWSYKRVEKSDLKYPLIHGQGRQARLLGTLAVSRGLGDHQLRVLDTNIQLKPF 356 
+1+ ++ I 1+ I ll+ll III I 11+ 



-ERIREA- 



- GGFVSNGRVNGVLALS RALGDFFL - 



-KPY 187 



LLSVPQVTVLITTOQLEXiQHDDVVVMATDGLVroVLSNEQVAWLVRS FL 

+++ I + + +I + IIIIMIM++I +11 I 

VIAEPDVTWELTEKDDFLI LASDGLWDVLSNQEWDI VRKHL 



403 



230 



Table 54H Domain Analysis of NO V54a 

gnl |pfam|pfam00481, PP2C, Protein phosphatase 2C. Protein phosphatase 
2C is a Mn++ or Mg++ dependent protein S/threonine phosphatase. (SEQ 
ID NO: 854) 

CD-Length « 252 residues, 77.0% aligned 
Score = 93.2 bits J230), Expect = 3e-20 





NOV54 : 


119 


25 


Sbjct : 


35 




NOV54 : 


179 


30 


Sbjct: 


70 




NOV54 : 


238 




Sbjct : 


123 


35 


NOV54 : 


295 




Sbjct: 


155 


40 


NOV54 : 


■355 




Sbjct: 


184 



119 HYWALFDGHGGPAAAIIiAANTLHSCLRRQLEAVVEGLVATQPPMHLNGRCICPSDPQFVE 17 8 

++I+IIIIII II I I + I + 11+ 
35 GFFAVFDGHGGSQAAKYAGKHLETKLALR KSFPEL-- 69 



i + n+ +i i ii + + i II+M+ iii+ii mi 

- DDLENALKE S FLE S TDEELRS S AANTDLDS GS T AVVAL IRGNKLYVANVGDS R 122 



l+l I 1+ 1+ + I I 11+11+ 

AVLCRNGNAIKQLTEDHKPSNEDERRRIEGAG - 



+ + I II 



1+ +++MIIIIIII+++I . +111 I 



154 



VLFRDHHMSGWSYKRVEKSDLKyPLIHGQGRQARLLGTLAVSRGLGDHQLRVLDTNIQLK 354 

+ II + I Mill II +1 + 

- GFVSRNGR- - - VNGVLAVSRAFGDFELK PGVL 183 



228 



Protein phosphorylation plays a key role in the regulation of cellular functions through 
the activation or inhibition of enzymes involved in various biochemical pathways. Kinases and 
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phosphatases that determine the phosphorylation state of an enzyme (and its activity) are 
frequently regulated through the action of hormones and growth factors (1). Four distinct 
subfamilies of serine/threonine protein phosphatases have been identified in mammals: PP1, 
PP2A, PP2B and PP2C (2). The PP2C subfamily contains structurally diverse protein 
5 phosphatases with a wide range of functions in cellular signal transduction; however, the exact 
physiological role of most PP2C enzymes is still unclear. 

The protein described in this invention contains protein phosphatase 2C domains and is 
therefore likely to play a role in signal transduction and cellular proliferation. The protein is 
also homologous, but not identical, to the rat petrin protein, a PP2C subfamily member that 

10 has been shown to modulate neurite growth inhibition and may therefore be useful in the 
treatment of nerve damage resulting from traumatic injury, stroke or CNS degenerative 
disorders (3). The PP2C-like gene described in this invention is also expressed in the brain and 
may therefore have similar functions in the CNS. However, it is also expressed in a number of 
other tissues and based on its expression pattern may contribute to additional human diseases, 

15 such as cancer, inflammation/autoimmune diseases, and metabolic disorders. The PP2C-Iike 
gene maps to human chromosome 3p2 1.1. 

Protein phosphatase 2C domain is found in protein phosphatase 2C, as well as other 
proteins e.g. adeylate cyclase. Protein phosphatase 2C (PP2C) is one of the four major classes 
of mammalian serine/threonine specific protein phosphatases. PP2C is a monomelic enzyme 

20 of about 42 Kd that shows broad substrate specificity and is dependent on divalent cations 

(mainly manganese and magnesium) for its activity. Its exact physiological role is still unclear. 
Three isozymes are currently known in mammals: PP2C-alpha,.-beta and -gamma. In yeast, 
there are at least four PP2C homologs: phosphatase PTC1 that has weak tyrosine phosphatase 
activity in addition to its activity on serines, phosphatases PTC2 and PTC3, and hypothetical 

25 protein YBR125c. Isozymes of PP2C are also known from Arabidopsis thaliana (ABI1, 
PPH1), Caenorhabditis elegans (FEM-2, F42G9.1, T23F1 1.1), Leishmania chagasi and 
Paramecium tetraurelia. In Arabidopsis thaliana, the kinase associated protein phosphatase 
(KAPP) is an enzyme that dephosphorylates the Ser/Thr receptor-like kinase RLK5 and which 
contains a C-terminal PP2C domain. 

30 PP2C does not seem to be evolutionary related to the main family of serine/ threonine 

phosphatases: PP1, PP2A and PP2B. However, it is significantly similar to the catalytic 
subunit of pyruvate dehydrogenase phosphatase (PDPC), which catalyzes dephosphorylation 
and concomitant reactivation of the alpha subunit of the El component of the pyruvate 
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dehydrogenase complex. PDPC is a mitochondrial enzyme and, like PP2C, is magnesium- 
dependent. 

The disclosed NOV54 nucleic acid of the invention encoding a Protein phosphatase 
2C-Iike protein includes the nucleic acid whose sequence is provided in Table 54A, 54C or a 
5 fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 54A or 54C while still 
encoding a protein that maintains its Protein phosphatase 2C -like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 

10 that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 

15 stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 37 percent of the bases may be so changed. 

The disclosed NOV54 protein of the invention includes the Protein phosphatase 2C- 
like protein whose sequence is provided in Table 54B or 54D. The invention also includes a 

20 mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 54B or 54D while still encoding a protein that maintains its Protein 
phosphatase 2C-like activities and physiological functions, or a functional fragment thereof. 
In the mutant or variant protein, up to about 56 percent of the residues may be so changed. 
The invention further encompasses antibodies and antibody fragments, such as F a b or 

25 (F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Protein phosphatase 2C-like protein 
(NOV54) is a member of a "Protein phosphatase 2C family". Therefore, the NOV54 nucleic 
acids and proteins identified here may be useful in potential therapeutic applications 
implicated in (but not limited to) various pathologies and disorders as indicated below. The 

30 potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 
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The NOV54 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated cancer, trauma, bacterial and viral infections, in vitro and 
in vivo regeneration, fertility, endometriosis, hypogonadism, cardiomyopathy, atherosclerosis, 
hypertension, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 
5 atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic stenosis, 
ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, obesity, 
transplantation, hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, 
autoimmune disease, allergies, immunodeficiencies, transplantation, graft versus host disease 
(GVHD), lymphaedema, anemia, Alzheimer's disease, stroke, hypercalcemia, Parkinson's 

10 disease, Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple 
sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, anxiety, 
pain, neurodegeneration, systemic lupus erythematosus, asthma, emphysema, allergy, ARDS, 
diabetes, renal artery stenosis, interstitial nephritis, glomerulonephritis, polycystic kidney 
disease, renal tubular acidosis, IgA nephropathy, hypercalcemia, Von Hippel-Lindau (VHL) 

15 syndrome, endocrine dysfunctions, growth and reproductive disorders, tonsillitis, 

Hirschsprung's disease, Crohn's Disease, appendicitis, and/or other diseases and pathologies. 

NOV54 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV54 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 

20 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV54 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 

25 disorders. 

NOV55 

A disclosed NOV55 nucleic acid of 1500 nucleotides (also referred to as CG56806-01) 
encoding a Heparan Sulfate 6-Sulfotransferase 3-Iike protein is shown in Table 55A. An open 
reading frame was identified beginning with a ATG initiation codon at nucleotides 74-76 and 
30 ending with a TGA codon at nucleotides 1490-1492. The start and stop codons are shown in 
bold in Table 55A, and the 5' and 3 5 untranslated regions, if any, are underlined. 

Table 55A. NOV55 nucleotide sequence (SEQ ID NO:201). 

CTTCCGAGCGGGCGCCCGTCCGCCCTGCCGCCGCCGCCGCCGCCGCTTCGCCTGCCGGCCTGAGAGCGQGAC 
CATGGATGAAAGGTTCAACAAGTGGCTGCTC 

GTACGTGTCCCCCTCCTGCACCAGCTCCZTGCACCAACTTCGGGGAGCAGCCCCGCGCGGGGGAGGCCGGCCC 
GCCCGCCGTCCCGGGTCCCGCCCGCCGGGCTCAGGCGCCGCCGGAGGAGTGGGAGCAGAGGAGGCCCCAGTT 
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GCCCCCGCCGCCCCGGGGGCCCCCCGAGGGACCTCGGGGGGCCGCGGCGCCGGAGGAGGAGGACGAGGAGCC 
CGGAGACCC CCGGGAGGGGGAGGAAGAGGAGGAGG AAGACG AGCCGGACCC CGAGGC C CCGGAAAACGGCTC 
CCTGCC CCGATTCGTGCCGCGCTTCAACTTCAGCCTGAAGGAC CTGACC CC5CTTCGTGGATTTCAACATCAA 
AGGGCGCGACGTGATCGTGTTCCTCCACATCCAGAAGACGGGGGGCACCACTTTCGGCCGGCACCTGGTGAA 
GAACAT CCGGCTGGAGCAGCCTTGTAGCTG CAAAGCGGGTCAG AAGAAGTGCACCTGCCACCGGC CTGGCAA 
GAAGGAGACGTGGCTCTTCTCCCGCTTCTCCACCGGCTGGAGCTGCGGGCTGCACGCCGACTGGACGGAGCT 
CACCAACTGCGTGC CGGCCATCATGGAGAAGAAGGACTGTCCCCGCAAC CACAG CCACACCAGGAATTTCTA 
TTACATCACAATGTTACGGGATCCAGTGTCACGTTACCTGAGCGAGTGGAAACATGTCC^GAGAGGGGCCA^ 
TTGGAAAACCTCTCTTCATATGTGTGATGGAAGAAGCCCCACCCCAGATGAGCTGCCTACCTGCTACCCTGG 
GGATGACTGGTCTGGGGTCAGCTTGCGGGAGTTTATGGATTGC^CCTACAACCTGGCTAACAATCGCCAGGT 
GCGCATGCTGGCTGACCTCAGCCTGGTGGGCTGCTATAACTTGACTTTCATGAACGAGAGTGAAAGAAACAC 
CAT CCTGTTG C AGAGTG CAAAG AACAAC CTGAAG AACATGG CC TTC TTTGG G CT CACTGAGT TC CAGAGG AA 
GACACAGTTTCTCTTTGAGAGAACATTCAACCTCAAGTTCATCTCCCCCTTCACACAGTTCAACATCACGCG 
GGCTTCTAACGTGGAGATCAACGAGGGTGCCCGCCAACGCATTGAGGATCTAAACTTCCTGGACATGCAGCT 
TTACGAGTATGCAAAAGATCTCTTCCAGCAGCGCTACCACCACACCAAGCAGCTAGAGCACCAGAGGGACCG 
CCAGAAGCGGCGGGAGGAG CGGAGG CTGCAGCG AGAGCACAGGGAC CAC CAGTGGCC CAAAG AAGATGGGG C 
TGCAGAAGGGACTOTCACCGAGGACTAC^CAGCCAGGTGGTGAGATGGTGACCTCCT^ 



In a search of public sequence databases, the NOV55 nucleic acid sequence, located on 
chromosome 13, has 1329 of 1492 bases (89%) identical to a gb.GENBANK- 
ID:AB024567|acc:AB024567.1 mRNA from Mus musculus (mRNA for heparan sulfate 6- 

5 sulfotransferase 3, complete cds) (E = 7. 1 e" 263 ). 

A disclosed NOV55 polypeptide (SEQ ID NO:202) encoded by SEQ ID NO:201 has 
944 amino acid residues and is presented in Table 55B using the one-letter amino acid code. 
Signal P 3 Psort and/or Hydropathy results predict that NOV55 has a signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6850. Alternatively, 

1 0 NOV55 may also localize to the endoplasmic reticulum (membrane) with a certainty of 

0.6400, to the Golgi body with a certainty of 0.3700, or to the microbody (peroxisome) with a 
certainty of 0.1269. The most likely cleavage site for NOV55 is between positions 28 and 29: 
VSP-SC. 



15 



Table 55B. Encoded NOV55 protein sequence (SEQ ID NO:202). 



MDERFNKWLLTPVLTLLFWIMYQYVSPS 

PPPPRGPPEGPRGAAAPEEEDEEPGDPREGEEEEEEDEPDPEAPENGSLPRFVPRFNFSLKDLTRFVDFNIK 

GRDVI VFLHI QKTGGTTFGRHLVKNIRLEQPCS CKAGQKKCT CHRP GKKETWLFSRFSTGWS CGLHADWTEL 

TNCVPAIMEKKDCPRNHSHTRNFYYITMIjRDPVSRYLSE 

DDWSGVSLRE^^CTYNLANNRQVRMI^LSLTO 

TQFLFERTFNLKFISPFTQFNITRAS1TVEINEGARQRI 

QKRREERRLQREHRDHQWPKEIXrAAEGTVTC 

SCTNFGEQPRAGEAGPPAVPGPARRAQAPPEEWEQRRPQLPPPPRGPPEGPRGAAAPEEEDEEPGDPREGEE 
EEEEDEPDPEAPENGSLPRFVPRFNFSLKDLTRFVDFNIKGRDVIVFMIQKTGGTTFGRHLVKNIRLEQPC 
SCKAGQKKCTCHRPGKKETWLFSRFSTGWSCGLHADWT 

VSRYLSEWKHVQRGATWKTSLHMCDGRSPTPDELPTCYPGDDWSGVSLREFMDCTYNIANN^ 

VGCYNLTFMNESEK^rriLLQSAKNNLKNMAFFGLTEFQRKTQFl-FERTFl^ S PFTQFNITRASNVEINE 

GARQRIEDLNFLDMQLYEYAKDLFQQRYHHTKQLEHQRDRQKRREERRLQREHRDHQWPK^ 

YNSQWRW 



A search of sequence databases reveals that the NOV55 amino acid sequence has 447 
of 472 amino acid residues (94%) identical to, and 458 of 472 amino acid residues (97%) 
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similar to, the 470 amino acid residue ptnr:SPTREMBL-ACC:Q9QYK4 protein from Mus 
musculus (Mouse) (Heparan Sulfate 6-Sulfotransferase 3) (E = 7.5e" 254 ). 

NOV55 is predicted to be expressed in at least Right Cerebellum, Oviduct/Uterine 
Tube/Fallopian tube, Amygdala, and Kidney. This information was derived by determining the 
5 tissue sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, Public EST sources, Literature sources, and/or RACE sources. 

NOV55 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 55C 



Table 55C. BLAST results for NOV55 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 7657192 |ref|NP 0 
56635.1) 
(NM_015820) 


heparan sulfate 
6-0- 

sulfotransf erase 
3 [Mus mus cuius] 


470 


440/472 
(93%) 


452/472 
(95%) 


0.0 


gi 1 16552186 | db j | BAB 
71260. 1| (AK056706) 


unnamed protein 
product [Homo 
sapiens] 


605 


241/330 
(73%) 


284/330 
(86%) 


e-144 


gi| 14042611 |dbj| BAB 
55322.1) (AK027720) 


unnamed protein 
product [Homo 
sapiens] 


459 


242/330 
(73%) 


285/330 
(86%) 


e-143 


gi 1 7657190 |ref|NP 0 
56634. 1| 
(NMJU5819) 


heparan sulfate 
6-0- 

sulf ©transferase 
2 [Mus musculus] 


506 


246/369 
(66%) 


290/369 
(77%) 


e-140 


gi | 12545389 |ref |NP_ 
004798. 2| 
(NM_004807) 


heparan sulfate 
6-0- 

sul f ©transferase 
[Homo sapiens] 


401 


238/353 
(67%) 


286/353 
(80%)' 


e-138 



10 

The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 55D. In the ClustalW alignment of the NOV55 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (z.e., regions that may be required to preserve structural or 
15 functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 55D. ClustalW Analysis of NOV55 

20 1) Novel NOV55a (SEQ ID NO:202) 

2) gi|7657i92|ref |NP_056635.1| (NMJU5820) heparan sulfate 6 -o-sulfo transferase 3 
[Mus musculus] (SEQ ID NO: 577) 

3) gi 1 16552186 |dbj|BAB71260.1 1 (AK056706) unnamed protein product [Homo sapiens] 
(SEQ ID N0:578) 



474 



WO 02/068649 



PCT/US02/02785 



4) gi 1 14042611 1 dbj |BAB55322.l| (AK027720) unnamed protein product [Homo sapiens] 
(SEQ ID NO: 579) 

5) gij7657190lreflNP_056634.il (NM_015819) heparan sulfate 6 -O-sulfo transferase 2 
[Mus musculus] (SEQ ID NO: 580) 

6) gi | 12545389 | ref |NP_004798 . 2 ) (NM_004807) heparan sulfate 6-O-sulf otransf erase 
[Homo sapiens] (SEQ ID NO: 581) 



NOV55 


1 


gi 


7657192) 


1 


gi 


16552186) 


1 


gi 


14042611) 


1 


gi 


7657190) 


1 


gi 


12545389) 


1 



10 



|....|... .|. 



20 
■ |.. 



30 



40 



50 



60 



,|....|. 



1 

1 

MALPAC^VREFEPPRQPERGAPVRTTCPRRHSRVEAELAASRPGSVAASVRAGPPRGVSH 6 0 
- 1 

1 

1 



NOV55 


1 


gi 


7657192 | 


1 


gi 


16552186) 


61 


gi 


14042611) 


1 


gi 


7657190) 


1 


gi 


12545389) 


1 



70 



80 



90 



100 



110 



)....). 



120 



1 

- - - -- 1 

GFHTRPLLDKP RKAS S SLAG AACAPL FALLS RGRRRRMHVLRRRWDLGSLCRALLTRGL A 12 0 

_ 1 

i 

1 



130 



140 



150 



160 



170 



NOV55 


1 


gi 


7657192| 


1 


gi 


16552186) 


121 


gi 


14042611) 


1 


gi 


7657190) 


1 


gi 


12545389) 


1 




180 

••I 

CTSSC 34 

ISCTSSC 34 

ECQ- 179 

STECQ- 33 

ICQ- 33 

S--- 31 



NOV55 

gi | 7657192 | 
gi | 16552186 | 
gi 1 14042611 1 
gi 1 7657190 | 
gi 1125453891 



190 200 

....| | | | 

3 5 TNFGEQPRAGEAGPPAVPGR 
3 5 TNFGEQLRSGEARPPAVPSPj 

179 l: 

33 — 

33 

31 LG . 



210 



220 




PPEEWEQRRPQLP) 
PLDEWE-RRPQL! 

[FSS- ■ 

ss- ■ 



PDDLY- 




230 240 
...|....|....| 

GPPEGPRGAAAPEEEDE 94 

PPEGSRGVAAPEDEDE 93 

191 

45 

-- 45 

50 



250 



NOV55 


95 


gi 


7657192) 


94 


gi 


16552186 | 


191 


gi 


14042611) 


45 


gi 


7657190) 


45 


gi 


12545389) 


50 




260 
,.|.. 
>EPDPEAPENG 
IPDPEAPENG 



270 



280 



290 




IVFLHI 
IVFLHI 
IVFLHI 
IVFLHI 
IVFLHI 
IVFLHI 



NOV55 



gi 
gi 
gi 
gi 
gi 



7657192) 

16552186) 

14042611) 

7657190) 

12545389) 



310 



320 



| 



330 



QKTGGTTFGRHLV^lgLEQPCgC^GQKKCTCHRPGKjjETWLFSRFSTGWSCGLHADWT 
QKTGGTTFGRHLV^i|leQPC|C^GQKKCTCHRPGKJ2ETWLFSRFSTGWSCGLHADWT 
QKTGGTTFGRHLVj^lS^ 

qktggttfgrhlv^isleqpc|cr|gqkkctchrpgkretwlfsrfstgwscglhadwt 
qktggttfgrhlv^isleqpcgcrsgqkkctchrpgkretwlfsrfstgwscglhadwt 
qktggttfgrhlvgn^lebpc5cr§gqkkctc5rpssretwlfsrfstgwscglhadwt 



qktggttfgrhlv^isleqpcgcrggqkkctchrpgkretwlfsrfstgwscglhadwt 
qktggttfgrhlv^iSleqpcScrRgqkkctchrpgkretwlfsrfstgwscglhadwt 



LEQPCSCRraGQKKCTCQRPSSRETWLFSRFSTGWSCGLHADWT 



NOV55 


215 


gi 


7657192) 


213 


gi 


16552186) 


295 


gi 


14042611 j 


149 


gi 


7657190) 


149 


gi 


12545389) 


145 




400 



410 



420 



^RWRIFQILDGTSKDRWGSSNFNSGANSPSSTKPRSTSKS 208 
- 165 
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440 



450 



10 



15 



20 



25 



NOV55 


236 




gi 


7657192 | 


234 




Si 


16552186) 


315 




gi 


14042611 | 


169 




gi 


7657190) 


209 


G 


gi 


12545389 | 


165 





NOV55 


296 


gi 


7657192) 


294 


gi 


16552186) 


375 


gi 


14042611) 


229 


gi 


7657190) 


269 


gi 


12545389) 


225 



NOV55 

gi|7657192| 
gi j 16552186 | 
gi I 14042611 | 
gi j 7657190 | 
gi | 12545389 | 



356 
354 
435 
289 
329 
285 




30 



35 



610 



620 



KOV55 

gi 
gi 
gi 
gi 
gi 



7657192 | 
16552186) 
14042611) 
7657190 | 
12545389 | 




630 
• I - - 



640 

..|....| 



650 



660 



_ 443 

1 441 

jEGRLLQTHFQSQGQGQSQN- - PNQNQSQNPNPN 552 
.KGRLLQTHFQSQGQGQSQN- - PNQNQSQNPNPN 406 
X3RFLQTHFQSQSQGQSQSQSPGQNLSQNPNPN 448 
-- - 372 



670 



40 



45 



NOV55 



gi 
gi 
gi 
gi 
gi 



7657192) 
16552186) 
14042611 j 
7657190 | 
12545389) 



| | | . 



680 

443 EHRD HQ' 

441 EHRA HR1 

553 ANQNLTQNLMQNLT QNLSQKENRES 

407 ANQNLTQNLMQNLT pLSQKENRES 

449 PNQNLTQNLSHNLTPSSNPNSTQRENRGSf 
372 AK 



690 



700 




Heparan-sulfate 6-sulfotransferase (HS6ST) catalyzes the transfer of sulfate from 3- 
phosphoadenosine 5-phosphosulfate to position 6 of the N-sulfoglucosamine residue of 
50 heparan sulfate. 

The disclosed NO V55 nucleic acid of the invention encoding a Heparan Sulfate 6- 
Sulfotransferase 3-like protein includes the nucleic acid whose sequence is provided in Table 
55 A, or a fragment thereof. The invention also includes a mutant or variant nucleic acid any 
of whose bases may be changed from the corresponding base shown in Table 55A while still 
55 encoding a protein that maintains its Heparan Sulfate 6-.Sulfotransferase 3 -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
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invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
5 chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 1 1 percent of the bases may be so 
changed. 

The disclosed NOV55 protein of the invention includes the Heparan Sulfate 6- 

10 Sulfotransferase 3-like protein whose sequence is provided in Table 55B. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 55B while still encoding a protein that maintains its 
Heparan Sulfate 6-Sulfotransferase 3-like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 34 percent of the 

1 5 residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Heparan Sulfate 6-Sulfotransferase 
3-like protein (NOV55) is a member of a "Heparan Sulfate 6-Sulfotransferase 3 family". 

20 Therefore, the NOV55 nucleic acids and proteins identified here may be useful in potential 

therapeutic applications implicated in (but not limited to) various pathologies and disorders as 
indicated below. The potential therapeutic applications for this invention include, but are not 
limited to: protein therapeutic, small molecule drug target, antibody target (therapeutic, 
diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene 

25 therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of 
all tissues and cell types composing (but not limited to) those defined here. 

The NOV55 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Diabetes, Autoimmune disease, Renal artery stenosis, 
Interstitial nephritis, Glomerulonephritis, Polycystic kidney disease, Systemic lupus 

30 erythematosus, Renal tubular acidosis, IgA nephropathy, Hypercalceimia, Lesch-Nyhan 
syndrome, Von Hippel-Lindau (VHL) syndrome , Alzheimer's disease, Stroke, Tuberous 
sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, 
Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia-telangiectasia,Leukodystrophies, 
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Behavioral disorders, Addiction, Anxiety, Pain, Neuroprotection, and/or other diseases and 
pathologies. 

NOV55 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV55 substances for use in 
5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV55 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
10 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV56 

NOV56 includes two novel N-Hydroxyarylamine Sulfotransferase-like proteins 
disclosed below. The disclosed sequences have been named NOV56a and NOV56b. 
15 NOV56a 

A disclosed NOV56a nucleic acid of 1223 nucleotides (also referred to as CG56816- 
01) encoding a N-Hydroxyarylamine Sulfotransferase-like protein is shown in Table 56A. An 
open reading frame was identified beginning with a ATG initiation codon at nucleotides 2-4 
and ending with a TGA codon at nucleotides 974-976. The start and stop codons are shown in 
20 bold in Table 56A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 56A. NOV56a nucleotide sequence (SEQ ID NO:203). 

TATGGCGAAGATTGAGAAAAACGCTCCCACGATGGAAAAAAAGCCAGAACTGTTTAACATCATGGAAGTAGA 
TGGAGTC C CT ACGTTGAT ATTATCAAAAGAATGGTGGG AAAAAGTATGT AATTT CCAAG C CAAG C CTGATGA 
TCTTATTCATGCATCCATGTTGTACCTGACTTTGGCT 

AAATTACCCAAAGTCAGGTACAACATGGATGCATGAAATTTTAGACATGATTCTAAATGATGGTGATGTGGA 

GAAATGCAAAAGAGCCCAGACTCTAGATAGACACGCTTTC^ 

AGATTTGGAGTTCGTTCTTGAAATGTCCTGACGA 

ACC^TCTATCTGGAAAGAAAACTGC^GATTGTCTATGTGACCAGAAATCCCAAGGATTGCCTGGTGTCCTA 

CTACCACTTTCACAGGATGGCTTCCTTTATGCCT 

GTCCGGAAAAGTTGTTGGCGGGTCCTGGTTTGACCATATC 

GAT CCTCTACCTCTTCTACGAGGATATTAAAAAAAATC CAAAACATGAGATC CACAAGGTGTTG G AATTC TT 
GGAGAAAACTTGGTCAGGTGATGTTATAAACAAGATTGTCCAC CAT AC CT CATTTGATGTAATGAAGGATAA 
TCCCATGGCCAACCATACTGCGGTACCTGCTC&CATATTCAAT^ 
GATGCCTGGJVGACTGGAAGAACCACTTTACTGTGGCTATGAATGAGAACTTO 

GATGGCAGGGTCCAC^CTGAACTTCTGCCTGGAGATCTGAGAGGAAC^C^CAAACTAGGTGACAGAGACT 
ATGCC^CTATTTCGCCTTTTATTCTGTTGAGC^AGGAACTGTGACTGAATGTGGAGCTTATGAGCTTCAGT 
CC^TCTCCTATAGTGTGGCTAGTTTGCTATAATATTAAAACATGATTTAAAATATCAAa^ 
CAG CAAAT AAAAT AAGAGAATTAGAGACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



In a search of public sequence databases, the NOV56a nucleic acid sequence, located 
on chromosome 2, has 633 of 921 bases (68%) identical to a gb:GENBANK- 
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ID:AF033653|acc:AF033653.1 mRNA from Mus musculus (phenol sulfotransferase mRNA, 
complete cds) (E = 7.3e" 270 ). 

A disclosed NOV56a polypeptide (SEQ ID NO:204) encoded by SEQ ID NO:203 has 
324 amino acid residues and is presented in Table 56B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV56a has no signal peptide and is 
likely to be localized to the microbody (peroxisome) with a certainty of 0.7480. Alternatively, 
NOV56a may also localize to the mitochondrial membrane space with a certainty of 0.1000, or 
to the lysosome (lumen) with a certainty of 0.1000. 



Table 56B. Encoded NOV56a protein sequence (SEQ ID NO:204). 



MAKIEKNAPTMEKKPELFNIME^ 
NYPKSGTTWMHEILDMILNIX3DVEKCKRA 

PSIWKENCKIVYVTRNPKDCLVSYYHFHRMASFMPDPQNLEEFYEKF^ 

IL YLF YED I KKNPKHE IHKVLEFLEKTWS GD VINK I VHHTSFDVMKDNPMANHTAVPAHI FNHS I S KFMRKG 
MPGDWKNHFTVAMNENFDKHYEKKMAGSTLNFCLEI 



10 

A search of sequence databases reveals that the NOV56a amino acid sequence has 155 
of 254 amino acid residues (61%) identical to, and 196 of 254 amino acid residues (77%) 
similar to, the 304 amino acid residue ptnr:SWISSPROT-ACC:P50237 protein from Rattus 
norvegicus (Rat) (N-Hydroxyarylamine Sulfotransferase (EC 2.8.2.-) (HAST-I)) (E = 6.7c 96 ). 

15 NOV56b 

In the present invention, the target sequence identified previously, NOV56a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 

20 examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 

25 related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 

30 gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 

uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
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redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
5 were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV56b. This is identical to the previously 
identified sequence (NOV56a). 

A disclosed NOV56b nucleic acid of 1 167 nucleotides (also referred to as CG56816- 
02) encoding a N-Hydroxyarylamine Sulfotransferase-Iike protein is shown in Table 56C. An 
10 open reading frame was identified beginning with a ATG initiation codon at nucleotides 33-35 
and ending with a TGA codon at nucleotides 91 8-920. The start and stop codons are shown in 
bold in Table 56C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 56C. NOV56b nucleotide sequence (SEQ ID NO:205). 

TATGGCGAAGATTGAGAAAAACGCTCCCACGCATGGGAAAAGAAAGCCAGGAACTGTTTAACATCATGGAAG 

TAGATGGAGTCCCTACGTTGATATTATCAAAAGAATGGTGGGAAAAAGTA^ 

ATGATCTTATTCTGGCAACTTACCCAAAGTCAGGTACAACM 

ATGATGGTGATGTGGAGAAATGCAAAAGAGCCCAGACTCTAGATA 

CCCATAAAGAAAAACCAGATTTGGAGTTCGTTCOTGA 

CTTCAC AT CTG ATT CC ACCATCTAT CTGG AAAG AAAACTGCAAGATTGTCT ATGTGG CCAGAAATC C CAAGG 

ATTGCCTGGTGTCCTACTACCACTTTCACAGGATGGCTTCCTTTATGCCTGATCCTCAGAACTTAGA 

TTTATGAGAAATTCATGTCCGGAAAAGTTGTTGGCGGGTCCTGGTTTGACCATATGAAAGGATGGTC 

C AAAAGACATG CAC CGG ATC CT CTAC CT CTT CT ACG AGGAT ATT AAAAAAAATCCAAAACATGAGAT CCACA 

AGGTGTTGGAATTCTTGGAGAAAACTTGGTCAGGTGATGTTATAAACAAGATTGTCC^CCATACCTCATTTC 

ATGTAATGAAGGATAATCCC^TGGCCAACCATACTGCGGTACCTGCTCACATATTCAA 

AATTTATGAGGAAAGGGATGCCTGGAGACTGGAAGAACCACTTTACTGTGGCTATGAATGAGAACTTTGATA 

AG CATTATGAAAAG AAGATGGC AGGGTCCACACTGAACTTCTG CCTGGAGAT CTO AG AGGAACAACAACAAA 

CTAGGTGACAGAGACTATGCCAAOTATTTCGCCTTTTATTCTGTTGAGCAAGGAACTCTGACTGAATGTGGA 

GCTTATGAGCTTCAGTCCATCTCCTATAGTGTGGCTAGTTTGCTATAATATTAAAACATGATTTAAAATATC 

AACAAAC CAGTT AC T C CAG CAAAT AAAAT AAGAG AATT AG AGACCAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAGGG 



In a search of public sequence databases, the NOV56b nucleic acid sequence, located 
1 5 on chromosome 2, has 649 of 91 9 bases (70%) identical to a gb:GENBANK- 

ID:AF033653|acc:AF033653.1 mRNA from Mus musculus (phenol sulfotransferase mRNA, 
complete cds) (E = 3.6e" 89 ). 

The disclosed NOV56b polypeptide (SEQ ID NO:206) encoded by SEQ ID NO:205 
has 295 amino acid residues and is presented in Table 56D using the one-letter amino acid 
20 code. Signal P, Psort and/or Hydropathy results predict that NOV56b has no signal peptide 
and is likely to be localized to the microbody (peroxisome) with a certainty of 0.7480. 
Alternatively, NOV56b may also localize to the mitochondrial membrane space with a 
certainty of 0.1000, or to, the lysosome (lumen) with a certainty of 0.1000. 
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Table 56D. Encoded NOV56b protein sequence (SEQ ID NO:206). 

MGKE S QELFNIMEVDGVPTL I LSKEWWEKVCNFQAKPDDL ILATY PKSGTTWMHE ILDMI LNDGDVEKCKRA 
QTLDRHAFLELKF PHKEKPDLEFVLEMS S PQL I KTHLP SHL I P P S I WKENC KI VYVARNPKDCLVSYYHFHR 
MAS FMPD PQNLEEFYEKFMSGKWGG SWFDHMKGWWAAKDMHRIL YLFYED I KKNPKHE I HKVLEFLEKTWS 
GDVINKIVHHTSFDVMKDNPMiU^ 

LNFCLEI _ 



A search of sequence databases reveals that the NOV56b amino acid sequence has 1 73 
of 283 amino acid residues (61%) identical to, and 220 of 283 amino acid residues (77%) 
similar to, the 304 amino acid residue ptnr:SWISSPROT-ACC:P50237 protein from Rattus 
5 norvegicus (Rat) (N-Hydroxyarylamine Sulfotransferase (EC 2.8.2.-) (HAST-I)) (E = 4.0e" 99 ). 

NOV56b is predicted to be expressed in at least brain. . 

NOV56a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 56E 



Table 56E. BLAST results for NOV56a 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1 17446341 1 ref | XP__ 
065757. l| 
(XMJ)65757) 


similar to 
sulfotransferase , 
phenol preferring 
2 ; Phenol 
sulfotransferase 
lcl (H. sapiens) 
[Homo sapiens] 


304 


299/324 
(92%) 


3 01/324 
(92%) 


e-166 


gi | 13929030 | ref |NP_ 
113920.1) _. 
(NMJ>31732) 


sulfotransferase , 
phenol preferring 
2; 

Phenol sul f ot rans f 
erase lcl (Rattus 
norvegicus] 


304 


171/303 
(56%) 


218/303 
(71%) 


le-94 


gi | 9055354 | ref |NP_0 

61221.1) 

(NM 018751) 


sulfotransferase , 
phenol preferring 
2 [MUs musculus] 


304 


172/303 
(56%) 


217/303 
(70%) 


4e-94 


gi| 16304836 |emb|CAC 
95180. 1| (AJ416889) 


sulfotransferase 
1C [Gallus 
gallus] 


307 


161/306 
(52%) 


214/306 
(69%) 


4e-88 


gi 1 14731177 | ref |XP_ 

010849.3) 

(XM 010849) 


SULT1C 

sulfotransferase 
[Homo sapiens] 


302 


161/285 
(56%) 


201/285 
(70%) 


2e-85 



10 

The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 56F. In the ClustalW alignment of the NOV56 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (ie., regions that may be required to preserve structural or 
15 functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 56F. ClustalW Analysis of NOV56 



1) Novel NOV56a (SEQ ID NO:204) 

2) Novel NOV56b (SEQ ID NO:206) 

3 ) gi | 17446341 | ref |XP_065757 . 1 | (XM_065757) 
preferring 2; Phenol sulfotransf erase lcl (H 
NO:582) 

4) gi 1 13929030 |ref |NP__ii3 920 . 1 1 (NM_031732) sulf ©transferase, 
Phenolsulf ©transferase lcl [Rattus norvegicus] (SEQ ID NO: 583) 

5) gi | 9055354 | ref |NP_061221 . 1 J 
[Mus xnusculus] (SEQ ID NO: 584 ) 

6) gi| 16304836 |erab|CAC95180.l| 
ID NO:585) 

7) gi| 14731177 | ref |XP_010849. 3 
(SEQ ID NO: 586) 



similar to sulfotransf erase, phenol 
sapiens) [Homo sapiens) (SEQ ID 

phenol preferring 2; 

IS] (SEQ ID NO: 583) 
(NM_018751) sulfotransf erase, phenol preferring 2 

(AJ416889) sulfotransf erase 1C [Gallus gallus] (SEQ 

(XM_010849) SULT1C sulfotransf erase [Hozno sapiens] 



10 



NOV56a 
NOV56b 



gi 
gi 
gi 
gi 



17446341 ] ref 
13929030 j ref 
9055354 | ref | 
16304836 | emb 
14731177 ref 



NOV56a 
NOV56b 



gi 
gi 
gi 
gi 
gi 



17446341 I ref 
13929030 j ref 
9055354 | ref | 
16304836 | emb 
14731177 j ref 



NOV56a 
NOV56b 

gi | 17446341 | ref 
gi |13929030|ref 
gi | 9055354 | ref | 
gi j 16304836 | emb 
gi j 14731177 | ref 



NOV56a 
NOV56b 

gi 
gi 
gi 
gi 
gi 



17446341 | ref 
13929030 | ref 
9055354 | ref | 
16304836 | emb 
14731177 | ref 



GKESQ- 



IgKNAPT|§SKPEL 

EL! 

:^knapt^|kpel: 
:|mkdlhSg|qdlqpet: 
ple&kdlhiSeqnmqpet 




30 

•Li3n 

'Lip: 
§iP*skmms: 

PVTKFgC^I 
PTKEOPI 




110 120 

.KF^---HK 117 

»KF J HK 88 

KFE---HK 97 
--SPL 98 
LE--PPL 98 
IJDSSPL 100 
Klg SL 96 




250 



NOV56a 




NOV56b 




gi 


17446341 


ref 


gi 


13929030 


ref 


gi 


9055354 Irefl 


gi 


16304836 


emb 


gi 


14731177 


ref 




■it 



I. 



260 

..I.. 



270 



I. 



280 



I. 



290 



I , 



300 
. . I 



K I gHHTS FD VM KjjN? MANtTixg 
KlgHHTSFDVMKSwPMANSTr 

nki Rhhtsfdvm kSnpmanStI 
ki §3ht s fd vm kSnpmanSt 



S]a g^JNKigHHTSFDVMKjgNPMflNj 

jjffi SvHnki fejTS f^ikSnpm2^ 
% V 55 K iBh5tsfpvmkBnpmani 




SP^lEjHSISraFMRKGMPGDWKNi:iF 
JP^I^HSISgFMRKGMPGDWiO^F 
HSISSFMRKGMPGDWKNgF 
I^HSISgFMRKGMPGDWKNWF 

i 'fiifiHS i sBfmrxgmpgdwknRf 
••IhsBsBfmrkg^gdwknRf 



297 
268 
277 
277 
277 
280 
275 
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10 



NOV56a 




NOV56b 




gi 


17446341 


ref 


gi 


13929030 


ref 


gi 


90553541 ref 1 


gi 


16304836 


emb 


gi 


14731177 


ref 
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Tables 56G lists the domain descriptions from DOMAIN analysis results against 
NOV56. This indicates that the NOV56 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 56G Domain Analysis of NOV56a 

gnl jPfamjpfaniO 06 85, Sulfotransfer, Sulfo transferase protein (SEQ ID 
NO: 855) 

CD-Length = 269 residues, 99.6% aligned 
Score « 260 bits (665), Expect = 7e-71 





NOV56: 


25 


20 


Sbjct: 


2 




N0V56 : 


84 




Sbjct: 


42 


25 


NOV56 : 


144 




Sbjct: 


100 


30 


N0V56: 


203 




SbjCt: 


160 




N0V56: 


263 


35 


Sb j Ct : 


220 



GVPTLILS KEWWEKVCN- FQAKPDDLIHASMLYLTLGKLPEEDHQAWLGNYPKSGTTWMH 83 

i i i 111*111* * mum* 

GFWVDKFHLEGWRKIKNCFQARPDDV LIAGYPKSGTTWLQ 4 1 

EILDMIIjOTXSDVEKCKRAQTLDWIAF^ 143 

III * I II I IK HI I* 111*1111111 I* 

EILSLHPNVGDFBPSPSDPLLFRNPWLEYPKGEDWYETLKP--MPSSPRLIKTHLPLELL 99 



1*11 



III 



+ 1 



III I 1**111+ 11*1 



I* III * **l*l 111*1**1 II I* III I * **!** 1*11 *ll 

160 DHVLGWWELRPEPQVLFLDYEDLKEDPAGEIKKIAEFLGLPLSEEELDKLLDHSSFFLMK 219 



11**1* * I lllll* 11111*11 II III **| 

220 LNPLSNYETLCLGKS KGRKS P FMRKGLVGDWKNYFTPEQNEKFDKVIKEK 269 



This protein carries out sulfation of phenols and bioactivation of n-hydroxyarylamines. 
It is responsible for the formation of N-hydroxy-2-acetylaminofluorene, a reactive metabolite 
which exhibits toxicity by binding to DNA, RNA and protein. 

40 Hepatic sulfation of heterocyclic and non-heterocyclic arylamines was studied to 

assess enzymes responsible for their metabolisms. Both 2-amino-3-methyIimidazo[4,5- 
fjquinoline (IQ)- and non-IQ-type (beta-carboline) heterocyclic amines were N-sulfated to 
form their sulfamates in cytosols of rat livers in the presence of 3-phosphoadenosine-5 f - 
phosphosulfate (PAPS). An arylsulfo-transferase, ST1A1, whose cDNA was isolated from a 

45 rat cDNA library, was expressed in COS-1 cells. The expressed enzyme catalyzed N-sulfation 
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of IQ, but not appreciably those of 2-amino-l-methyI-6-phenylimidazo[4 ? 5-b]pyridine (PhIP), 
2-amino-6-methyldipyrido[l,2-a:3',2 f -d] imidazole (Glu-P-I), and 3-amino-l-methyl-5H- 
pyrido[4,3-b] indole (Trp-P-2). N-Sulfation of heterocyclic amines except IQ was higher in 
hepatic cytosols of female rats than of male rats. These results suggest the involvement of at 
5 least plural forms of sulfotransferase on the N-sulfation. In addition, N-sulfation of IQ was 
also observed in cytosol of a human liver, suggesting that N-sulfation is one of the metabolic 
pathways of heterocyclic amines in humans as well as rats. Hepatic sulfotransferase also 
catalyzes metabolic activation of N-hydroxy derivatives of carcinogenic arylamines. Using 
anti-HAST (hydroxylarylamine sulfotransferase) antibodies and ST1 Al cDNA as screening 

10 probes, several cDNA clones were isolated from the cDNA library. A new member of 

arylsulfotransferase, ST1C1, whose cDNA shows considerable sequence similarity to ST1 Al 
cDNA, was found to catalyze O-sulfation of N-hydroxy-2-acetylaminofluorence by the cDNA 
expression in COS-1 cells. From the close similarity of ontogenic profile and sex-specific 
expression of ST1C1 and HAST, ST1C1 cDNA was shown to encode a major sulfotransferase 

15 (HAST) mediating the metabolic activation of N-hydroxyarylamines in rat livers. In addition, 
properties of PAPS-dependent N-hydroxyarylamine activation and sulfotransferase in human 
livers are also discussed. 

The disclosed NOV56 nucleic acid of the invention encoding a N-Hydroxyarylamine 
Sulfotransferase-like protein includes the nucleic acid whose sequence is provided in Table 

20 56A, 56C or a fragment thereof. The invention also includes a mutant or variant nucleic acid 
any of whose bases may be changed from the corresponding base shown in Table 56A or 56C 
while still encoding a protein that maintains its N-Hydroxyarylamine Sulfotransferase -like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to those just described, 

25 including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 

30 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 48 percent of the 
bases may be so changed. 
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The disclosed NOV56 protein of the invention includes the N-Hydroxyarylamine 
Sulfotransferase-like protein whose sequence is provided in Table 56B or 56D. The invention 
also includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 56B or 56D while still encoding a protein that 
5 maintains its N-Hydroxyarylamine Sulfotransferase-like activities and physiological functions, 
or a functional fragment thereof. In the mutant or variant protein, up to about 32 percent of 
the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as Fab or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

1 0 The above disclosed information suggests that this N-Hydroxyarylamine 

Sulfotransferase-like protein (NOV56) is a member of a "N-Hydroxyarylamine 
Sulfotransferase family". Therefore, the NOV56 nucleic acids and proteins identified here may 
be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 

1 5 invention include, but are not limited to: protein therapeutic, small molecule drug target, 

antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
those defined here, 

20 The NOV56 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in metabolic diseases and disorders, and/or other diseases 
and pathologies. 

NOV56 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV56 substances for use in 

25 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV56 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used hi 
assay systems for functional analysis of various human disorders, which will help in 

30 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV57 

A disclosed NOV57 nucleic acid of 953 nucleotides (also referred to as CG56829-01) 
encoding a Testis Specific Serine Kinase-3-like protein is shown in Table 57A. An open 
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reading frame was identified beginning with a ATG initiation codon at nucleotides 50-52 and 
ending with a TGA codon at nucleotides 854-856. The start and stop codons are shown in 
bold in Table 57A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 57 A- NOV57 nucleotide sequence (SEQ ID NO:207). 

CAGAGGCAGCATGAGCTGAGAGGGTGATAGGAAGGCGGCGCTAGACAGCA TGGAGGACTTTCTGCTCTCCAA 
TGGGTACCAGCTGGGCAAG 

CCAAAGAAAAGTGGCAATTAAAGTTATAGACAAGATGGGAACTTCCTCAGAGTTTAT CCAGAGATTCCTC CC 
TOSGGAGCTCCAAATCGTCCGTACCCTGGACCAC^GAACATCATCa^GGTGTATGAGATGCTGGAGTCTGC 
CGACGGGAAAAT CTG CCTGGTG ATGGAG CT CG CTGAGGGAGGGGATGT CT TTGACTG CG TGC TGAATGGGGG 
GCCACTGCCTGAAAGCCGGGCCAAGGCCCTCTTCCGTCAGATGGTTGAGGCCATCCGCTACTGCCATGGCTG 
TGGTGTGGCCCACCGGGACCTCAAATGTGAGAACGCCTTGTTGC^GGGCTTCAACCTGAAGCTGACTGACTT 
TGGCTTTGCOVAGGTGTT<^CCAAGT^^ 

TGCCCCCGAGGTGCTGCAGGGCATTCCCCACGATAGCAAAAAAGGTGATGTCTGGAGCATGGGTGTGGTCCT 
GTATGTCATGCTCTGTGCCAGCCTACCTTTTGACGACACAGACATCCCCAAGATGCTGTGGCAGCAGCAGAA 
GGGGGTGTCCTTCCCCACTCATCTGAG<^TCTCGGCC^ 

CGATATGATCCTCCGGCCTTCAATTGAAGAAGTTAGTTGGCATCCATGGCTAGCAAGCACTTGATAAAAGCA 
ATGGCAAGTGCTCTCCAATAAAGTAGGGGGAGAAAGCAAACCCAAAAACCCGCTTCT 
ATT'rrACGCTTTAAGTT : 



5 In a search of public sequence databases, the NOV57 nucleic acid sequence, located on 

chromosome 1, has 831 of 912 bases (91%) identical to a gb:GENBANK- 
ID:AF201 734|acc:AF201 734. 1 mRNA from Mus musculus (testis specific serine kinase-3 
(Tssk-3) mRNA, complete cds) (E = 2.0e" 165 ). 

A disclosed NOV57 polypeptide (SEQ ID NO:208) encoded by SEQ ID NO:207 has 
10 268 amino acid residues and is presented in Table 57B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV57 has no signal peptide and is 
likely to be localized to the cytoplasm with a certainty of 0.4500. Alternatively, NOV57 may 
also localize to the microbody (peroxisome) with a certainty of 0.1821, to the mitochondrial 
matrix space with a certainty of 0. 1 000, or to the lysosome (lumen) with a certainty of 0. 1 000. 

15 



Table 57B. Encoded NOV57 protein sequence (SEQ ED NO:208). 

MEDFLLSNGYQLGKTIGEGTYSKVKEAFSKKHQRKVAIKVIDKMGTSSEFIQRFLPRELQI 
VYEMLESAIX3KICLVMELAEGGDVFD 

FNLKLTDFGFAKVLPKSHRELSQTFCGSTAYAAPEVLQGI PHDSKKGDVWSMGWLYVMLCASLPFDDTDI P 
KMLWQQQKGVSFPTHLSISADCQDLLKRLLEPDMILRPSIEEVSWHPWLAST 



A search of sequence databases reveals that the NOV57 amino acid sequence has 240 
of 268 amino acid residues (89%) identical to, and 245 of 268 amino acid residues (91%) 
similar to, the 266 amino acid residue ptnnSPTREMBL-ACC:Q9JL98 protein from Mus 
20 musculus (Mouse) (Testis Specific Serine Kinase-3) (E = 6.2e" 124 ). 

NOV57 is predicted to be expressed in at least lung, testis, B-cell, brain, head and 
neck. Expression information was derived from the tissue sources of the sequences that were 



486 



WO 02/068649 



PCT/US02/02785 



included in the derivation of the sequence of CuraGen Acc. No. CG56829-0L The sequence is 
predicted to be expressed in testis because of the expression pattern of (GENBANK-ID: 
gb:GENBANK-ID:AF201734|acc:AF201734.1) a closely related Mus musculus testis specific 
serine kinase-3 (Tssk-3) mRNA, complete cds homolog. 
5 NOV57 also has homology to the amino acid sequences shown in the BLASTP data 

listed in Table 57C 



Table §7C BLAST results for NOV57 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 16418343 |ref|NP 
44-3 073.1] 
(NM_052841) 


testis -specific 
serine /threonine 
kinase 22 C [Homo 
sapiens] 


268 


265/268 
(98%) 


265/268 
(98%) 


e-157 


gi|l2860201.|dbj |BAB 
31876.1 | (AK019840) 


data source : MGD , 
source 

key :MGI: 1929914, 
evidence : ISS-puta 
tive-serine/threo 
nine kinase 22 C 
(spermiogenesis 
associated) [Mus 
musculus] 


268 


259/268 
(96%) 


261/268 
(96%) 


e-153 


gi | 10946880 |ref|NP 
067454. l| 
(NM_021479) 


s er ine / threonine 
kinase 22C 
( spermiogenes is as 
sociated) ; testis 
specific serine 
kinase-3 (Tssk-3) 
[Mus musculus] 


266 


240/273 
(87%) 


245/273 
(88%) 


e-136 


gi | 16507245 |ref|NP 
443732. 1| 
(NMJ353006) 


serine /threonine 
kinase 22B 
( spermiogenesisas 
sociated) ; testis 
specific serine 
threonine kinase 
2 [Homo sapiens) 


358 


127/266 
(47%) 


192/266 
(71%) 


le-70 


gi | 14776972 |ref|xp 

033051.1) 

(XM_033051) 


serine/ threonine 
kinase 22B 
(spermiogenesis 
associated) [Homo 
sapiens] 


358 


127/266 
(47%) 


192/266 
(71%) 


le-70 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 57D. In the ClustalW alignment of the NOV57 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (Le. 9 regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and. 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 57D. ClustalW Analysis of NO V57 

1) Novel NOVS7 (SEQ ID NO:208) 

2) gi) 16418343 | ref |NP_443073 . 1 | (NM_052841) testis-specif ic serine/threonine kinase 
22C [Homo sapiens] {SEQ ID NO: 587) 

3) gi | 12860201 |dbj (BAB31876.1 | (AK019840) data SOUrce:MGD, source key :MGI: 1929914, 
evidence :ISS~putative-serine/ threonine kinase 22C (spermiogenesis associated) [Mus 
musculus] (SEQ ID NO: 588) 

4) gi | 10946880 |ref |NP_067454 . 1 | (NMJ>21479) serine/ threonine kinase 22C 
(spermiogenesisassociated) ; testis specific serine kinase-3 (Tssk-3) [Mus musculus] 
(SEQ ID NO: 589) 

5) gi 1 16507245 | ref |NP__443732 . 1 1 (NM_053006) serine./ threonine kinase 22B 
(spermiogenesisassociated) ; testis specific serine threonine kinase 2 [Homo sapiens] 
(SEQ ID NO:590) 

6) gi | 14776972) ref |XP_033051 .l| (XM_033051) serine /threonine kinase 22B 
(spermiogenesis associated) [Homo sapiens] (SEQ ID NO: 591) 



NOV57 




gi 


16418343 


ref 


gi 


12860201 


dbj 


gi 


10946880 


ref 


gi 


16507245 


ref 


gi 


14776972 


ref 


NOV57 




gi 


16418343 


ref 


gi 


12860201 


dbj 


gi 


10946880 


ref 


gi 


16507245 


ref 


gi 


14776972 


ref 



NOV57 




gi 


16418343 


Iref 


gi 


1286020l|dbj 


gi 


10946880 


|ref 


gi 


16507245 


ref 


gi 


14776972 


ref 


NOV57 




gi 


16418343 


ref 


gi 


12860201 


dbj 


gi 


10946880 


ref 


gi 


16507245 


ref 


gi 


1477 6972 


ref 


NOV57 




gi 


16418343 


ref 


gi 


12860201 


dbj 


gi 


10946880 


ref 


gi 


16507245 


ref 


gi 


14776972 


ref 




PPKPKPTS SASFKREGEGKYRAECKLD 300 
•PPKPKATS SAS FKREGEGKYRAECKLD 300 



NOV57 

gi | 16418343 | ref 



310 
|....|....|, 



320 



330 



340 



|....|....| 



350 



|....| 



268 
268 
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gi 
gi 
gi 
gi 



12860201 1 dbj 268 

10946880 jref 266 

16507245 j ref TKTGLRPDHRPDHKLGAKTQHRLLVVPENENRMEDRLAETS RAKDHHI S GAEVGKAS T 358 

14776972 | re£ TKTGLRPDHRPDHKLGAKTQHRLLVVPENENRMEDRLAETSRAKDHHISGAEVGKAST 358 



10 



Tables 57E-G lists the domain descriptions from DOMAIN analysis results against 
NOV57. This indicates that the NOV57 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 57E Domain Analysis of NOV57 

gnl | Smart | smart 00220, S_TKc, Serine/Threonine protein kinases, 
catalytic domain; Phosphotransferases. Serine or threonine- specific 
kinase subfamily. {SEQ ID NO: 856) 
CD-Length = 256 residues, 100.0% aligned 
Score = 258 bits (659), Expect = 3e-70 



15 



20 



25 



30 



N0V57 : 


10 


Sbjct: 


1 


NOV57 : 


70 


Sb j Ct : 


60 


NOV57 : 


130 


Sbjct: 


119 


NOV57 : 


189 


SbjCt: 


177 


NOV57 : 


246 


SbjCt: 


237 



YQLGKTI GEGTYS KVKEAFSKKHQRKVAI KVI DKMGT S SEF I QRFL PRELQI VRTLDHKN 6 9 

1+1 + + ii i ii + nun i + +i i 11++1++ hi i 

YELLEVLGKGAFGKVYLARDKKTGKLVAI KVI KKEKLKKKKRERI L - RE I K.I LKKLDHPN 5 9 

1 1 QVYEMLES ADGKICLVMELAEGGDVFDCVLNGGPLPESR^ 129 

1+++1++ 1 i i+ mi mini + i i i i+ n++ kin 

I VKLYDVFED - DDKLYLVMEYCEGGDLFDLLKKRGRIiS EDEARFYARQ ILS ALEYLHSQG 118 



+ iiiii ii ii ++n iii ii i i i ii i+ i inn i + 



i i+n+ii+n +i n 



ii i + ii n+ 



+ + + + 



Table 57F Domain Analysis of NOV57 

gnl|Pfam|pfam00069 f pkinase, Protein kinase domain (SEQ ID NO: 857) 
CD-Length » 256 residues, 100.0% aligned 
Score = 234 bits (596) , Expect * 6e-63 





NOV57 : 


10 


35 


Sbjct: 


1 




N0V57 : 


70 




Sbjct: 


59 


40 






NOV57: 


129 




Sbjct: 


118 


45 


N0V57 : 


188 



YQLGKTIGEGTYSKVKEAFSKKHQRKVAIKVIDKMGTSSEFIQRFLPRELQIVRTLDHKN 69 

1+11+ +11+11+ I IIII++ I I + ++ ll+ll+l I I I 

YELGEKLGSGAFGKVYKGKHKDTGEI VAI KILKKRSLSEK - - KKRFLREI QILRRLSHPN 58 

I IQVYEMLESADGKICLVMELAEGGDVFDCVL -NGGPLPESRAKALFRQMVEAIRYCHGC 128 

i+++ +i i + mi iin+n + n ii n + i++ +ii 

IVRLLGVFEE - DDHLYLVME YMEGGDLFDYLRRNGLLLSEKE AKKI ALQI LRGLEYIjHSR 117 



i+ inn ii ii + +i+ in i+ 1 i i 

GIVHRDLKPENILLDENGTVKIADFGLARKLESSSYEB 

SKKGDVWSMGWLYVMLCASLPFDDTDI PKMLWQQQKG 
I I 1111+11*11 +1 III I + |++ ++ 

489 



+ I lllll+l 

;TPE YMAPEVLEG - RGS 

IL- -SISADCQDLLKSI 
I + I + +11+1+ 
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Sbjct : 177 SS KVDVWS LGVI L YELLTGKLPFPGI DPLEEI1FRIKERPRLRLPLPPNCSEEL1KDLI KKC 236 



NOV57 : 24 6 LEPDMILRPSIEEVSWHPWL 265 

I I II* +1+ III 
Sbjct: 237 LNKDPEKRPTAKEILNHPWF 256 



Table 57G Domain Analysis of NOV57 



gnl | Smart | smart 0021 9, TyrKc, Tyrosine kinase, catalytic domain; 
Phosphotransferases. Tyrosine -specific kinase subfamily (SEQ ID 
NO: 858) 

CD-Length = 258 residues, 97.7% aligned 
Score - 115 bits (289) , Expect = 2e-27 



10 



15 



20 



25 



NOV57 : 12 LGKT I GEGTY SKVKEAF - — SKKHQRKVAIKVIDKMGTSSEFIQRFLPRELQI VRTLDHK 68 

Ml *HI + +1 * + +11*1 + 1 I + I* II II +++I III 

Sbjct: 3 LGKKLGEGAFGEVYKGTL KGKGGVEVEVAVKTL - KEDASEQQ I EE FL - REARLMRKLDHP 60 

NOV57: 69 NI IQVYEMLESADGKICLVMELAEGGDVFDCVIiNGGP - - LPESRAKALFRQMVEAIRYCH 126 

|| +++ + ++ + + ||| ,|, | + | + I]) + |+ + | 
Sbjct: 61 NI VKLLGVC - TEEE PLM I VMEYMEGGDLLDYIiRKNRPKEI*SLSDLLS FALQI ARGMEYLE 119 

NOV57 : 127 GCGVAHRDLKCENALL- QGFNLKLTDFGFAK - VLPKSHRELSQTFCGSTAYAAPEVLQGI 184 

mi i i* + +i* in i* + + ++ + in i+ 

Sb j ct : 120 SKNFVHRDLAARNCLVGENKTVKIADFGLARDLYDDDYYRKKKSPRLPIRWMAPESLKDG 179 

NOV57 : 185 PHDSKKGDVWSMGWLYVML- CASLPFDDTDI PKMLWQQQKGVSFPTKLS I SADCQDLLK 243 

I I till 11 + 1 + + 1+ ++I +11 | + + ||+ 

Sbjct: 180 KFTS - KSDWSFGVLLWEIFTLGESPYPGMSNEEVLEYLKKGYRLPQPPNCPDEIYDLML 238 

NOV57: 244 RLLEPDMILRPSIEEV 259 

I 11+ 1+ 
Sbjct: 239 QCWAEDPEDRPTFSEL 254 



Eukaryotic protein kinases (1) are enzymes that belong to a very extensive family of 
30 proteins which share a conserved catalytic core common with both serine/threonine and 

tyrosine protein kinases. Protein phosphorylation is a fundamental process for the regulation of 
cellular functions. The coordinated action of both protein kinases and phosphatases controls 
the levels of phosphorylation and, hence, the activity of specific target proteins. One of the 
predominant roles of protein phosphorylation is in signal transduction, where extracellular 
35 signals are amplified and propagated by a cascade of protein phosphorylation and 

dephosphorylation events. Two of the best characterized signal transduction pathways involve 
the cAMP-dependent protein kinase and protein kinase C (PKC). Each pathway uses a 
different second-messenger molecule to activate the protein kinase, which, in turn, 
phosphorylates specific target molecules. Extensive comparisons of kinase sequences defined 
40 a common catalytic domain, ranging from 250 to 300 amino acids. This domain contains key 
amino acids conserved between kinases and are thought to play an essential role in catalysis. 
In the N-terminal extremity of the catalytic domain there is a glycine-rich stretch of residues in 
the vicinity of a lysine residue, which has been shown to be involved in ATP binding. In the 



WO 02/068649 



PCT/US02/02785 



central part of the catalytic domain there is a conserved aspartic acid residue which is 
important for the catalytic activity of the enzyme (2). 

Protein kinases and phosphatases regulate cell-cycle progression, transcription, 
translation, protein sorting and cell adhesion events that are critical to the inflammatory 

5 process. Two of the best-characterized immunosuppressants, cyclosporin and rapamycin, are 
also effective anti-inflammatory drugs. They act directly on protein phosphorylation and, as 
such, validate the concept that small-molecule modulators of phosphorylation cascades 
possess anti-inflammatory properties (3). 

Some examples of the role of serine/threonine protein kinases that are important in cell 

1 0 proliferation and disease include AKT, RAF1 and PIM1 . Dudek et al. (4) demonstrated that 
AKT is important for the survival of cerebellar neurons. Thus, the 'orphan 1 kinase moved 
center stage as a crucial regulator of life and death decisions emanating from the cell 
membrane. Holland et al. (5) transferred, in a tissue-specific manner, genes encoding 
activated forms of Ras and Akt to astrocytes and neural progenitors in mice. These authors 

1 5 found that although neither activated Ras nor Akt alone was sufficient to induce glioblastoma 
multiforme (GBM) formation, the combination of activated Ras and Akt induced high-grade 
gliomas with the histologic features of human GBMs. These tumors appeared to arise after 
gene transfer to neural progenitors, but not after transfer to differentiated astrocytes. Increased 
activity of Ras is found in many human GBMs and Akt activity is increased in most of these 

20 tumors, implying that combined activation of these 2 pathways accurately models the biology 
of this disease (5). 

Another disease that involves yet another serine/threonine kinase is Peutz-Jeghers 
syndrome (PJS) , an autosomal dominant disorder characterized by melanocyte macules of the 
lips, buccal mucosa, and digits, multiple gastrointestinal ham&rtomatous polyps, and an 

25 increased risk of various neoplasms. Jenne et al. (6) identified and characterized the 

serine/threonine kinase STK1 1 and identified mutations in PJS patients. All 5 germline 
mutations were predicted to disrupt the function of the kinase domain. They concluded that 
germline mutations in STK1 1, probably in conjunction with acquired genetic defects of the 
second allele in somatic cells according to the Knudson model, caused the manifestations of 

30 PJS. These authors commented that PJS was the first cancer susceptibility syndrome identified 
that is due to inactivating mutations in a protein kinase and found mutations in the STK1 1 
gene in 1 1 of 12 unrelated families with PJS. Ten of the 1 1 were truncating mutations. All 
were heterozygous in the germline. Su et al. (7) found that of 53 PJS patients with cancer 
reported to that time, 6(11%) were diagnosed with pancreatic adenocarcinoma. Su et al. (7) 
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presented evidence that the STK1 1 gene plays a role in the development of both sporadic and 
familial (PJS) pancreatic and biliary cancers. They found that in sporadic cancers, the STK1 1 
gene was somatically mutated in 5% of pancreatic cancers and in at least 6% of biliary cancers 
examined. In the patient with pancreatic cancer associated with PJS, there was inheritance of a 
5 mutated copy of the STK1 1 gene and somatic loss of the remaining wildtype allele. 

The novel human serine/threonine protein kinase of the invention contains a protein 
kinase domain. Therefore it is anticipated that this novel protein has a role in the regulation of 
essentially all cellular functions and could be a potentially important target for drugs. Such 
drugs may have important therapeutic applications, such as treating numerous inflammatory 
10 diseases. 

The disclosed NOV57 nucleic acid of the invention encoding a Testis Specific Serine 
Kinase-3-like protein includes the nucleic acid whose sequence is provided in Table 57A or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 57A while still encoding a 

1 5 protein that maintains its Testis Specific Serine Kinase-3 -like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 

20 include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 

25 acids, and their complements, up to about 9 percent of the bases may be so changed. 

The disclosed NOV57 protein of the invention includes the Testis Specific Serine 
Kinase-3-like protein whose sequence is provided in Table 57B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 57B while still encoding a protein that maintains its Testis Specific 

30 Serine Kinase-3-like activities and physiological functions, or a functional fragment thereof. 
In the mutant or variant protein, up to about 53 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 
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The above disclosed information suggests that this Testis Specific Serine Kinase-3-like 
protein (NOV57) is a member of a "Testis Specific Serine Kinase-3 family". Therefore, the 
NOV57 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 

5 below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

10 The NOV57 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in metabolic diseases and disorders, and/or other diseases 
and pathologies. 

NOV57 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV57 substances for use in 

1 5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV57 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 

20 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV58 

NOV58 includes two Gap Junction Beta-5-like proteins, designated herein as NOV58a 
and NOV58b. Gap junctions are conduits that allow the direct cell-to-cell passage of small 
25 cytoplasmic molecules, including ions, metabolic intermediates, and second messengers, and 
that thereby mediate intercellular metabolic and electrical communication. Gap junction 
channels consist of connexin protein subunits, which are encoded by a multigene family. 

NOV58a 

30 The disclosed NOV58a (alternatively referred to herein as CG56315-01) includes the 

728 nucleotide sequence (SEQ ID NO:209) shown in Table 58A. A NOV58a ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 28-30 and ends with a stop codon 
at nucleotides 697-699. The disclosed NOV58a maps to human chromosome 6. 
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Table 58A. NOV58a Nucleotide Sequence (SEQ ID 
NO:209) 

AGTCTTGCCTTCTTTTGAGCCTAAGTCATGAGTTGGATGTTCCTCAGAGATCTCCTGAGT 

GGAGTAAATAAATACTCCACTGGGACTGGATGGATTTGGCTGGCTGTCGTGTTTGTCTTC 

CGTTTGCTGGTCTACATGGTGGCAGCAGAGCACGTGTGGAAAGATGAGCAGAAAGAGTTT 

GAGTGCAACAGTAGACAGCCCGGTTGCAAAAATGTGTGTTTTGATGACTTCTTCCCCATT 

TCCCAAGTCAGACTTTGGGCCTTACAACTGATAATGGTCTCCACACCTTCACTTCTGGTG 

GTTTTACATGT AG CCTATCATGAGGGTAGAGAGAAAAGGCACAGAAAGAAACT CTATGTC 

AGCCCAGGTACAATGGATGGGGGCCTATGGTACGCTTATCTTATCAGCCTCATTGTTAAA 

ACTGGTTTTGAAATTGG CTT C CTTGTTTT ATTTTATAAGC T AT ATGATGGC TTT AG TGTT 

CCCTAC CTTATAAAGTGTGATTTGAAGCCTTGTCCCAACACTGTGGACTGCTTCAT CT CC 

AAACCCACTGAGAAGACGATCTTCATCCTCTTCTTGGTCATCACCTCATGCTTGTGTATT 

GTGTTGAATTTCATTGAACTGAGTTTTTTGGTTCTCAAGTGCTTTATTAAGTG 

CAAAAAT ATTT AAAAAAAC CT CAAGT CCT CAGTGTGT GAGTGCCACAGC CT CAGAT ATGT 

TGAATGTG 



A NOV58a polypeptide (SEQ ID NO:210) encoded by SEQ ID NO:209 is 223 amino 
5 acids in length and is presented using the one-letter amino acid code in Table 58B. The Psort 
profile for NOV58a predicts that this sequence has a signal peptide and is likely to be 
localized at the plasma membrane with a certainty of 0.6000. In alternative embodiments, a 
NOV58a polypeptide is located to the mitochondrial inner membrane with a certainty of 
0.4358, to the endoplasmic reticulum (membrane) with a certainty of 0.3000, or to the Golgi 
10 with a certainty of 0.4000. The Signal P predicts a likely cleavage site for a NOV58a peptide 
is between positions 40 and 41, at the dash in the sequence VAA-EH. 



Table 58B. NOV58a Polypeptide Sequence (SEQ ID 
NO:210) 

"mswmflr^ 

knvcfddffp i s qvrlwalql i mvs tps ll wlhvayhegrekrhrkklyvs pgtmdggl 
wyaylislivktgfeigflvlfyklydgfsvpylikcdlkpcpntvdcfi skptekti fi 
lflvitsclcivlnfielsflvlkcfikcclqkylkkpqvlsv 

NOV58b 

1 5 The disclosed NOV58b (alternatively referred to herein as CG563 1 5-02) includes the 

727 nucleotide sequence (SEQ ID NO:21 1) shown in Table 58C. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 27-29 and ends with a stop codon at 
nucleotides 696-698. The disclosed NOV58b maps to human chromosome 10. 



Table 58C NOV58b Nucleotide Sequence (SEQ ED 
NO:211) 

AGTCTTGCTTCTTTTGAGCCTAAGTCATGAGTTGGATGTTCCTCAGAGATCTCCTGAGTG 
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GAGTAAATAAATACTCCACTGGGATTGGATGGATTTGGCTGGCTGTCGTGTTTGTCTTCC 
GTTTGCTGGTCTACATGGTGGCAGCAGAGCACGTGTGGAAAGATGAGCAGAAAGAGTTTG 
AGTGCAACAGTAGACAGCCCGGTTGCAAAAATGTGTGTTTTGATGACTTCTTCCCCATTT 
CCCAAGTCAGACTTTGGGCC TTACAACTGATAATGGT CTC CACACCTTCACTTCTGGTGG 
TTTTACATGTAGCCTATCATGAGGGTAGAGAGAAAAGGCACAGAAAGAAACTCTATGTCA 
GCCCAGGTACAATGGATGGGGGCCTATGGTACGCTTATCTTATCAGCCTCATTGTTAAAA 
CTGGTTTTGAAATTGGCTTCCTTGTTTTATTTTATAAGCTATATGATGGCTTTAGTGTTC 
CCTACCTTATAAAGTGTGATTTGAAGCCTTGTCCCAACACTGTGGACTGCTTCATCTCCA 
AACCCACTGAGAAGACGATCTTCATCCTCTTCTTGGTCATCACCTCATGCrTGTGTATTG 
TGTTGAATTTCATTGAACTGAGTTTTTTGGTTCTCAAGTGCTTTATTAAGTGCTGTCTCC 
AAAAATATTTAAAA7^AACCTCAAGTCCTCAGTGTGTGAGTGCCACAGCCTCAGATATGTT 
GAATGTG 



A NOV58b polypeptide (SEQ ID NO:212) encoded by SEQ ID NO:21 1 is 628 amino 
acids in length and is presented using the one-letter amino acid code in Table 58D. The Psort 
profile for NOV58b predicts that this sequence has a signal peptide and is likely to be 
5 localized at the plasma membrane with a certainty of 0.6000. The Signal P predicts a likely 
cleavage site for a NOV58b peptide is between positions 40 and 41, i.e., at the dash in the 
sequence VAA-EH. 



Table 58D. NOV58b Polypeptide Sequence (SEQ ID 
NO:212) 

MSWMFLRDLLSGVNKYSTGIGWIWLAWFV^ 
KNVCFDDFFPISQVRLWALQLIMVSTPSLLVVLHVAYHE^ 

WYAYLISLIVKTGFEIGFLVLFyKLYDGFSVPYLIKCDLKPCPNTVDCFISKPTEKTIFI 
LFLVITSCLCIVLNFIELSFLVLKCFIKCCLQKYLKKPQVIiSV 



1 0 A BLAST analysis of NOV58 was run against the proprietary PatP GENESEQ Protein 

Patent database. It was found, for example, that the amino acid sequence of NOV58 had high 
homology to other proteins as shown in Table 58E. 



Table 58E. BLASTX results from PatP database for NOV58 






Smallest 






Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAY32079 Human gap junction protein beta-4 


689 


1.2e-67 


patp:AAY36i45 Human secreted protein #17 - Homo sapiens 


689 


1.2e-67 


patp:AAY36192 Human secreted protein #64 - Homo sapiens 


689 


1.2e-67 


patp:AAY70457 Human membrane channel protein-7 (MECHP-7) 


666 


3.3e-65 


patp:AAG74 00l Human colon cancer antigen protein 


657 


3 .Oe-64 



15 In a search of public sequence databases, it was found, for example, that the nucleic 

acid sequence of this invention has 240 of 336 bases (71%) identical to a gb:GENBANK- 
ID:RATCXN31 lA|acc:M76533.1 mRNA from Rattus norvegicus (Rattus norvegicus 
connexin (CXN-31 1) gene). The full amino acid sequence of the protein of the invention was ' 
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found to have 125 of 220 amino acid residues (56%) identical to, and 170 of 220 amino acid 
residues (77%) similar to, the 271 amino acid residue ptnr:SWISSNEW-ACC:Q02739 protein 
from Mus musculus (Mouse) (GAP JUNCTION BETA-5 PROTEIN (CONNEXIN 31.1) 
(CX3 1.1)). NOV58 also has homology to the other proteins shown in the BLASTP data in 
5 Table 58F. 



Table 58F. NOV58 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi| 15990851 |e 
mb|CAC93 845.1 
| (AJ414563) 


connexin25 
[Homo sapiens] 


223 


222/223 
(99) 


222/223 
(99) 


e-116 


gi 

ef 
M 


10835079 |r 
NP_005259. 
[NM 005268) 


gap junction protein, 
beta 5 (connexin 31.1) 
[Homo sapiens] 


273 


123/216 
(56) 


161/216 
(73) 


le-67 


gi 

f|i 

1 


6753996) re. 
JP 034421.1 
(NM 010291) 


gap junction membrane 
channel protein beta 5; 
connexin 31.1 
[Mus musculus] 


271 


123/214 
(57) 


166/214 
(77) 


le-67 


gi| 15029850 |g 
b|AAH11148.l| 
AAH11148 
(BC011148) 


Similar to gap junction 
membrane channel protein 
beta 5 

[Mus musculus] 


271 


123/214 
(57) 


166/214 
(77) 


7e-67 


gi (4009522 |gb 
|AAC95472.l| 
(AF099731) 


connexin 31.1 
[Homo sapiens] 


273 


122/216 
(56) 


160/216 
(73) 


7e-67 



This BLASTP data is displayed graphically in the Clustal W in Table 58G. A multiple 
sequence alignment is given, with the NOV58a and b protein being shown on line 1 and 2 in a 
10 Clustal W analysis comparing the protein of the invention with the related protein sequences 
shown in Table 58F. 



Table 58G. ClustalW Alignment of NOV58 



NOV58a 
NOV58b 



9i 

gi 
gi 
gi 
gi 



15990851] 
10835079 | 
6753996 | 
15029850| 
4009522 | 



(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 



NO:210) 
N0:212) 
NO:592) 
NO:593) 
NO:594) 
NO: 595) 
NO:596) 
10 



NOV58a 


1 






NOV58b 


1 








gi 


15990851| 


1 






VIFLRD 


gi 


10835079) 


1 






SIFEG 


gi 


6753996 | 


1 






SVFEG 


gi 


15029850 | 


1 






SVFEG 


gi 


4009522 | 


1 






SIFEG 
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NOV58a 


61 




NOV58b 


61 




gi 


15990851 | 


61 


K 


gi 


10835079 j 


61 


S 


gi 


6753996| 


61 




gi 


15029850) 


61 




gi 


4009522] 


61 


s 




NOV58a 


114 


NOV58b 


114 


gi 


159908511 


114 


gi 


10835079) 


121 


gi 


6753996) 


119 


gi 


15029850 | 


119 


gi 


4009522 | 


121 



NOV58a 


174 


NOV58b 


174 


gi 


15990851) 


174 


gi 


10835079 | 


181 


gi 


6753996) 


179 


gi 


15029850 | 


179 


gi 


4009522 | 


181 




170 



ISO 



p ■ ■■ 

jPCP 


sttvd 


CFISKP 


173 


jPCP 


NTVD 


CFISKP 


173 


JPCP 


NTVD 


CFISKP 


173 


PCP 


nJJvd 


CFISKP 


180 


PCP 


NTVD 


cfi|kp 


178 


PCP 


NTVD 


CFI§KP 


178 


PCP 




CFISKP 


180 



23 0 



240 



!LQj 
MS* 
I*QKY 

^KAQAMCTGHHPHGTTSSi 
PT AHAKND PNWANS PS 
PT AHAKNDPNWANS PS 
.QAMCTGHHPHGTTSSi 




NOV58a 


218 




t3 

LS 


NOV58b 


218 


PQ# 


LS 


gi j 15990851 | 


218 




LS 


gij 10835079) 


241 




LS 


gij 6753996 | 


239 


KDF 


LS 


gij 15029850) 


239 


KDF 


LS 


gij 4009522) 


241 


dd£ 


LS 



250 260 270 

....|....|....|....|....|... 

V - 223 

V 223 

V 223 

GDLI FLGSDSHPPLL PDRPRDHVKKT I L 273 
CDLIFLGSDAHPPLLPDRPRAHVKKTIL 271 
CDL I FLGSDAHPPLLPDRPRAHVKKT IL 271 
GDLIFLGSDSHPPLLPDRPRDHVKKTIL 273 



10 



15 



The presence of identifiable domains in NOV58 was determined by searches using 
software algorithms such as PROSITE, DOMAIN, Blocks, Pfam, ProDomain, and Prints, and 
then determining the Interpro number by crossing the domain match (or numbers) using the 
Interpro website (http:www.ebi.ac.uk/ interpro). DOMAIN results for NOV58 as disclosed in 
Table 58H, were collected from the Conserved Domain Database (CDD) with Reverse 
Position Specific BLAST analyses. This BLAST analysis software samples domains found in 
the Smart and Pfam collections. For Table 58H fully conserved single residues are indicated 
by the sign (|) and "strong" semi-conserved residues are indicated by the sign (+). The 
"strong" group of conserved amino acid residues may be any one of the following groups of 
amino acids: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, FYW. 

Table. 58H lists the domain description from DOMAIN analysis results against 
NOV58. This indicates that the NOV58 sequence has properties similar to those of other 
proteins known to contain this domain. 



497 



WO 02/068649 



PCT/US02/02785 



Table 58H. Domain Analysis of NOV5S 




gnl | Pfam|pf am00029, connexin, Connexin. SEQ ID NO: 


859 






CD-Length = 218 residues, 100.0% aligned 






Score — 


265 bits (678), Expect — 2e-72 




1NUVDO . 


l 


MSWMFLRDLLSGVNKYSTGTGWIWIAVVFVFRLLVYMVAAEHTOKDEQ 


60 






M W FL LL GVNK+ST G IWL+V+F+FR+LV VAAE VW DEQ +F CN++QPGC 




Sb j Ct : 


l 


MDWSFLGRLLEGVNKHSTAIGKIWLSVLFIFRILVLGVAAESVWGDEQSDFVCNTQQPGC 


60 


NOV58: 


61 




113 






+NVC+D FFPIS VRLW LQLI VSTPSLL + HVAY RE++ R+K LY 




Sbjct: 


61 


ENVCYDQFFPI SHVRLWVLQLI FVSTPSLLYLGHVAYRVRREEKLREKEEEHSKGLYSEE 


120 


NOV58: 


114 


G TMTCGLVnfAYLISLIVKTGFEIGFLVLFYKLYDGFSVPYLIKCDLKPC 


162 






+ GGLW+ Y+ S+I K+ FE+GFL Y LY GF++ L+ C PC 




Sbjct: 


121 


AKKRCGS EDGKVRI RGGL WWTYVFS 1 1 FKS I FEVG FLYGQ YLL Y - GFTMSPLWCSRAPC 


179 


NOV58: 


163 


POTVDCFISKPTEKTIFILFLVITSCLCIVLNFIELSFI* 201 








P+TVDCF+S+PTEKTIFI+F+++ S +C++LN EL +L 




Sbjct: 


. 180 


PHTVDCFVSRPTEKTI FI VFMLWSAI CLLLNLAELFYL 218 





Connexins are a family of integral membrane proteins that oligomerise to form 
intercellular channels that are clustered at gap junctions. These channels are specialized sites 
of cell-cell contact that allow the passage of ions, intracellular metabolites and messenger 
5 molecules (with molecular weight <l-2 kD) from the cytoplasm of one cell to its apposing 
neighbours. They are found in almost all vertebrate cell types, and somewhat similar proteins 
have been cloned from plant species. Invertebrates utilise a different family of molecules, 
innexins, that share a similar predicted secondary structure to the vertebrate connexins, but 
have no sequence identity to them. 

10 Vertebrate gap junction channels are thought to participate in diverse biological 

functions. For instance, in the heart they permit the rapid cell-cell transfer of action potentials, 
ensuring coordinated contraction of the cardiomyocytes. They are also responsible for 
neurotransmission at specialised 'electrical 1 synapses. In non-excitable tissues, such as the 
liver, they may allow metabolic cooperation between cells. In the brain, glial cells are 

1 5 extensively-coupled by gap junctions; this allows waves of intracellular Ca2+ to propagate 
through nervous tissue, and contribute to their ability to spatially-buffer local changes in 
extracellular K+ concentration. 

The connexin protein family is encoded by at least 13 genes in rodents, with many 
homologues cloned from other species. They show overlapping tissue expression patterns, 

20 most tissues expressing more than one connexin type. Their conductances, permeability to 

different molecules, phosphorylation and voltage-dependence of their gating, have been found 
to vary. Possible communication diversity is increased further by the fact that gap junctions 
may be formed by the association of different connexin isoforms from apposing cells. 
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However, in vitro studies have shown that not all possible combinations of connexins produce 
active channels. 

Hydropathy analysis predicts that all cloned connexins share a common 
transmembrane (TM) topology. Each connexin is thought to contain 4 TM domains, with two 

5 extracellular and three cytoplasmic regions. This model has been validated for several of the 
family members by in vitro biochemical analysis. Both N- and C-termini are thought to face 
the cytoplasm, and the third TM domain has an amphipathic character, suggesting that it 
contributes to the lining of the formed-channel. Amino acid sequence identity between the 
isoforms is -50-80%, with the TM domains being well conserved. Both extracellular loops 

10 contain characteristically conserved cysteine residues, which likely form intramolecular 

disulphide bonds. By contrast, the single putative intracellular loop (between TM domains 2 
and 3) and the cytoplasmic C-terminus are highly variable among the family members. Six 
connexins are thought to associate to form a hemi-channel, or connexon. Two connexons then 
interact (likely via the extracellular loops of their connexins) to form the complete gap 

15 junction channel. Two sets of nomenclature have been used to identify the connexins. The 
first, and most commonly used, classifies the connexin molecules according to molecular 
weight, such as connexin43 (abbreviated to Cx43), indicating a connexin of molecular weight 
close to 43 kD. However, studies have revealed cases where clear functional homologues 
exist across species that have quite different molecular masses; therefore, an alternative 

20 nomenclature was proposed based on evolutionary considerations, which divides the family 
into two major subclasses, alpha and beta, each with a number of members. 

Due to their ubiquity and overlapping tissue distributions, it has proved difficult to 
elucidate the functions of individual connexin isoforms. To circumvent this problem, 
particular connexin-encoding genes have been subjected to targeted-disruption in mice, and 

25 the phenotype of the resulting animals investigated. Around half the connexin isoforms have 
been investigated in this manner. Further insight into the functional roles of connexins has 
come from the discovery that a number of human diseases are caused by mutations in 
connexin genes. For instance, mutations in Cx32 give rise to a form of inherited peripheral 
neuropathy called X-linked dominant Charcot-Marie-Tooth disease. Similarly, mutations in 

30 Cx26 are responsible for both autosomal recessive and dominant forms of nonsyndromic 
deafness, a disorder characterised by hearing loss, with no apparent effects on other organ 
systems. 

The disclosed NOV58 is a connexin-like protein localized to gap junctions. Gap 
junctions were first characterized by electron microscopy as regionally specialized structures 
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on plasma membranes of contacting adherent cells. These structures were shown to consist of 
cell-to-cell channels. Proteins, called connexins, purified from fractions of enriched gap 
junctions from different tissues differ. The connexins are designated by their molecular mass. 
Another system of nomenclature divides gap junction proteins into 2 categories, alpha and 
5 beta, according to sequence similarities at the nucleotide and amino acid levels. For example, 
CX43 is designated alpha-1 gap junction protein, whereas CX32 and CX26 are called beta-1 
and beta-2 gap junction proteins, respectively. This nomenclature emphasizes that CX32 and 
CX26 are more homologous to each other than either of them is to CX43. 

Willecke et al. (1990) used rat connexin gene probes in Southern blot analysis of 
1 0 human-mouse somatic cell hybrids to map the CX26 gene to chromosome 1 3 . By means of 
somatic cell hybrids, Hsieh et al. (1991) assigned the GJB2 gene to chromosome 13 in man 
and chromosome 14 in the mouse. Haefliger et al. (1992) showed that the rat homologs of the 
CX26 and CX46 genes are tightly linked on chromosome 14. By isotopic in situ hybridization, 
Mignon et al. (1996) mapped GJB2 to 13ql l-ql2 and confirmed the assignment to mouse 

15 chromosome 14. Kelsell et al. (1997) studied a pedigree containing individuals with autosomal 
dominant deafness and identified a mutation in the CX26 gene: a 101T-C transition resulting 
in a met34-to-thr amino acid substitution. CX26 mutations resulting in premature stop codons 
were also found in 3 autosomal recessive nonsyndromic sensorineural deafness pedigrees, 
genetically linked to 13ql l-ql2, where the CX26 gene is localized. 

20 Immunohistochemical staining of human cochlear cells for CX26 demonstrated high 

levels of expression. Kelley et al. (1998) presented evidence that the 101T-C missense 
mutation identified by Kelsell et al. (1997) in individuals with autosomal dominant 
nonsyndromic deafness is not sufficient to cause hearing loss. Carrasquillo et al. (1997) 
performed linkage analysis in 2 interrelated inbred kindreds in a single Israeli-Arab village 

25 containing more than 50 individuals with nonsyndromic recessive deafness. Genetic mapping 
demonstrated that a gene located at 13ql 1 (DFNB1) segregated with the deafiiess in these 2 
kindreds. Haplotype analysis, using 8 microsatellite markers spanning 15 cM in 13ql 1, 
suggested the segregation of 2 different mutations in this extended kindred; affected 
individuals were homozygotes for either haplotype or compound heterozygotes. Carrasquillo 

30 et al. (1 997) identified 2 distinct mutations, trp77 to arg and 35deIG, in the CX26 gene, both of 
which were predicted to inactivate connexin 26. 

The recombination of marker alleles involving polymorphisms in 13ql 1, at known 
map distances from the mutations, allowed them to estimate the age of the mutations to be 3 to 
5 generations (75 to 125 years). The study demonstrated that in small populations with high 
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rates of consanguinity, as compared with large outbred populations, recessive mutations may 
have very recent origin and show allelic diversity, they pointed to the same phenomenon 
being observed for Hurler syndrome with 3 unique mutations and for metachromatic 
leukodystrophy with 5 distinct mutations, discovered among the Druze and Muslim Arab 
5 villages in Israel. In light of these findings, the authors commented that it is likely that 

homozygosity mapping studies in highly inbred communities may be compromised, as may be 
studies of mapping by linkage disequilibrium, unless the possibility of mutational diversity is 
taken into account. 

Lench et al. (1998) studied the role of CX26 mutations in singleton (sporadic) cases of 

10 nonsyndromal sensorineural deafness. Such mutations were identified in 4 of 43 U.K. and 2 of 
25 Belgian patients. Thus, about 10% of families presenting with a child sporadically affected 
with this disorder can be offered definitive mendelian recurrence risks. This was said to be the 
first genetic test available for screening such children. Kelley et al. (1 998) analyzed 58 
multiplex families each having at least 2 affected children diagnosed with autosomal recessive 

1 5 nonsyndromic deafness. Mutations in both alleles of GJB2 were observed in 20 of the 58 
families. A 30delG allele occurred in 33 of the 1 16 chromosomes, for a frequency of 0.284. 
This mutation was observed in 2 of 192 cdntrol chromosomes, for an estimated gene 
frequency of 0.01 +/- 0.007. The homozygous frequency of the 30delG allele was then 
estimated at 0.0001, or 1 in 10,000. Given that the frequency of all childhood hearing 

20 impairment is 1 in 1,000 and that half of that is genetic, the specific mutation 30delG is 
responsible for 10% of all childhood hearing loss and for 20% of all childhood hereditary 
hearing loss. Six novel mutations were also observed in the affected population. 

Murgia et al. (1999) studied 53 unrelated individuals with nonsyndromic sensorineural 
hearing impairment and carried out CX26 mutation analysis. Mutations were found in 53% of 

25 cases, in 35.3% of those in whom autosomal recessive inheritance was thought likely and in 
60% of the presumed sporadic cases. Three novel mutations were found. The hearing deficit 
varied from mild to profound even within the same family. Among patients with profound 
hearing loss, 35.5% were found to have a mutation; among those severely impaired, 20%; and 
among those moderately impaired, 33.3%. 

30 Rabionet et al. (2000) analyzed the GJB2 gene in 576 families/unrelated patients with 

recessive or sporadic deafness from Italy and Spain, 193 of them being referred as autosomal 
recessive and the other 383 as apparently sporadic. Of the 1,152 unrelated GJB2 
chromosomes, 37% had GJB2 mutations. A total of 23 different mutations were detected. 
Mutation 35delG was the most common, accounting for 82% of all GJB2 deafness alleles. It 
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represented 88% of the alleles in Italian patients and only 55% in Spanish cases. Sobe et al. 
(2000) sequenced the entire coding region of the GJB2 gene in 75 hearing-impaired children 
and adults in Israel. Was both prelingual and postlingual, with hearing loss ranging from 
moderate to profound. Almost 39% of all persons tested harbored GJB2 mutations, most of 
5 which were 35delG and 167delT. A novel mutation, involving both a deletion and an insertion, 
5 ldell2insA, was identified in a family originating from Uzbekistan. All GJB2 mutations 
were associated with prelingual hearing loss, although severity ranged from moderate to 
profound, with variability even among hearing-impaired sibs. No significant difference in 
hearing levels was found between individuals with 35delG and 167delT mutations. 

10 Antoniadi et al. (2000) screened 26 unrelated Greek patients with prelingual 

sensorineural deafness in whom syndromic forms and environmental causes of deafness had 
been excluded. They detected the 35delG mutation in 28 chromosomes (53.8%); another 3 
sequence variations accounted for 7.6% of the alleles. Wilcox et al. (2000) performed 
mutation analysis of the GJB2 gene and audiology on 106 families presenting with at least 1 

15 child with congenital hearing loss. In 74 families (80 children), the etiology was consistent 
with nonsyndromic recessive hearing loss. Six different GJB2 mutations, including 1 novel 
mutation, were identified. They found that GJB2 mutations caused a range of phenotypes from 
mild to profound hearing impairment and that loss of hearing in the high-frequency range 
(4,000 to 8,000 Hz) is a characteristic feature in children with molecularly diagnosed CX26 

20 hearing impairment. They also demonstrated that high frequency hearing loss was found in a 
group of similar size of deaf children in whom a mutation could be found in only one of the 
GJB2 alleles. In their study, the M34T mutation was associated with hearing loss only when 
present in compound heterozygous state, suggesting autosomal recessive inheritance. 
Heathcote et al. (2000) reported a missense mutation in affected members of a family with 

25 autosomal dominant deafness and palmoplantar keratoderma. Rabionet et al. (2000) reviewed 
the molecular genetics of hearing impairment due to mutations in gap junction genes encoding 
beta-connexins. Among these genes, mutations in GJB2 account for about 50% of all 
congenital cases of hearing impairment. Three mutations in GJB2 are particularly common in 
specific populations: 35delG in Caucasoids, 167delT in Ashkenazi Jews, and 235delC in 

30 Orientals. Carrier frequencies in these populations vary between 1 and 30 and 1 in 75. Over 50 
mutations have been identified in the GJB2 gene, of which some missense changes have a 
dominant-negative action in hearing impairment, with partial to full penetrance. Functional 
studies for some missense mutations in connexins 26, 30, and 32 indicate abnormal gap 
junction conductivity. Expression patterns in mouse and rat cochlea indicate that connexin 26 
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and connexin 30 are expressed in the supporting cells of the cochlea, suggesting a potential 
role in endolymph potassium recycling. 

In the Japanese population, Kudo et aL (2000) sequenced the GJB2 gene in 39 patients 
with prelingual deafness, 39 patients with postlingual progressive sensorineural hearing loss, 

5 and 63 individuals with normal hearing. GJB2 mutations were found in 5 of the 39 patients 
(12%) with prelingual deafness. The most common mutation was 235delC observed in 7 of 10 
mutant alleles. There were no cases with the 30delG allele. No GJB2 mutation was found in 
patients in the postlingual hearing loss group. Nance et al. (2000) noted that recessive 
mutations at the connexin-26 gene locus account for nearly half of all cases of genetic 

1 0 deafness in many populations. They suggested that this high frequency is only seen in 
populations with a long tradition of intermarriage among deaf people. Available data are 
consistent with the hypothesis that such marriages might well have contributed to the high 
frequency of connexin-26 deafness in the U.S., and could represent a novel mechanism for 
maintaining specific genotypes at unexpectedly high frequencies. 

15 The NOV58 disclosed in this invention is predicted to be expressed in at least the 

following tissues: brain, lung, ovary, and colon. This information was derived by determining 
the tissue sources of the sequences that were included in the invention including but not 
limited to SeqCalling sources, public EST sources, literature sources, and/or RACE sources. 
Further expression data for NOV58 is provided in Example 2. 

20 The nucleic acids and proteins of NOV58 are useful in potential therapeutic 

applications implicated in various gap junction-related pathological disorders described further 
herein. The NOV58 nucleic acid encoding the connexin-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. The novel nucleic acid of the 

25 invention encoding a Connexin-like protein includes the nucleic acid whose sequence is 

provided in Table 58A or 58C, or a fragment thereof. The invention also includes a mutant or 
variant nucleic acid any of whose bases may be changed from the corresponding base shown 
in Table 58A or 58C while still encoding a protein that maintains its connexin-like activities 
and physiological functions, or a fragment of such a nucleic acid. The invention further 

30 includes nucleic acids whose sequences are complementary to the sequence shown in Table 
58A or 58C, including nucleic acid fragments that are complementary to any of the nucleic 
acids just described. The invention additionally includes nucleic acids or nucleic acid 
fragments, or complements thereto, whose structures include chemical modifications. Such 
modifications include, by way of non-limiting example, modified bases, and nucleic acids 
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whose sugar phosphate backbones are modified or derivatized. These modifications are carried 
out at least in part to enhance the chemical stability of the modified nucleic acid, such that 
they may be used, for example, as antisense binding nucleic acids in therapeutic applications 
in a subject. In the mutant or variant nucleic acids, and their complements, up to about 29% of 
5 the bases may be so changed. The novel protein of the invention includes the connexin-like 
protein whose sequence is provided in Table 58B or 58D. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 58B or 58D while still encoding a protein that maintains its connexin- 
like activities and physiological functions, or a functional fragment thereof. In the mutant or 

10 variant protein, up to about 44% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 

15 below. 

NOV59 

A eukaryotic translation initiation factor 5 (EIF5), disclosed herein as NOV59, 
interacts with the 40S initiation complex to promote hydrolysis of bound GTP with 

20 concomitant joining of the 60S ribosomal subunit to the 40S initiation complex. The resulting 
functional 80S ribosomal initiation complex is then active in peptidyl transfer and chain 
elongations. The disclosed NOV59 (alternatively referred to herein as CG56633-01) includes 
the 1328 nucleotide sequence (SEQ ID NO:213) shown in Table 59A. A NOV59 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 34-36 and ends with a TGA 

25 codon at nucleotides 1273-1275. The disclosed NOV59 maps to human chromosome 3. 



Table 59A. NOV59 Nucleotide Sequence (SEQ ID 
NO:213) 

CTTTC CTC ATCAC CTTAAATT CGGGTGT CTTT TATGAGTAAT CAAAAG CAGCAAAAGCCA 
A03CTATCAGGCCCAGTATTTAAAACCAGAAAAAGAGATGAAAAAGAGAGGTTTGACCCT 
ACTCAGTTTCAGGACTACGTTATTCAAGGCTTAACTCAAACTGGTACTGATTTGGAAGC^ 
GTAGCTAAGTTCCTTGATGCTTCTGGAACAAAACTTGATTACCGTCGATGTGCAGAAACA 
CTCTTTGACATTCTGGTGGGTGGTGGAATGCTGGCCCCAGGTGGTACACTGGCAGATGAC 
ATCATGCGTACAGATGTCTGCGTGTTTGCAGCCCAAGAAGACCTAGAGACCATGCAAGCA 
TTTGCTCAGGTCTTTAACAAGTTAATC^ 

GATGAAGTAAAAAGGCTGCTGGTGTTCGGAAAGGGTTTTTCAGACTCGGAGAGGAAAAAA 
CTGGCTATGTTGACTGGTGTTCTTCTGGCTAATGCATCCATTCTTAATAGCCTTTATAAT 
GAGAATTTGGTTAAAGAAGGGGTTTC^ACAGCTTTTGCTGGAAAGCTATTTAAATCAT^ 
ATA7UVTGAAAAAGATATCAATGCAGTAACTGCAAGGAAAGTCAGCATGGATAACAGCCTG 
ATGGAACTTTTT C CTG CCAAT AAG CAAAG CGTTCAACACTTCACGAAGTATTTTACTGAG 
GCAGG CCTGAAAG AGC TTTCAGAAT AT GTT CGGAATCAG CAAACCAT CAG AG CTTGT AAG 
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GAGCTGCAGAAAGAACTTCAAGAACAGATGTCCCGTGGGGATCCATTTAAGGTTATAATT 
TTATATGTCAAGGAGGAGATGAAAAAAAACAACATCCCAGAACCAGTTGTCATCGAAATA 
GTCTGGTCAAATGTAATGAGCGCTGTGGAATGGAACAAAAGAGAGGAGATTGTAGCAGAG 
CAAG C CAT CAAACACTTGAAG CAACAC AG C CCT CT AC TTG CTGC CTTT ACT ACT CAAAGT 
CAGTCTGAGCTGACCCTGTTACTGAAGATTCAGGAGTATTGCTATGACAACATTCATTTC 
ATGAAAG CCTTACGGAAAATAGTGGTGCTTTTTTATAAAG CTGTAGTCCTGAGCAAAGAG 
ACCATTTTGAAGTGGTATAAAGGTACACATGTTGCAAAGGGGAAGAGTGTTTTCCTTGAG 
CAAATGAAAAAGTTTGGAGAGTGGCTCAAAAATGCTGAAGAAGAATCTGAATCTGAAGCT 
GAAGAAGGTGACTGAATTTTGAAACTACACCCTCAGTAAAGCAAACAGGAGTTGTAGATA 
AAATGTCC 



The NOV59 polypeptide (SEQ ID NO:214) encoded by SEQ ID NO:213 is 413 amino 
acids in length and is presented using the one-letter amino acid code in Table 59B. The Psort 
profile for NOV59 predicts that this sequence has no signal peptide and is likely to be 
5 localized to the nucleus with a certainty of 0.7600. In alternative embodiments, a NOV59 
polypeptide is located to lysosomes with a certainty of 0. 1 0000. 



Table 59B. NOV59 Polypeptide Sequence (SEQ ID 
NO:214) 



MSNQKQQKPTLSGPVFKTRKRDEKERFDPTQFQDYVIQGLTETGTDLEAVAKFLDASGTK 
LDYRRCAETLFDI LVGGGMLAPGGTLA^ I RH 

YKYLEKCCEDEVKRLLVFGKGFSDSERKKLAML^ 

FAGKLFKSCIKEKDINAVTARKVSMDNSLMELFPANKQSVQHFTKYFTEAGLKELSEYVR 
NQQTIRACKELQKELQEQMSRGDPFKVI ILYVKEEMKKNNI PEPWI EIVWSNVMSAVEW 
NKREE I VAEQAI KHLKQHS PLLAAFTTQS QSELTLLLK I QE YCYDNI HFMKALRKI WLF 
YKAVVLSKETILKWYKGTHVAKGKSVFLEOMKKFGEWLKNAEEESESEAEEGD 



A BLAST analysis of NOV59 was run against the proprietary PatP GENESEQ Protein 
10 Patent database. It was found, for example, that the amino acid sequence of NOV59 had high 
homology to other proteins as shown in Table 59C. 



Table 59C. BLASTX results from PatP database for NOV59 



Smallest 

Sum 





High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P 


(N) 




patp:AAB43883 Human cancer associated protein sequence 


1834 


5 


6e- 


189 


patp:AAW93950 Human regulatory molecule HRM-6 protein 


1403 


2 


6e- 


143 


patp:AAB92726 Human protein sequence 


1403 


2 


6e- 


143 


patp:AAM38764 Human polypeptide 


1403 


2 


.6e- 


•143 


patp.-AAM40550 Human polypeptide 


1403 


2 


,6e- 


143 



In a search of sequence databases, it was found, for example, that the nucleic acid 
1 5 sequence of this invention has 778 of 824 bases (94%) identical to a gb:GENBANK- 

E):HUMRSC419|acc:D13630.1 mRNA from Homo sapiens (Human mRNA for KIAA0005 
gene. The full amino acid sequence of the protein of the invention was found to have 372 of 
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419 amino acid residues (88%) identical to, and 385 of 419 amino acid residues (91%) similar 
to, the 419 amino acid residue ptnr:SPTREMBL-ACC:Q15394 protein from Homo sapiens 
(Human) (KIAA0005 PROTEIN). NOV59 also has homology to the other proteins shown in 
the BLASTP data in Table 59D. 



Table 59D. NOV59 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi|7661850|re 
f |NP_055485.1 
| (NM_014670) 


basic leucine- zipper 
protein BZAP45 ; KIAA0005 
gene product {Homo 
sapiens) 


419 


372/419 
(88) 


385/419 
(91) 


0.0 


gi| 7661744) re 
f |NPJ)54757.1 
| (NM_014038) 


HSPC028 protein 
[Homo sapiens] 


419 


264/406 
(65) 


355/406 
(82) 


e-143 


gi|l5341786|g 
b|AAH13060.l| 
AAH13060 
(BC013060) 


HSPC028 protein 
[Mus musculus] 


419 


264/406 
(65) 


334/406 
(82) 


e-143 


gi|4426565|gb 
(AAD20436.1) 
(AF031483) 


unknown 

[Rattus norvegicus] 


419 


264/406 
(65) 


334/406 
(82) 


e-143 


gi| 11640562 |g 
b|AAG39278.l| \ 
AF110323 1 
(AF110323) 


MSTP017 
[Homo sapiens] 


419 


263/406 
(64) 


334/406 
(Bl) 


e-143 



This BLASTP data is displayed graphically in the ClustalW in Table 59E. A multiple 
sequence alignment is given, with the NOV59 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
10 Table 59D. 



Table 59E. ClustalW Alignment of NOV59 




NOV59 



7661850) 
7661744 j 
15341786] 
4426565) 
116405621 



NOV59 

gi | 7661850 

gi | 7661744 



(SEQ ID NO: 214) 
(SEQ ID NO: 597) 
(SEQ ID NO: 598) 
(SEQ ID NO: 599) 
(SEQ ID NO: 600) 
(SEQ ID NO: 601) 



NOV59 


1 


gi 


7661850) 


1 


gi 


7661744 | 


1 


gi 


15341786) 


1 


gi 


4426565) 


1 


gi 


11640562) 


1 



61 
61 

.59 
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59 
59 



11640562 | 59 



LDYRRYADTLFDILVAGSMLAPGGTRIDDGDKTKMTNHCVFSANEDHETIRNYAQVFMKL 
LDYRRYADTLFDILVAGSMLAPGGTRIDDGDKTKMTNHCVFSANEDHETIRNYAQVFNKL 
LDYRRYADTLFDlIiVAGSMLAPGGTRIDDGDKTKMTNHCVFSANEDHET IRNYAQVFNKL 



118 
118 
118 



130 



I 



140 
..I 



150 

. . I . . 



I 



160 



170 



180 



NOV59 


118 


gi 


7661850) 


118 


gi 


7661744| 


119 


gi 


15341786| 


119 


gi 


4426565) 


119 


gi 


11640562) 


119 



IR||jTC^^ 

IRRYKYLEKjpraDEQKKLLLFLKg^ 

irrykylekafedemkklllflkafseBeqtklamlsgillgngtlpatiltslftdslv 
irrykylekafedemkklllflkafse|eqtklamlsgillgngtlpatiltslftdslv 
irrykylekafedemkklllflkafse|eqtklamlsgillgngtlpatiltslftdslv 
irrykylekafedemkklllflkafseIeqtklamlsgillgmgtlpatiltslftdslv 



173 
177 
178 
178 
178 
178 



190 



200 



210 



220 



230 



240 



NOV59 . 


174 


gi 


7661850) 


178 


gi 


7661744) 


179 


gi 


15341786 | 


179 


gi 


4426565) 


179 


gi 


11640562) 


179 



NOV59 


232 


gi 


7661850) 


238 


gi 


7661744) 


239 


gi 


15341786) 


239 


gi 


4426565) 


239 


gi 


11640562 | 


239 




310 



320 



330 



340 



350 



I 



I 



J 



NOV59 


292 


gi 


7661850 | 


298 


gi 


7661744 | 


299 


gi 


15341786) 


299 


gi 


4426565) 


299 


gi 


11640562| 


299 



Mj^ewnkkeelva: 
tcAmnavewnkk 



CQ^KHLKC^PLLAgF|^QgQSELgLLg.KMQEYCYDNIHFM 
SQJ^KHLKQ^PLLAgF^QGQSELpLLwKyQEYCYDNIHFM: 

2qalkhlkqyapllavfssqgqsel|llqkvqeycydnihfmk 
s'qalkhlkqyapllavfssqgqselgjllqkvqeycydnihfmk 
sqalkhlkqyapllavfssqgqselfflllqkvqeycydnihfmk 
5Qalkhlkqyapllavfssqgqsel|llqkvqeycydnihfmk 



360 

L 



tcSmnavewnkpceelva* 
tcymnavewnkkeelva 
tc|mnavewnkkeelva 
tcBmnavewnkkeelva: 



351 
357 
358 
358 
358 
358 



370 



I 



380 



1- 



390 
. . I . . 



400 



410 



420 



I 



NOV59 


352 


gi 


7661850) 


358 


gi 


7661744) 


359 


gi 


15341786) 


359 


gi 


4426565| 


359 


gi 


11640562) 


359 



A^KIWLFYKA^LSjgEBlLKWYKjg^ 
AFQ KI WLF YKAgVLSEEgi LKW YKgjAHVAKGKS VFLgQMKKF VE WLjjjNAEEE SES EgS t 
AFQKIWLFYKADVLSE3AILKWYKEAHVAKGKSVFLDQMKKFVEWLQNAEEESESEGEE 
AFQKIVVLFYKADVLSEEAILCTJYKEAHgAKGKSVFLDQMKKFVEWLQNAEEESESEGEE 
AFQKIWLFYKADVLSEEAlLKWYKEAHgAKGKSVFLDQMKKFVEWLQNAEEESESEGEE 
AF0KIVVLFYKADVLSEEAILKWYKEAHVAKGKSVFLDQMKKFVEWLQNAEEESE5EGEE 



411 
417 
418 
418 
418 
418 



NOV59 


412 


GD 


413 


gi 


7661850) 


418 


GD 


419 


gi 


7661744) 


419 




419 


gi 


15341786 | 


419 




419 


gi 


4426565) 


419 




419 


gi 


11640562| 


419 




419 



Table 59F lists the domain description from DOMAIN analysis results against NOV59. 
This indicates that the NOV59 sequence has properties similar to those of other proteins 
known to contain this domain. 
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Table 59F. Domain Analysis of NOV59 

gnl 1 Load 1 LOAD W2 , W2, conserved protein-protein interaction domain in translation 
factors like eIF2B SEQ ID NO : 860 

CD-Length = 116 residues, 96.6% aligned 
Score = 83.6 bits (205), Expect = 2e-17 

N0V59: 290 VWSNVMS AVEWNKREE IVAEQAI KHLKQHS PLLAAF TTQSQSEL TLLLKI QE YCYDNIHF 349 

V ++S + E A+K K+ PLLA + S+L LL ++E+C + 

Sbjct: 1 VALVI L SVAS I ELADNE PKEAAVKVFKKWGPLLAKYLKDEDSQLELL YALEE FCEELEEL 60 

NOV59: 350 MKALRIQ WLFYKAVVLSKETILKJTY- KGTHVAK^ 400 

+KLKI+ Y VL +E ILKWY K + +GK I*+ K F WL+ 
Sbjct; 61 LKLLAKILKYLYDEDVLEEEAI LKWYEKKSKAEEGKKKVLKSAKPFVTWLQE 112 

The NOV59 disclosed in this invention is predicted to be expressed in at least the 
following tissues: brain. This information was derived by determining the tissue sources of 
the sequences that were included in the invention including but not limited to SeqCalling 
sources, public EST sources, literature sources* and/or RACE sources. Further expression data 
for NOV59 is provided in Example 2. ; . • ' 

The nucleic acids and proteins of NOV59 are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, myasthenia gravis, neuroprotection, endocrine 
dysfunctions, diabetes, obesity, growth and reproductive disorders and other diseases, 
disorders and conditions of the like. The NOV59 nucleic acid encoding the translation 
initiation factor 5-like protein of the invention, or fragments thereof, may further be useful in 
diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. The novel nucleic acid of the invention encoding a translation initiation factor 
5-like protein includes the nucleic acid whose sequence is provided in Table 59A, or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 59A while still encoding a 
protein that maintains its translation initiation factor 5-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
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include chemical modifications. Such modifications include, by way of non-limiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 

5 binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 6% of the residues may be so changed. 

The novel protein of the invention includes the translation initiation factor 5-like 
protein whose sequence is provided in Table 59B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 

1 0 in Table 59b while still encoding a protein that maintains its translation initiation factor 5-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 12% of the bases may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 

1 5 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV60 

NOV60 includes two Lynxl -like proteins, designated herein asNOV60a and NOV60b, 
20 which differ by three amino acids and the relative length of their untranslated regions (UTR's), 

NOV60a 

The disclosed NOV60a (alternatively referred to herein as CG56894-01) includes the 
715 nucleotide sequence (SEQ ID NO:215) shown in Table 60A. A NOV60a ORF begins 
25 with a Kozak consensus ATG initiation codon at nucleotides 348-350 and ends with a stop 
codon at nucleotides 696-698. 



Table 60A. NOV60a Nucleotide Sequence (SEQ ED NO:215) 

AGCTTTGTTCrTGAGTGGGTCTGCCTCGGGGGCTTTAGAGGAGACCCCAGAGGGTGGCGATGCGGCACGG 

GTGCTGCGGGACACACAGACACGCCTACGATTAGACTCAGGCAGGCACCTACCGGCGAGCGGCCGCGGGT 

GACT C CCAGG CG CGG CGGTAC CTCACGGTGGTGAAGGTCACAGGGTTGCAG C ACT CCCAGT AGACCAGG A 

GCTCCGGGAGGCAGGGCCGGCCCCACGTCCTCTGCGCACCACCCTGAGTTGGATCCrCTGTGCGCCACCC 

CTGAGTTGGATCCAGGGCTAGCTGCTGTTGACCTCCCC 

ACGCCCCTGCTCACCCTGATCCTGGTGGTCCTCATGGGCTTAC 

TGTGTGCCTACAACGGAGAC^CTGCTTC^CCCCATGCGCrrGCCCGGCTATGGTTGCCTACTGC^TGAC 
CACGCGCACCTACTACACCCCCACCAGGATGAAGGTCAGTAAGTCCTGCGTGCCCCGCTGCTTCGAGACT 
GTGTATGATGGCTACTCC^GCACGCGTCC^CC^CCTC 
GCCriXSCCACCCCGGCCACCCCGGCCCTGGCCCCCATCCTCCTC 

CCCCCGAGGCAGACA \ 
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The NOV60a polypeptide (SEQ ID NO:216) encoded by SEQIDNO:215 is 116 
amino acids in length and is presented using the one-letter amino acid code in Table 60B. The 
Psort profile for NOV60a predicts that this sequence has a signal peptide and is likely to be 
5 exported from the cell with a certainty of 0.8200. In alternative embodiments, a NOV60a 

polypeptide is located to lysosomes with a certainty of 0. 1 000, or to the endoplasmic reticulum 
(membrane) with a certainty of 0. 1000. The Signal P predicts a likely cleavage site for a 
NOV60a peptide is between positions 34 and 35, i.e., at the dash in the sequence AQA-SD. 



Table 60B. NO V60a Polypeptide Sequence (SEQ ID 
NO:216) 

MTPIJjTLILVVMGLPLAQASDC^ 

S K S CVPRCFETVYDG YSKHASTTS CCQYDLCNGTGLATP ATPALAP I LLATLWGLL 

10 

NOV60b 

The disclosed NOV60b (alternatively referred to herein as CG56894-02) includes the 
876 nucleotide sequence (SEQ ID NO: ) shown in Table 60C. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 348-350 and ends with a stop codon at 
15 nucleotides 696-698. 



Table 60C. NOV60b Nucleotide Sequence (SEQ ID 
NO:217) 



TGCGGC^CGGGTGCTGQSGGACACACAGACACGCCTACGATTAGACTCAGGCAGGCACCT 
ACCGGCGAGCGGCCGCGGGTGACTCCCAGGCGCGGCGGTACCTCACGGTGGTGAAGGTCA 
CAGGGTTGCAGCACTCCCAGTAGACCAGGAGCTCCGGGAGGCAGGGCCGGCCCCACGTCC 
TCTGCG»CCACCCTGAGTTGGATCCTCTGTGCGCCACCCCTGAGTTGGATCCAGGGCTA 
GCTGCTGTTGACCTCCCCACTCCCACGCTGCCCTCCTGCCTGCAGCCATGACGCCCCTGC 
TCACC CTGATCCTGGTGGTCCTCATGGGCTTACCTCTGGC CCAGGCCT CGGACTGCCACG 
TGTGTGCCTACAACGGAGACAACTGCTTCT^ACCCCATGCGCTGCCCGGCTATGGTTGCCT 
ACTGCATGACCACGCGCACCTACTACACCCCCACCAGGATGAAGGTCAGTAAGTCCTGCG 
TGCCCCGCTGCTTCGAGACTGTGTATGATGGCTACTCCAAGCACGCGTCCACCACCTCCT 
GCTGCCAGTACGACCTCTGCAACGGCACCGGCCTTGCCACCCCGGCCACCCTGGCCCTGG 
CCCCCATCCTCCTGGCCACCCTCTGGGGTCTCCTCTAAAGCCCCCGAGGCAGACCCACTC 
AAGAACAAAGCTCTCGAGACACACTGCTACACCCTCX3CACCCAGCTCACCCTGCCTCACC 
CTCCACACTCCCTGCGACCTCCTCAGCCATGCCCAGGGTCAGGACTGTGGGCAAGAAGAC 
ACCCGACCTCCCCCAACCACCACACGACCTCACTTC 



The NOV60b polypeptide (SEQ ID NO:218) encoded by SEQ ID NO:217 is 1 16 

amino acids in length and is presented using the one-letter amino acid code in Table 60D. The 

20 Psort profile for NOV60b predicts that this sequence is a Type la membrane protein, has a 

signal peptide, and is likely to be localized at the plasma membrane with a certainty of 0.9190. 

In alternative embodiments, a NOV60b polypeptide is located to lysosomes with a certainty of 

0.2000, or to the endoplasmic reticulum (membrane) with a certainty of 0.1000. The Signal P 
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predicts a likely cleavage site for a NOV60b peptide is between positions 20 and 21, i.e., at the 
dash in the sequence AQA-SD. 

Table 60D. NOV60b Polypeptide Sequence (SEQ ID 
NO:218) 

MTPIiLTLILVVLMGLPIAQASDCHVCAYNGDNCFNPMRCPAMV 
S KS CTPRCFETVYDGYS KHAS TTS CCQYDLCNGTG 

5 

A BLAST analysis of NOV60 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV60 had high 
homology to other proteins as shown in Table 60E. 



Table 60E. BLASTX results from PatP database for NOV60 






Smallest 






Sum 




High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P(N) 


patp:AAY0273 8 Human secreted protein encoded by gene 89 


630 


2.2e-61 


patp:AAM39828 Human polypeptide 


630 


2.2e-6l 


patp:AAM416l4 Human polypeptide 


630 


2.2e-6l 


patp:AAB6113l Human NOV3 protein 


594 


1.4e-57 


patp:AAY79325 Mouse receptor ligand Lynxl 


521 


7.7e-50 



10 

In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 338 of 424 bases (79%) identical to a gb:GENBANK- 
K):AF141377|acc:AF141377.1 mRNA from Mus musculus (Ly-6/neurotoxin homolog 
(Lynxl) mRNA). The full amino acid sequence of the protein of the invention was found to 
15 have 92 of 1 16 amino acid residues (79%) identical to, and 96 of 1 16 amino acid residues 
(82%) similar to, the 1 16 amino acid residue ptnr:SPTREMBL-ACC:Q9WVC2 protein from 
Mus musculus (Mouse) (LY-6/NEUROTOXIN HOMOLOG). NOV60 also has homology to 
the other proteins shown in the BLASTP data in Table 60F. 



Table 60F. NOV60 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%> 


Positive 
(%) 


Expect 


gi (7106349 | re 
f|NP 035968.1 
| (NM 011838) 


Ly 6 /neurotoxin 1 
[Mus jnusculus] 


116 


92/116 
(79%) 


96/116 
(82%) 


5e-39 
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gi|l2698684|g 
b|AAK01642.1 | 
AF321824_1 
(AF321624) 


Ly-6 neurotoxin- like 
protein Lynxl 
[Homo sapiens] 


80 


79/80 
(98%) 


79/80 
(98%) 


5e-27 


gi| 1519481 |gb 

|AAB07524.l| 
(U66837) 


E48 antigen 
[Homo sapiens] 


79 


28/72 
(38%) 


34/72 
(46%) 


0.035 


gi|l072024l|s 
p|P57Q96|PSCA 
_M0USE E48 
antigen [Homo 
sapiens] 


E48 antigen 
[Homo sapiens] 


123 


30/104 
(28%) 


40/104 
(37%) 


0.038 


gi|l2845967|d 
bj |BAB26976.1 
| (AK010485) 


PAR/Ly-6 domain 
containing protein 
[Mus musculus] 


154 


36/108 
(33%) 


47/108 
(43%) 


0.066 



This BLASTP data is displayed graphically in the ClustalW in Table 60G. A multiple 
sequence alignment is given, with the NOV60a and b protein being shown on lines 1 and 2 in 
a ClustalW analysis comparing the protein of the invention with the related protein sequences 
shown in Table 60F. 



NOV60a 
NOV60b 



gi 
gi 
gi 
gi 



7106349] 

12698684) 

1519481) 

10720241) 

12845967 



NOV60a 
NOV60b 



7106349) 

12698664) 

1519481) 

10720241) 

12845967) 



Table 60G. ClustalW Alignment of NOV60 



(SEQ ID NO: 216) 

(SEQ ID NO: 218) 

<SEQ ID NO: 602) 

(SEQ ID NO: 6 03). 

(SEQ ID NO: 604) 

(SEQ ID NO: 605) 

(SEQ ID NO: 606) 



NOV60a 


58 


NOV60b 


58 


gi 


7106349) 


58 


gi 


12698684) 


58 


gi 


1519481 | 


40 


gi 


10720241) 


60 


gi 


12845967) 


59 




•ATP 102 
|PATL 102 
[PVTL 102 
80 

EKL 79 

-KPP 103 
118 



130 



140 



150 
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NOV60a 
NOVSOb 



gi 
gi 
gi 
gi 
gi 



7106349| 
12698684| 
1519481 | 
10720241) 
12845967 J 



103 ALAP 
103 ALAP 
103 ALVP 
80 




116 

116 

116 

80 

79 



79 _ 

104 TTLG 

119 KEQPGKASGRRHRYi: 




,WGSSRL 123 

kNGLSALCLL 154 



Elapid snake venom neurotoxins exert their effects through high-affinity interactions 
with specific neurotransmitter receptors. The lynxl-like gene disclosed herein as NOV60, is 



10 



15 



highly expressed in the brain and contains the cysteine-rich motif characteristic of this class of 
neurotoxins. Primary sequence and gene structure analyses reveal an evolutionary relationship 
between lynxl and the Ly«6/neurotoxin gene family. Lynxl is expressed in large projection 
neurons in the hippocampus, cortex, and cerebellum. In cerebellar neurons, lynxl protein is 
localized to a specific subdomain including the soma and proximal dendrites. Lynxl binding 
to brain sections correlates with the distribution of nAChRs, and application of lynxl to 
Xenopus oocytes expressing nAChRs results in an increase in acetylcholine-evoked 
macroscopic currents. These results identify NOV60 as a protein modulator for nAChRs in 
vitro, with important implications in. the regulation of cholinergic function in vivo. 

The NOV60 disclosed in this invention is predicted to be expressed in at least the 
following tissues: brain. This information was derived by determining the tissue sources of 
the sequences that were included in the invention including but not limited to SeqCalling 
sources, public EST sources, literature sources, and/or RACE sources. Further expression data 
for NOV60 is provided in Example 2. 

The nucleic acids and proteins of NOV60 are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention will have efficacy for the treatment of patients 
suffering from: Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous 
sclerosis, hypercalcemia, Parkinsons disease, Huntington's disease, cerebral palsy, epilepsy, 
Lesch-Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, and neurodegeneration, as well as other diseases, disorders 
and conditions. The NOV60 nucleic acid encoding the lynxl-like protein of the invention, or 
fragments thereof, may further ]be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. The nucleic acid of the invention 
encoding a lynxl-like protein includes the nucleic acid whose sequence is provided in Table 
60A or 60C, or a fragment thereof. 
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The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 60A or 60C while still encoding a 
protein that maintains its Lynxl-like activities and physiological functions, or a fragment of 
such a nucleic acid. The invention further includes nucleic acids whose sequences are 
5 complementary to the sequence of Table 60A or 60C including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of non-limiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

10 derivatized. These modifications are carried out at least in part to enhance the chemical 

stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 21% of the bases may be so changed. 

The novel protein of the invention includes the lynxl-like protein whose sequence is 

1 5 provided in Table 60B and 60D. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 60B and 
60D while still encoding a protein that maintains its lynxl-like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 21% 
of the amino acid residues may be so changed. 

20 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

25 

NOV61 

NOV61 is a homolog of the adlican gene and belongs to the superfamily of cell 
adhesion molecules. The disclosed NOV61 (alternatively referred to herein as CG56453-01) 
includes the 5925 nucleotide sequence (SEQ ID NO: ) shown in Table 61 A. A NOV61 ORF 
30 begins with a Kozak consensus ATG initiation codon at nucleotides 16-18 and ends with a 

stop codon at nucleotides 5653-5655. The disclosed NOV 70 maps to human chromosome Y. 



Table 61A. NOV61 Nucleotide Sequence (SEQ ID 
NO.-219) 

AGGCACCCCGACAAGATGCCCAAGCGCGCGCACTGGGGGGCCCTCTCTGTGGTGCTGATC 
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CXGCTTTGGGGTCATCCGCGAGTGGCGCTGGCCTGCCCTCATCCTTGTGCCTGCTACGTC 

CCC AG CGAGGTCCACTGCACGTTCCGATC CCTGGCTTCTGTGCCCGCTGGCATTGCTAAA 

CATGTGGAAAGAATCAATTTGGGGTTTGGAATTCTGAAGTGTAAAAAGGACAAAGCTTAT 

GAAGGCGGTCAGTTGTGTGCAATGTGCTTCAGTCCAAAGAAGTTGTACAAACATGAGATT 

CACAAGCTGAAGGACCTGACTTGT CTGAAGCCTTCCATAGAGT CTCCTCTGAGACAGAAC 

AGGAGC^GGAGTATTGAGGAGGAGCAAAAACAAGAAGAGAATGGTGACAGCCAGCTCATC 

CTGGAGAAAAT CCAACTTC CC CAGTGGAGCATCTCTTTGAATATGACTGATGAGCACGGG 

AACCTGGTGAACTTGGTGTGTGACATCAAGAAACCAATGGATGTGTACAAAATTCACTTG 

AACCAAACAGATCCTCCAGATATTGACATAAATGCAATGGTTGCCTTGGACTTTGAGTAT 

CCAATGACCCAGGAAAACTATGAAAATCTATGGAAATTGATAGCATACTACAGTGAAGTT 

CCCATGAAGCTACACAGAGAGCTCATGCTCAGCAAACACCCCAGAGTCAGCTACCAGTAC 

AGGCAAGATGCCGATGAAGAAGCTCTTTACTACACAGGTGTGAGAGCCCAGATTCTTGCA 

GAACCAGAATGGATCATGCAGCCATCCATAGATATCCAGCTGAACCGACCTCAGAGTACG 

GCCAAGAAGGTGCTACTTTCCTACTACAACCAGTATTCrCAAACAATAGCCACCAAAGAT 

ACAAGGCAGGCTCGGGGCAGAAGCTGGGTAATGATTGAGCCTAGTAGAGCTGTGCAAAAA 

GAT CAGACTGTCCTGG AAGGGGGT CGATGCCAGTTGAGCT GCAATGTGAAAG CTT CTGAG 

AGTCCATCTATCTTCTGGGTGCTTCCAGATGGCrCCATCCTGAAAGTGCCTGTGGATGAC 

CCAGACAG C AAGTT CT CCATT CTCAG CAGTGG CTGGC TGAGGAT C AAGTC C ATGG AGCCA 

TCTGACTCGGGCTTGTACCAGTGCATTGCTCAAGTGAGGGATGAAATGGACOGCATGGTA 

TATAGGGTACTTGTGCAGTCTCCCTCCACTCAGCCAGCCGAGAAAGACACAGTGACAATT 

GGCAAGAACCCAGGGGAGCCAGTGATGTTGCCTTGCAATGCTTTAGCTATACCCGAAGCC 

CAC CTTAG CTGGATT CTTC CAAACAGAAGGATAATTAATGATT TGG CTAACACATCACAT 

GTATACATGCTGCCAAATGGAACTCTTTCCATCCCAAAGGTCCAAGTCAGTGACAGTGGT 

TACCACAGATGTGTGGCTGTCAACCAGCATGGGGCAGACCATATCACGGTGGGAATCAC^ 

GTGACCAAGAAAGGTTCTGGCTCGCCATCCAAAAGAGGCAGATGGCCAGGTCCAAAGGCT 

CTTTCCAGAATGAGAGAAGACATCGTGGAGGATGAAGGGGTCTCAGGCACGGGAGATGAA 

GAGAACACTTCAAGGAGACTT CTACAT CCAAAGCACCAAGAGGCGTTCCTCAAAACAAAG 

GATGATGCCATCAATGGAGATAAGAAAGCCAAGAAAGGGAGAAGAAAGCTGAAACTCTGG 

AAGCATTCAGAAAAAGAACC^GAGACCAGTGTTGCAGAAGATCTCAGAGTGTTTGAATCA 

AGACGAAGGATAAACGTGGCAAACAAACAGATTAATCCGGAGCACTGGGCTGATATTTTA 

GCC AAAGT CTTTGGGAAAAATCT C CCT AC AGG CACAGAAGT AT CCC CAATT ATT AAAAC C 

ACAAGTTCTCCATTCTTGAGCCTAGTAGTCACACCACCTTTGCCTGCTGTTTCTCCCCCC 

TTGGCATCTCCAATACAGACAGCAACAAGTGCTGAAGAATCCTCAGCAGATGTACCTCTA 

CT C AG CGAAGG AAAGCACATTTTGAGT AC CATTT C CT CAG CCAGC ATGGGACTAGAACAC 

CACAACAATGGAGTTATTCTTGTTGAACCTGAAGTAACAAGC71CACCTCT 

GTTGATGAGTATT CCAAGAAGACTGAGGAGATGACTT CCACTG AAGG CGAC CTGAAGGGG 

ACTGCAGC CTCTACACTT ATATCTGAGCCTTATGAACAATCTC CT ACT CTACACACCTTA 

GAC AC AGT CTATGAAGAGC C C ACC CATGAAGAGACGG AAACAGAGGGTTGGTCTG CAGC A 

GATGTTGG ATC CT CACCAG ATCC CACATCCAGTGAGT ATGAGCTT C CATTGGTTGTTGTC 

TCCTTGGCTGAGTCTAAGCCTGTGCAATACTTTGACCCAGATTTGGAGACTAATTCACAA 

CCACATGAGGATAACATAAAAGAATACAGTTTTGCACAC CTTACT CCAACCGC CAT CATC 

TGGTTTAATGACTCTAGTACATCACTGTCATTTGAGGATTCTACTGTAGGGGAACAAGGT 

GTC CCAGG CAAAT CACAT CT ACAAGGAC CGACAGAGAACATCCAG CTT GTGAAAAGT AGT 

TTTAGCACTCAAGACACCTTATTGATTAAAAAAGGTATGAAAGAGATGTCTCAGACACTA 

CAGGGAGGAAATATGCTAGAGGGAGACCCTACACACTCCAGAAGTTCTGAGAATGAGGGC 

CAAGAGAG CAAAT CCATCACTTTACCTGACTCCACACTGGGTATAACGAGCAGTACGTCT 

CC^GTTAAGAAGCCTGCGGAAACCACAGTTGTC^CCCTGCTACACAAAGACACCACAAC^ 

GAAACAACTCCAAGGCAAAAAGTGGCTTC^TCATCCACCATGAGCACTCACCCTTCTCGA 

AGG AGACC CAATGGGAGAAAATT ACACC CT CACAAATTC C ACCAC CGGCACAAGCAAACC 

CCACCCACAACTTTTGCTC<^TTAGAGACTTTTTCTAGTCAACCAACT 

ATTAAGATTTCAAATCAAATGGAGAGTTCTCTGGTTCCTACATCTTGGGAGATTAACACA 

GTTAATACCCCCAAACAGCTGGAAATGGAGAAGAATGTAGAGCTCATATCAAAGGGAACT 

CCACGGAGAAAACACGGGAAGAGGCCAAACAAACATCGATATACCCCTTCTACAGTGAGT 

TCAAGAGCATCTGCATCCAAGCCCAGCCCTTCTCCAGAAAATAAACATAGAAACATTGTT 

ACTCCCAGTTCAGAAACTACACrTrrGCCTAGAAATGTTTCTCTGAAAACTGAGGGCGTT 

TATGATTCCTTAGATTACACGACAACCACCAGAAAAATACATTCATCTCACCATAAAGTC 

GAAGAGACACTTCCAGTCATGTATAAACCCACATCAGATC 

GTTGCCAC?^TGTTGACAAACATAAAAGTGACATTTTAGTCCCTGGTGAGTCAATTACA 
AATGTCACACAAACTTCTCGCTCCTTGGTCTCCACTATGGGAGAATTTAAGGAAGAATCC 
TCTCCTGTGGGCTTTCCAGGAATTCCAACCTGGAATCCCTCAAGGAAAGCTCAGCCT 
AGG CTACAG ACAG ACAT ACAT GTT AC CAC TTCTGG GGAAAC CC CTACAGAC CCTC CCCTT 
GTT AACGAGC TTGAGG ATGTGGATTTTAC TTCTG AGTTTTTGT CCTCTGTG ACAGT CT CC 
ACACCATTTCACCAGGAAGAAGCrGGTTTTTCCACAATTCTCTCAAGCA 
ATGGCTTCAAGTC^GGTAGAAACTACCACCCTTGGTCAAGATCATCATGAAACCACTGTG 
GCTATTCTCCACTCTGAAACTAGACCACAGAATCACATCCTTACTGCTGCCrGGATGAAG 
GAGCCAGCATCTTTGTCCCCTCCCATGATTCTCCTGTCTTTGGGACAAACCACCACCACT 
AAG C CAGAACT T CTCAGT CCAAGAACATCTCAAATATGTAAAG ATTCC AAGGAAAATGT T 
TTCTTGAATTACATGG GGAAT CCAGAAAC AGAAGCAACCC CAGTGAAAAATGAAGGAACA 
C AGCGT ATGT CAGGGC CAAATGAATTAT CAACAC CAT CTT CTGACCACGATGCAT TTAAC 
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TTGT CTACAAAGCTAGAATTGGAAAAG CAAGT ATTTGATAGTAGGAGTCTAACACGTGGC 
CCAGATAGCCACCACCAGGATGGAAGAGTTCATGCTTCTCATCAACTAACCAGAATCCCT 
GCCAAACCCATCCTACCAACAGGAACAGTGAGGCTGCCTGAAATGTCCACACAAAGCACT 
TCCAGATACTTTGTAACTTTCCAGCCACCTCATCACGGGACCAACAAACCAGAAATAAC^ 
ACATATCCTTCTAGGGCTTTGCCAGAGAGCAAACAGTTTACAACTCCAAGAGTAGCAAGT 
ACAACTCCTCTCCTATCACACATGTCCAAACCCAG CATTTCTAGTAAGTTTG CTGACCTA 
AGAACTGACCAATCCAATGGCTCCTACAAAGTGTTTGGAAATAGCAACATCCCTGAGGCA 
AGAAACTCAGTTGGAAAGCCTCTCAGTCCAAGAATTTATCATTATTCCAATGGAAGACTC 
CCTTTCTTTACCAACAGGACTCnTTCTTTTTCACAGTTGGGAGTCACCCGGAGACCCCAG 
ATACCCTCTTCTCCTGTCCCAGTAATGAGAGAGAGAAAAGTTAATCCAGGTTCCTACAAT 
AGGATATATT C CCATAGCACCTTC CAT CTGGACTTTGGCCT T C CAGCAC CT C CACTGTTG 
CACACTC CATGGAC CATGGT ATCACCC CC AACTAACT TAC AGAAT AT C CCT ATGGT CTCA 
TCCACCCAGAGTTCTGTCTCCTTTATAACATCTTCTGTCCAGTCCTCAGGAAGCATCCAC 
C^VAAGCGGCTCAAAGTTCTTTGC^IGGAGGACCGCCTGCATCCAAATTCTGGCCTCTTGGG 
GAAAAGCCCCAAATCCTCACCAAGTCCCCACAGACTGTGTCTGTCACTGCTGAAACGGAC 
GCTGTGTTCCCGTGTGAGGCAATAGGAAAACCAAAGCCTTTCGTTACTTGGACAAAAGTT 
TCCACAGGAGTTCTTATGACTCCGAATACCAGGATACAACGGTTTGAGGTTCTCAAGAAC 
GGTACCTTAGTGATAAGGAAGTTTCAAGTGCAAGATCGAGGCCAGTATATGTGCACCGCC 
AGCAACCTGTACGGCCTGGACAGGATGGTGGTCTTTCTCTGGGTCACCGTGCAGCAACCT 
CAAATCCTAGCCT CCCACTAC CAGGACGT CAC CGT CT AC CTGGGAG AC AC C ATT ACAATG 
GAGTGTCTGGCGAAAGGGACCCCAGCCCCCCAAATTTCCTGGATCTTCCGTGACAGGAGG 
GTGTGGCAAACTCTGTCCTCCGTGGAGGGCCGGATCACCCTGCACCAAAACCGGACCCTT 
TCCATCAAGGAGGCGTCCTTCTC^GACAGAGGCGTCTATAAGTGCGTGGCCAGCAACGCA 
ACCCGGGCGGACAGCGTGTCCATCCGCCTACACGTGGCGGCACTGCCCCCCATTATCCAC 
CAGGAGAAGCT GTAGAACATCTCG CTG C C C C CGGGG CTCAGCATT C ACATT CACTG CACT 
GCCAAAGCTGCGCCCCTGCCCAGCGTGCTCTGGGTGCTCGGGGATGGTACCCAAATCCGC 
CCCrCGCATTTCCTCCACCGGAACTTGTTTGTTTTCCCCAACGGGACGCTCTACATCTGC 
AACCTCGCGCCCAAGGACAGCGGGCGCTATGAGTGCGTGGCCGCCAACCTGATCGGCTCC 
GCGCGCAGTACGGTGCAGCTGAACGTGCAGCG CGCAG CAGCGAAC 



The NOV61 polypeptide (SEQ ID NO:220) encoded by SEQ ID NO:219 is 1879 
amino acids in length and is presented using the one-letter amino acid code in Table 61B. The 
Psort profile for NOV61 predicts that this sequence has a signal peptide and is likely to be 
5 localized outside the cell with a certainty of 0.4371 . In alternative embodiments, a NOV61 
polypeptide is located to lysosomes with a certainty of 0.1900, to the endoplasmic reticulum 
(membrane) with a certainty of 0.1000, or to the nucleus with a certainty of 0.1800. The 
Signal P predicts a likely cleavage site for a NOV61 peptide is between positions 26 and 27, 
le. 9 at the dash in the sequence ALA-CP. 

10 

Table 61B. NOV61 Polypeptide Sequence (SEQ ID 
NO:220) 

MPKRAHWGALSVVLILLWGHPRVALACPHPC 

NLGFGILKCKKDKAYEGGQLCAMCFSPKKLYKHEIHKLKDLTCLKPSIESPLRQNRSRSI 
EEEQKQEENGDSQLILEKIQLPQWSISLNMTDEHGNL 

PDIDINAMVALDFEYPMTQENYENLWKLIAYYSEVPMKLHRELMLSKHPRVSYQYRQDAD 

EEALYYTGVRAQILAEPEWIMQPSIDIQLNRPQSTAKKVLLSYYNQYSQTIATKDTRQAR 

GRSWVMIEPSRAVQKDQTVLEGGRCQLSCTVKASESPSIFWVLPDGSILKVPVDDPDSK^ 

S ILSSGWLRIKSMEPSDSGLYQCIAQVRDEMDRMVYRVLVQS PSTQPAEKDTVTIGKNPG 

EPVMLPCNALAIPEAHLS WILPNRRI INDLANTSHVYMLPNGTLS I PKVQVSDSGYHRCV 

AWQHGADHITVGITVTKKGSGSPSKRGRWPGPKALSRMREDIVEDEGVSGTGDEENTSR 

RLLHPKHQEAFLKTKDDAINGDKKAKKGRRKLKLWKHSEKEPETSVAEDLRVFESRRRIN 

VANKQINPEHWADIIJ^FGKNLPTGTEVSPIIKTTSSPFLSLVVTPPLPAVSPPLASPI 

QTATSAEESSADVPLLSEGKHILSTISSASMGLEHHNNGVILVEPEVTSTPLEEWDEYS 

KKTEEMTSTEGDLKGTAASTLISEPYEQSPTLHTLDTVYEEPTHEETETEGWSAADVGSS 

PDPTSSEYELPLVWSLAESKPVQYFDPDLETNSQPHEDNIKEYSFAHLTPTAIIWFNDS 

STSLSFEDSTVGEQGVPGKSHLQGPTENI QLVKSS FSTQDTLLIKKGMKEMSQTLQGGNM 

I^EGDPTHSRSSENEGQESKSITLPDSTLGITSSTSPVKKPAETTVVTLLHKOT 

QKVASSSTMSTHPSRRRPNGRKLHPHKFHHRHKQTPPTTFAPLETFSTQPTQATDIKISN 
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QMESSLVPTSWEIOTVOTPKQLEMEKNVELISKGTPRRKHGKRPNKHRYTPSTVSSRASA 
S KPS P S PENKHRN I VT PS SETTLLPRNVS LKTEGVYDSLDYTTTTRKIHS SHHKVQDTLP 
VMYKPTSDGKEIQDDVATNVDKHKSDII.VPGESITNVTQTSRSLVSTMGEFKEESSPVGF 
PGIPTWNPSRKAQPGIUiQTDIHVTTSGETPTDPPLVNELEDVDFTSEFLSSVTVSTPFHQ 
EEAGFST ILS SI KVEMAS SQVETTTLGQDHHETTVAI LHSETRPQNHILTAAWMKEPASL 
SPPMILLSI^QTTTTKPEIJiSPRTSQICKDSKENVFLNYMGNPETEATPVKNEGTQRMSG 
PNELSTPSSDHDAFNLSTKLELEKQVFDSRSLTRGPDSHHQDGRVHASHQLTRIPAKPIL 
PTGTVRLPEMSTQSTSRYFVTFQPPHHGTNKPEITTYPSRALPESKQFTTPRVASTTPLL 
SHMSKPSISSKFADLRTDQSNGSYKVFGNSNIPEARNSVGKPLSPRIYHYSNGRLPFFTN 
RTLSFSQLGVTRRPQIPSSPVPVMRERKVNPGSYNRIYSHSTFHLDFGLPAPPLLHTPWT 
MVSPPTNLQNIPMVSSTQSSVSFITSSVQSSGSIHQSGSKFFAGGPPASKFWPLGEKPQI 
LTKSPQTVSVTAETDAVFPCEAIGKPKPFVTWTKVSTGVLMTPNTRIQRFEVLKNGTLVI 
RKFQVQDRGQYMCTASNLYGLDRMWFLWVTVQQPQILASHYQDVTVYLGDTITMECIJIK 
GTPAPQI SWI FRDRRVWQTLS SVEGRITLHQNRTLSIKEASFSDRGVYKCVASNATRADS 
VS IRLHVAALPPI IHQEKL __ 



A BLAST analysis of NOV61 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV61 had high 
homology to other proteins as shown in Table 61C. 

5 



Table 61C. BLASTX results from PatP database for NOV61 






Smallest 






Sum 




High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


PW) 


patp:AAM03157 Peptide #1839 encoded by probe 


2631 


2.0e-273 


patp:AAM15395 Peptide #1829 encoded by probe 


2631 


2.0e-273 


patp:AAM27883 Peptide #1920 encoded by probe 


2631 


2.0e-273 


patp:AAM55191 Human brain expressed single exon probe 


2631 


2.0e-273 


patp:AAM67586 Human bone marrow expressed probe 


2631 


2.0e-273 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 4045 of 4330 bases (93%) identical to a gb:GENBANK- 
lD:AF245505|acc:AF245505.1 mRNA from Homo sapiens (adlican mRNA). The full amino 
10 acid sequence of the protein of the invention was found to have 1598 of 1818 amino acid 

residues (87%) identical to, and 1661 of 1818 amino acid residues (91%) similar to, the 2828 
amino acid residue ptnr:SPTREMBL-ACC:Q9NR99 protein from Homo sapiens (Human) 
(ADLICAN). NOV61 also has homology to the other proteins shown in the BLASTP data in 
Table 61D. 
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Table 61D. NOV61 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi| 14766612 |r 
ef |XP_035465. 
l| (XM_ 035465) 


hypothetical protein 
XP_035465 
[Homo sapiens] 


2828 


1590/1818 
(87) 


1654/1818 
(90) 


0.0 
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gi| 9280405 |gb 












|AAF86402.l|A 


Adlican 


2828 


1591/1818 


1654/1818 




F245505_l 


[Homo sapiens) 


(87) 


(90) 


0.0 


(AF245505) 













This BLASTP data is displayed graphically in the ClustalW in Table 61 E. A multiple 
sequence alignment is given, with the N0V61 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 61D. 



Table 61E. ClustalW Alignment of NO V61 



NOV61 

gi | 14766612 | 
gi I 9280405 | 



NOV61 

gi | 14766612 | 
gi I 9280405 | 



KOV61 

gi (14766612 | 
gi) 9280405 | 



NOV61 

gi | 14766612 | 
gi | 9280405 | 



NOV61 

gi | 14766612 | 
gi I 9280405 | 



NOV61 

gi | 14766612 | 
gi | 9280405 | 



NOV61 

gi | 14766612 | 
gi j 9280405 | 



NOV61 

gi | 14766612 | 
gi |9280405 | 



NOV61 

gi | 14766612 I 



(SEQ ID NO: 220) 
(SEQ ID NO: 607) 
(SEQ ID NO: 608) 



10 20 30 40 50 60 
1 I I 1 I I t I 1 | | | 

1 ^^^^p^^^^Sj^^^^^^^^SM^g^^^^^^^^^Sg^ 60 

70 80 90 100 110 120 

|....|_.|....|....|....|....|....|....|....|..- .|....| 

61 gg H 63 

6i SlraareHiWBB^ t * * 120 

130 140 150 160 170 180 

63 - F 64 

121 ^^^^^^^^^^^^j^^^gj^^^^^^^^^S^^ 180 

121 BBBBIE ME^ i 80 

190 200 210 220 230 240 

64 : n ag 6 s 

181 ^S^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^M 240 

181 EBSBBBiWEMBTHB^^ 240 

250 260 270 280 290 300 

| | j | | | | | | t I I 

69 S^^^^^^^^ ^^Q^M^^^S^^ ^^^^^^^^^^ ^S^S^ 128 
241 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^H 300 

241 B3gSEW9eSSWS!W!H^ , , - : ' 300 

310 320 330 340 350 360 
■ ■ . . . 1 ■ . . . | ■ . : . 1 | | | | 1 | | 

129 i^^b !& ^ ^ 198 
301 '^^^m^M^^^^^^^^mSM^^^^^^^^^^^ ^M 360 

301 360 

370 380 390 400 410 420 

| | ,1 | j | 1 | | | 1 

189 ^p^ l gygTO 248 
3 61 420 

3 6i VIW^SSSSSSSSWSSSKmKIX^ ^f;^, < ^>^^ySia^^^»^ 4 20 

430 440 450 460 470 480 

| . - -_^J - | 1 | | | | | | I 

249 
421 



VRAQILAEPEWgMQPSIDIQLNRgQSTAKKVLLSYYjJjQYSQTI^TKDTRQARGRSWVMIE 
VRAQILAEPEWVMQPSIDIQLNRRQSTAKKVI,LSYYTQYSQTISTKDTRQARGRSWVMIE 
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gi | 9280405 | 421 



NOV61 

gi | 14766612 | 
gil 9280405 I 



NOV61 

gi | 147 66612 | 
gi | 9280405 | 



NOV61 

gi | 14766612 | 
gi|9280405| 



NOV61 

gi | 14766612 | 
gi | 9280405 | 



NOV61 

gi | 14766612 | 
gi 9280405 | 



NOV61 

gi | 14766612 | 
gi|9280405| 



NOV61 

gi | 14766612 | 
gi | 9280405 | 



NOV61 

gi | 14766612 | 
gi I 9280405 | 



NOV61 

gi | 14766612 | 
gi I 9280405 | 



NOV61 

gi | 14766612 | 
gi | 9280405 | 



HOV61 

gi | 14766612 | 
gi | 9280405 ) 



309 
481 
481 



369 
541 
541 



/RAOILAEPEITOIOPSIDIQLI^RQSTAKKV LLSYYTQYSQTISTKDTRQARGRSWVMIE 



480 



490 



500 



510 



520 



530 



540 



I 



PSGAVQRDQTVLEGGPCQLSCNVKASESPSIFWVLPDGSILKAPMDDPDSKFSIIjSSGWL 
PSGAVORDQTVLEGGPCQLSCIWl^SESPSIFVIVIiPDGSXLKAPWPDPBSKFSILSSG^L 



368 
540 
540 



550 



I , 



560 



570 



580 



590 



600 



RJKSMEP^DSGLYQC I AQVRDEMDRMVYRVLVQS P STQPAEKDT VT IGKNPGEgVjjjL P C 
RIKSMEPSDSGLYQCIAQVRDEMDRMVYRVLVQSPSTQPAEKDTVTIGKNPGESVTLPCN 
RIKSMEPSDSGLYQCIAQVRDEMDR^YRVLVQSPSTQPAEKDTVTIGKNPGESVTLPCN 



610 



620 



630 



640 



650 



660 



429 
601 

601 m 



iaipeahlswilpnrriindlantshvymlpngtlsipkvqvsdsgyj|ircvavnqsjgad 
laipeahlswilpnrriiitolantshvymlpngtlsipkvqvsdsgyyrcvavnqqgad! 
lai pe ahl s wilpnrr i indlantshvymlpngtls i pkvq vs dsgy yrc vavnqqg ad 



488 
660 
660 



670 



• 1. 



660 
. . I . . 



690 



700 710 

„ \ 1 ^ ,^ 



720 



489 
661 
661 



549 
721 
721 



609 
781 
781 



669 
841 
841 



729 
901 
901 



H JjT VG I T VT KKGSGQ P S KRGRjjjjP Ggj KAL S RjgRED I VEDEG jj S GyGDEENT S RRL LH P KjjjQ 
HFTVGITVTKKGSGLPSKRGRRPGAKALSRVREDIVEDEGGSGMGDEENTSRRLLHPKDQ 
HFT VG I T VT KKG S GL P S KRGRR P GAKAL S R VRED I VEDE GG S GMGDE ENT S RRL LHP KDQ 



730 



740 



■I 



750 760 
..| | 1.. 



770 



780 



E^^KTKDDAINGDKKAKKGRRKLKLW 

EVFLKTKDDAINGDKKAKKGRRKLKLWKHSEKEPETNVAEGRRVFESRRRINMANKQINP. 
EVFLKTKDDAINGDKKAKKGRRKLKLWKHSEKEPETNVAEGRRVFESRRRINMANKQINP 1 



790 



800 



I 



810 



820 
..I.. 



830 



840 



Ejg^II^vSGKNLPgGTEV^ 

ERWADILAKVRGKNLPKGTEVPPLIKTTSPPSLSLEVTPPFPAjjSPPSASPVQTVTSAEE 
ERWADILAKVRGKKLPKGTEVPPLIKTTSPPSLSLEVTPPFPAVSPPSASPVQTVTSAEE 



668 
840 
840 



I 



850 
. . I . : 



I 



860 



870 



680 



890 



! 



900 



S S ADVPLLgE^H yLgT I S S AS MGLEHjjjJjNG VI L VE PE VTSTP LEE WDjjjgS jgKTEEjfljTS 

ssadvpllgeeehvlgtissasmglehnhngvilvepevtstpleeWddlsekteeits 
ssadvpllgeeehvlgtissasmglehnhngvilvepevtstpleewddlsekteeits 



910 



920 

.J.. 



930 



940 



950 



960 



TEGDLKGTAAgTLISEPYEgSPTLHTLDTVYEigPTHEETgTEGWSAADVGSSPgPTSSEY 
TEGDLKGTAAPTLISEPYEPSPTLHTLDTVYEKPTHEETATEGV5SAADVGSSPEPTSSEY 
TEGDLKGTAAPTLiSEPYEPSPTLHTLDTVYEKPTHEETATEGWSAADVGSSPEPTSSEY 



788 
960 
960 



970 



980 



990 



1000 



1010 



1020 




1030 1040 1050 1060 1070 



1080 



I , 



I 



849 

1021 

1021 



909 

1081 

1081 



STgGSgGVPGgSHLQGgTi^lgLVKSSgSTQDTLLIKKGMKEMSQTLQGGNMLEGDPTHS 
STIGEPGVPGQSHLQGLTDNIHLVKSSLSTQDT f LLIKKGMKEMSQTLQGGNMLEGDPTHS 
STIGEPGVPGQSHLQGLTDI^IHLVKSSLSTQPTLLIKKGMKEMSQTLQGGNMLEGDPTHS 



908 

1080 

1080 



1090 



1100 
..I... 



1110 



1120 



1130 
.J. . > 



1140 



rsse^egqesksitlpdstlglussuspvkkpaettvgtllgkdtttottqprqkvagss 
r's s e segqes ks itlpds tlgi ms s ms p vkkpaett vg'f lltjkbt.ttgtttprqkvaps s 
rssesegqesksitlpdstlgimssnspvkkpae^tvgtlldkdtttBtttprqicvapss 



967 

1140 

1140 



1150 1160 1170 1180 



1190 



1200 
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1027 
1201 
1201 



1087 
1261 
1261 



1147 
1321 
1321 



1267 
1441 
1441 



NOV61 

gi | 14766612 | 
gi |9280405 | 



NOV61 

gi | 14766612 | 
gi | 9280405 | 



NOV61 

gi | 14766612 | 
gi 1 9280405 | 



NOV61 

gi | 14766612 | 
gi 1 9280405 | 



NOV61 

gi | 14766612 | 
gi j 9280405 | 



NOV61 

gi | 14766612 | 
gi | 9280405 | 



NOV61 

gi 1 14766612-] 
gi | 9280405 | 



NOV61 

gi | 14766612 | 
gi | 9280405 | 



NOV61 

gi | 14766612 | 
gi | 9280405 | 



NOV61 

gi | 14766612 | 
gi | 92804051 



NOV61 

gi | 14766612 | 
gi | 9280405 I 



NOV51 

gi 114766612 | 




1026 
1200 
1200 



1210 
..I... 



1220 



I. 



1230 
..I... 



1240 



I. 



I . 



1250 
..I... 



1260 



I 



JPTgjWjsgOTVNTPKQLEMEKNgEj^SK^^ 
VPTAWVDJJTVNTPKQLEMEKNAEPTSKGTPRRKHGKRPNKHRYTPSTVSSRASGSKPSPS 
VPTAWVDNT VNTPKQLEME KNAEPTS KGTPRRKHGKRPNKHRYTPSTVS SRASG S KP S P S 



1270 



1280 



1290 



1300 



1310 
..|... 



I 



1320 



I 



PENICHRNIVTPSSETgLLPR^VSLKTEGQYDSLDYgTTTRKIgSSi^KVQfgTLPVlJiYKPT 
PENKHRNIVTPSSETILLPRTVSIiKTEGPYDSLDYMTTTRKIYSSYPKVQETLPVTYKPT 
PENKHRNIVTPSSETILLPRTVSLKTEGPYDSLDYMTTTRKIYSSYPKVQETLPVTYKPT 



1146 
1320 
1320 



1330 1340 

I — i — i. :..[.., 



1350 



I 



1360 



I. 



1370 
I ... 



1380 



I 



SDGKEIgDDVATNVDKHKSDILVgGESITNgSgTSRSLVSTMGEFKEESSPVGFPGjjPTW 
SDGKEIKDDVATNVDKHKSDILVTGESITNAIPTSRSLVSTMGEFKEESSPVGFPGTPTW 
SDGKEIKDPVATNVDKHK5DILVTGESITNAIPTSRSLVSTMGEFKEESSPVGFPGTPTW 1 



1390 
I-. ■■[.■■ 



1400 1410 1420 

..|....|....|....|....|... 



PSRjaAQPGRLQTDljgVTTSGEaaTDPPL^ELEDVDFTSEFLSSgTVSTPFHOEEAGjj.q 
NPSRTAQPGRLQTglPVTTSGENLTDPPLLKELEDVDFTSEFIiSSLTVSTPFHQEEAGSS 
NPSRTAQPGRLQTDIPVTTSGENLTDPPLLKELEDVDFTSEFLSSLTVSTPFHQEEAGSS 



1430 1440 
. I....I | 

1266 
1440 
1440 



1450 



1460 



I. 



1470 



1480 



1490 
..I... 



1500 



I 



TgLSSIKVE^SSQflETTTLgQDHj^ 

TTLSSIKVEVASSQAETTTLDQDHLETTVAILLSETRPQNHTPTAARMKEPASSSPSTIL 
TTLSSIKVEVASSQAETTTLDQDHLETTVAILLSETRPQNHTPTAARMKEPA5SSPSTIL 



1510 



1520 



1530 



I. 



I. 



1540 
..I... 



1327 X 
1501 
1501 



1550 



|S LGQTTTT KPgJLjgS PRJJS Q jjjggD S KEOTFLNYsJiGNPETEAT P VgNEGTQgMSGPNEL ST 
MS LGQTTTTKPALPS PR I SQAS RDS KENVFLNYVGNPETE ATPVNNEGTQKMSG PNEL S T 
^SLGQTTTTKPALPSPRISQASRDSKENVFLNYVGNPETEATPVNNEGTQHMSGPNELST 



1560 
.1 

1386 
1560 
1560 



1570 



1387 
1561 
1561 



1447 
1621 
1621 



1506 
1681 
1681 



1580 

■ . I . . ■ 



I. 



1590 



1600 
..I... 



1610 



I. 



PSSDgDAFNLSTKLELEKQVFgSRSLgRGPDSggQDGRVHASHQLTRyPAKPILPTgTVR 
PSSDgDAFNLSTKLELEKQVFGSRSLPRGPDSQRQDGRVHASHQLTRVPAKPILPTATVR 
PSSD^DAFNLSTKLELEKQVFGSRSLPRGPDSQRQDGRVHASHQLTRVPAKPILPTATVR 



1620 
J 

1446 
1620 
1620 



1630 



1640 



1650 



1660 
..|... 



I 



1670 
..|... 



LPEMSTQSaSRYFVTgQgPjgHgTNKPEITTYPSi^PEgKQFTTPRESTTBPLiEHMSK 
LPEMSTQSASRYFVTSQSPRHWTNKPEITTYPSGALPENKQFTTPRLSSTTIPLPLHMSK 
LPEMSTQSASRYFVTSQSPRHWTNKPEITTYPSGALPENKQFTTPRLSSTTIPLPLHMSK 



1680 
I. ...I 

1505 
1680 
1680 



1690 



1700 



1710 



1720 
..I... 



1730 



PSIjgSKF^DjJRTDQj^GggKVFGN]^ 

PSIPSKFTDRRTDQFNGYSKVFGNSSINIPEARNPVGKPPSPRIPHYSNGRLPFFTNKTLSFj 
PSIPSKFTDRRTDQFNGYSKVFGMNNIPEARIMPVGKPPSPRIPHYSNGRLPFFTNKTLSF 



1740 
.| 

1565 
1740 
1740 



1750 



1566 S 
1741 
1741 



1760 



1770 



1780 

..I... 



I 



1790 
..I... 



I. 



1800 



QLGVTRRPQIPgSPgPViyiRERKViaPGSYNRlSSHSTFHLDFGgPAPPLLHTPiST^SP 
PQLGVTRRPQIPTSPAPVMRERKVIPGSYNRIHSHSTFHLDFGPPAPPLLHTPQTTGSPS 
PQLGVTRRPQIPTSPAPVMRERKVIPG5YNRIHSHSTFHLDFGPPAPPLLHTPQTTGSPS 



P 1625 
1800 
1800 



1810 



1820 



1630 

..I.. . 



1840 

. . I . . . 



1850 



1860 



T^QNIPfWSSTQSS^SFITSSVQSSGSaHQSgSKFFAGGPPASKFWgLGEKPQILTKSP 
im^IPHVSS^Q'S S I S FITSSVQSSGSFHQSSSKFFAGGPPASKFWSLGEKPQILTKSP 
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gi | 9280405 | 



NOV61 

gi 1 14766612 | 
gi j 9280405 | 



NOV61 

gi 1 14766612 | 
gi | 9280405 1 



NOV61 

gi | 14766612 | 
gij 9280405 | 



N0V61 

gi | 14766612 | 
gij 9280405 ) 



N0V61 

gi | 14766612 | 
gij 9280405 | 



N0V61 

gi|l4766612| 
gi | 9280405 ) 



N0V61 

gi | 14766612 | 
gi | 9280405 | 



N0V61 

gi | 14766612 | 
gi | 9280405 | 



NOV61 

gi | 14766612 | 
gij 9280405 | 



N0V61 

gi | 14766612 | 
gij 9280405 | 



N0V61 

gi | 14766612 | 
gij 9280405 | 



1801 ^^^g^j^^^^^^^^^^^^^^^j^^^^^^^^^^^^^ I860 

1870 1880 1890 1900 1910 1920 

I I I | | | | 1 | | | | 

1686 S^ ^^SlAB^^Bl^'I^ ^^g^ ^^B^ ^^^^^^^^^^ PgFSa 1 7 45 
1861 ^^^^^^^^^^^^S^^^^^^^^^^^^^S^^^^^^m 1920 

1930 1940 1950 1960 1970 1980 

• [ 1 [ | , . . . | [ | | | 1 [ | 

1746 1805 
1921 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^B 1980 
1921 wSSBBBSB^^ 1980 

1990 2000 2010 2020 2030 2040 

1806 lll §l[BRg^ ^ 1865 

1981 ^^^S^^^^^^^^^^^m^^^^^^^^M^^^^^^^^^ 2040 
2050 2060 2070 2080 2090 2100 

' : ' - L" ^ ' l 1 — 1 — 1 — { — ' — 1 — ' — 1 — 1 — 1 

1866 ^ ^Slgg^ ^ g- 1879 

2041 B^^^fe|^^^^^^^^^^^^^^^^^^^^^%^^^^^^g 2100 

2110 2120 2130 2140 2150 2160 
1879 - - - 1879 

2101 3H5BcHWH5SBS5gB ^ ffia, »;?^f5^S 8BM 2160 

2170 2180 2190 2200 2210 2220 

.-.|-...|....|-...|-..-|-...|.-..|....|..-.,|....|....|....| 

1879 ------ - 1879 

2161 Batfflggwll^^ t ^jejA^I 2220 

2161 BSWSffigECSfiHi^ ^ 2220 

2230 2240 2250 2260 2270 2280 

-•...|....|-...|-....|....|....|....|....|....|....|....|....| 

1879 - 1879 

2221 S^^^^^^^^^^^^ES^^^^^^^^^^^^^^^^^^^^B 2280 

2221 BBraSifflSiSSBgaS^^ 2280 

2290 2300 2310 2320 2330 2340 

....i-....u...|....i....i~..i I .... I .... I .... I — I — I 

1879 -~ - — — - 1879 

2281 ^^ ^^TTP^j? 'SY^aMfeWA^ 2340 

2281 E^^^^^^^^^^S^^^^^^^^^Sy^^P 2340 

2350 2360 2370 2380 2390 2400 

....|..--|-...|....|....|....|....|....|....|....|....|....| 
1879 --------- 1879 

2341 BHESSffiSSBSS^^^^^^^^^^^^^^SS^ffi^SSES® 2400 

2341 vSS^^^^^S^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 2400 

2410 2420 2430 2440 2450 2460 

....|....|.-..|....|....|....|....|....|....|....|....|....| 

1879 — — — 1879 

24 01 gjyjjjjjgjjj^^ 2460 

2401 SBffiBraSBBHBSl^ > < *** • *r» ^"f^i^ 24 6 0 

2470 2480 2490 2500 2510 2520 
1879 — -- 1879 

2461 pMaam^ Z\T7?™ -t;". ^^z^^t^^^^^ma 2 s2o 

2461 tr*W»I«K^rte»a M ar4Tjiw^ 2520 
2530 2540 2550 2560 2570 2580 
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NOV61 

gi ) 14766612 | 
gi | 9280405 | 



NOV61 

gi | 14766612 | 
gi | 9280405 | 



NOV61 

gi | 14766612 | 
gi j 9280405 | 



NOV61 

gi | 14766612 | 
gi j 9280405 | 



N0V61 

gi | 14766612 | 
gi j 9280405 | 



N0V61 

gi | 14766612 | 
gi j 9280405 | 



| | | | | | | | | | | | 

1879 — - 1879 

2521 ^^^^^S^^^^M^SS^i^^^^^^^^^^^^^^^^^^S 2580 

2521 K^ 'y iliK v^^'. 5 Bm3EmS^S^^SB5^mg53 2580 

2590 2600 2610 2620 2630 2640 

....|....|....|....|....|....|....|....|....|....|....|....| 

1879 - 1879 

2581 ^^^^S^^^^^^^^^^^^^^^^^^M^^^^^^^^^ 2640 

2581 ISBBiBSBBffi^ i^'rsyF ^BBISGSSSBBSGBXSEKISS^ 2640 

2650 2660 2670 2680 2690 2700 

....|...-|....|....|....|....|..-|-....|....|....|....|....| 

1879 - 1879 

2641 ra^^^B^^^^^gEg^^^^^^^^ragggggS^g^^graBra^ 2700 
2641 W^SS^S^^^^^ ^SSml^^^^^^S^S^^^^^^^ 2700 

2710 2720 2730 2740 2750 2760 

1879 -r - 1879 

2701 f^S^^^^^^^^^^^^^^^^^^^^^^^S^^^Sj^B 2760 
2701 ^^^^^^^p^^^p^^ppj^^^^^^^^j^^.^^^^^^g 2760 

2770 2780 2790 2800 2810 2820 

....|....| v ..|.,..|....|...;|....|....|....|....|....|....| 

1879 ^ 1879 

2761 Sp^^^g^g|g^^^gj^2gg^^|^gggM^^5OT^^gOTj^^ 2820 

2761 ^^^^^^^^^^^^S^^^^^E^^So^^^S^^^^^^^^S 2820 

1879 - 1879 

2821 gjggm 2828 
2821 Ef^lli 2828 



Table 61F lists the domain description from DOMAIN analysis results against NOV6L 
This indicates that the NOV61 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 61F. Domain Analysis of NOV61 



gnlj Smart 1 smart00409 , IG, Immunoglobulin SEQ ID NO; 861 



CD-Length =* 86 residues, . 100.0% aligned 
Score - 62.8 bits (151), Expect = 2e-10 



NOV61: 1685 PQTVSVTAETDAVFPCEAIGKPKPFVTWTKVST^ 1744 

P +V+V CEA GPP VTW K GIH-+R NTLI 

Sbjct: 1 PPSVTVKEGESVTLSCEASGNPPPTVTWYKQG-GKIiI^ 59 



NOV61: 1745 VQDRGQYMCTASNLYGLDRMWFLWVT 1771 

+D G Y C A+N G L V 

Sbjct; 60 PEDSGTYTCAATNSSGSASSGTTLTVL 86 



The gene of invention is a close homolog of the adlican gene and belongs to the 
superfamily of cell adhesion molecules. Cell adhesion molecules mediate key aspects of 
development, differentiation, cellular plasticity and physiological function in a variety of 
tissues. In addition, they are central to a number of disease processes such as cancer. Adlican 
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is a protein that has been described to be elevated in patients with osteoarthritis. Sequence 
analysis indicates that it is likely to be a secreted protein. A rat gene named mechanical stress- 
induced protein has been patented and shows significant similarity to adlican. This protein is 
elevated in osteoblasts subjected to mechanical stress and has been suggested to be effective in 
5 the prognosis, diagnosis or treatment of osteoarthritis. Since this family of proteins seems to 
have involvement in osteoarthritis, it follows that the protein of invention may share that 
characteristic. 

The disclosed NOV61 of invention has two significant attributes - it is truncated 
relative to its homolog, adlican, and secondly, it maps to the Y chromosome. The first attribute 

10 is significant in that it is possible that the truncated adlican -like protein may play a dominant - 
negative role in the function of adlican . It is therefore possible that it may be constitute an 
effective treatment for osteoarthritis. The chromosomal localization is relevant because it is 
known that osteoarthritis has a higher frequency in women. It is possible, therefore, that the 
truncated protein may be nullifying the effect of adlican, if any, in males. 

1 5 The NOV61 disclosed in this invention is predicted to be expressed in at least the 

following tissues: muscle, lymph. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NO V61 is provided in Example 2. 

20 The nucleic acids and proteins of NOV61 are useful in potential therapeutic 

applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention will have efficacy for the treatment of patients 
suffering from: osteoarthitis, asthma, allergy, muscular dystrophy, Lesch-Nyhan syndrome, 
myasthenia gravis, hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, 

25 autoimmune disease, allergies, immunodeficiencies, transplantation, graft versus host disease 
(GVHD), lymphaedema, cancer, tissue degeneration as well as other diseases, disorders and 
conditions.. The NOV61 nucleic acid encoding the adlican-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. The novel nucleic acid of the 

30 invention encoding a adlican-like protein includes the nucleic acid whose sequence is provided 
in Table 61 A, or a fragment thereof. The invention also includes a mutant or variant nucleic 
acid any of whose bases may be changed from the corresponding base shown in Table 61A 
while still encoding a protein that maintains its adlican-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
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whose sequences are complementary to the sequence of Table 61 A, including nucleic acid 
fragments that are complementary to any of the nucleic acids just described The invention 
additionally includes nucleic acids or nucleic acid fragments, or complements thereto, whose 
structures include chemical modifications. Such modifications include, by way of non-limiting 
5 example, modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 7% of the bases may be so changed. 

10 The novel protein of the invention includes the adlicari-like protein whose sequence is 

provided in Table 61B. The invention also includes a mutant or variant protein any of whose 
residues maybe changed from the corresponding residue shown in Table 61B while still 
encoding a protein that maintains its adlican-like activities and physiological functions, or a 
functional fragment thereof, hi the mutant or variant protein, up to about 13% of the amino 

1 5 acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 

20 below. 

NOV62 

NOV62 has homology to neuropsin, an extracellular matrix serine protease. The 
disclosed NOV62 (alternatively referred to herein as CG56781-01) includes the 834 nucleotide 
sequence (SEQ ID NO:221) shown in Table 62A. A NOV62 ORF begins with a Kozak 
25 consensus ATG initiation codon at nucleotides 3 1-33 and ends with a TGA codon at 
nucleotides 808-810. The disclosed NOV62 maps to human chromosone 19. 



Table 62A. NOV62 Nucleotide Sequence (SEQ ID 
NO:221) 

AGTCTTGCCTTCTTTTGAGCCTAAGTCATGAGTTGGATGTTCCTCAGAGATCTCCTGAGT 

CGTTTGCTGGTCTACATGGTGGCAGCAGAGCACGTC^ 

GAGTGCAACAGTAGACAGCCCGGTTGCAAAAATGTGTGTTTT 

TCCCAACTCAGACTTOGGGCCTTACAACTGATAATGGTCT 

GTTTTACATGTAGCCTATCATGAGGGTAGAGAGAAAAGGCACAGAAAGAAACTCTATGTC 
AGCCCAGGTACAATGGATGGGGGCCTATGGTACGCTT 
ACTGGTTTTGAAATTGGCTTCCITGTITT 
CCCTACCTTATAAAGTGTGATTTGAAGCCCT^ 
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AAACCCACTGAGAAGACGATCTTCATCCTCTTCTTGGTCATCACCTCATGCTTGTGTATT 
GTGTTGAATTTCATTGAACTGAGTTTTTTGGTTCTCAAGTGCTTTATTAAGTGCTGTCTC 
CAAAAATATTTAAAAAAACCTCAAGTCCTCAGTGTGTGAGTGCCACAGCCTCAGATATGT 
TGAATGTG 



The NOV62 polypeptide (SEQ ID NO:222) encoded by SEQ ID NO:221 is 259 amino 
acids in length and is presented using the one-letter amino acid code in Table 62B. The Psort 
profile for NOV62 predicts that this sequence has a signal peptide and is likely to be localized 
5 to the endoplasmic reticulum (membrane) at the plasma membrane with a certainty of 0.5500. 
In alternative embodiments, a NOV62 polypeptide is located to iysosomes with a certainty of 
0.2353, or to the endoplasmic reticulum (lumen) with a certainty of 0.1000, The Signal P 
predicts a likely cleavage site for a NOV62 peptide is between positions 28 and 29, /.e., at the 
dash in the sequence TRA-QG. 

10 



Table 62B. NOV62 Polypeptide Sequence (SEQ D> 
NO:222) 

MGRPPPCAIQPWILLLLFMGAWAGVTRAQGSRSRKGQASKPHSQPWQAALFQGERLICGG 
VLVGDRWVLTAAHCKKQKYSVRLGDHSLQSRDQPEQEIQVAQSIQHPCYNNSNPEDHSHD 
IMLIRLQNSANLGDKVKPVQLANLCPKVGQKCI I SGWGTVTSPQENFPNTLNCAEVKI YS 
QNKCERAYPGKITEGMVCAGS SNGADTCQGDS GGPLVCEGTLAG I VSGGS EP VFRPRRPA 
VYTNVFDYLEWIESTMERN 



A BLAST analysis of NOV62 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV62 had high 
15 homology to other proteins as shown in Table 62C. 



Table 62C. BLASTX results from PatP database for NOV62 






Smallest 






Sum 


Sequences producing High-scoring Segment Pairs: 


High 
Score 


Probability 
P(N) 


patp:AAWi0694 Human recombinant neuropsin 
patp:AAWi2393 Mouse neuropsin protein 
patp:AAB2l3li Human neuropsin - Homo sapiens, 275 aa. 
patp:AAY41744 Human PR0322 protein sequence 
patp:AAY32852 Human serine protease protein sequence 


1247 
1247 
972 
965 
965 


8.9e-127 

8.9e-l27 

1.2e-97 

6.8e-97 

6.8e-97 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 747 of 805 bases (92%) identical to a gb:GENBANK- 
20 ID:E12348|acc:E12348.1 mRNA from Mus sp. The full amino acid sequence of the protein of 
the invention was found to have 227 of 257 amino acid residues (88%) identical to, and 238 of 
257 amino acid residues (92%) similar to, the 260 amino acid residue ptnnSWISSNEW- 
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ACC:Q61955 protein from Mus musculus (Mouse) (NEUROPSIN PRECURSOR (EC 3.4.21 .) 
(NP)). NOV62 also has homology to the other proteins shown in the BLASTP data in Table 
62D. 



Table 62D. NOV62 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

<%> 


Positive 

(%) 


Expect 


gi|6679487|re 
f|NP 032966.1 
| (NM_008940) 


protease, serine, 19 
(neuropsin) ; Brain 
Serine protease 1 
[Mus musculus) 


260 


227/257 
(88) 


238/257 
(92) 


e-131 


gi|6093538|sp 
|O88780|NRPN_ 
RAT 


NEUROPSIN PRECURSOR (NP) 
(BRAIN SERINE PROTEASE 
1) 


260 


216/258 
(83) 


232/258 
(89) 


e«126 


gi| 4699764 |pd 
b|lNPM|A 


Chain A, Neuropsin, A 
Serine Protease 
Expressed In The Limbic 
System Of Mouse Brain 


225 


197/223 
(88) 


206/223 
(92) 


e-112 


gi| 6005844 |re 
f |NP_009127.1 
| (NM_007196) 


kallikrein 8 
(neuropsin/ovasin) / 
protease, serine, 19 
(neuropsin/ovasin) 
[Homo sapiens] 


260 


172/253 
(67) 


207/253 
(80) 


8e-99 


gi 

ef 
1| 


16162680 |r 
XP_057595. 
;XM_057595) 


hypothetical protein 

XP_057595 

[Homo sapiens] 


260 


169/253 
(66) 1 


203/253 
(79) | 


3e-96 K 



This BLASTP data is displayed graphically in the ClustalW in Table 62E. A multiple 
sequence alignment is given, with the NOV62 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 62D. 



Table 62E. ClustalW Alignment of NOV62 



NOV62 


(SEQ ID NO r 222) 


gi 


6679487] 


(SEQ ID NO 


609) 


gi 


6093536) 


(SEQ ID NO 


.610) 


gi 


4699764) 


(SEQ ID NO 


611) 


Si 


6005844) 


(SEQ ID NO 


612) 


gi 


16162680) 


(SEQ ID NO 


613) 
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NOV62 

gi| 6679487 | 
gi|6093538| 
gi (4699764 | 
gi|6005844| 
gi j 16162680 | 



NOV62 



6679487 
6093538 
4699764 



61 
61 
61 
29 




70 



80 



90 
.1-. 



100 



110 

..I.. 



120 
..I 



VLVGDRVTOiTAAHCKKgxySVRLGDHSLQgRDSPEQEIQVAQSIQHPCYNgSNPEDHSED 
VL VGDRWVL T AAHC KKg K Y S VRLGDHS L Q§ RDSPEQE I Q VAQ S I QHP CYnSsNPEDH S HD 

vlvgdrwvltaahckkgkysvrlgdhslq|rd|peqeiqva2siqhpc[^snp 
vlvgdrwvltaahckkBkysvrlgdhslqIrdspeqeiqvaqsiqhpcynSsnpedhskd 
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gi |6005844 | 
gi] 16162680 | 



NOV62 



gi 
gi 
gi 
gi 
gi 



667*487 
6093538 
4659764 
6005844 
16162680) 



61 
61 



NOV62 


121 


gi 


6679487 


121 


gi 


6093538 


121 


gi 


4699764 


89 


gi 


6005844 


121 


gi 


16162680 | 121 




190 



200 



210 



220 



230 



NOV62 


181 


gi 


6679487 


181 


gi 


6093538 


181 


gi 


4699764 


149 


gi 


6005844 


181 


gi 


16162680 | 181 



241 
241 
241 
209 
241 
241 



240 




Table 62F lists the domain description from DOMAIN analysis results against NOV62. 
This indicates that the NOV62 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 62F. Domain Analysis of NOV62 



gnl 1 Smart I smart 0 0 02 0 , TrypJSPc, Trypsin -like serine protease; Many of these 

are synthesised as inactive precursor zymogens that are cleaved during 

limited proteolysis to generate their active forms. A few, however, are 

active as single chain molecules, and others are inactive due to 

substitutions of the catalytic triad residues. SEQ ID NO: 812 

CD-Length =23 0 residues, 98.3% aligned 
Score « 237 bits (604), Expect « 7e-64 



N0V62 : 


36 


Sb j ct : 


5 


NOV62 : 


91 


Sbjct: 


65 


N0V62 : 


149 


Sbj Ct: 


121 


NOVS2: 


206 



GQASKPHSQPWQAALF - QGERLICGGVLVGDRWVLTAAHC KKQKYSVRLGDHSLQS 90 

G + S PWQ +L +G R CGG L+ RWVLTAAHC VRLG HLS 

GSEANIGSFPWQVSLQYRGGRHFCGGSLISPRWVLTAAHCVYGSAPSSIRVRLGSHDLSS 64 

RDQPEQEIQVAQSIQHPCY^SOTEDHSHDIMLIRLQNSAmGDKVKPVQLA- -NLCPKV 14 8 

+ q ++V++ I HP YN P + +DI L D V+P+ L 

GE - ETQTVKVS KVT VHPNYN- - - PSTYDNDIALLKLSEPVTLSDTVRPICLPSSGYNVPA 12 0 



G C +SGWG + 



+ P+TL 



VIS C RAY G IT+ M+CAG 



NOV62 : 206 DTCQGDSGGPLVCE - - -GTIAGIVSGGSEPVFRPRRPAVYTNVFDYLEWI 252 
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D CQGDSGGPLVC L GIVS GS RP +P VYT V YL+WI 
Sbjct: 181 DACQGDS GGPL VCNDPRWVL VG TVS WGS YG CARPNKPG VYTR VS S YLDWI 230 

Neuropsin appears to act as a regulatory molecule in the early phase of LTP via its 
proteolytic function on extracellular matrix rather than affecting NMDA receptor-mediated 
calcium increase. The behavioral and electrographical abnormalities associated with seizures 
5 in epileptic (kindled) mice correspond with those of human epilepsy . In kindled mice, 
neuropsin was markedly increased in the hippocampus and cerebral cortices.A single 
intraventricular injection of monoclonal antibodies specific to neuropsin reduced or eliminated 
the epileptic pattern noted on electroencephalograms and, as a result markedly inhibited the 
progression of kindling. 

10 Therefore, neuropsin appears to be a key protein controlling pathogenic events in the 

hippocampus, and thus neuropsin inhibitors might be useful for treatment of epilepsy. 
Neuropsin has two isoforms, which have been reported to be involved in hippocampal 
plasticity. The amino acid sequences of the two types of human neuropsin were identical, 
except that type 2 carried an insert of 45 amino acids at the C-terminus of the leader sequence. 

15 The essential three amino acids in the active site triad, His, Asp, and Ser, and the single 
putative N-glycosylation site were conserved in human and mouse neuropsin. Sequence 
analysis of the 946 bp genomic DNA spanning the region encoding the insertion sequence 
revealed that two isoforms were generated in human brain by alternative splicing. However, 
the mouse genomic sequence did not conserve the 3 1 acceptor consensus sequence at the 

20 corresponding position, suggesting that type 2 neuropsin was a species-specific splice variant. 
When the open reading frames of human neuropsin were expressed in insect cells, both types 
of neuropsin were detected in the conditioned media by western blot analysis using anti- 
human neuropsin serum. 

Northern blot hybridization and reverse transcription-polymerase chain reaction 

25 showed predominant expression of type 1 neuropsin in pancreas. Xype 2 neuropsin was 
preferentially expressed in human adult brain and hippocampus, although both types were 
expressed in fetal brain and placenta in comparable amounts. Dot blot hybridization showed 
that neuropsin was expressed in various regions of adult brain, including the hippocampus and 
cerebral cortex, and also in various fetal tissues. These results suggest that human type 2 

30 neuropsin may be important to the adult brain plasticity, although both types may be necessary 
for the development of the nervous system. 

The disclosed NOV62 is predicted to be expressed in at least the following tissues: 
brain. This information was derived by determining the tissue sources of the sequences that 
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were included in the invention including but not limited to SeqCalling sources, public EST 
sources, literature sources, and/or RACE sources. Further expression data for NOV62 is 
provided in Example 2. 

The nucleic acids and proteins of NOV62 are useful in potential therapeutic 
5 applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention will have efficacy for treatment of patients suffering 
from: Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalcemia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 

10 disorders, addiction, anxiety, pain, neuroprotection, osteoarthritis, and other diseases, 

disorders and conditions of the like. The NOV62 nucleic acid encoding the neuropsin-like 
protein of the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. The 
novel nucleic acid of the invention encoding a neuropsin precursor-like protein includes the 

15 nucleic acid whose sequence is provided in Table 62A, or a fragment thereof. The invention 
also includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 62A while still encoding a protein that maintains its 
neuropsin precursor-like activities and physiological functions, or a fragment of such a nucleic 
acid. The invention further includes nucleic acids whose sequences are complementary to 

20 those just described, including nucleic acid fragments that are complementary to any of the 
nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

25 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 8% of the residues 
may be so changed. 

30 The novel protein of the invention includes the neuropsin precursor-like protein whose 

sequence is provided in Table 62B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 62B while 
still encoding a protein that maintains its neuropsin precursor-like activities and physiological 
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functions, or a functional fragment thereof. In the mutant or variant protein, up to about 12% 
of the bases may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV63 

NOV63 has homology to the WNT family of proteins. The Wnt gene family consists 
of at least 15 structurally related genes that encode secreted extracellular signaling factors. 
WNT proteins function in a range of critical developmental processes in both vertebrates and 
invertebrates and are implicated in regulation of cell growth and differentiation in certain adult 
mammalian tissues, including the mammary gland. The disclosed NOV63 (alternatively 
referred to herein as CG56054-02) includes the 1 128 nucleotide sequence (SEQ ID NO: ) 
shown in Table 63 A, A NOV63 ORF begins with a Kozak consensus ATG initiation codon at 
nucleotides 31-33 and ends with a stop codon at nucleotides 1 102-1 104. The disclosed 
NOV63 maps to human chromosome 1. 



Table 63A. NOV63 Nucleotide Sequence (SEQ ID 
NO:223) 

TCCTCCCCGCAGCTTCTCGCTGAATTCCGAGGGGGCTGAGAGGATGGCCACCACCGGGAC 
CCCAACGGCCGACCGAGGCGACGC^GCCGC^ 

GCAGAAGCACTCGTGGGACGGGCTCCGGAGCATCATCCACGGCAGCa3CAAGTACrCGGG 
CCTCATTGTCAACAAGGCGCCCCACGACT^ 

GCCCCACTCCCACCGCCTCTACTACCTGGGAATGCCATATGGCAGCCGAGAG 
CCTCTACTCTGAGATTCCCAAGAAGGTCCGGAAAGAGGCTCTGCTGCTCCTGTCCTGGAA 
GCAGATGCTGGATCATTTCCAGGCCACGCCCCACC^ 
GCTGCTGAGGGAGCGGAAACGCCTGGGGGTCTTCGG 

CGAGAGTGGCCTCTTCCTCTTCCAGGCCAGCAACAGCCTCTTCCACTGCCGCGACGGCTC 
CAAGAACGGCTTCATGGTGTCCCCTATGAAACCXjCT 

gccccggatggaccccaaaatctgccctgccgaccctgc 

cagcx^cctgtgggtggccaac^tcgagacaggcgaggagcggcggctgaccttctgcca 

ccaaggtttatccaatgtcctggatgaccccaagtctgcgggtgtggccacctt03tcat 

acaggaagagttcgaccgcttcactgggtactggtggtgccccacagcctcct 

ttcagagggcctcaagacgctgcgaatcctgtatgaggaagtcgatgagtccgaggtgga 

ggtcattcacgtcccctctcctgcgctagaagaaaggaagacggactcgtatcggtaccc 

CAGGACAGGTAGC^GAATCCCAAGATTGCCTTGAAACTGGCT 

CC^GGGCAAGATCGTCTCGACCCAGGAGAAGGAGCTGGTGCAGCCCTTCAGCTCGCTGTT 

CCCGAAGGTGGAGTACATCGCCAGGGCCGGGTGGACCCGGGATGGCAAATACGCCTGGGC 

CATGTTCCTGGACCGGCCCCAGCAGTGGCTCCAGCTCGTCCTCCTCCCCCCGGCCCTGTT 

CATCCCGAGCACAGAGAATGAGGAGCAGaXSCTAGCCTCTGCCAGAGCTGTCCCCAGGAA 

TGTCCAGC^GTATGTGGTGTAO^GGAGGTC^CCAACGTCTGGATCAATGTTCATGACAT 

CTTCTATCCCTTCCCCCAATCAGAGGGAGAGGACGAGCTCTGCTTTCTCCGCG^ 

ATGCAAGACCGGCTTCTGCCATTTGTACAAAGTCACCGCCGTTTTAAAATCCCAGGGCT 

CGATTGGAGTGAGCCCTTCAGCCCCGGGGAAGATGAATTTAAGTGCCCCATTAAGGAAGA 

GATTGCTCTGACCAGCGGTGAATGGGAGGTTTTGGCGAGGCAra 

CAATGAGGAGACCAAGCTGGTGTACTTCCAGGGCACCAAGGA 
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CCTCTACGTGGTCAGCTATGAGGCGGCCGGCGAGATCGTACGCCTCACCACGCCCGGCTT 
CTCCCATAGCTGCTCCATGAGCCAGAACTTCGACATGTTCGTCAGCCACTACAGCAGCGT 
GAGCACGCCGCCCTGCGTGCACGTCTACAAGCTGAGCGGCCCCGACGACGACCCCCTGCA 
CAAGCAGCCC CGCTTCTGGGCTAGCATGATGGAGGCAGCCAGCTGCC CCCCGGATT ATGT 
TCCTCCAGAGATCTTCCATTTC CACACGCGCT CGGATGTGCGGCT CTACGGCATGATCTA 
CAAGCCCCACGCCTTGCAGCCAGAGAAGAAGCACCCCACCGTCCTCTTTGTATATGGAGG 
CCCCCAGGTGCAGCTGGTGAATAACTCCTTCAAAGGCATCAAGTACTTGCGGCTCAACAC 
ACTGGCCTCCCTGGGCTACGCCGTGGTTGTGATTGACGGCAGGGGCTCCTGTCAGCGAGG 
GCTTCGK3TTCGAAGGGGCCCTGAAAAACCAAATGGGCCAGGTGGAGATCGAGGACCAGGT 
GGAGGGCCTGCAGTTCGTGGCCGAGAAGTATGGCTTCATCGACCTGAGCCGAGTTGCCAT 
CCATGGCTGGTCCTACGGGGGCTTCCTCTCGCTCATGGGGCTAATCCACAAGCCCCAGGT 
GTTCAAGGTGGCCATCGCGGGTGCCCCGGTCACCGTCTGGATGGCCTACGACACAGGGTA 
CACTGAGCGCTACATGGACGTCCCTGAGAACAACCAGCACGGCTATGAGGCGGGTTCCGT 
GGCCCTGCACGTGGAGAAGCTGCCCAATGAACCCAACCGCTTGCTTATCCTCCACGGCTT 
CCTGGACGAAAACGTGCACTTTTTCCACACAAACTTCCTCGTCTCCCAACTGATCCGAGC 
AGGGAAACCTTACCAGCTCCAGATCTACCC CAACGAGAGACACAGTATTCGCTGCC CCGA 
GTCGGGCGAGCACTATGAAGTCACGTTGCTGCACTTTCTACAGGAATACCTCTGAGCCTG 
CCCACCGGGAGCCGCCACAT 



The NOV63 polypeptide (SEQ ID NO:224) encoded by SEQ ID NO:223 is 357 amino 
acids in length and is presented using the one-letter amino acid code in Table 63B. The Psort 
profile for NOV63 predicts that this sequence has a signal peptide and is likely to be exported 
5 from the cell with a certainty of 0.3700. In alternative embodiments, a NOV63 polypeptide is 
located to lysosomes with a certainty of 0.1000, or to the endoplasmic reticulum (membrane) 
with a certainty of 0.1000. The Signal P predicts a likely cleavage site for a NOV63 peptide is 
between positions 18 and 19, /.<?., at the dash in the sequence ALG-SY. 



Table 63B. NOV63 Polypeptide Sequence (SEQ ID 
• N0224) 

MATTGTPTADRGDAAATDDPAARFQVQKHSWDGLRS I IHGSRKYSGLIVNKAPHDFQFVQ 
KTDESGPHSHRLYYLGMP YGSRENSLL YSE I PKKVRKE ALLLLSWKQMLDHFQATPHHGV 
YSREEEIiLRERKRLGVFGITSYDFHSESGLFLFQASNSLFHCRDGGKNGFMVS PMKPLEI 
KTQCSGPRMDPKI CPADPAFFSFINNSDLWVANI ETGEERRLTFCHQGLSNVLDDPKSAG 
VATFVIQEEFDRFTGYWWCPTASWEGSEGLKTLRILYEEVDESEVEVIHVPSPALEERKT 
DSYRYPRTGSKNPKIALKLAEFQTDSQGKIVSTQEKELVQPFSSLFPKVEYIARAGWTRD 
GKYAWAMFLDRPQQWLQLVLLPPALFIPSTEOTEQRLASARAVPRNVQPYVVYEEVTmW 
INVHDIFYPFPQSEGEDELCFLRANECKTGFCmYKVTAVLKSQGYDWSEPFSPGEDEFK 
C P I KEEI ALTSGEWEVLARHGSKI WVNEETKLVYFQGTKDT P LEHHL YWS YE7AGEI VR 
LTTPGFSHSCSMSQNFDMFVSHYSSVSTPPCVHVYKLSGPDDDPLHKQPRFWASMMEAAS 
CPPDYVPPEIFHFHTRSDVRLYGMIYKPHALQPEKKHPTVLFVYGGPQVQLV3NNSFKGIK 
YLRIiNTLASLGYAVVVIDGRGSCQRGLRFEGALKNQMGQVEIEDQVEGLQFVAEKYGFID 
LSRVAIHGWSYGGFLSLMGLIHKPQVFKVAIAGAPVTVWMAYDTGYTERYMDVPENNQHG 
YEAGSVALHVEKLPNEPNRLLILHGFLDENVHFFHTNFLVSQLIRAGKPYQLQIYPNERH 
SIRCPESGE HYEVTLLHFLQEYL 

10 



A BLAST analysis of NOV63 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV63 had high 
homology to other proteins as shown in Table 63C. 

15 



Table 63C. BLASTX results from PatP database for NOV63 



Smallest 
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Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAY8i693 Human Wnt-6 protein sequence - Homo sapiens 


411 


1.0e-61 


patp:AAB49769 Amyloid-beta protein agglutination regulator 


411 


1.0e-61 


patp:AAB88439 Human membrane or secretory protein - 


411 


1.0e-61 


patp:AAB19786 Human Wnt-1 protein involved in kidney 


630 


2.2e-61 


patp:AAY94319 Murine Wnt-lOA protein - Afus musculus, 417 aa. 


398 


2.7e-61 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 633 of 795 bases (79%) identical to a gb:GENBANK- 
ID:AF031 168|acc:AF031 168.1 mRNA from Gallus gallus (Gallus gallus Wnt-14 protein 
5 (Wnt-14) mRNA). The full amino acid sequence of the protein of the invention was found to 
have 287 of 349 amino acid residues (82%) identical to, and 319 of 349 amino acid residues 
(91%) similar to, the 354 amino acid residue ptnr:SWISSPROT-ACC:O42280 protein from 
Gallus gallus (Chicken) (WNT-14 PROTEIN PRECURSOR). NOV63 also has homology to 
the other proteins shown in the BLASTP data in Table 63D. 



Table 63D. NOV63 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%> 


Expect 


gi| 15082261 |r 
ef|NP 003386. 
l| (NM_003395) 


wingless -type MMTV 
integration site family, 
member 14 
{Homo sapiens] 


365 


339/356 
(95) 


343/356 
(96) 


0.0 


gi)3915306|sp 
|O42280|WN14_ 
CHICK 


WNT-14 PROTEIN PRECURSOR 


354 


283/333 
(84) 


310/333 
(92) 


e-156 


gi|l6303264|d 
bj [BAB70499.1 
| (AB063483) 


WNT14B 

[Homo sapiens] 


357 


215/352 
(61) 


263/352 
(74) 


e-107 


gi| 17017976 |r 
ef |NP_003387. 
l| (NM_003396) 


wingless -type MMTV 
integration site family, 
member 15 precursor 
[Homo sapiens] 


357 


215/352 
(61) 


263/354 
(74) 


e-107 


gi|l8181917|d 
bj |BAB83866.1 
| (AB073819) 


Wntl4b 

[Mus musculus) 


359 


216/354 
(61) 


264/354 
(74) 


e-106 



This BLASTP data is displayed graphically in the ClustalW in Table 63E. A multiple 
sequence alignment is given, with the disclosed NOV63 protein being shown on line 1 in a 
ClustalW analysis comparing the protein of the invention with the related protein sequences 
15 shown in Table 63D. 



Table 63E. ClustalW Alignment of NOV63 

NOV63 ! (SEQ ID NO:224) " 
gi|l508226l| (SEQ ID N0.-614) 
gil3915306| (SEQ ID NQ;615) 
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gi 
gi 



16303264 
17017976 
18181917 



NOV63 



15082261) 

3915306) 

16303264 

17017976 

18181917 



NOV63 



15082261 
3915306) 
16303264 
17017976 
18181917 



NOV63 



15082261) 

3915306) 

16303264 

17017976 

18181917 



NOV63 



15082261) 

3915306) 

16303264) 

17017976) 

18181917 



NOV63 

gi | 15082261 
gi|3915306| 
gi | 16303264 
gi 1 17017976 
gi j 18181917 



NOV63 



15082261 
3915306) 
16303264 
17017976 
18181917 



NOV63 



15082261 
3915306) 
16303264 
17017976 
18181917 



(SEQ ID NO: 616) 
(SEQ ID NO: 617) 
{SEQ ID NO: 618) 




250 



260 




290 




ajGGSDp: 

iGGSDPL 
KSIPGHSDQI 
iTKG] 

ft 



350 



I. 



"360 



CRCQVgWCCYVECgQC 
cfficQVgWCCYVECgQC 
CKCQvSwCCYVEcSqc 

cScqvHwccyvecSqc 
cRcqvSwccyvecSqc 
cScqvSwccyvecSqc* 



347 
355 
344 
347 
347 
349 



533 



WO 02/068649 



PCT/US02/02785 



Table 63F lists the domain description from DOMAIN analysis results against NOV63. 
This indicates that the NOV63 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 63F. Domain Analysis of NOV63 


gnl|Pfam|pfam00110, wnt, writ familv. SEO ID NO:862 








CD- Length = 313 residues, 99.7% aligned 






Score = 


£oo Dits { f*5)t itxpect — yt-/o 




NOV63 : 


51 


CDRLK-LERKQRRMOlRDPGVAETLVEAVSMSALECQFQFRFERWNCTLEGRyRASL 


106 






C L L +QR++CRR+P V ++ E ++ ECQ QFR RWNC+ R R 




Sbjct: 


2 


CRSLPGI*SPRQRQLCRRNPDVMASVSEGAQLAIQECQHQFRGRRWNCSTLDRLRVVFGKV 


61 


NOV63 : 


107 


LKRGFKETAFLYAISSAGLTHALAKACSAGRMERCTra 


165 






LK+G +ETAF+YAISSAG+ HA+ +ACS G +E C CD + +WQWGGC DN++ 




Sbjct: 


.62 


LKKGTRETAFVYAISSAGVAHAVTRACSEGELESCGCDYKKGPGGPQGSWQWGGCSDNVE 


121 


NOV63 : 


166 


YS SKFVKEFL- GRRSSKDLRARVDFHNNLVGVKVI KAGVETTCKCHGVSGS CTVRTCWRQ 


224 






+ +F +EF+ R +D R+ ++ HNN G K +K+ + CKCHGVSGSC+++TCW 




SbjCt: 


122 


FGIRFSRE FVDARERERDARS LMNLHNNEAGRKAVKSHMRRECKCHGVSGSC SMKTCWLS 


181 


NOV63 : 


225 


LAPFHE VGKHLKHKYETALKV- GS TTNEAAGEAGAI S PPRGRASGAGGSDPL PRTPELVH 


283 






L F VG LK KY+ A++V + G A + R SD LV+ 




Sbjct: 


182 


LPDFRAVGDALKDKYDGAI RVEPNKRGMGQGSAPRLVAKNPRFKPPTRSD LVY 


234 


NOV63 : 


284 


LDDSPSFCL- -AGRFSPGTAGRRC HREKNCES I CCGRGHNTQSRWTRPCQCQVRW 


337 


Sbjct: 




L+DSP +C S GT GR C CE +CCGRG+NTQ T C C+ W 




235 


LEDSPDYCERDRSTGSI^TQGRVCNKTSKGLDGCELLCCGRGYNTQQVERTEKCNCKFHW 


294 


NOV63 : 


338 


CCYVECRQCTQREEVYTCK 356 








CCYV+C +C + EV+TCK 




Sbjct: 


295 


CCYVKCEECQEWEVHTCK 313 





WnM (previously known as int-1) is a proto-oncogene induced by the integration of 
the mouse mammary tumor virus. It is thought to play a role in intercellular communication 
and seems to be a signalling molecule important in the development of the central nervous 
system (CNS). The sequence of wnM is highly conserved in mammals, fish, and amphibians. 

1 0 WnM is a member of a large family of related proteins that are all thought to be 

developmental regulators. These proteins are known as wnt-2 (also known as irp), wnt-3, up to 
wnt-15. At least four members of this family are present in Drosophila. One of them, wingless 
(wg), is implicated in segmentation polarity. All these proteins share the following features 
characteristics of secretory proteins, a signal peptide, several potential N-glycosylation sites 

15 and 22 conserved cysteines that are probably involved in disulfide bonds. The Wnt proteins 
seem to adhere to the plasma membrane of the secreting cells and are therefore likely to signal 
over only few cell diameters. 

The NOV63 disclosed in this invention is predicted to be expressed in at least the 
following tissues: brain This information was derived by determining the tissue sources of the 
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sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV63 is provided in Example 2. 

The nucleic acids and proteins of NOV63 are useful in potential therapeutic 
5 applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention will have efficacy for treatment of patients suffering 
from: Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalcemia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
1 0 disorders, addiction, anxiety, pain, neuroprotection, osteoarthritis, and other diseases, 
disorders and conditions of the like. The NOV63 nucleic acid encoding the WNT-14 
precursor-like protein of the invention, or fragments thereof, may further be useful in 
diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. The novel nucleic acid of the invention encoding a WNT-14 precursor-like 

1 5 protein includes the nucleic acid whose sequence is provided in Table 63A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 63A while still encoding a protein 
that maintains its WNT-14 precursor-like activities and physiological functions, or a fragment 
of such a nucleic acid. The invention further includes nucleic acids whose sequences are 

20 complementary to the sequence indicated in Table 63A, including nucleic acid fragments that 
are complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

25 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 21% of the bases 
may be so changed. 

30 The novel protein of the invention includes the WNT-14 precursor-like protein whose 

sequence is provided in Table 63B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 63B while 
still encoding a protein that maintains its WNT-14 precursor-like activities and physiological 
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functions, or a functional fragment thereof. In the mutant or variant protein, up to about 1 8% 
of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV64 

NOV64 has homology to dipeptidyl peptidase. The disclosed NOV64 (alternatively 
referred to herein as CG56884-01) includes the 2660 nucleotide sequence (SEQ ID NO:225) 
shown in Table 64A. A NOV64 ORF begins with a Kozak consensus ATG initiation codon at 
nucleotides 44-46 and ends with a stop codon at nucleotides 2633-2635. The disclosed 
NOV64 maps to human chromosome 17. 



Table 64A. NOV64 Nucleotide Sequence (SEQ ID 
NO:225) 

TCCTCCCCGCAGCTTCTCGCTGAATTCCGAGGGGGCTGAGAGGATGGCCACCACCGGGAC 

CCCAACGGCCGACCGAGGCGACGCAGCCGCCACAGATGACCCGGCCGCCCGCTTCCAGGT 

GCAGAAGCACTCGTGGGACGGGCTCCGGAGC^TCATCCAaSGCAGCCGCAAGTACTCGGG 

CCTCATTGTCAACAAGGCGCCCCACGACTTCCAGTTTGTGCAGAAGACGGATGAGTCTGG 

GCCCCACTCCCACCGCCTCTACTACCTGGGAATGCCATATGGCAGCCGAGAGAACTCCCT 

CCTCTACTCTGAGATTCCCAAGAAGGTCCGGAAAGAGGCT 

GCAGATGCTGGATCATTTCCAGGCCACGCCCCACCATGGGGT 

GCTGCTGAGGGAGCGGAAACGCCTGGGGGTCTTCGGC^TCACCTCCTACGACTTCCACAG 
CGAGAGTGGCCTCTTCCTCTTCCAGGCCAGCAACAGCCT 
CAAGAACGGCTTCATGGTGTCCCCTATGAAACCGCTGG^ 
GCCCCGGATGGACCCCAAAATCTGCCCTGCCGACCCTGCCTT 

CCAAGGTTTATCCAATGTCCTGGATGACCCCAAGTCTGCX^ 

ACAGGAAGAGTTCGACCGCTTCACTGGGTACTGGTGGTGCCCCACAGCCTCCTGGGAAGG 

TTC^GAGGGCCTCAAGACGCTGCGAATCCTGTATGAGGAAGTCGATGAGTCCGAGGTGGA 

GGTCATTCACGTCCCCTCTCCTGCGCTAGAAGAAA 

CAGGACAGGTAGCAAGAATCCCAAGATTGCCTTGAAACTTC 

CCAGGGCAAGATCGTCTCGACCCAGGAGAAGGAGCT^ 

CCCGAAGGTGGAGTACATCGCCAGGGCCGGGTGGACCCGGGATGGCAAATACGCCTGGGC 
CATGTTCCTGGACCGGCCCCAGCAGTGGCTCCAGCTCGTCCTCCTCCCCCCGGCCCTGTT 
CATCCCGAGCACAGAGAATGAGGAGCAGCGGCTAGCCTCTGCCAGAGCTGTCCCCAGGAA 
TGTCCAGCCGTATGTGGTGTACGAGGAGGTCACCAACGTCTGGATCAATGTTCATGACAT 
CTTCTATCCCTTCCCCCAATC^GAGGGAGAGGACGAGCTCTGCTTTCTCCGCGCCAATGA 
ATGCAAGACCXX3CTTCTGCCATTTGTACAAAGTCACCGCCGTTT 

CGATTGGAGTGAGCCCTTCAGCCCCGGGGAAGATGAATTTAAGTGCCCCATTAAGGAAGA 

GATTGCTCTGACCAGCGGTGAATGGGAGGTTTTGGCGAGGCACGGCTCCA^ 

CAATGAGGAGACCAAGCTGGTGTACTTCCAGGXX^ 

CCTCTACGTGGTCAGCTATGAGGCGGCCGGCGAGATCGTACGCCT(^CCACGCCCGGCTO 
CTCCCATAGCTGCTCCATGAGCCAGAACTTCGACATGTTCGTCAGCCACTACAGCAGCGT 
GAGCACGCCGCCCTGCGTGCACGTCTACAAGCTGAGCGGCCCCGACX3ACGACCCCCTGCA 
CAAGCAGCCCCGCTTCTGGGCTAGCATGATGGAGGCAGCCAGCTGCCCCCCGGATTATGT 
TCCTCCAGAGATCHTCCATTTC^ 

CAAGCCCCACGCCTTGCAGCCAGAGAAGAAGCACCCCACCGTCCTCTTTGTATATGGAGG 
CCCCCAGGTGCAGCTGGTGAATAACTCCTTC^AAGGCATCAAGra 
ACTGGCCTCCCTGGGCTACGCXIGTGGTTGTGATTGACGGCAGGG^ 
GCITCGGTTCGAAGGGGCCCTGAAAAACCAAATGGGCCAGGTGGAGATCGAGGACCAGGT 
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GGAGGGCCTGCAGTTCGTGGCCGAGAAGTATGGCTTCATCGACCTGAGCCGAGTTGCCAT 

CCATGGCTGGTCCTACGGGGGCTTCCTCTCGCTCATGGGGCTAATCCACAAGCCCCAGGT 

GTTCAAGGTGGCCATCGCGGGTGCCCCGGTCACCGTCTGGATGGCCTACGACACAGGGTA 

CACTGAGCGCTACATGGACGTCCCTGAGAACAACCAGCACGGCTATGAGGCGGGTTCCGT 

GGCCCTGC^CGTGGAGAAGCTGCCCAATGAACCCAACCGCTTGCTTATCCTCCACGGCTT 

CCTGGACGAAAACGTGCACTTTTTCCACACAAACTTCCTCGTCTCCCAACTGATCCGAGC 

AGGGAAACCTTACCAGCTCCAGATCTACCCCAACGAGAGACACAGTATTCGCTGCCCCGA 

GTCGGGCGAGCACTATGAAGTCACGTTGCTGCACTTTCTACAGGAATACCTCTGAGCCTG 
CCCACCGGGAGCCGCCACAT 



A NOV64 polypeptide (SEQ ID NO:226) encoded by SEQ ID NO:225 is 863 amino 
acids in length and is presented using the one-letter amino acid code in Table 64B. The Psort 
profile for NOV64 predicts that this sequence has no signal peptide and is likely to be 
localized at peroxisomal microbodies with a certainty of 0.6400. In alternative embodiments, 
a NOV64 polypeptide is located to lysosomes with a certainty of 0.1000, or to the cytoplasm 
with a certainty of 0.4500. 



Table 64B. NOV64 Polypeptide Sequence (SEQ ID 
NO:226) 



MATTGTPTADRGD AAATDDPAARFQVQKHS WDGLRS I IHGSRKYS GL I VNKAPHDFQFVQ 
voo «^ ^**^^^kGMP YGS RENSLLYSE I PKKVJUCEALLLLSWKQMLDHFQATPHHGV 
^^EELLRERKRLGVFGITSYDFHSESGLFLFQASNSLFHCRDGGKNGFMVSPMKPLEI 
CTQCSGPRMDPKICPADPAFFSFINNSDLWVANIETGEERRLTFCHQGLSNVLDDPKSAG 
VATFVI OEEFDRFTG YWWCPTAS WEGSEGLKTLRIL YEE VDESE VEVIHVP SPALEERKT 

GKYAWAMFIJDRPQQWLQLVLLPPALFIPSTENEEQ 

CPIKEEIALTSGEWEVI^GSKI 

LTTPGFSHSCSMS QNFDMFVSHYS SVS TPPC VHVYKLSGPDDDPLHKQPRFWASMMEAAS 
^PDYVPPEIFHFHTRSDVRLYGMIYKPHALQPEKKHPTVLFVYGGPQVQLVNNSFKGIK 
YLRIjiNTLASLGYAVWIDGRGSCQRGLRFEGALKNQMGQVEIEDQVEGLQFVAEKYGFID 
vf» ifr?^****^ YGGFLSLMGLI HKPQVFKV AI AGAP VTVWMAYDTGYTERYMDVPENNQHG 
^^^^^^^^^^^^WRLL^HGFIJ^ENVHFFHTNFL VS QL IRAGKPYQLQ I YPNERH 



A BLAST analysis of NOV64 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV64 had high 
homology to other proteins as shown in Table 64C. 



Table 64C. BLASTX results from PatP database for NOV64 



Sequences producing High-scoring Segment Pairs: 

n^ : ^to^ 6 Hman 0RFX ORF1390 P ol YPeptide sequence 
patp:AAM38724 Human polypeptide 

patp:AAM40510 Human polypeptide 

patp:AAB47i87 Human DPP8 - Homo sapiens, 882 aa 

patp:AAY90299 Human peptid ase, HPEP-16 protein seouencs 



Smallest 
Sum 

High Probability 
Score P (N) 

3403 0.0 

1987 0.0 

1823 0.0 

2868 1.5e-298 

2547 1.6e-264 
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In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1601 of 2525 bases (63%) identical to a gb:GENBANK- 
ID:AF221634|acc:AF221634.1 mRNA from Homo sapiens (dipeptidyl peptidase 8 (DPP8) 
mRNA). The full amino acid sequence of the protein of the invention was found to have 521 
5 of 856 amino acid residues (60%) identical to, and 657 of 856 amino acid residues (76%) 
similar to, the 882 amino acid residue ptnr:TREMBLNEW-ACC:AAG29766 protein from 
Homo sapiens (Human) (DIPEPTIDYL PEPTIDASE 8). NOV64 also has homology to the 
other proteins shown in the BLASTP data in Table 64D. 



Table 64D. NOV64 BLASTP results 


Gene Index / 
Identifier 


Protein /Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi|l786531l|g 
b|AAL47179.l| 
AF452102 X 
(AF452102) 


dipeptidyl peptidase 9 
[Homo sapiens] 


863 


862/863 
(99) 


862/863 
(99) 


0.0 


gi 17463229 |r 
ef XP 035636. 
2| (XM 035636) 


hypothetical protein 

XP_035636 

[Homo sapiens) 


689 


688/689 
(99) 


688/689 
(99) 


0.0 


gi 1 12855335 |d 
bj |BAB30295.1 
| (AK016546) 


DIPEPTIDYL PEPTIDASE 
8-putative [Mus 
musculus] 


883 


519/859 
(60) 


658/859 
(76) 


0.0 


gi|H095188|g 
b|AAG29766.l| 
AF221534_1 
(AF221634) 


dipeptidyl peptidase 8 
[Homo sapiens] 


882 


516/840 
(61) 


650/840 
(76) 


0.0 


gi |3513303 |gb 
|AAC33801.l| 
(AC005594) 


R26984_l 
[Homo sapiens] 


508 


492/543 
(90) 


495/543 
(90) 


0.0 



10 



15 



This BLASTP data is displayed graphically in the ClustalW in Table 64E. A multiple 
sequence alignment is given, with the NOV64 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 64D. 



Table 64E. ClustalW Alignment of NOV64 



NOV64 

gi 
gi 
gi 



178653111 
17483229 | 
12855335) 
11095188| 
gi 3513303 | 



(SEQ ID NO: 226) 
(SEQ ID NO: 619) 
(SEQ ID NO: 620) 
(SEQ ID NO: 621) 
(SEQ ID NO: 622) 
(SEQ ID NO: 623) 



NOV64 

gi | 17865311 | 
gi | 174832291 
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gi| 12855335| 


1 


gi |11095188 


1 


gi 


|3513303| 


1 


NOV64 


49 


gi 


1786531l| 


49 


gi 


17483229) 


1 


gi 


12855335 | 


61 


gi 


11095188 | 


60 


gi 


3513303 | 


1 



HOV64 


109 


gi 


17865311 


109 


gi 


17483229 


1 


gi 


12855335 


121 


gi 


11095188 


120 


gi 


3513303 | 


1 


N0V64 


169 


gi 


17865311 


169 


gi 


17483229 


1 


gi 


12855335 


181 


gi 


11095188 


180 


gi 


3513303) 


1 



NOV64 

gi| 17865311| 
gi j 17483229 | 
gij 12855335) 
gi|H095188| 
gi|3513303| 




190 




200 

L 



210 
I 



220 



230 



240 



c 


SGP 


RMDPK 


UCPADP 


AgF§ 


F? 


c 


SGP 


RMDPK 


UCPADP 


A§FS 


FI 




SGP 


RMDPK 


MCPADP 


AEF£ 


FI 




PNI 


RMDPK 


gCPADP 


mm 


FI 


B 


PNI 


RMDPK 


JCPADP 


FI 



250 



260 



NOV64 


229 


gi| 17865311 


229 


gij 17483229 


55 


gi|l2855335 


241 


gij 11095188 


240 


gij 3513303 | 


1 




270 



GYWWCP 
GYWWCP 
GYWWCP 
GYWWCP 
GYWWCP 




310 



320 



330 



340 



350 



NOV64 


289 Effip 


gi 


17865311 


289 


HV 


P 


gi 


17483229 


115 


HV 


P 


gi 


12855335 


300 


HV 


T 


gi 


11095188 


299 


HV 




gi 


3513303) 


1 








IVSTQEKELVQPFSSLFPK 



370 



380 



390 



349 
349 
175 
360 
359 
20 



400 
} 



410 
I 



420 
I 



VEYIARAGWTgGKYAWAMFLDRPQQWLQLVLLPPALFIPSTENEEQRIASA^V^RNVQ 
VEYIARAGWTSSGKYAWAMFLDRPnnWT 



VEYIARAGWT^GKYAWAMFLDRPQQWLQLVLLPPALFIPSTENEEQRLASARAVPRNVQ 
VEYIARAGWTggGKYAWAMFLDRPQQWLQLVLLPPALFIPSTENEEQRLASARAVPRNVQ 

VEYIARAG^^^BAWAMFLDRPQQWLQLVLLPPALFI PSTENSEQRLAS ARAVPRNVQ : 



408 
408 
234 
IT 419 
4"18 
72 



430 



NOV64 


409 


gi 


17865311 


409 


gi 


17483229 


235 


gi 


12855335 


• 420 


gi 


11095188 


419 


gi 


3513303 | 


73 



I 



440 



I . 



450 
. . I . . 



460 
. _ I 



470 
I 



480 



PYVWEEVTNWlNVHDIFYPF-PQSEGEDELCFLRAl^CKTGFCHLYKVTAVLKSQG^m? 

PYWYEEVTNWINWD'IFYPFPQSEGEDELCFLRA^CKTGFCHLYKVTAVLKSQGYDW 

PYWYEEVTZ^lNVmiFYPFPQSEGEDELCFLRAI^ECKTGFCHLYKVTAVLKSQGYDW 
PjggYEEgTjgwiNgHDIFjg 



468 
468 
294 
478 
477 
132 



490 



500 



510 



j — | — | — ) — , , 



520 



530 



540 



539 



WO 02/068649 



PCT/US02/02785 



NOV64 


469 




17865311 | 


469 


gi 


17483229) 


295 


gi 


12855335| 


479 


gi 


11095188 j 


478 


gi 


3513303) 


133 



SEPFSPGEjgFKCPIKEEIALTSGEWEVLjjRHGS 

sepfspgeSIfkcpikeeialtsgetovlBrhgsS^ 

s epfs p gejj |fkcpikeeialtsgewevlgrhgsgivttoeetklvyfqgtkdtplehhly 

S^^P^gFKCPIKEEI^TSGEWEVI^RHGsSlWVjgE^KLVYFgGTKDgPLEHHLY 

,mmr,mm*~ m T „ ^rm^ „ „ _ , . — 2rhg^i2^2eJ|^lvyf§gtkd§plehhly 

IOTHEETKLVYFQGTKDTPLEHHLY 



SEPFSPGE 



gFKCPIKE EIj 

HSBe 



flTSGEWEV 

Ilt! 



528 
528 
354 
538 
537 
173 



550 



NOV64 


529 


gi 


17865311 


| 529 


gi 


17483229 


355 


gi 


12855335| 539 


gi 


11095188 


| 538 


gi 


3513303) 


174 


NOV64 


589 


gi 


17865311 


589 


gi 


17483229 


415 


gi 


12855335 


599 


gi 


11095188 


598 


gi 


3513303 | 


234 




610 



ygPPEIFHFHTRSDVRLYGMIYKPHALQPjgKKHPTVLFVYGGPQ 
SsfSpPEIFHFHTRSDVRLYGMIYKPHALQPGKKHPTVLFVYGGPQ 




620 
.J.. 



630 



640 
..I.. 



650 



660 



PDYBPPEIFgFj^2^^3 LYGM 2 YKp H3LQPGKK§PTVLF|YGGPQ 

<IFHFHTRSDVRLYGMIYKPHALQPGKKHPTVLFVYGGPQ 



648 
648 
474 
658 
657 
284 



NOV64 


649 


gi 


17865311 


649 


gi 


17483229 


475 


gi 


12855335 


659 


gi 


11095188 


658 


gi 


3513303 | 


285 


NOV64 


709 


gi 


17865311 


709 


gi 


17483229 


535 


gi 


12855335 


719 


gi 


11095188 


718 


gi 


3513303) 


345 



680 
..I.. 



I , 



690 



I . 



700 



I 



710 
I 



720 
I 



VQL VNNS FKGI KYL R LNTL AS LG Y AVW I DGRGS CQRGL RFEGALKNQMGQ VE IEDQVEG 
VQLVlSnvJSFKGIKYLRLNTLASLGYAVVVIDGRGSCQRGLRFSGALKNQMGQVEIEDQVEG 
VQLVNNSFKGIKYLRLNTLASLGYAVWIDGRGSCQRGLRFEGALKNQMGQVEIEDQVEG 
VQL VNNS FKG jjjK Y jjjRLNTL AS LG Y jjlVVV ZD^ RG S cftRGLS FEGAja K PjjjjMGQfl E I ^DQ VEG 

vqlvi^fkgBkySrlot^ 

VQLVNNSFKGIKYLRLNTLASLGYAVVVIDGRGSCQRGLRFEGALKNQMGQVEIEDQVEG 



708 
708 
534 
718 
717 
344 



730 
I I .... I 



740 



I 



750 



760 
.. L. 



I 



770 
..I 



I . 



780 
. . 1 



lqfvaekygfidlsrvaihgwsyggflslmglihkpqvfkvaiagapvtvwmaydtgyte" 

LQFVAEKYGFIDLSRVAIHGWSYGGFLSLMGLIHKPQVFKVAIAGAPVTVWMAYDTGYTE 

LQFVAEKYGFIDLSRVAIHGWSYGGFLSLMGL IHKPQV FKVAIAGAPVTVWMAYDTGYTE 

LQgA^YgFIDLgRVglHGWSYGGgLSLMgL^E^FgVAIAGAPVTflwTOYDTGYTE 
LQjSjA^YjgFIDLjgRVglH^ 

LQFVAEK YGFJDLSRVAIHGWSYGGFLSLMGLlljKPQVFKVAIAGAPVTVtVMAYDTGYTE 



768 
768 
594 
778 
777 
404 



790 



800 



810 



820 
..I.. 



830 



I 



84 0 



NOV64 


769 


gi 


17865311 


769 


gi 


17483229 


595 


gi 


12855335 


779 


gi 


11095188 


778 


gi 


3513303 | 


405 



R YMD VPENNQHG YE AGS VALHVE KLPNE PNRLL I LHG FLDENVH FFHTNFL VS QL I RAGK 
RYMDVPEl^QHGYEAGSVALHVEKLPNEPNRLLILHGFLDSNVHFFHTNFLVSQLIRAGK 
RYMDVPENNQHGYEAGSVALHVEKLPNEPNRLLILHGFLDENVHFFHTNFLVSQLIRAGK 



RYM^Pj^^GYafcsVAS^KjapB^ 
RYM^pjSB^GYfflGSVAS^EKaPgEPNRLLjjL 

RYMDVPEN NQHGYEAGSVALHVEKLPNEPNRLLILHGFLDENTOFFHTNFT.VSQT.TRAnTC 



828 
628 
654 
838 
837 
464 



NOV64 


829 


PYQL 


gi 


17865311 


829 


PYQL 


gi 


17483229 


655 


PYQL 


gi 


12855335 


839 


?yJl 


gi 


11095188 


838 


PYgL 


gi 


3513303) 


465 


PYQL 



850 



860 
. . | . . 



870 
..I.. 



880 



I 



890 



2VALPPVSE 



QIYPNERHSIRCPESGEHYEVTLLHFLQEYL 
QIYPNERHSIRCPESGEHYEVTLLHFLQEYL 
QIYPNERHSIRCPESGEHYEVTLLHFLQEYL 
QIYPgERHSIRKPESGEHYETOLLHgLQEjL 
QIYPgERHSIR3pESGEHYESf§LLH|LQE*L 
IQI'YPNERHSI'RCPESGEHYEVTLLHFLQEYL 1 



- 863 

863 

689 

3SRIAALKVI 883 
3SRIAALKVI 882 
508 



Table 64F lists the domain description from DOMAIN analysis results against NOV64. 
This indicates that the NOV64 sequence has properties similar to those of other proteins 
known to contain this domain. 
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Table 64F. Domain Analysis of NOV64 


gnl|Pfam|pfamO0930, DPPIV N term, Dioeotidvl oentidase TV fDPP rtrt 


N- terminal 


region . 


This family is an alignment of the region to the N- terminal 


side of 


the active 


site. The Prosite motif does not correspond to this Pfam entry. 


SEQ ID 


NO: 


863 








CD-Length ■ 504 residues 




Score = 112 bits (280), Expect = 9e-26 


* 


NOV64 : 


200 


FFSFimSDLWVANIETGEERttTFCHQGLSNVLDDPKSACJVATFVIQEEFDRPTG-YWW 


258 






+F+ +++L++ + +G ++T G SN + + G+ +V +EE WW 




Sbj ct : 


122 


KLAFVRDNNLYIQKLPSGPAIQITT- -DGKSNDIFN GIPDWVYEEEILSTDYALWW 


175 


N0V64 : 


259 


CPTASWEGSEGLKTLRILYEEVDESEVEVIHVPSPALEE--RKTDSYRYPRTGSKNPKIA 


316 






P + Y ++SEV VI P + + +YP+ G NP + 




Sbj Ct : 


176 


SPDGD : FLAYLRFNDSEVPVIEYPFYTDDSQYPEDMEIKYPKAGDPNPTVK 


225 


NOV64 : 


317 


LKLAEFQTDSQGKIVSTQEKELVQPFSSLFPKVEYIARAGWTRDGKYAWAMFLDRPOQF^ 


376 


Sbjct : 




L + G VS + +SL YI R W + + A +L+R Q 




226 


LFWNLAD-- - GASVSE IPLPASLASGDYYITRVAWVTNERIA- VQWLNRDQNIS 


276 


NOV64 : 


377 


OLVLLPPALFIPSTENEEQRLASARAVPRNVQPYVVYEEVTNTOINVHDIFYPF 


436 






h h A +S V +N +E+ W+ + P +G 




Sbjct: 


277 




317 


NOV64 : 


437 


DELCFIiRANECKTGFCHLYKVTAVLKSQGYDWSEPFSPGEDEFKCPIKEEIALTSGEWEV 


496 


Sbjct: 




+L ++ + G+ HL E + K PI ALT G WEV 




318 




352 


N0V64 : 


497 


LARHGSKIWVNEETKLVYFQGTKDTPLEHHLYWSY EAAGEITOLTTPGFSHSCSM 


552 


Sbjct: 




+ + V+ +T VYF T++ E HLY +S + G+ +S S 




353 


T NILGVDSKTDTVYFTATEEGSGERHLYSISLKGGKTTLSCQLDSERCGY-YSASF 


407 


NOV64 : 


553 


SQNFDMFVSHYSSVSTPPCVHVYKLSGPDDDPLHKQPRFWASMME AASCPPDYV 


606 






S N ++ YS P SDL +E A 




Sbjct: 


408 




456 


N0V64 : 


607 


PPEIFHFHTRSDTOLYGMIYKPHALQPEKKHPTVLFVYGGPQVQLV 652 








E + L + KP P KK+P + FVYGGP Q V 




Sbjct: 


457 


SKEFGKIKIADGITLNYQMIKPANFDPSKKYPVLFFVYGGPGSQQV 502 





NOV64 is a member of the family of dipeptidyl peptidases (DPPs). This group of 
enzymes catalyzes the removal of dipeptides from the N termini of polypeptides. This novel 
5 gene has greatest homology to a recently discovered protein, DPP8 (Abbott et al, Eur J 

Biochem 2000 Oct;267(20):6140-50). DPP8 in turn is related to DPP4, which is a cell surface 
peptidase involved in T-cell activation (Kahne et al., Int J Mol Med 1999 Jul;4(l):3-15). 
Other members of the peptidase family have been targeted as putative drug targets, for 
instance, in situations where they might cleave polypeptides beneficial in the prevention or 
10 reduction of a disease condition. 

The NOV64 disclosed in this invention is predicted to be expressed in at least the 
following tissues: bone, bone marrow, brain (cerebellum,substantia nigra, thalamus), 
bronchus, cartilage, cervix, chorionic villus, coronary artery, colon, breast, heart, kidney, liver, 
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lung, lymph node, lymphoid tissue, ovary, placenta, pituitary gland, respiratory bronchiole, 
retina, skeletal muscle, skin, small intestine,spinal cord, spleen, testis, thymus, thyroid, 
umbilical vein, urinary bladder, vulva, adrenal gland/suprarenal gland, synovium/synovial 
membrane, and uterus. This information was derived by determining the tissue sources of the 
5 sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV64 is provided in Example 2. 

The nucleic acids and proteins of NOV64 are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 

1 0 the compositions of the present invention will have efficacy for treatment of patients suffering 
from: Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalcemia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, osteoarthritis, and other diseases, 

1 5 disorders and conditions of the like. A NOV64 nucleic acid encoding the dipeptidyl 
peptidase-like protein of the invention, or fragments thereof, may further be useful in 
diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. 

The novel nucleic acid of the invention encoding a dipeptidyl peptidase-like protein 
20 includes the nucleic acid whose sequence is provided in Table 64A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 64A while still encoding a protein that maintains 
its dipeptidyl peptidase-like activities and physiological functions, or a fragment of such a 
nucleic acid. The invention further includes nucleic acids whose sequences are complementary 
25 to the sequence of Table 64A, including nucleic acid fragments that are complementary to any 
of the nucleic acids just described. The invention additionally includes nucleic acids or nucleic 
acid fragments, or complements thereto, whose structures include chemical modifications. 
Such modifications include, by way of non-limiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications are 
30 carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
that they may be used, for example, as antisense binding nucleic acids in therapeutic 
applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
about 37% of the bases may be so changed. 
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The novel protein of the invention includes the dipeptidyl peptidase-like protein whose 
sequence is provided in Table 64B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 64B while 
still encoding a protein that maintains its dipeptidyl peptidase-like activities and physiological 
5 functions, or a functional fragment thereof. In the mutant or variant protein, up to about 40% 
of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
10 prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV65 

NOV65 includes two dual specificity phosphatase-like proteins, designated herein as 
NOV65a and NOV65b. 

15 

NOV65a 

A disclosed NOV65a (alternatively referred to herein as CG56651-01) includes the 71 1 
nucleotide sequence (SEQ ID NO: ) shown in Table 65A. A NOV65a ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a stop codon at 
20 nucleotides 652-654. The disclosed NOV65a maps to human chromosome 1 . 



Table 65A. NOV65a Nucleotide Sequence (SEQ ID 
NO:227) 

ATGCTTCCCAAACGCGTGAGGGAGAAGATGGATGACACCAGCCTCTATAATACGCCCTGT 
GTCCTGGACCTACAGCGGGCCCTGGTTCAGGATCGCCAAGAGGCGCCCTGGAATGAGGTG 
GAT GAGGTCTGGCC CAATGTCTT CAT AGCTGAGAAGAGTGTGG CTGTG AACAAGGGGAGG 
CTGAAGAGGCTGGGAATCACCCACATTCTGAATGCTGCGCATGGCACCGGCGTTTACACT 
GGCCCCGAATTCTACACTGGCCTGGAGATCCAGTACCTGGGTGTAGAGGTGGATGACTTT 
CCTGAGGTGGACATTTCCCAGCATTTCCGGAAGGCGTCTGAGTTCCTGGATGAGGCGOTG 
CTGACTTACAGAGGGAAAGTCCTGGTCAGCAGCGAAATGGGCATCAGCCGGTCAGCAGTG 
CTGGTGGTCGCCTACCTGATGATCTTCCACAACATGGCC^TC 

GTGCGTAAGAAGCGGGCCATCTACCCCAATGAGGGCTTCCTGAAGCAGCTGCGGGAGCTC 
AATGAGAAGTTGATGAGGAGAGAGAAGAGGACTATGGCCGGGAGGGGGGATCAGCTGAGG 
CTGAGGAGGGCGAGGGCACTGGGAGCATGCTCGGGGCCAGAGTGCACGCCCTGACGGTGG 
AAG AGGAGGAC GACAGCGC CAG C CACCTGAGTGG CTCCTC CCTGG GG AAG G 

The NOV65a polypeptide (SEQ ID NO:228) encoded by SEQ ID NO:227 is 217 
amino acids in length and is presented using the one-letter amino acid code in Table 65B. The 
25 Psort profile for NOV65a predicts that this sequence is likely to be a Type II membrane 

protein, and is likely to be localized at the plasma membrane with a certainty of 0.4400. In 
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alternative embodiments, a NOV65a polypeptide is located to the endoplasmic reticulum 
(membrane) with a certainty of 0.8500, or to the nucleus with a certainty of 0.7400. 



Table 65B. NOV65a Polypeptide Sequence (SEQ ID 
NO:228) 

MLPKRVREKMDDTSLYNTPCVLDLQRALV^ 
LKJtfjGITHILNAAHGTGVYTGPEFYTGLEIQYLGVEVDD 

LTYRGKVLVSSEMGI SRSAVLWAYLMIFHNMAILEAIOT'VRKKRAIYPNEGFIiKQIJlEL 
NEKLMRREKRTMAGRGDQLRLRR?\RALGACSGPECTP 

5 NOV65b 

The disclosed NOV65b (alternatively referred to herein as CG56652-02) includes the 
3212 nucleotide sequence (SEQ ID NO: ) shown in Table 65C. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a stop codon at 
nucleotides 3 1 93-3 1 95. The disclosed NOV65b maps to human chromosome 1 . 

10 ' 

Table 65C. NOV65b Nucleotide Sequence (SEQ ID 
NO:229) 

ATGCTTCCCAAACGCGTGAGGGAGAAGATGGATGACACCAGCCTCTATAATACGCCCTGT 
GTC CTGGACC T AC AGCGGG C C CTG GTTCAGGATCG C CAAGAGGCGCC CTGG AATGAGGTG 
GAT G AGGT CTGG C CCAATG T CTTC ATAGCTGAGAAGAGTGTGGCTGTGAACAAGGGGAGG 
CTG AAGAGGCTGGG AAT CACC CAC ATTCTGAATGC TG CGC ATGG CACCGGCGTTTACACT 
GGC C C CGAATT CT ACACTGGC CTGGAGAT CCAGTACCTGGGTGTAG AG GTGGATGACTTT 
C CTGAGGTGGACATTT C CCAG CATTTCCGGAAGGCGTCTGAGTT CCTG GATGAGGCGCTG 
CTGACTTACAG AGGGAAAGT C CTGGTCAGCAG CGAAATGGG CAT CAG C CGGT C AGCAGTG 
CTGGTGGTCGCCTACCTGATGATCTTCCAC^CATGGCCATCCTGGAGGCTTTGATGACC 
GTGCGTAAGAAGCGGGCCATCTACCCCAATGAGGGCTTCCTGAAGCAGCTGCGGGAGCTC 
AATGAGAAGTTGATGGAGGAGAGAGAAGAGGACTATGGCCGGGAGGGGGGATCAGCTGAG 
GCTGAGGAGGGCGAGGGCACTGGGAGCATGCTCGGGGCCAGAGTGCACGCCCTGACGGTG 
GAAGAGGAGGACGACAGCGCCAGCCACCTGAGTGGCTCCT CCCTGGGGAAGGC CACC CAG 
GCCTCCAAGCCCCTCACCCTCATAGACGAGGAGGAGGAGGAGAAACTGTACGAGCAGTGG 
AAGAAGGGGCAGGGCCTCCTCTCAGACAAGGTCCCCCAGGATGGAGGTGGCTGGCGCTCA 
GCCTCCTCTGGCCAGGGTGGGGAGGAGCTCGAGGACGAGGACGTGGAGAGGATCATCCAG 
GAGTGGCAGAGCCGAAACGAGAGGTACCAAGCAGAAGGGTACCGGAGGTGGGGAAGGGAG 
GAGGAGAAGGAGGAGGAGAGCGACGCTGGCTCCTCGGTGGGGAGGCGGCGGCGCACCCTG 
AGCGAGAGCAG CG C CTGGGAG AGCGTGAGCAG CCACG ACATCTGGGT C C TG AAGCAGCAG 
CTGGAGCTGAACCGCCCGGACCACGGCAGGAGGCGCCGCGCAGACTCGATGTCCTCGGAG 
AGCACCTGGGACGCATGGAACGAGAGGCTGCTGGAGATTGAGAAGGAGGCTTCCCGGAGG 
TACCACGCCAAGAGCAAGAGAGAGGAGGCGGCAGACAGGAGCTCAGAAGCAGGGAGCAGG 
GTGCGGGAGGATGATGAGGACAGCGTGGGCTCTGAGGCCAGTTCCTTCTACAACTTCTGC 
AGCAGGAACAAGGACAAGCTCACTGCCCTGGAAAGATGGAAGATCAAGAGAATCCAATTT 
GGATTTCACAAGAAAGACTTGGGAGCGGGAGACAGCAGCGGTGAGCCCGGTGCAGAGGAG 
GCAGTAGGGGAGAAGAACCCCTCCGACGTCAGCCTGACAGCCTACCAGGCCTGGAAGCTG 
AAAC^CCAGAAGAAGGTGGGCAGTGAGAAC^AGGAGGAGGTGGTGGAGCTCAGCAAGGGG 
GAGGACTCGGCCTTGGCTAAGAAGAGACAACGGAGGCTGGAGCTGCTGGAGAGAAGCCGG 
CAGACGCTGGAGGAGAGCCAGTCTATGGCAAGCTGGGAGGCGGACAGCTCCACGGCCAGC 
GGGAGCATTCCCCTGTCTGCGTTCTGGTCTGCAGACCCCTCAGTCAGCGCTGATGGGGAC 
ACGACGTCAGTACTGAGCACCCAGAGCCACCGCTCCCACCTGTCTCAGGCTGCAAGCAAC 
ATAGCGGGGTGTTCAACCTCCZACCCCACCACACCCCTGCCTAACCTGCCAGTGGGGCCT 
GGAGACACCATTTCCATTGCCAGTATCCAGAACTGGATTGCCAATGTAGTCAGTGAGACC 
CTTGCTCAGAAGCAAAATGAAATGCTGCTGTTGTCCCGCT 

AAGGCAGTACCAGCGGCTAGCTGCCTGGGGGATGACCAAGTCTCCATGCTTAGTGGACAC 
AGCAGCTCCTCCTTGGGTGGCTGCCTGTTGCCTCAGAGCCAGGCAAGACCCAGCTCTGAC 
ATGCAGTCTGTGCTGTCCTGCAACACCACACTGAGCTCACCCGCGGAAAGTTGCAGAAGC 
AAAGTGAGGGGGACCAG CAAG CCCATCTTCAG CCTCTTTG CTGACAATGTGGACCTAAAG 
GAACTTGGCCGGAAGGAGAAGGAGATGCAGATGGAGCTTAGGGAGAAGATGTCTGAGTAC 
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AAAATGGAAAAGCTGGCCTCAGACAACAAACGCAGCTCCCTCTTCAAGAAGAAGAAGGTC 
AAGGAAGATGAGGATGATGGTGTGGGTGATGGGGATGAGGACACTGACAGTGCCATAGGG 
AGCTT CCGATATTCTT CC CGCAGTAATTCCCAGAAAC CTGAAACAGAC ACATGCTCCTCC 
CTGGCTGTCTGTGATCACTATGCAAGTGGCAGCAGAGTTGGCAAAGAGATGGATAGCAGT 
ATTAATAAGTGGCTCAGTGGCCTCAGGACGGAGGAAAAACCTCCTTTCCAAAGTGACTGG 
TCTGGAAGTTCCAGAGGGAAGTACACCAGATCGTCCCTGCTCAGGGAGACAGAGTCTAAA 
TCCTCCAGTTAC AAGTTTT CC AAATCC CAGTC AGAGGAACAGGACACCTCCTCCTACCA^ 
GAGGC AAATGGCAACT CTGTAAGAAGC ACTTCACGGTTCT CAT CTTCCTC CACCAGGGAG 
GG C AGAGAGATGCACAAGTTCTCCAGGTCCACGTACAACGAGACCTCAAGTTC CCGAGAG 
GAGAGCCCAGAGCCCTACTTCTTCCGCCGGACCCCAGAGTCCTCAGAAAGGGAAGAGTCC 
CCAGAACCACAGCGCCCAAATTGGGCCAGGTCCAGGGACTGGGAAGATGTGGAAGAGTCA 
TCCAAGTCAGACTTCTCTGAATTTGGAGCCAAGAGGAAGTTCACCCAGAGCTTTATGAGG 
TCTGAAGAAGAGGGAGAGAAAGAGAGGACAGAAAACAGAGAAGAAGGGAGGTTTGCATCT 
GGACGGCGGTCCCAGTATCGGAGAAGCACTGACAGGGAGGAAGAGGAAGAAATGGACGAT 
GAAGC CAT CATTG CTG CTTGGAGACGCCGGCAAGAAGAAACCAGGACCAAGCTGCAGAAA 
AGGAGGGAGGACTGAGCTGGGGAAAATCTGAG 



The NOV65b polypeptide (SEQ ID NO:230) encoded by SEQ ID NO:229 is 1064 
amino acids in length and is presented using the one-letter amino acid code in Table 65D. The 
Psort profile for NOV65b predicts that this sequence is a Type II membrane protein, and is 
likely to be localized at the plasma membrane with a certainty of 0.7900. In alternative 
embodiments, a NOV65b polypeptide is located to Goligi bodies with a certainty of 03000 or 
to the nucleus with a certainty of 0.8200. 



Table 65D. NOV65b Polypeptide Sequence (SEQ ID 
NO:230) 

MLPKRVREK^DTSLYOTPCVLDLQRALVQD 

LKRLGITHILNAAHGTGVYTGPEFYTGLEIQYLGVEVDDFPEVDISQHFRKASEFLDEAL 

LTYRGKVLVS SEMGI S RSAVLWAYLM I FHNMAI LEALMTVRKKRAI YPNEGFLKQLREL 

NEKLMEEREEDYGREGGSAEAEEGEGTGSMLGARVHALTVEEEDDSASHLSGSSLGKATQ 

ASKPLTLIDEEEEEKLYEQWKKGQGLLSDKVPQDGGGWRSASSGQGGEELEDEDVERIIQ 

EWQSRNERYQAEGYRRWGREEEKEEESDAGSSVGRRRRTLSESSAWESVSSHDIWVLKQQ 

LELNRPDHGRRRRAD SMSSES TWDAWNERLLE IEKEASRR YHAKS KREEAADRSSEAGSR 

VREDDEDSVGSEMSFYNFCSRNKDKLTALERWKIKRIQFGFHKKDLGAGDSSGEPGAEE 

AVGEKNPSDVSLTAYQAWKLKHQKKVGSENKEEVVELSKGEDSALAKKRQRRL 

QTLEESQSMASWEADSSTASGSIPLSAFWSADPSVSADGDTTSVLSTQSHRSHIiSQAASN 

I AGCSTSNPTTPLPNLPVGPGDT I S I AS IQNWIANWSETLAQKQNEMLLLSRS PSVASM 

KAVPAASCLGDDQVSMLSGHSSSSLGGCLLPQSQARPSSDMQSVLSOnTLSSPAESCRS 

KVRGTSKPIFSLFADNVDLKELGRKEKEMQMELREKMSEYKMEKLASDNKRSSLFKKKK^ 

KEDEDDGVGDGDEDTDSAIGSFRYSSRSNSQKPETDTCSSLAVCDHYASGSRVGKEMDSS 

INKWLSGLRTEEKPPFQSDWSGSSRGKYTRSSLLRETESKSSSYKFSKSQSEEQDTSSYH 

EIANGNSVRSTSRFS S SSTREGREMHKFSRST YNETS S SREES PEP YFFRRTPES S EREES 

PEPQRPNWARSRDWEDVEESSKSDFSEFGAKRKFTQSFMRSEEEGEKERTENREEGRFAS 

GRRSQYRRSTDREEEEEMDDEAI IAAWRRRQEETRTKLQKRRED 



10 

A BLAST analysis of NOV65 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV65 had high 
homology to other proteins as shown in Table 65E. 



5 
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Table 65E. BLASTX results from PatP database for NOV65 

Smallest 
Sum 





High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P 


(N) 




patp:AAE04836 Human SGP018 phosphatase polypeptide 


858 


l 


.5e- 


*85 


patp:AAB40919 Human ORFX ORF683 polypeptide sequence 


767 


6 


5e« 


-76 


patp:AAE04837 Human SGP003 phosphatase polypeptide 


410 


4 


4e- 


-38 


patp:AAY68779 Amino acid sequence of a human 


389 


7 


4e- 


•36 


patp:AAB42334 Human ORFX ORF2098 polypeptide sequence 


389 


7 


4e- 


•36 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 328 of 531 bases (61%) identical to a gb:GENBANK- 
ID:AB027004|acc:AB027004.1 mRNA from Homo sapiens (mRNA for protein phosphatase). 
5 The full amino acid sequence of the protein of the invention was found to have 80 of 1 74 
amino acid residues (45%) identical to, and 1 15 of 174 amino acid residues (66%) similar to, 
the 198 amino acid residue ptnr:SPTREMBL-ACC:Q9UII6 protein from Homo sapiens 
(Human) (PROTEIN PHOSPHATASE). NOV65 also has homology to the other proteins 
shown in the BLASTP data in Table 65F. 



Table 65F. NOV65 BLASTP results 


Gene Index/ 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi|l4602535|g 
b|AAH09778.l| 
AAH09778 
(BC009778) 


protein phosphatase 
[Homo sapiens] 


198 


81/178 
(45) 


117/178 
(65) 


3e-37 


gi| 17454087 |r 
ef |XP_061191. 
1) (XM 061191) 


similar to protein 
phosphatase (H. sapiens) 
[Homo sapiens] 


370 


82/186 
(44) 


121/186 
(64) 


4e-37 


gi| 7705959 | re 
f|NP 057448.1 
I {NM 016364) 


protein phosphatase 
[Homo sapiens] 


198 


81/178 
(45) 


116/178 
(64) 


le-36 


gi|73050ll|re 
f |NP_03 8877.1 
_L (NM 013849) 


dual specificity 
phosphatase 13 
[Mus musculus] 


198 


80/178 
(44) 


115/178 
(63) 


3e-36 


gi|l283924l|d 
bj |BAB24480.1 
| (AK006247) 


dual specificity 
phosphatase 13 -putative 
[Mus musculus] 


198 


79/178 
(44) 


114/178 
(63) 


3e-35 



This BLASTP data is displayed graphically in the Clustal W in Table 65G. A multiple 
sequence alignment is given, with the NOV65a and b proteins being shown on lines 1 and 2 in 
a Clustal W analysis comparing the protein of the invention with the related protein sequences 
15 shown in Table 65F. 



Table 65G. ClustalW Alignment of NOV65 
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NOV65a 


1 


NOV65b 


1 


gi 


14602535| 


1 


gi 


17454087 | 


1 


gi 


7705959 | 


1 


gi 


7305011 | 


1 


gi 


12839241) 


1 



NOV65a 


13 


NOV65b 


13 


gi 


14602535 | 


12 


gi 


17454087] 


61 


gi 


7705959 | 


12 


gi 


7305011 j 


12 


gi 


1283924l| 


12 



NOV65a 

NOV65b 

gi | 14602535 

gi | 17454087 

gi | 7705959 | 

gi | 7305011 1 

gi ) 12839241 



NOV65a 


13 


NOV65b 


13 


gi 


14602535) 


16 


gi 


17454087) 


121 


gi 


7705959) 


16 


gi 


7305011) 


16 


gi 


12839241) 


16 



NOV65a 


118 


NOV65b 


118 


gi| 14602535) 


125 


gi j 17454087 | 


239 


gi (7705959) 


125 


gij 7305011) 


125 


gi 1 12839241 1 


125 



NOV65a 


178 RE 


NOV65b 


178 RE 


gi 


14602535 | 


185 pV 


gi 


17454087 | 


299 pE 


gi 


7705959] 


185 &V 


gi 


7305011 j 


185 pV 


gi 


12839241| 


185 pV 



(SEQ ID NO: 

(SEQ ID NO: 

(SEQ ID NO: 

(SEQ ID NO: 

(SEQ ID NO: 

(SEQ ID NO: 

(SEQ ID NO: 



228) 
230) 
624) 
625) 
626) 
627) 
628) 



10 



20 



30 



40 



-MEPKRVR EKMDDT 13 

-MDTKRVjfjE fcKMDDT 13 

MW^r'-#St?? l^linl 



MLEVDPVTVPVllPJ?^ 




70 



80 



90 



100 

|....|....|. 



110 



120 



llgl: 




- 13 

_ _ 13 

- 16 

KWPKPVGFSQIKEKVGTKGVKSKESCRKERKSLGSKMTSGEVKTSLKNAYS 12 0 
16 

16 

16 



130 



140 



NOV65a 


59 






NOV65b 


59 






gi| 14602535) 


65 






gij 17454087) 


179 






gi|7705959| 


65 






gij 7305011 j 


65 




lie 


gi ) 12839241) 


65 









320 



330 



340 



350 



360 
....] 
K 189 

TCREGGSAEAEEGEGTGSMLGARVHALTVEEEDDSASHLSGSSLGK 237. 

- -- 198 

jSQRQDGEN GS 321 

198 

198 

198 
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NOV65a 


200 


NOV65b 


298 


gi 


14602535) 


198 


gi 


17454087 | 


337 


gi 


7705959) 


198 


gi 


7305011) 


198 


gi 


12839241 | 


198 



370 380 390 400 410 420 

NOV65a 190 RTMAGR GDQLR 200 

NOV65b 238 ATQ AS KPLTL I DE E EEE KL YE QWKKGOGLLSDKVPQDGGGWRS AS SGQGGEEL ED E OVER 297 

gi 1 14602535 | 198 198 

gi j 17454087 j 322 RHEAGSDSF QGQESQT 337 

gi|7705959| 198 - --- 199 

gi j 73 05011 j 198 - i 98 

gi 1 12839241 1 198 - - 198 



NOV65a 
NOV65b 



14602535) 

17454087) 

7705959| 

7305011) 

12839241) 



NOV65a 
NOV65b 



14602535) 
17454087 j 
7705959) 
7305011) 
12839241) 



NOV65a 

NOV65b 

gi 1 14602535 I 

gi I 17454 087 j 

gi (7705959 I 

gi j 7305011) 

gi j 12839241 I 



NOV65a 

NOV65b 

gi 1 14602535 I 

gi j 17454087 j 

gi j 7705959 I 

gi j 7305011 j 

gi 112839241 1 



I 



430 



440 



450 



470 



480 



460 

I I 

LRRARALGACSGPECTP 217 

IIQEWQSRNERYQAEGYRRWGREEEKEEBSDAGSSVGIiRRRTLSESSAWESVSSHDIWVL 357 
■_ - 198 

RQKPKRVAAVG DA GRQGPGMEMAWRNQNVI KAF 370 

198 

" " 198 

" - - 198 



490 



500 



510 



520 



530 



540 



217 " 217 

358 KQQLELNRPDHGRRRRADSl^SESTWDAWNERLLEIEKEASRRYHAKSKREEAADRSSEA 417 

198 . 198 

370 - 370 

198 19Q 

198 -- - i9 8 

198 19Q 
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560 



570 



580 



590 ■ 



600 

-...|.-..|..-.|....|....|..--|...-|.-..|....|...-|....|-...| 
217 - - - 217 

418 GSRVREDDEDSVGSEASSFYNFCSRNKDKLTALERWKIKRIQFGFHKKDLGAGDSSGEPG 477 

198 — 198 

370 " 370 

198 " 198 

* 98 - -- 198 

198 " 198 

610 620 630 640 650 660 

— I — )....).... I — I — 1 — )....).... 1 — 1 — 1 

217 - 217 

478 AEEAVGEKNPSDVSLTAYQAWKLIO10KKVGSENKEEVVELSKGEDSALAKKRQRRLELLE 537 

198 198 

37 ° — " 370 

198 " 198 

198 " 198 

198 198 



670 



680 



690 



NOV65a 


217 


NOV65b 


538 


gi 


14602535) 


198 


gi 


17454 087) 


370 


gi 


7705959) 


198 


gi 


7305011) 


198 


gi 


12839241) 


198 



700 
|....|.. 



710 



720 



217 
597 
198 
370 
198 
198 
198 



730 



740 



770 



780 



750 760 
....|..-.|....|....|....|....|....|....|..... 

217 - - — 217 

598 ASNIAGCSTSl^TPLPNLPVGPGDTISIASIQNWIANVVSETLAQKQNEMIiIiLSRSPSV 657 

188 " " - - 198 

370 - 370 

198 7 — - --- 198 

198 198 

198 - - 198 
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NOV65a 
NOV65b 

gi 
gi 
gi 
gi 
gi 



14602535| 
17454087| 
7705959) 
7305011) 
12839241 | 



217 
658 
198 
370 
198 
198 
198 



I 



790 
..|.. 



800 



810 
..|.. 



820 
..|.. 



830 



840 



|....| 



217 

ASMKAVPAASCLGDDQVSMLSGHSSSSLGGCLLPQSQARPSSDMQSVLSCNTTLSSPAES 717 
198 

l -- 370 

: - - 198 

- 198 

- - - 198 



850 



860 



870 



880 



890 



900 



718 CRSKVRGTSKPIFSLFADNVDLKELGRKEKEMQMELREKMSEYKMEKLASDNKRSSLFKK 777 
14602535 | 198 198 



NOV65a 


217 


NOV65b 


718 


gi 


14602535 | 


198 


gi 


17454087) 


370 


gi 


7705959| 


198 


gi 


7305011) 


198 


gi 


12839241 | 


198 



217 



370 
198 
198 



12839241 1 198 - - - . i 98 



NOV65a 
NOV65b 

gi 
gi 
gi 



14602535 | 
17454087 | 
7705959 | 
73050111 



910 



920 
..|.. 



NOV65a 


217 


NOV65b 


778 


gi 


14602535 


198 


gi 


17454087 


370 


gi 


7705959| 


198 


gi 


730501l| 


198 


gi 


12839241) 198 



930 
..|.. 



940 



950 



960 



217 

KKVKEDEDDGVGDGDEIOT)SAIGSFRYSSRSNSQKPETDTCSSIiAVCDHYASGSRVGKEM 83 7 

- 198 

" 370 

198 

198 

_ 198 



I 



970 



NOV65a 


217 


NOV65b 


838 


gi 


14602535) 


198 


gi 


17454087) 


370 


gi 


7705959) 


198 


gi 


7305011) 


198 


gi 


12839241) 


198 



I 



980 



990 



1000 



1010 



1020 



217 

DSSINKWIiSGLRTEEKPPFQSDWSGSSRGKYTRSSLLRETESKSSS YKFSKSQSEEQDTS 897 
198 

370 

198 

198 

198 



1030 



1040 



1050 



1060 



NOV65a 


217 


NOV65b 


898 


gi 


14602535) 


198 


gi 


17454087] 


370 


gi 


7705959) 


198 


gi 


7305011) 


198 


gi 


12839241) 


198 



1070 



1080 



" - 217 

SYHEANGNSVRSTSRFSSSSTREGREMHKFSRSTYNETSSSREESPEPYFFRRTPESSER 957 
_ -- 198 

- ~ 370 

- 198 

- 198 

198 



1090 



NOV65a 


217 


NOV65b 


958 


gi 


14602535) 


198 


gi 


17454087) 


370 


gi 


7705959) 


198 


gi 


7305011) 


198 


gi 


12839241) 


198 



1100 



1110 



1120 



■|....|....|....|. 



I 



1130 



1140 



)....) 



217 

EESPEPQRPNWARSRDWEDVEESSKSDFSEFGAKRKFTQSFMRSEEEGEKERTENREEGR 1017 

198 

370 

198 

__ - i9 8 

" 198 



1150 



1160 



217 



I I 



1170 



1180 



| |....| | | 



I 



217 



1018 FASGRRSQYRRSTDREEEEEMDDEAIIAAWRRRQEETRTKLQKRRED 1064 

"8 -- 198 

370 370 

" " 198 

1** -~ - 198 
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gi 1 1283924l| 198 198 



Table 65H lists the domain description from DOMAIN analysis results against 
NOV65. This indicates that the NOV65 sequence has properties similar to those of other 
proteins known to contain this domain. 

5 

Table 65H. Domain Analysis of NOV65 

gnl] Pf am | pf amO 07 82 , VSPc, Dual specificity phosphatase, catalytic domain. 

Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly 

similar to that of tyrosine- specif ic phosphatases, except for a "recognition" 

region. SEQ ID NO: 864 

CD-Length « 139 residues, 97.8% aligned 
Score = 113 bits (282), Expect = le-26 

NOV65 : 42 EVWPNVFI AEKS VAVNKGRLKRLGITHI LNAAHGTGVYTGPE FYTGLE IQYLGVEVDDFP 101 

E+ 'P++++ AN L +LGITH++N F YL + VDD 

Sbj ct : 4 EILPHLYLGSYPTASNIAFLSKLGITHVINVTEEVPNSKNSGF LYLHI PVDDNH 57 

NOV65: 102 EVDISQHFRKASEFLDEALLTYRGKVLVSSEMGISRSAVLWAYLMIFHNMAILEALMW 161 

E DIS + +A EF+++A GKVLV + GISRSA L++AYLM N+++ EA V 

Sbjct: 58 ETDI SPYLDEAVEFIEDAR - QKGGKVLVHCQAGI SRSATLI IAYLMKTRNLSLNEAYSFV 116 

NOV65: 162 RKKR-AIYPNEGFLKQLRELNEK 183 

+++R I PN GF +QL E K 
Sbjct: 117 KERRPI 1 5 PNFGFKRQLIEYERK 139 

The NOV65 gene of invention is a member of the family of dual specificity protein 
phosphatases (DSPs; Martell et al., Mol Cells 1998 Feb 28;8(1):2-1 1). DSPs recognize either 
serine/threonine (Ser/Thr) or tyrosine (Tyr) moieties as targets for dephosphorylation. These 

10 enzymes regulate mitogenic signal transduction and can thereby regulate the cell cycle. Some 
members of this family are effective tumor suppressors, for example, PTEN. PTEN is required 
during embryonic development and later in life, and mutations in this gene give rise to 
different kinds of inherited and sporadic cancers (Eng, Recent Prog Horm Res 1999;54:441- 
52; discussion 453). In Drosophila, members of the DSP family, such as puckered, have 

15 important roles in development (Martin-Blanco et al., Genes Dev 1998 Feb 15;12(4):557-70). 
The crystal structure of one member of the DSP family has been elucidated (Yuvaniyama at 
al., Science 1996 May 3 1;272(5266): 1328-31) and this family has been successfully targeted 
for small molecule drug development (Ducruet et al., Bioorg Med Chem 2000 Jun;8(6):1451- 
66). In addition, overexpression of a DSP has been demonstrated to be a potential therapy for 

20 cardiac hypertrophy (Bueno et al., Circ Res 2001 Jan 19;88(l):88-96). NOV65 has closest 

homology to a phosphatase that is differentially regulated in the testis during spermatogenesis 



550 



WO 02/068649 PCT7US02/02785 

and is therefore thought to be involved in sperm development and maturation (Nakamura et 
al, Biochem. J. 344 Pt 3, 819-825 (1999)). . 

The disclosed NOV65 is predicted to be expressed in at least the following tissues: 
heart, skeletal muscle, colon, fetal lung, head, and ovary. This information was derived by 
5 determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, public EST sources, literature sources, and/or RACE 
sources. Further expression data for NOV65 is provided in Example 2. 

The NOV65 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 

10 cardiomyopathy, atherosclerosis, hypertension, congenital heart defects, aortic stenosis, atrial 
septal defect (ASD), atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary 
stenosis, subaortic stenosis, ventricular septal defect (VSD), valve diseases, tuberous sclerosis, 
scleroderma, obesity, transplantation, fertility, polycystic ovarian syndrome, cancer, tissue 
degeneration, bacterial/viral/parasitic infection, systemic lupus erythematosus, autoimmune 

15 disease, asthma, emphysema, scleroderma, allergy, ARDS, muscular dystrophy, Lesch-Nyhan 
syndrome, myasthenia gravis, Hirschsprung's disease, Crohn's Disease, appendicitis as well as 
other diseases, disorders and conditions. The NOV65 nucleic acid encoding the phosphatase- 
like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be 

20 assessed. The novel nucleic acid of the invention encoding a protein phosphatase-like protein 
includes the nucleic acid whose sequence is provided in Table 65A or 65C, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 65A or 65C while still encoding a 
protein that maintains its protein phosphatase-like activities and physiological functions, or a 

25 fragment of such a nucleic acid. 

The invention further includes nucleic acids whose sequences are complementary to 
the sequence indicated in Table 65A or 65C, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 

30 chemical modifications. Such modifications include, by way of non-limiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
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binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 39% of the bases may be so changed. 

The novel protein of the invention includes the protein phosphatase-like protein whose 
sequence is provided in Table 65B or 65D. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
65B or 65D while still encoding a protein that maintains its protein phosphatase-like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 
up to about 54% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



15 



20 



NOV66 

The disclosed NOV66 (alternatively referred to herein as CG56633-01) includes the 
1036 nucleotide sequence (SEQ ID NO:23 1) shown in Table 66A. A NOV66 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 28-30 and ends with a stop codon 
at nucleotides 913-91 5 . The disclosed NOV66 maps to human chromosome 3 . 



Table 66 A. NOV66 Nucleotide Sequence (SEQ ID 
NO:231) 



CCCGTCCTCGCTGGGTTGTCCCGGTCCATGTATCTGGTCATGGTGCTGAGGAACCTGCTC 

ATCATCCTGGCTGTCAGCTCTGACTCCCACCTCCACACCCCCA 

AACCTGTGCTGGGCTGACATCGGTTTCACCT 

ATGCAGTCTCATAGCAGAGTCATCTCTTATGCGGGCT 

GTCCTTTTTGCATGTATAGAAGACATGCTC 

GCCATCTGTCGCCCTCTGCACTACCCAGTCATC 

GT1UTGGTGTCCTTTTTCCITAGCCTGTTGGATTCCCAGCTGCACAGTTTGATTGTGTTA 
CAATTCACCTTCTTCAAGAATGTGGAAATC 

CTCAACCTTGCCTGTTCTGACAGCGTCATCAATAGCATATTCTTATATTTCGATAGTACT 
ATGTTTGGTTTTCTTCCC^TTTCAAGGATCCT 

ATTCTAAGGATTTCATCGTCAGATGGGAAGTATAAAGCCTTCTCCACCTGTGGCTCTCAC 

CTAGCAGTTGTTTGCTTATTTTATGGAACAGGCATTGGCGTGTACCTGACTTCAGCTGTG 

TCACC^CCCCCCAGGAGTGGTGTGGTGGCGTCAGTGATGTACGCTGTGGTCACCCC(^TO 

CTGAACCCTTTCATCTATAGCCTGAG 

CGCAGCAGAACAGTCGAATCTCATGATCTC 

GGGCAACCACATTAAATCCCTGCATCTGCAAATCCTGCT 

GGCTTGATGGCTTTTATTCCTTTCCGCATTTC 

CCTTTAACTGGAATGG 



A NOV66 polypeptide (SEQ ID NO:232) encoded by SEQ ID NO:231 is 295 amino 
acids in length and is presented using the one-letter amino acid code in Table 66B. The Psort 
profile for NOV66 predicts that this sequence is a Type Ilia membrane protein, has a signal 
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peptide and is likely to be localized at the plasma membrane with a certainty of 0.6400. In 
alternative embodiments, a NOV66 polypeptide is located to Golgi bodies with a certainty of 
0.4600, or to the endoplasmic reticulum (membrane) with a certainty of 03700. The Signal P 
predicts a likely cleavage site for a NOV66 peptide is between positions 17 and 18, i.e., at the 
5 dash in the sequence AVS-SD. 



Table 66B. NOV66 Polypeptide Sequence (SEQ ID 
NO:232) 

MYLVMVLRNLLIILAVSSDSHLHTPMYFFLSN^ 

YAGCLTRMSFLVLFACIEDMLLTAMAYDCFVAICRPLHYPVIVNPHLSVFLVLVSFFLSL 
LDS QLHS LIVLQFTFFKNVE I SNFVCEPS QLLNLACSDS VINS I FLYFDSTMFGFLPI SR 
ILLSYYKiyPSILRISSSDGKYKAFSTCGSHLAVVCLFYGTGIGVYLTSAVSPPPRSGVV _ _ . 

ASVOTAVVTPMLNPFIYSLRNRDIQSALWRLR5RTVESHDLFHPFSCVGKKGQPH 

A BLAST analysis of NOV66 was run against the proprietary PatP GENESEQ Protein 
• Patent database. It was found, for example, that the amino acid sequence of NOV66 had high 
1 0 homology to other proteins as shown in Table 66C. 



Table 66C BLASTX results from PatP database for NOV66 






Smallest 






Sum 


Sequences producing High-scoring Segment Pairs: 


High 


Probability 


Score 


P(N) 


patp:AAG7l875 Human olfactory receptor polypeptide 


1466 


5.6e-150 


patp:AAE04583 Human G-protein coupled receptor-39 


1377 


1.5e-140 


patp:AAU24551 Human olfactory receptor AOLFR38 


1377 


1.5e-140 


patp:AAG71816 Human olfactory receptor polypeptide 


1363 


4.6e-139 


patp:AAU24549 Human olfactory receptor AOLFR36 


1354 


4.1e-138 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 973 of 1036 bases (93%) identical to a gb:GENBANK- 

1 5 ID:AF042089|acc:AF042089.1 mRNA from Homo sapiens (chromosome 3, olfactory receptor 
pseudogene cluster 1, complete sequence, and myosin light chain kinase (MLCK) pseudogene, 
partial sequence). The full amino acid sequence of the protein of the invention was found to 
have 192 of 265 amino acid residues (72%) identical to, and 221 of 265 amino acid residues 
(83%) similar to, the 264 amino acid residue ptnr:SPTREMBL-ACC:043789 protein from 

20 Homo sapiens (Human) (OLFACTORY RECEPTOR). NOV66 also has homology to the 
other proteins shown in the BLASTP data in Table 66D. 
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Table 66D. NOV66 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%> 


Positive 
(%) 


Expect 


gi| 17466082 |r 
ef |XP_070192. 
l| (XM_070192) 


similar to ol factory 
receptor, family 7, 
subfamily C, member 3 
(H. sapiens) 
[Homo sapiens] 


349 


221/250 
(88) 


229/250 
(91) 


e-lll 


gi|l7482057|r 
ef |XP_064778. 
l| <XM_064778) 


similar to G protein- 
coupled receptor homolog 
clone G3 (H. sapiens) 
[Homo sapiens) 


251 


220/250 
(88) 


229/250 
(91) 


e-105 1 


gi 17448458 |r 
ef XP_070402. 
l| (XM_070402) 


similar to OLFACTORY 
RECEPTOR 7C2 (OLFACTORY 
RECEPTOR 19-18) (OR19- 
18) (H. sapiens) [Homo 
sapiens] 


528 


221/242 
(87) 


221/242 
(91) 


e-10l 


gi| 7443955 |pi 
r | |PC4369 


olfactory receptor, HT2 . 
- human (fragment) 


264 


192/265 
(72) 


221/265 
(82) 


6e-92 


gi| 4092819 |gb 

|AAD03353.l| 

(AC006271) 


BC319430_5 
[Homo sapiens] 


263 


191/264 
(72) 


220/264 
(82) 


44-91 



This BLASTP data is displayed graphically in the ClustalW in Table 66E. A multiple 
sequence alignment is given, with the NOV66 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
5 Table 66D. 



Table 66K ClustalW Alignment of NOV66 



NOV66 



gi 
gi 
gi 
gi 
gi 



17466082 
17482057 
17448458 
7443955 | 
4092819) 



NOV66 



(SEQ ID NO: 232) 
(SEQ ID NO: 629) 
(SEQ ID NO:630) 
(SEQ ID NO: 631) 
(SEQ ID NO:632) 
(SEQ ID N0:633) 

10 



20 



30 



40 



50 



60 



gi 


17466082 




gi 


17482057 




gi 


17448458 




gi 


7443955) 




gi 


4092819) 




NOV66 


1 


gi 


17466082 


12 


gi 


17482057 


' 1 


gi 


17448458 


61 


gi 


7443955) 


1 


gi 


4092819) 


1 



~ - - - MVSDIRKEDGMN- 12 

± 

MYLVTVLRNVLIILAVSSDSHLHTPMYFFLSSLCWADIGFTS 60 

- 1 

1 



70 



I.... I. 



80 
■ |.. 



90 



|....|, 



-VLPLKYIPNVG — 



100 
-VN--- 



110 



120 



|....| 



1 

27 
1 

12C 

1 

1 



130 



140 



150 



160 



170 



180 
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NOV66 


1 


Si 


174 66082 


27 


gi 


17482057 


1 1 


gi 


174484581 121 


gi 


7443955 | 


1 


gi 


4092819 | 


1 


NOV66 


1 


gi 


17466082 


39 


gi 


.17482057 


1 


gi 


17448458 


181 


gi 


7443955) 


1 


gi 


4092819) 


1 


NOV 6 6 


1 


gi 


17466082 


■ 77 


gi 


17482057 


1 


gi 


17448458 


241 


gi 


7443955) 


1 


gi 


4092819) 


1 



.).... I 



,|....|....|. 



-FAGVYL- 



-ASETLP- 39 

1 

LSNLSLPD 18 C 

1 

1 



190 200 210 

.. |....|.. ..)....).. ..|....|, 



220 



230 

|....|.:..|. 



240 



- GSFAHPEATSRGAVATGTTHLASAVEPN- 



-GDSWCK- 



-QRSP-- 77 
1 



250 
|....|.. 



260 



270 



I 



280 



290 



RVS - VQDPQLQPALALLSLSLSj 

241 I FGFLPI SGTLFSYYKI VSS ILRVSS SEDPELQSVLALLSLSLS 




LI 


I LAV 


16 


FS 


I LAV 


114 


LS 


I LAV 


16 


LS 


ILAV 


300 






1 
1 



310 



NOV66 


17 


SSD 


gi 


17466082) 


115 


SSD 


gi 


17482057 j 


17 


SSD 


gi 


17448458) 


301 


SSD 


gi 


7443955 | 


1 




gi 


4092819) 


1 





NOV66 


77 


gi 


17466082 


175 


gi 


17482057 


77 


gi 


17448458 


3 61 


gi 


7443955 | 


53 


gi 


4092819 j 


52 


NOV66 


137 


gi 


17466082 


235 


gi 


17482057 


137 


gi 


17448458 


421 


gi 


7443955) 


113 


gi 


4092819) 


112 


NOV66 


197 


gi 


17466082 


295 


gi 


17482057 


197 


gi 


17448458 


481 


gi 


7443955) 


173 


gi 


4092819) 


172 


NOV66 


257 


gi 


17466082 


349 


gi 


17482057 


251 


gi 


17448458 


528 


gi 


7443955 | 


233 




550 



560 



.|....|. 



570 
..|.. 



349 
251 
528 
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gi | 4092819 | 232 YSLRNKDIQSALCRLHGRI IKSHHLHPFCYMG 263 



Table 66F lists the domain description from DOMAIN analysis results against NOV66. 
This indicates that the NOV66 sequence has properties similar to those of other proteins 
known to contain this domain. 

5 

Table 66R Domain Analysis of NOV66 

gnl 1 Pfam|pf amOOOOl , 7tm_jL, 7 transmembrane receptor 
(rhodopsin family) . SEQ ID NO: 810 

CD-Length » 254 residues, 99.6% aligned 
Score « 88.6 bits (218), Expect « 5e-19 



N0V66: 


9 


NLL 1 1 LAVS SDSHLHT PMYFFLSNLC WADI G FTSAMVP KM I VDMQSHS RVI S YAGCLTRM 


68 






NLL+IL + L TP FL NL AD+ F +p + + V AC 




Sbjct: 


2 


NLLVIL VILRTKKLRTPTNI FLLNLAVADLL FLLTLPP WAL YYLVGGD WVFGDALCKLVG 


61 


N0V66: 


69 


S FLVLFAC I EDMLL TAMAYDCFVAI CR PLHY P VI VNPHLSVFL VL VSFFLSLLDSQUESIj 


128 






+ V+ +LLTA++ D ++AI PL Y I P + L+L+ + L+LL S L 




Sbjct: 


62 


ALFWNGYAS ILLLTAI S IDRYLAI VHPLRYRRI RTPRRAKVL ILLVWVLALLLSLPPLL 


121 


N0V66: 


129 


I VLQFTFFKNVEI SNFVCEPSQLLNLACSDSVINSI FLYFDSTMFGFLPI SRILLSYYKI 


188 






T + +P++ ++++ LP+ IL+ Y +1 




Sbjct: 


122 


FSWLRTVEEGNTTVCLIDFPEESV KRSYVLLSTLVGFVLPLLVILVCYTRI 


172 


N0V66 : 


189 


VP SILRISSSDGKYKAFSTCGSHLAWCLFYGTGIGVYL TSAVSPPP 


235 






+ L+SS + A +V+I + L ++ 




Sbjct: 


173 


LRTLRKRARSQRSLKRRSSSERKAAKMLLVVVWFVLCWLPYHIVLLLDSLCLLSIWRVL 


232 


N0V66 : 


236 


RSGWASVMYAWTPMLNPFIY 257 








+ ++ ++ a V LNP IY 




Sb j Ct : 


233 


PTALLITLWLAYVNSCLNPIIY 254 





The olfactory system is able to distinguish several thousand odorant molecules. 
Olfactory receptors are believed to be encoded by an extremely large subfamily of G protein- 
coupled receptors. These receptors share a 7-transmembrane domain structure with many 

1 0 neurotransmitter and hormone receptors. They are responsible for the recognition and G 
protein-mediated transduction of odorant signals. The genes encoding these receptors are 
devoid of introns within their coding regions. Schurmans et al. (1993) cloned a member of this 
family of genes, OLFR1, from a genomic library by cross-hybridization with a gene fragment 
obtained by PCR. By isotopic in situ hybridization, they mapped the gene to 17pl3-pl2 with a 

15 peak at band 17pl3. A minor peak was detected on chromosome 3, with a maximum in the 
region 3ql3-q21. After Mspl digestion, a RFLP was demonstrated. Using this in a study of 3 
CEPH pedigrees, they demonstrated linkage with D17S126 at 17pter-pl2; maximum lod = 3.6 
at theta = 0.0. Used as a probe on Southern blots under moderately stringent conditions, the 
cDNA hybridized to at least 3 closely related genes. Ben-Arie et al. (1994) cloned 16 human 
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OLFR genes, all from 17pl3.3. The intronless coding regions are mapped to a 350-kb 
contiguous cluster, with an average intergenic separation of 15 kb. The OLFR genes in the 
cluster belong to 4 different gene subfamilies, displaying as much sequence variability as any 
randomly selected group of OLFRs. This suggested that the cluster may be one of several 
5 copies of an ancestral OLFR gene repertoire whose existence may have predated the 
divergence of mammals. Localization to 17pl3.3 was performed by fluorescence in situ 
hybridization as well as by somatic cell hybrid mapping. 

The ability to distinguish different odors depends on a large number of different 
odorant receptors (ORs). Sullivan et al. (1996) noted that ORs are expressed by nasal olfactory 

10 sensory neurons; each neuron expresses only 1 allele of a single OR gene. In the nose, 

different sets of ORs are expressed in distinct spatial zones. Neurons that express the same OR 
gene are located in the same zone; however, in that zone they are randomly interspersed with 
neurons expressing other ORs. This distribution suggested to the authors that, when the cell 
chooses an OR gene for expression, it may be restricted to a specific zonal gene set, but it may 

1 5 select from that set by a stochastic mechanism. Proposed models of OR gene choice fall into 2 
classes: locus-dependent and locus-independent. Locus-dependent models posit that OR genes 
are clustered in the genome, perhaps with members of different zonal gene sets clustered at 
distinct loci. In contrast, locus-independent models do not require that OR genes be clustered. 
To assess the feasibility of these models, Sullivan et al. (1996) determined the expression 

20 zones, sequences, and chromosomal locations of a number of mouse OR genes. They mapped 
OR genes to 1 1 different regions on 7 chromosomes. These loci lie within paralogous 
chromosomal regions that appear to have arisen by duplications of large chromosomal 
domains followed by extensive gene duplication and divergence. These studies showed that 
OR genes expressed in the same zone map to numerous loci. Moreover, a single locus can 

25 contain genes expressed in different zones. These findings raised the possibility that OR gene 
choice is locus-independent or involved consecutive stochastic choices. 

Nekrasova et al. (1996) overexpressed human (OR17-4) and rat (olp4) olfactory 
receptor genes in insect cells, purified them, and characterized them biochemically. They 
identified monomelic, dimeric, and trimeric forms of the proteins corresponding to molecular 

30 weights of 32, 69, and 94 kD by electrophoresis. The oligomers were resistant to reduction and 
alkylation and were therefore thought to be held together by SDS-resistant hydrophobic 
interactions, consistent with observations of other G protein-coupled receptors. 

Glusman et al (1996) described the results of complete sequencing of an OR-rich 
cosmid spanning the center of the OR gene cluster on 17pl3.3. The resulting 40-kb sequence 
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revealed 3 known OR coding regions, 2 OR genes which may have originated from a tandem 
duplication event, and a new OR pseudogene fused to another OR gene. 

Issel-Tarver and Rine (1 996) characterized 4 members of the canine olfactory receptor 
gene family. The 4 subfamilies comprised genes expressed exclusively in olfactory epithelium. 
5 Analysis of large DNA fragments using Southern blots of pulsed field gels indicated that 

subfamily members were clustered together, and that 2 of the subfamilies were closely linked 
in the dog genome. Analysis of the 4 olfactory receptor gene subfamilies in 26 breeds of dog 
provided evidence that the number of genes per subfamily was stable in spite of differential 
selection on the basis of olfactory acuity in scent hounds, sight hounds, and toy breeds. 

10 Issel-Tarver and Rine (1997) performed a comparative study of 4 subfamilies of 

olfactory receptor genes first identified in the dog to assess changes in the gene family during 
mammalian evolution, and to begin linking the dog genetic map to that of humans. These 4 
families were designated by them OLF1, OLF2, OLF3, and OLF4 in the canine genome. The 
subfamilies represented by these 4 genes range in size from 2 to 20 genes. They are all 

1 5 expressed in canine olfactory epithelium but were not detectably expressed in canine lung, 

liver, ovary, spleen, testis, or tongue. The OLF1 and OLF2 subfamilies are tightly linked in the 
dog genome and also in the human genome. The smallest family is represented by the canine 
OLF1 gene. Using dog gene probes individually to hybridize to Southern blots of genomic 
DNA from 24 somatic cell hybrid lines. They showed that the human homologous OLF1 

20 subfamily maps to human chromosome 1 1 . The human gene with the strongest similarity to 
the canine OLF2 gene also mapped to chromosome 11. Both members of the human subfamily 
that hybridized to canine OLF3 were located on chromosome 7. It was difficult to determine to 
which chromosome or chromosomes the human genes that hybridized to the canine OLF4 
probe mapped. This subfamily is large in mouse and hamster as well as human, so the rodent 

25 background largely obscured the human cross-hybridizing bands. It was possible, however, to 
discern some human-specific bands in blots corresponding to human chromosome 19. They 
refined the mapping of the human OLF1 homolog by hybridization to YACs that map to 
1 Iql 1. In dogs, the OLF1 and OLF2 subfamilies are within 45 kb of one another (Issel-Tarver 
and Rine (1 996)). Issel-Tarver and Rine (1997) demonstrated that in the human OLF1 and 

30 OLF2 homologs are likewise closely linked. By studying YACs, Issel-Tarver and Rine (1997) 
found that the human OLF3 homolog maps to 7q35. A chromosome 19-specific cosmid library 
was screened by hybridization with the canine OLF4 gene probe, and clones that hybridized 
strongly to the probe even at high stringency were localized to 19pl3.1 and 19pl3.2. These 
clones accounted, however, for a small fraction of the homologous human bands. 
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Rouquier et al. (1998) demonstrated that members of the olfactory receptor gene 
family are distributed on all but a few human chromosomes. Through fluorescence in situ 
hybridization analysis, they showed that OR sequences reside at more than 25 locations in the 
human genome. Their distribution was biased for terminal bands of chromosome arms. Flow- 
5 sorted chromosomes were used to isolate 87 OR sequences derived from 16 chromosomes. 
Their sequence relationships indicated the inter- and intrachromosomal duplications 
responsible for OR family expansion. Rouquier et al. (1998) determined that the human 
genome has accumulated a striking number of dysfunctional copies: 72% of these sequences 
were found to be pseudogenes. ORF-containing sequences predominate on chromosomes 7, 
10 16, and 17. 

Trask et al. (1998) characterized a subtelomeric DNA duplication that provided insight 
into the variability, complexity, and evolutionary history of that unusual region of the human 
genome, the telomere. Using a DNA segment cloned from chromosome 19, they demonstrated 
that the blocks of DNA sequence shared by different chromosomes can be very large and 

1 5 highly similar. Three chromosomes appeared to have contained the sequence before humans 
migrated around the world. In contrast to its multicopy distribution in humans, this 
subtelomeric block maps predominantly to a single locus in chimpanzee and gorilla, that site 
being nonorthologous to any of the locations in the human genome. Three new members of the 
olfactory receptor (OR) gene family were found to be duplicated within this large segment of 

20 DNA, which was found to be present at 3q, 15q, and 19p in each of 45 unrelated humans 

sampled from various populations. From its sequence, one of the OR genes in this duplicated 
block appeared to be potentially functional. The findings raised the possibility that functional 
diversity in the OR family is generated in part through duplications and interchromosomal 
rearrangements of the DNA near human telomeres. 

25 Mombaerts (1 999) reviewed the molecular biology of the odorant receptor (OR) genes 

in vertebrates. Buck and Axel (1991) discovered this large family of genes encoding putative 
odorant receptor genes. Zhao et al. (1998) provided functional proof that one OR gene encodes 
a receptor for odorants. The isolation of OR genes from the rat by Buck and Axel (1991) was 
based on 3 assumptions. First, ORs are likely G protein-coupled receptors, which 

30 characteristically are 7-transmembrane proteins. Second, ORs are likely members of a 

multigene family of considerable size, because an immense number of chemicals with vastly 
different structures can be detected and discriminated by the vertebrate olfactory system. 
Third, ORs are likely expressed selectively in olfactory sensory neurons. Ben-Arie et al. 
(1994) focused attention on a cluster of human OR genes on 17p, to which the first human OR 
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gene, OR1D2, had been mapped by Schurmans et al. (1993). According to Mombaerts (1999), 
the sequences of more than 150 human OR clones had been reported. The human OR genes 
differ markedly from their counterparts in other species by their high frequency of 
pseudogenes, except the testicular OR genes. Research showed that individual olfactory 
5 sensory neurons express a small subset of the OR repertoire. In rat and mouse, axons of 
neurons expressing the same OR converge onto defined glomeruli in the olfactory bulb. 

Gilad et al. (2000) reported the population sequence diversity of genomic segments 
within a 450-kb cluster of olfactory receptor (OR) genes on chromosome 17. They found a 
dichotomy in the pattern of nucleotide diversity between OR pseudogenes and introns on the 

10 one hand and the closely interspersed intact genes on the other. They suggested that weak 
positive selection is responsible for the observed patterns of genetic variation. This was 
inferred from a lower ratio of polymorphism to divergence in genes compared with 
pseudogenes or introns, high nonsynonymous substitution rates in OR genes, and a small but 
significant overall reduction in variability in the entire OR gene cluster compared with other 

1 5 genomic regions. The dichotomy among functionally distinct segments within a short genomic 
distance requires high recombination rates within this OR cluster. 

NOV66 is predicted to be expressed in at least the following tissues: lung, liver, ovary, 
spleen, testis. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 

20 public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV66 is provided in Example 2. 

The nucleic acids and proteins of NOV66 are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
cardiomyopathy, atherosclerosis, hypertension, congenital heart defects, aortic stenosis, atrial 

25 septal defect (ASD), atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary 

stenosis, subaortic stenosis, ventricular septal defect (VSD), valve diseases, tuberous sclerosis, 
scleroderma, obesity, transplantation, fertility, polycystic ovarian syndrome, cancer, tissue 
degeneration, bacterial/viral/parasitic infection, systemic lupus erythematosus, autoimmune 
disease, asthma, emphysema, scleroderma, allergy, ARDS, muscular dystrophy, Lesch-Nyhan 

30 syndrome, myasthenia gravis, Hirschsprung's disease, Crohn's Disease, appendicitis as well as 
other diseases, disorders and conditions. The NOV66 nucleic acid encoding the GPCR-like 
protein of the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. A 
NOV66 nucleic acid of the invention encoding a Olfactory receptor-like protein includes the 
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nucleic acid whose sequence is provided in Table 66A, or a fragment thereof. The invention 
also includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 66A while still encoding a protein that maintains its 
Olfactory receptor -like activities and physiological functions, or a fragment of such a nucleic 
5 acid. 

The invention further includes nucleic acids whose sequences are complementary to 
the sequence disclosed in Table 66A, including nucleic acid fragments that are complementary 
to any of the nucleic acids just described. The invention additionally includes nucleic acids or 
nucleic acid fragments, or complements thereto, whose structures include chemical 

10 modifications. Such modifications include, by way of non-limiting example, modified bases, 
and nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
modifications are carried out at least in part to enhance the chemical stability of the modified 
nucleic acid, such that they may be used, for example, as antisense binding nucleic acids in 
therapeutic applications in a subject. In the mutant or variant nucleic acids, and their 

1 5 complements, up to about 7% of the bases may be so changed. 

The novel protein of the invention includes the olfactory receptor-like protein whose 
sequence is provided in Table 66B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 66B while 
still encoding a protein that maintains its Olfactory receptor -like activities and physiological 

20 functions, or a functional fragment thereof. In the mutant or variant protein, up to about 28% 
of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 

25 prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV67 

The disclosed NOV67 (alternatively referred to herein as CG56571-01) includes the 
30 1 072 nucleotide sequence (SEQ ID NO:233) shown in Table 67A. A NOV67 ORF begins 

with a Kozak consensus ATG initiation codon at nucleotides 41-43 and ends with a stop codon 
at nucleotides 989-99 1 . The disclosed NOV67 maps to human chromosome 7. 
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Table 67A. NOV67 Nucleotide Sequence (SEQ ED 
NO:233) 



CATTCCCTCCTAACCACCTAGATTGAAGAAGTGAGGTTCAATGTCCCAACTGGGAAGGGA 
CAACATAACCTGGGTGAGTGAGTTCATCCTAATGGGTCTCTCCAGTGACAGGCAGACCCA 
GGCTGGACTCTTTATCTTATTTGGGGCTGCCTACCTGCTGACCCTGCTGGGCAATGGGCT 
CATCCTGCTCCTGATCTGGCTGGACGTGAGACTCCACCTGCCCATGTATTTCTTCCTCTG 
CAACCTCTCACTTGTGGACATCTG CTACACCT CCAGCAGGGTCCCTCAGATGCTGGTGCA 
CTGCACCAGCAA7VAGAAAGACCATCTCCTTTGCCCGATCTGGGACCCAGCTCTTTTTCTC 
CCTGGCCCTCGGAGGGACCGAGTTTTTGTTGCTGGCCGCAATGGCCTATGACCGCTACGT 
GGCTGTTTGCGACCCCCTGTGTTACATAGCAGTGATGAGCCC^ 

GGCAGCTGTCTCTTGGCTAGTGGGCCTGGCTAATTCTGCTATGGAGACGGCACTGACCAT 
G CAC CTGC CCAC CTGTGGG CACAACGTGCTGAACCATGTGG CCTGTGAGACACTGGCACT 
GGTCAGGTCGGCCTGCGTGGACATCACCTTCAATCAGGTGGTCATAGTGGCCTCCAGTGT 
GGTGGTGCTGCTGGTGCCCTGCTGCCTGGTCTCGCTGTCCTACACCCTCATTGTAGTTGC 
CGTCCTGCAGATCCACTCCACCCAGGGGCACCGCAAGGCCTOTGGGACCTGTGCCTCCCA 
CCT(^CTGTGGTCTCCATATCCTATGGGATGGCCCTCTTTACCTAaTGCAGCCTC^CTC 
CATGGCCTCAGCTGAGCAGGAAAAGGTGATGGTACTCTCTTATGCTGTGGTGACCCCCAT 
GTTGAATCCTTTCATCTACAGTCTGCGGAACAAGGATGTGAAGGCAGCTCTGAGTCGAGC 
TCTGATGAGGAGCTCTGAATTAAAACATTAGAGAGTGGTTTGAGTAACAAGAAGGCCTCA 
CTCTGAAAACAGTGGGCATTGGACTGTGCTCTCCAGTATAACGTGTGTACGC 
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A NOV67 polypeptide (SEQ ID NO:234) encoded by SEQ ID NO:233 is 312 amino 
acids in length and is presented using the one-letter amino acid code in Table 67B. The Psort 
profile for NOV67 predicts that this sequence is a Type IHb membrane protein, has a signal 
peptide, and is likely to be localized at the plasma membrane with a certainty of 0.6000. In 
alternative embodiments, a NOV67 polypeptide is located to Golgi bodies with a certainty of 
0.4000, to the endoplasmic reticulum (membrane) with a certainty of 0.3000, or to the 
mitochondrial membrane with a certainty of 0.3522. The Signal P predicts a likely cleavage 
site for a NOV67 peptide is between positions 58 and 59, i.e., at the dash in the sequence 
VRL-HL. 



Table 67B. NOV67 Polypeptide Sequence (SEQ ID 
NO:234) 



MS QLGRDNI TWVS EF ILMGLS SDRQTQAGLFILFGAAYLIiTLLGNGL I LLL I WLDVRLHL 
PMYFFLCNLSLVDICyTSSRVPQI^VHCTSKRKTISFARCGTQLFFSLALGGTEFLLIJ^ 
MAYDRYVAVCDPLCYIAVMSPRLCMALAAVSWLVGL 
ACETLALVRSACVDITFNQWIVASSVWLLVP 

FGTCASHLTWSISYGMALFTYMQPRSMASAEQEKVMVLSYAVVTPMLNPFIYSLRNKDV 
KAALSRALMRSSELKH 
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A BLAST analysis of NOV67 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found* for example, that the amino acid sequence of NOV67 had high 
homology to other proteins as shown in Table 67C. 



Table 67C BLASTX results from PatP database for NOV67 



Smallest 
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Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 




patp:AAG7l514 Human olfactory receptor polypeptide 


1576 


1.2e- 


161 


patp:AAG7233 0 Human OR- like polypeptide query sequence 


924 


1.5e- 


92 


patp:AAG72925 Human olfactory receptor data exploratorium 


924 


1.5e- 


•92 


patp:AAG72977 Olfactory receptor -like polypeptide 


924 


1.5e- 


•92 


patp:AAG71408 Human olfactory receptor polypeptide 


923 


1.9e- 


92 



In a search of sequence databases^ it was found, for example, that the nucleic acid 
sequence of this invention has 536 of 649 bases (82%) identical to a gb:GENBANK- 
ID:AF073974|acc:AF073974.1 mRNA from Mus musculus domesticus (Mus miisculus 

5 domesticus clone OR28M olfactory receptor gene). The full amino acid sequence of the 

protein of the invention was found to have 179 of 31 1 amino acid residues (57%) identical to, 
and 228 of 31 1 amino acid residues (73%) similar to, the 317 amino acid residue 
ptnr:SWISSNEW-ACC:Q13607 protein from Homo sapiens (Human) (OLFACTORY 
RECEPTOR 2F1 (OLFACTORY RECEPTOR-LIKE PROTEIN OLF3)). NOV67 also has 

10 homology to the other proteins shown in the BLASTP data in Table 67D. 



Table 67D. NOV67 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi| 9297120 |sp 
|Q13607|02F1_ 
HUMAN. 


OLFACTORY RECEPTOR 2F1 
(OLFACTORY RECEPTOR-LIKE 
PROTEIN OLF3) 


317 


179/311 
(57) 


228/311 
(72) 


7e-86 


gi| 6912558] re 
f|NP 036501.1 
| (NM_012369) 


olfactory receptor, 
family 2, subfamily F, 
member 1; olfactory 
receptor, family 2, 
sub family F, member 5; 
olfactory receptor, 
family 2, subfamily F, 
member 4 [Homo sapiens] 


317 


179/311 
(57) 


228/311 
(72) 


9e-B6 


gi|2495055|sp 
|Q95156|0LF3_ 
CANFA 


OLFACTORY RECEPTOR- LIKE 
PROTEIN OLF3 


317 


176/305 
(57) 


228/305 
(72) 


2e-85 


gi|l4423778|s 
p|O95006|O2F2 
_HUMAN 


OLFACTORY RECEPTOR 2F2 
(OLFACTORY RECEPTOR 7-1) 
(OR7-1) (PID:g249505l) 
[Homo sapiens] 


317 


175/308 
(56) 


220/308 
(70) 


7e-82 


gi|5453066|gb 
|AAD43423.l| 
(AF073974) 


olfactory receptor 
[Mus musculus] 


216 


167/216 
(77) 


189/216 
(87) 


3e-78 



This BLASTP data is displayed graphically in the ClustalW in Table 67E. A multiple 
sequence alignment is given, with the NOV67 protein being shown on line 1 in a ClustalW 
15 analysis comparing the protein of the invention with the related protein sequences shown in 
Table 67D. 
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Table 67E. ClustalW Alignment of NOV67 



NOV67 



g± 
gi 
gi 
gi 
gi 



9297120 
6912558 
2495055 
14423778 
5453066 | 



NOV67 

gi | 9297120 | 
gi j 6912558 | 
gi | 2495055 j 
gi 1 14423778 | 
gi j 5453066 1 



NOV67 

gi | 9297120 | 
gi | 6912558 j 
gi | 2495055 j 
gi 1 14423778 | 
gi j 5453066 | 



MOV67 



9297120 
6912558 
2495055 
14423778 | 
5453066 [ 



(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 



NO-.234) 
NO:634) 
NO:635) 
NO:636) 
NO:637) 
NO:638) 
10 



NOV67 


61 


gi 


9297120) 


58 


gi 


6912558) 


58 


gi 


2495055 j 


58 


gi 


14423778) 


58 


gi 


5453066] 


1 



121 
118 
118 
118 
118 
51 



NOV67 


181 


gi 


9297120 


178 


gi 


6912558 


178 


gi 


2495055 


178 


gi 


14423778) 178 


gi 


5453066 


111 



241 
238 
238 
238 
238 
171 



NOV67 


301 


gi 


9297120 


298 


gi 


6912558 


298 


gi 


2495055 


298 


gi 


14423778) 298 


gi 


5453066 


216 
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Table 67F lists the domain description from DOMAIN analysis results against NOV67. 
This indicates that the NOV67 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 67F. Domain Analysis of NOV67 




gnl | Pfam|pf amOOOOl, 7ttn 1, 7 transmembrane receptor 




(rhodopsin family). SEQ ID NO: 810 






Score « 


CD-Length. = 254 residues, 93.3% aligned 
76.3 bits (186), Expect = 3e-15 




NOV67 : 
Sbjct: 


61 
18 


PMYFFLCNLSLVDI CYTSSRVPQMLVHCTSKRKTI SFARC6TQLFFSI1ALGGTEFLLLAA 
P FL NL++ D++ + PL+ AC + G LLL A 
PTNI FLLNIiAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLVGALFVVNGYAS ILLLTA 


120 
77 


NOV67 : 
Sbjct: 


121 
78 


MAYDR YVAVCDPLC Y I AVM S PRLCMAIJ^VS WLVGLANS AMET ALTMHL PTCGHNVLNHV 
++ DRY+A+ PL Y + +PR L + W++ L S + N + 
I S IDRYLAI VHPLRYRRIRT PRRAK\njILLVWVLALLLSLPPLLFS WLRTVEEGNTTVCL 


180 
137 


NOV67 : 
Sbjct: 


181 
138 


ACETLALVRSACVDITFNQ VVT\^^SVVVLLVPCCLVSLS)n^LIV^AVLQXliST^2SRKA 
V+ + V ++ ++ V+++ L +L L+ S+ + A 
IDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTRILRTLRKRARSQRSLKRRSSSERKAA 


240 
197 


N0V67 : 
Sbjct: 


241 
198 


FGTCASHLTWSISYG MALFTYMQPRSMASAEQEKVMVLSYAWTPMLNPFIY 293 

+ V + L + ++ L A V LNP IY 
KMl^VVVVVFVLCWLPYHIVLLLDSLC^ 254 



5 

The olfactory system is able to distinguish several thousand odorant molecules. 
Olfactory receptors are believed to be encoded by an extremely large subfamily of G protein- 
coupled receptors. These receptors share a 7-transmembrane domain structure with many 
neurotransmitter and hormone receptors. They are responsible for the recognition and G 

1 0 protein-mediated transduction of odorant signals. The genes encoding these receptors are 

devoid of introns within their coding regions. Schurmans et al. (1 993) cloned a member of this 
family of genes, OLFR1, from a genomic library by cross-hybridization with a gene fragment 
obtained by PCR. By isotopic in situ hybridization, they mapped the gene to 17pl3-pl2 with a 
peak at band 1 7pl 3. A minor peak was detected on chromosome 3, with a maximum in the 

15 region 3ql3-q21. After Mspl digestion, a RFLP was demonstrated. Using this in a study of 3 
CEPH pedigrees, they demonstrated linkage with D17S126 at 17pter-pl2; maximum lod = 3.6 
at theta = 0.0. Used as a probe on Southern blots under moderately stringent conditions, the 
cDNA hybridized to at least 3 closely related genes. Ben-Arie et al. (1994) cloned 16 human 
OLFR genes, all from 17pl3.3. The intronless coding regions are mapped to a 350-kb 

20 contiguous cluster, with an average intergenic separation of 1 5 kb. The OLFR genes in the 
cluster belong to 4 different gene subfamilies, displaying as much sequence variability as any 
randomly selected group of OLFRs. This suggested that the cluster may be one of several 
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copies of an ancestral OLFR gene repertoire whose existence may have predated the 
divergence of mammals. Localization to 17pl3.3 was performed by fluorescence in situ 
hybridization as well as by somatic cell hybrid mapping. 

The ability to distinguish different odors depends on a large number of different 
5 odorant receptors (ORs). Sullivan et ah (1996) noted that ORs are expressed by nasal olfactory 
sensory neurons; each neuron expresses only 1 allele of a single OR gene. In the nose, 
different sets of ORs are expressed in distinct spatial zones. Neurons that express the same OR 
gene are located in the same zone; however, in that zone they are randomly interspersed with 
neurons expressing other ORs. This distribution suggested to the authors that, when the cell 

10 chooses an OR gene for expression, it may be restricted to a specific zonal gene set, but it may 
select from that set by a stochastic mechanism. Proposed models of OR gene choice fall into 2 
classes: locus-dependent and locus-independent. Locus-dependent models posit that OR genes 
are clustered in the genome, perhaps with members of different zonal gene sets clustered at 
distinct loci. In contrast, locus-independent models do not require that OR genes be clustered. 

1 5 To assess the feasibility of these models, Sullivan et al. (1 996) determined the expression 

zones, sequences, and chromosomal locations of a number of mouse OR genes. They mapped 
OR genes to 1 1 different regions on 7 chromosomes. These loci lie within paralogous 
chromosomal regions that appear to have arisen by duplications of large chromosomal 
domains followed by extensive gene duplication and divergence. These studies showed that 

20 OR genes expressed in the same zone map to numerous loci. Moreover, a single locus can 

contain genes expressed in different zones. These findings raised the possibility that OR gene 
choice is locus-independent or involved consecutive stochastic choices. 

Nekrasova et al. (1996) overexpressed human (OR17-4) and rat (olp4) olfactory 
receptor genes in insect cells, purified them, and characterized them biochemically. They 

25 identified monomeric, dimeric, and trimeric forms of the proteins corresponding to molecular 
weights of 32, 69, and 94 kD by electrophoresis. The oligomers were resistant to reduction and 
alkylation and were therefore thought to be held together by SDS-resistant hydrophobic 
interactions, consistent with observations of other G protein-coupled receptors. 

Glusman et al. (1996) described the results of complete sequencing of an OR-rich 

30 cosmid spanning the center of the OR gene cluster on 17pl3.3. The resulting 40-kb sequence 
revealed 3 known OR coding regions, 2 OR genes which may have originated from a tandem 
duplication event, and a new OR pseudogene fused to another OR gene. 

Issel-Tarver and Rine (1996) characterized 4 members of the canine olfactory receptor 
gene family. The 4 subfamilies comprised genes expressed exclusively in olfactory epithelium. 
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Analysis of large DNA fragments using Southern blots of pulsed field gels indicated that 
subfamily members were clustered together, and that 2 of the subfamilies were closely linked 
in the dog genome. Analysis of the 4 olfactory receptor gene subfamilies in 26 breeds of dog 
provided evidence that the number of genes per subfamily was stable in spite of differential 
5 selection on the basis of olfactory acuity in scent hounds, sight hounds, and toy breeds. 

Issel-Tarver and Rine (1997) performed a comparative study of 4 subfamilies of 
olfactory receptor genes first identified in the dog to assess changes in the gene family during 
mammalian evolution, and to begin linking the dog genetic map to that of humans. These 4 
families were designated by them OLF1, OLF2, OLF3, and OLF4 in the canine genome. The 

10 subfamilies represented by these 4 genes range in size from 2 to 20 genes. They are all 

expressed in canine olfactory epithelium but were not detectably expressed in canine lung, 
liver, ovary, spleen, testis, or tongue. The OLF1 and OLF2 subfamilies are tightly linked in the 
dog genome and also in the human genome. The smallest family is represented by the canine 
OLF1 gene. Using dog gene probes individually to hybridize to Southern blots of genomic 

15 DNA from 24 somatic cell hybrid lines. They showed that the human homologous OLF1 
subfamily maps to human chromosome 1 1. The human gene with the strongest similarity to 
the canine OLF2 gene also mapped to chromosome 11. Both members of the human subfamily 
that hybridized to canine OLF3 were located on chromosome 7. It was difficult to determine to 
which chromosome or chromosomes the human genes that hybridized to the canine OLF4 

20 probe mapped. This subfamily is large in mouse and hamster as well as human, so the rodent 
background largely obscured the human cross-hybridizing bands. It was possible, however, to 
discern some human-specific bands in blots corresponding to human chromosome 19. They 
refined the mapping of the human OLF1 homolog by hybridization to YACs that map to 
1 lql 1. In dogs, the OLF1 and OLF2 subfamilies are within 45 kb of one another (Issel-Tarver 

25 and Rine (1996)). Issel-Tarver and Rine (1997) demonstrated that in the human OLF1 and 

OLF2 homologs are likewise closely linked. By studying YACs, Issel-Tarver and Rine (1997) 
found that the human OLF3 homolog maps to 7q35. A chromosome 19-specific cosmid library 
was screened by hybridization with the canine OLF4 gene probe, and clones that hybridized 
strongly to the probe even at high stringency were localized to 19pl3.1 and 19pl3.2. These 

30 clones accounted, however, for a small fraction of the homologous human bands. 

Rouquier et al. (1998) demonstrated that members of the olfactory receptor gene 
family are distributed on all but a few human chromosomes. Through fluorescence in situ 
hybridization analysis, they showed that OR sequences reside at more than 25 locations in the 
human genome. Their distribution was biased for terminal bands of chromosome arms. Flow- 
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sorted chromosomes were used to isolate 87 OR sequences derived from 16 chromosomes. 
Their sequence relationships indicated the inter- and intrachromosomal duplications 
responsible for OR family expansion. Rouquier et al. (1 998) determined that the human 
genome has accumulated a striking number of dysfunctional copies: 72% of these sequences 
5 were found to be pseudogenes. ORF-containing sequences predominate on chromosomes 7, 
16, and 17. 

Trask et al. (1998) characterized a subtelomeric DNA duplication that provided insight 
into the variability, complexity, and evolutionary history of that unusual region of the human 
genome, the telomere. Using a DNA segment cloned from chromosome 19, they demonstrated 

10 that the blocks of DNA sequence shared by different chromosomes can be very large and 
highly similar. Three chromosomes appeared to have contained the sequence before humans 
migrated around the world. In contrast to its multicopy distribution in humans, this 
subtelomeric block maps predominantly to a single locus in chimpanzee and gorilla, that site 
being nonorthologous to any of the locations in the human genome. Three new members of the 

1 5 olfactory receptor (OR) gene family were found to be duplicated within this large segment of 
DNA, which was found to be present at 3q, 15q, and 19p in each of 45 unrelated humans 
sampled from various populations. From its sequence, one of the OR genes in this duplicated 
block appeared to be potentially functional. The findings raised the possibility that functional 
diversity in the OR family is generated in part through duplications and interchromosomal 

20 rearrangements of the DNA near human telomeres. 

Mombaerts (1999) reviewed the molecular biology of the odorant receptor (OR) genes 
in vertebrates. Buck and Axel (1991) discovered this large family of genes encoding putative 
odorant receptor genes. Zhao et al. (1998) provided functional proof that one OR gene encodes 
a receptor for odorants. The isolation of OR genes from the rat by Buck and Axel (1991) was 

25 based on 3 assumptions. First, ORs are likely G protein-coupled receptors, which 

characteristically are 7-transmembrane proteins. Second, ORs are likely members of a 
multigene family of considerable size, because an immense number of chemicals with vastly 
different structures can be detected and discriminated by the vertebrate olfactory system. 
Third, ORs are likely expressed selectively in olfactory sensory neurons. Ben-Arie et al. 

30 (1994) focused attention on a cluster of human OR genes on 1 7p, to which the first human OR 
gene, OR1D2, had been mapped by Schurmans et al. (1993). According to Mombaerts (1999), 
the sequences of more than 150 human OR clones had been reported. The human OR genes 
differ markedly from their counterparts in other species by their high frequency of 
pseudogenes, except the testicular OR genes. Research showed that individual olfactory 
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sensory neurons express a small subset of the OR repertoire. In rat and mouse, axons of 
neurons expressing the same OR converge onto defined glomeruli in the olfactory bulb. 

Gilad et al. (2000) reported the population sequence diversity of genomic segments 
within a 450-kb cluster of olfactory receptor (OR) genes on chromosome 17. They found a 
5 dichotomy in the pattern of nucleotide diversity between OR pseudogenes and introns on the 
one hand and the closely interspersed intact genes on the other. They suggested that weak 
positive selection is responsible for the observed patterns of genetic variation. This was 
inferred from a lower ratio of polymorphism to divergence in genes compared with 
pseudogenes or introns, high nonsynonymous substitution rates in OR genes, and a small but 

10 significant overall reduction in variability in the entire OR gene cluster compared with other 
genomic regions. The dichotomy among functionally distinct segments within a short genomic 
distance requires high recombination rates within this OR cluster. 

NOV67 is predicted to be expressed in at least the following tissues: brain, testis, 
ovary, skeletal muscle, neuronal tissue. This information was derived by determining the 

1 5 tissue sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV67 is provided in Example 2. 

The NOV67 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein. The 

20 NOV67 nucleic acid encoding the GPCR-like protein of the invention, or fragments thereof, 
may further be useful in diagnostic applications, wherein the presence or amount of the nucleic 
acid or the protein are to be assessed. The novel nucleic acid of the invention encoding an 
olfactory receptor-like protein OLF3-like protein includes the nucleic acid whose sequence is 
provided in Table 67A, or a fragment thereof. 

25 The invention also includes a mutant or variant nucleic acid any of whose bases may 

be changed from the corresponding base shown in Table 67A while still encoding a protein 
that maintains its olfactory receptor-like protein OLF3-like activities and physiological 
functions, or a fragment of such a nucleic acid. Hie invention further includes nucleic acids 
whose sequences are complementary to the sequence disclosed in Table 67A, including 

30 nucleic acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of non-limiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
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chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 18% of the bases may be so changed. 

The novel protein of the invention includes the olfactory receptor-like protein OLF3- 
like protein whose sequence is provided in Table 67B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 67B while still encoding a protein that maintains its olfactory receptor-like protein 
OLF3-like activities and physiological functions, or a functional fragment thereof. In the 
mutant or variant protein, up to about 43% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV68 

The disclosed NOV68 (alternatively referred to herein as CG56844-01) includes the 
2580 nucleotide sequence (SEQ ID NO:235) shown in Table 68A. A NOV68 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 21-23 and ends with a TAG 
codon at nucleotides 1 896-1898. The disclosed NOV68 maps to human chromosome 9. 



Table 68A. NOV68 Nucleotide Sequence (SEQ ID 
NO:235) 

CAGGCCCCCACGTGGACAGCATGGACCGCGGCACGCTCCCTCT<MCTGTTGCCCTGCTGC 

TGGCCAGCTGCAGCCTCAGCCCCACAAGTCTTGCAGAAACAGTC^ 

CTGTGGGCCCCGAGAGGGGCGAGGTGACATATACCACTAGCCAGGTCTCGAAGGGCTGCG 

TGGCTCAGGCCCCCAATGCCATCCTTGAAGTCCATGTCCTCTTCCTGGAGTTCCCAACGG 

GCCCGTCACAGCTGGAGCTGACTCTCCAGGCATCCAAGCAAAATGGCACCTGGCCCCGAG 

AGGTGCTTCTGGTCCTC7VGTGTAAACAGCAGTGTCTTCCTGCATCTCCAGGCCCTGGGAA 

TCCCACTCC^CTTGGCCTACAATTCCAGCCTGGTCA 

ACACCACAGAGCTGCCATCCTCCTCCTCCTCCTCACTGTCCTTCTGCATGCTGGAAGCCA 
GCCAGGACATGGGCCGCACGCTCGAGTGGCGGCCGCGTACTCCAGCCTTGGTCCGGGGCT 
GCCACTTGGAAGGCGTGGCCGGCCA(^GGAGG03CACATCCTGAGGGTCCTGCCGGGCC 
ACTCGGCCGGGCCCCGGACGGTGACGGTGAAGGTGGAACTGAGCTGCGCACCCGGGGATC 
TCGATGCCGTCCTCATCCTGCAGGGTCCCCCCTACGTGTCCTGGCTCATCGACGCCAACC 
ACAACATGCAGATCTGGACCACTGGAGAATACTCCTTCAAGATCTTTC(^GAGAAAAACA 
TTCGTGGCTTCAAGCTCCCAGACACACCTC^AGGCCTCCTGGGGGAGGCCCGGATGCTCA 
ATGCCAGCACTGTGGCATCCTTCGTGGAGCTACCGCTGGCCAGCATTGTCTCACTTCATG 
CCTCCAGCTGCGGTGGTAGGCTGCAGACCTC^CCCGCACCGATCCAGACCACTCCTCCCA 
AGGACACTTGTAGCCCGGAGCTGCTCATGTCCTTGATCCAGACAAAGTGTGCCGACGACG 
CCATGACCCTGGTACTAAAGAAAGAGCTTGTTGCGCATTTGAAGTGCACCATCACGGGCC 
TGACCTTCTGGGACCCC^CTGTGAGGCAGAGGACAGGGGTGAC^GTTTGTCTTGCG^ 
GTGCTTACTCCAGCTGTGGCATGCAGGTGTCAGCAAGTATGATCAGCAATGAGGCGGTGG 
TCAATATCCTGTCGAGCTCATCACCACAGCGGAAAAAGGTGCACTGCCTCAACATGGACA 
GCCTCTCTTTCCAGCTGGGCCTCTACCTCAGCCCACACTTCCTCCAGGCCTCCAACAC^ 
TCGAGCCGGGGCAGCAGAGCTTTGTGCAGGTCAGAGTGTCCCCATCCGTCTCCGAGTTCC 
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TGCTCCAGTTAGACAGCTGGCACCTGGACTTGGGGCCTGAGGGAGGCACCGTGGAACTCA 
TCCAGGGCCGGGCGGCCAAGGGCAACTGTGTGAGCCTGCTGTCCCCAAGCCCCGAGGGTG 
ACCCGCGCTTCAGCTTCCTCCTCCACTTCTACACAGTACCCATACCCAAAACCGGCACCC 
TCAGCTGCACGGTAGCCCTGCGTCCCAAGACCGGGTCTCAAGACCAGGAAGTCCATAGGA 
CTGTCTTCATGCGCTTGAACATCATCAGCCCTGACCTGTCTGGTTGCACAAGCAAAGGCC 
TCGTCCTGCCCGCCGTGCTGGGCATCACCTTTGGTGCCTTCCTCATCGGGGCCCTGCTCA 
CTGCTGCACTCTGGTACATCTACTCGCACACGCGTTCCCCCAGCAAGCGGGAGCCCGTGG 
TGGCGGTGGCTGCCCCGGCCTCCTCGGAGAGCAGCAGCACCAACCACAGCATCGGGAGCA 
CCCAGAGCACCCCCTGCTCCACCAGCAGCATGGCATAGCCCCGGCCCCCCGCGCTCGCCC 
AGCAGGAGAGACTGAGCAGCCGCCAGCTGGGAGCACTGGTGTGAACTCACCCTGGGAGCC 
AGTCCTCCACTCGACCCAGAATGGAGCCTGCTCTCCGCGCCTACCCTTCCCGCCTCCCTC 
TCAGAGGCCTGCTGCCAGTGCAGCCACTGGCTTGGAACACCTTGGGGTCCCTCCACCCCA 
CAGAACCTTCAACCCAGTGGGTCTGGGATATGGCTGCCCAGGAGACAGACCAGTTGCCAC 
GCTGTTGTAAAAACCCAAGTCCCTGTCATTTGAACCTGGATCCAGCACTGGTGAACTGAG 
CTGGGCAGGAAGGGAGAACTTGAAACAGATTCAGGCCAGCCCAGCCAGGCCAACAGCACC 
TCCCCGCTGGGAAGAGAAGAGGGCCCAGCCCAGAGCCACCTGGATCTATCCCTGCGGCCT 
CCACACCTGAACTTGCCTAACTAACTGGCAGGGGAGACAGGAGCCTAGCGGAGCCCAGCC 
TGGGAGCCCAGAGGGTGGCAAGAACAGTGGGCGTTGGGAGCCTAGCTCCTGCCACATGGA 
GCCCCCTCTGCCGGTCGGGCAGCCAGCAGAGGGGGAGTAGCCAAGCTGCTTGTCCTGGGC 
CTGCCCCTGTGTATTCACCACCAATAAATCAGACCATGAAACCAGTGAAAAAAAAAAAAA 



A NOV68 polypeptide (SEQ ID NO:236) encoded by SEQ ID NO:235 is 625 amino 
acids in length and is presented using the one-letter amino acid code in Table 68B. The Psort 
profile for NOV68 predicts that this sequence is a Type Ilia membrane protein, has a signal 
5 peptide, and is likely to be localized at the plasma membrane with a certainty of 0.6400. In 
alternative embodiments, a NOV68 polypeptide is located to Golgi bodies with a certainty of 
0.4600, or to the endoplasmic reticulum (membrane) with a certainty of 03700. The Signal P 
predicts a likely cleavage site for a NOV68 peptide is between positions 25 and 26, Le. 9 at the 
dash in the sequence SLA-ET. 

io ; 

Table 68B. NOV68 Polypeptide Sequence (SEQ ID 
NO:236) 

MDRGTLP1AVALLLASCSLSPTSLAETVH 
ILEVHVLFLEFPTGPSQLELTLQASKQNGTW^ 

NSSLVTFQEPPGVNTTELPSSSSSSLSFCMLEASQDMGRTI1EWRPRTPAI1VRGCHI1EGVA 
GHKEAHILRVLPGHS AGPRTVTVKVELS CAPGDLDAV^ 

TGEYSFKIFPEKNIRGFKLPDTPQGLLGEARMLNASIVASFTOLPLASIVSLHASSCGGR 
LQTSPAPIQTTPPKDTCSPELLMSLIQTKCADDAMTLVLKKELVAHLKCTITGLTFWDPS 
CEAEDRGDKFVLRSAYSSCGMQVSASMISNEAWNILSSSSPQRKKVHCLNMDSLSFQLG 
LYLSPHFLQASNTIEPGQQSFVQVRVSPSVSEFLLQLDSCHLDLGPEGGTVELIQGRAAK 
GNCVSIjLSPSPEGDPRFSFIjLHFYTVPIPKTGTLSCTVALRPKTGSQDQEVHRTVFMRIiN 
IISPDLSGCTSKGLVLPAVLGITFGAFLIGALLTAALWYIYSHTRSPSKREPVVAVAAPA 
SSESSSTNHSIGSTQSTPCSTSSMA 



A BLAST analysis of NOV68 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV68 had high 
homology to other proteins as shown in Table 68C. 

15 



Table 68C. BLASTX results from PatP database for NOV68 



Smallest 
Sum 
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High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAR54828 Endoglin - Homo sapiens, 658 aa. 


2500 


1.5e-259 


patp-.AAR99802 Endoglin - Homo sapiens, 658 aa. 


2500 


1.5e-259 


patp:AAY82190 Human endoglin SEQ ID NO: 2 - Homo sapiens 


2500 


1.5e-259 


patp:AAR37808 Rat betaglycan - Synthetic, 853 aa. 


215 


2.2e-24 


patp:AAR74601 Rat betaglycan contg. transforming growth factor 215 


2.2e-24 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 2128 of 2128 bases (100%) identical to a gb:GENBANK- 
ID:HUMENDO|acc:J05481.1 mKNA from Homo sapiens (Human endoglin mRNA, 3* end). 
5 The full amino acid sequence of the protein of the invention was found to have 509 of 624 
amino acid residues (81%) identical to, and 531 of 624 amino acid residues (85%) similar to, 
the 658 amino acid residue ptnr:SWISSNEW-ACC:P17813 protein from Homo sapiens 
(Human) (ENDOGLIN PRECURSOR (CD105 ANTIGEN)). NOV68 also has homology to 
the other proteins shown in the BLASTP data in Table 68D. 



Table 68D. NOV68 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi |3041681 ( sp 
|P17813|EGLN 
HUMAN 


ENDOGLIN PRECURSOR 
(CD105 ANTIGEN) 


658 


621/658 
(94) 


632/658 
(94) 


0.0 


gi|l5679936|g 
b|AAH1427l.l| 
AAH14271 
(BC014271) 


endogl in ( Os 1 er- Rendu - 
Weber syndrome 1) 
[Homo sapiens} 


658 


620/658 
(94) 


621/658 
(94) 


0.0 


gi| 105920 |pir 
| |A36262 


endoglin precursor - 
human 


645 


607/644 
(94) 


608/644 
(94) 


0.0 


gi|4557555|re 
f |NP__000109.1 
| (NM_000118) 


endoglin precursor; 

Endoglin 

[Homo sapiens] 


625 


581/618 
(94) 


582/618 
(94) 


0.0 


gi| 6679649) re 
f |NP_03 1958.1 
| (NM007932) 


endoglin 

[Mus musculus] 


653 


452/660 
(68) 


540/660 
(75) 


0.0 



This BLASTP data is displayed graphically in the ClustalW in Table 68E. A multiple 
sequence alignment is given, with the NOV68 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
15 Table 68D. 



Table 68E. ClustalW Alignment of NOV68 


NOV68 


(SEQ ID NO: 236) 


gi|3041681| 


(SEQ ID NO: 639) 


gij 15679936) 


(SEQ ID NO: 640) 
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NOV68 


1 


gi 


3041681| 


1 


gi 


15679936| 


1 


gi 


105920) 


1 


gi 


4557555 | 


1 


gi 


6679649| 


1 


NOV68 


60 


gi 


3041681] 


60 


gi 


15679936) 


60 


gi 


105920 | 


47 


gi 


4557555 | 


60 


gi 




61 



105920) (SEQ ID NO:641> 

4557555) (SEQ ID NO: 642) 
6679649 (SEQ ID NO: 643) 




30 



40 



I 



50 



60 



laetvhcdlqpvgpergevtyttsqvskgcvaqap^ 
laetvhcdlqpvgpergevtyttsqvskgcvaqapn 
laetvhcdlqpvgper|Jevtyttsqvskgcvaqapn 
laetvhcdlqpvgpergevtyttsqvskgcvaqapn 
laetvhcdlqpvgpergevtyttsqvskgcvaqapn 
i^SvScdlqpvSpRrgevtSttsqvsSgcvaqaSn 



59 
59 
59 
46 
59 
60 



120 



NOV68 


120 


gi 


3041681) 


120 


gi 


15679936) 


120 


gi 


105920) 


107 


gi 


4557555 | 


120 


gi 


6679649 [ 


121 


NOV68 


146 


gi 


3041681) 


179 


gi 


15679936) 


179 


gi 


105920) 


166 


gi 


4557555 | 


179 


gi 


6679649) 


181 



NOV68 


206 


gi 


3041681) 


239 


gi 


15679936) 


239 


gi 


105920) 


226 


gi 


4557555) 


239 


gi 


6679649) 


241 


NOV68 


266 


gi 


3041681) 


299 


gi 


15679936) 


299 


gi 


105920) 


266 


gi 


4557555) 


299 


gi 


6679649) 


299 


NOV68 


326 


gi 


3041681) 


359 


gi 


15679936) 


359 


gi 


105920) 


• 346 


gi 


4557555) 


359 


gi 


6679649) 


359 




310 



I. 



320 
..I.. 



330 



340 



350 



360 



llgearmlnasivasfvelplasivslhasscggrlqtspapiqttppkdtcspellmsl 
llgearmlnasivasfvelplasivslhasscggrlqtspapiqttppkdtcspellmsl: 
llgearmlnasivasfvelplasivslhasscggrlqtspapiqttppkdtcspellmsl 
llgearmlnas ivasfvelplas I vslhas s cggrlqtspapi qttppkdtcs pellmsl 
llgearmlnasivasfvelplasivslhasscggrlqtspapiqttppkdtcspellmsl 

lSJeArSlNAS I VHS F VELPLffisS^ 



325 
358 
358 
345 
358 
358 



400 



410 



420 
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NOV68 


386 


9i 


3041681 | 


419 


9i 


15679936| 


419 


9i 


105920 | 


406 


9i 


4557555 | 


419 


9i 


6679649) 


419 



HOV68 



gi 
gi 
gi 



3041681 I 
15679936) 
105920 | 
4557555| 
6679649 j 



SMISNEAWWILSSSSPQRKKVHCLNMDSLSFQLGLYLSPHFLQASNTIEPGQQSFVQVE 
SMISNEAVVNILSSSSPQRKKVHCLNMDSLSFQLGLYLSPHFLQASNTIEPGQQSFVQVR 
SMISNEAVWILSSSSPORKKVHCLNMDSLSFQLGLYLSPHFLQASNTIEPGQQSFVQVR 
SMI SNE AWNI LS S S S PQRKKVHCLNT4DSLS FQLGLYLS PHFLQASNTI E PGQQS FVQVR 
SMISNE AWNIL SSSSPQRKKVHCLNlvlDSLSFQLGLYLSPHFLQASNTIEPGQQSFVQVR 
"^SNEffi|ll^sgsrapJlRKKVScilMDSLSFQLGLYLSPHFLQASNTIEijGQQlFVQ 



445 
478 
478 
465 
478 
gs 478 



KOV68 


446 


9i 


3041681| 


479 


gi 


15679936) 


479 


gi 


105920 | 


466 


gi 


4557555) 


479 


gi 


6679649| 


479 



KOV68 


506 


gi 


3041681| 


539 


gi 


15679936) 


539 


gi 


105920 | 


526 


gi 


4557555 | 


539 


gi 


6679649| 


539 




610 



620 



630 



566 S 
599 
599 
586 
599 
594 



L 



L 



640 



I 



650 



I 



660 



FLIGALLTAALWYIYSHTRSPSKREPWAVAAPASSESSSTNHSIGSTQSTPCSTSSMA 
AFLIGALLTAALWYIYSKTRSPSKREPWAVAAPASSESSSTNHSIGSTQSTPCSTSSMA 
AFLIGALLTAALWYIYSHTRSPSKREPWAVAAPASSESSSTNHSIGSTQSTPCSTSSMA 
AFLIGALLTAALWYIYSHTR SPSK REPVVAVA APAS SE SSSTNHSI GSTQSTPCSTSSMA 
AFLIGALLTAALWYIYgHTR BB3l RBIp|r~ 

AFLIGAtiLTAALWYIYSHTRgPSKREPWAVAAPASSESSSTNHSIGSTQSTPCSTSSMA 



625 
658 
658 
645 
625 
653 



NOV68 


625 


625 


gi 


3041681 | 


658 


658 


gi 


15679936) 


658 


658 


gi 


105920 | 


645 


645 


gi 


4557555| 


625 


625 


gi 


6679649) 


653 


653 



Table 68F lists the domain description from DOMAIN analysis results against NOV68. 
This indicates that the NOV68 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 68F. Domain Analysis of NOV68 

gnl|Smart | smart00241 , ZP, Zona pellucida (ZP) domain; ZP proteins are responsible for 
sperm-adhesion fo the zona pellucida. ZP domains are also present in raultidomain 
transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type 
III (betaglycan) . SEQ ID NO: 1391 

CD-Length « 253 residues, 96.4% aligned 



Score = 42.4 bits (98), Expect = 8e-05 



N0V68 : 


329 


KC^UDDAMTLVLKKELVAHLKCTI TGLTFWDPSCEAEDRG - - - DKFVLRS AYSS CGMQVS A 


385 






+C +D M + + +L+ + GLT DPSC G + CG + 




Sbjct: 


1 


QCGEDRMVVSVS TDLLFPGGI YVKGLTLGDPS CRPVFVGANSAWS FEVPLNECGTRRQV 


60 


N0V68 : 


386 


S MISNEAWNILSSSSP QRKKVHCLNMD 


413 






+ + SN W+ + + L D 




Sbjct: 


61 


NPDGIVYSOTLWSPIFHPLFITRDDRANYHVQCFYPESEKVSLRADVSTIPPTPLSW 


120 
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NOV68 : 
Sb j Ct : 


414 
121 


- - SLSFQLGLYLSPHF- - - LQASNTIEPGQQSFVQVRVSPSVSE- FLLQLDSCHLDLGP - 

+ LY F Q+++T++ G + + L + +C+ G 
EGPPTCTYSLYKDDSFGSPYQSADTVQLGDPVYHEWSCDGRDDPSLGLLVHNCYATPGSD 


466 
180 


NOV68: 
Sb j Ct : 


467 
181 


- - EGGTVELIQGRAAKGNCVSLLSPSPEGDPRFSFLLHFYTVPIPKTGTLS CTVALRPKT 
G +1 + +SP F + + C + L K+ 
PFSGPKYFIIDNGCPVDRYLDSVSPYSSPSHYARFSVKVFKFADRSLVYFHCQITLCDKS 


524 
240 


NOV68: 


525 


GSQD 528 




Sb j Ct : 


241 


DGSS 244 





Transforming growth factor-beta (TGF-beta) plays an important role in angiogenesis 
and vascular function. Endoglin, a transmembrane TGF-beta binding protein, is highly 
expressed on vascular endothelial cells and is the target gene for the hereditary haemorrhagic 

5 telangiectasia type I (HHT1), a dominantly inherited vascular disorder. The specific function 
of endoglin responsible for HHT1 is believed to involve alterations in TGF-beta responses. 
The initial interactions on the cell surface between endoglin and TGF-beta receptors may be 
an important mechanism by which endoglin modulates TGF-beta signalling, and thereby 
responses. On human microvascular endothelial cells, endoglin is co-expressed and is 

1 0 associated with betaglycan, a TGF-beta accessory receptor with which endoglin shares limited 
amino acid homology. This complex formation may occur in either a ligand-dependent or a 
ligand-independent manner. In addition, three higher order complexes containing endoglin, 
type II and/or type I TGF-beta receptors, also can occur on these cells. Thus endoglin may 
modify TGF-beta signalling by interacting with both betaglycan and the TGF-beta signalling 

1 5 receptors at physiological receptor concentrations and ratios ( Wong et aL, 2000, Eur J 
Biochem vol. 267 : 5550-60). 

Endoglin is a homodimeric membrane glycoprotein. In association with transforming 
growth factor (TGF)-ss receptors I and II, endoglin can also bind TGF-ssl and -ss3 and form a 
functional receptor complex. In human vascular tissue, endoglin immunolabeling is shown to 

20 be higher in endarterectomy specimens removed from diseased coronary arteries than in 

normal internal mammary arteries. In vitro, antisense oligonucleotides to endoglin is shown to 
decrease its expression and antagonized the TGF-ss-mediated inhibition of human and porcine 
SMC migration. Thus, upregulation of endoglin occurs during arterial repair and in established 
atherosclerotic plaques and may be required for modulation of SMC migration by TGF-ss (Ma 

25 X et aL, 2000, Arterioscler Thromb Vase Biol vol. 20 :2546-52). 

Hereditary hemorrhagic telangiectasia (HHT) is an inherited autosomal dominant 
vascular dysplasia caused by mutations in either endoglin (HHT1) or activin-like kinase 
receptor-1 (ALK-1) (HHT2). The majority of the mutations in endoglin cause frameshifts and 
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premature stop codons. Although initial reports suggested a dominant-negative model for 
HHT1, more recent reports have suggested that mutations in endoglin lead to 
haploinsufficiency . Expression of the missense mutants alone revealed that they are misfolded 
and that most show no cell surface expression. When co-expressed with wild-type endoglin, 
5 the missense mutants are able to dimerize with the normal endoglin protein and are trafficked 
to the cell surface. Thus either dominant-negative protein interactions or haploinsufficiency 
can cause HHT1 (Lux et ah, 2000, Hum Mol Genet vol 9 : 745-55). 

NOV68 is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 

10 nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, ' 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 

15 public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV68 is provided in Example 2. 

The NOV68 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention will have efficacy for the treatment of patients 

20 suffering from: arterial injuries, cerebral arteriovenous malformalities, pregnancy 

complications, carcinomas such as breast and mammary carcinoma as well as other diseases, 
disorders and conditions. The NOV68 nucleic acid encoding the endoglin-like protein of the 
invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. The novel nucleic 

25 acid of the invention encoding a Endoglin (CD1 05 antigen)-like protein includes the nucleic 
acid whose sequence is provided in Table 68A, or a fragment thereof. The invention also 
includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 68A while still encoding a protein that maintains its 
Endoglin (CD 105 antigen>like activities and physiological functions, or a fragment of such a 

30 nucleic acid. The invention further includes nucleic acids whose sequences are complementary 
to the sequence disclosed in Table 68A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
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include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
5 In the mutant or variant nucleic acids, and their complements, up to about 0% of the bases may 
be so changed. 

The novel protein of the invention includes the Endoglin (CD 105 antigen)-like protein 

whose sequence is provided in Table 68B. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
10 68B while still encoding a protein that maintains its Endoglin (CD105 antigen)-like activities 

and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 

up to about 19% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
15 methods. These antibodies may be generated according to methods known in the art, using 

prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 

below. 

NOV69 

NOV69 includes two ILl-like proteins, designated herein as NOV69a and NOV69b. 
20 These are splice variants of sequence accession number CG56950-0 1 . 

NOV69a 

The disclosed NOV69a (alternatively referred to herein as CG56950-01) includes the 
414 nucleotide sequence (SEQ ID NO: ) shown in Table 69A. A NOV69a ORF begins with a 
25 Kozak consensus ATG initiation codon at nucleotides 100-102 and ends with a TGA codon at 
nucleotides 412-414. The disclosed NOV69a maps to human chromosome 7. 

Table 69 A. NOV69a Nucleotide Sequence (SEQ ID 
NO:237) 

ATGGAAAAAG CATTGAAAATTGAC ACACCT CAGCAGGGGAGCAT T CAGGAT AT CAATCAT 
CGGGTGTGGGTTCTTCAGGACCAGACGCTCATAGCAGTCCCGAGGAAGGACCGTATGTCT 
CCAGTCACTATTGCCTTAATCTCATGCCGACATGTGGAGACCCTTGAGAAAGACAGAGGG 
AACCCCACACTGCAGCTGAAGGAAAAGGATATAATGGATTTGTACAACCAACCCGAGCCT 
GTGAAGTCCTTTCTCTTCTACCACAGCCAGAGTGGCAGGAACTCCACCTTCGAGTCTGTG 
GCTTTCCCTGGCTGGTTCATCGCTGTCAGCTCTGAAGGAGGCTGTCCTCTCATCCTTACC 
CAAGAACTGGGGAAAGCCAACACTACTGACTTTGGGTTAACTATGCTGTTTTAA 
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A NOV69a polypeptide (SEQ ID NO:238) encoded by SEQ ID NO:237 is 137 amino 
acids in length and is presented using the one-letter amino acid code in Table 69B. The Psort 
profile for NOV69a predicts that this sequence has no signal peptide and is likely to be 
localized to the cytoplasm with a certainty of 0,4500. In alternative embodiments, a NOV69a 
5 polypeptide is located to lysosomes with a certainty of 0.15 14, or to peroxisomal microbodies 
with a certainty of 02384. 



Table 69B. NOV69a Polypeptide Sequence (SEQ ID 
NO:238) 

MEKALKIDTPQQGS I QDINHRVWVLQDQTLI AVPRKDRMS PVTI ALI SCRHVETLEKDRG 
NPTLQLKEKDIMDLYNQPEPVKSFLFYHSQSGRNSTFESVAFPGWFIAVSSEGGCPLILT 
QELGKANTTDFGLTMLF 

NOV69b 

1 0 The disclosed NOV69b (alternatively referred to herein as CG56 1 36-02) includes the 

411 nucleotide sequence (SEQ ID NO: ) shown in Table 69C. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a stop codon at 
nucleotides 409-41 1. The disclosed NOV69b maps to human chromosome 2ql2-14.1. 



Table 69C NOV69b Nucleotide Sequence (SEQ ID 
NO:239) 

ATGGAAAAAGCATTGAAAGTTGACACACCTCAGCGGGGGAGCATTCAGGATATCAATCAT 
CGGGTGTGGCTTCn^CAGGACCAGACGCTCATAGCAGTCCCGAGGAAGGACCGTATGTCT 
CCAGTCACTATTGCCTTAATCTCATGCCGACATGTGGAGACCCTTGAGAAAGACAGAGGG 
AACCCCATCTACCTGGGCCTGAATGGACTCAATCTCTGCCTGATGTGTGTTCAAGTCGGG 
GACCAGCCCACACTGCAGATGAACCAGAGTGGCAGGAACTCCACCTTCGAGTCTGTGGCT 
TTCCCTGGCTGGTTGATCGCTGTCAGCTCTGAAGGAGGCTGTCCTCTCATCCTTACCCAA 
GAACTGGGGAAAGCCAACACTACTGACTTTGGGTTAACTATGCTGTTTTAA 

15 ~ ' ~ " ' ~ " ~ ~ ™~ ~ ~ " 

A NOV69b polypeptide (SEQ ID NO:240) encoded by SEQ ID NO:239 is 136 amino 
acids in length and is presented using the one-letter amino acid code in Table 69D. The Psort 
profile for NOV69b predicts that this sequence has no signal peptide and is likely to be 
localized to the cytoplasm with a certainty of 0.6500, In alternative embodiments, a NOV69b 

20 polypeptide is located to lysosomes with a certainty of 0.2305. 



Table 69D. NOV69b Polypeptide Sequence (SEQ ID 
NO:240) 

MEKALKVDTPQRGS I QDINHRVWVLQDQTLI AVPRKDRMSPVTI ALI SCRHVETLEKDRG 
NPIYLGLNGLl^CXMCVQVGDQPTLQMNQSGRNSTFESVAFPGWLIAVSSEGGCPLILTQ 
ELGKANTTDFGLTMLF 
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A BLAST analysis of NOV69 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV69 had high 
homology to other proteins as shown in Table 69E. 

5 



Table 69E. BLASTX results from PatP database for NOV69 






Smallest 






Sum 




High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P(N) 


patp:AAW86286 Rodent interleukin (IL)-l epsilon polypeptide 


263 


l.le-38 


patp:AAY24 049 Amino acid sequence of a murine SPOIL- 1 1 


263 


l.ie-38 


patp:AAE06662 Mouse "interleukin- lepsilon .(IL-lepsilon) 


263 


l.le-38 


patp:AAY7G217 Human Interleukin-l epsilon protein 


414 


1.7e-38 


patp:AAY702i8 Human Interleukin-l epsilon 


414 


1.7e-38 | 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 258 of 263 bases (98%) identical to a gb:GENBANK- 
ID:AF201831|acc:AF201831.1 mRNA from Homo sapiens (FIL1 epsilon mRNA). The full 
amino acid sequence of the protein of the invention was found to have 82 of 89 amino acid 
residues (92%) identical to, and 87 of 89 amino acid residues (97%) similar to, the 158 amino 
acid residue ptnr:SFTREMBL-ACC:Q9UHA7 protein from Homo sapiens (Human) (FIL1 
EPSILON). NOV69 also has homology to the other proteins shown in the BLASTP data in 
Table 69R 



Table 69F. NOV69 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 
(%) 


Positive 
(%) 


Expect 


gi|7657092 |re 
f )NP_055255.1 
| (NM014440) 


Interleukin- 1 
Superfamily 1 [Homo 
sapiens] 


158 


137/158 
(86) 


137/158 
(86) 


7e-73 


gi| 9506601 |re 
f |NP_062323.1 
| (NM_019450) 


interleukin 1 family, 
member 6 (epsilon) [Mus 
musculus] 


160 


77/156 
(49) 


104/105 
(66) 


4e-38 


gi| 9665234 |re 
f |NP_062564.1 
(NM 019618) 


interleukin- 1 homolog l 
[Homo sapiens] 


169 


76/147 
(51) 


97/147 
(65) 


2e-35 


gi| 7657090 |re 
£ |NP_055253.1 
| (NM_014438) 


Interleukin-l 
Superfamily □ [Homo 
sapiens] 


157 


62/145 
(42) 


85/145 
(57) 


2e-27 


gi| 12844800 |d 
bj |BAB26505.1 
| (AK009787) 


homolog to FILl 
ETA-putative 
[Mus musculus] 


183 


58/144 
(40) 


77/144 
(53) 


4e-23 



This BLASTP data is displayed graphically in the ClustalW in Table 69G. A multiple 
sequence alignment is given, with the NOV69 protein being shown on line 1 in a ClustalW 
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analysis comparing the protein of the invention with the related protein sequences shown in 
Table 69F. 



Table 69G. ClustalW Alignment of NOV69 



NOV69a 


(SEQ 


ID N0:238) 


NOV69b 


(SEQ 


ID NO:240) 


gi 


7657092 | 


<SEQ 


ID N0:644) 


gi 


9506601) 


(SEQ 


ID NO:645) 


gi 


9665234| 


(SEQ 


ID NO: 646) 


gi 


7657090) 


(SEQ 


ID NO: 647) 


gi 


12844800) 


(SEQ 


ID N0:648) 



NOV69a 


1 


NOV69b 


1 


gi 


7657092) 


1 


gi 


9506601) 


1 


gi 


9665234) 


1 


gi 


7657090) 


1 


gi 


12844800) 


1 



10 



20 



30 



-MRGTPGDA 

MNPQ 

MMAFPPQSCVHVLPPKSIQMWEPNHNTMHGSSQi 




MEKALKID! 

MEKAL^VDj 

MEKALjfet 

- -MNKEKELgAAggSLRKV^ 
DGGGRAVYQSMC~ 



70 



80 



90 



100 



110 



120 



NOV69a 


36 


NOV69b 


36 


gi 


7657092) 


36 


gi 


9506601) 


38 


gi 


9665234) 


48 


gi 


7657090) 


35 


gi 


12844800) 


61 



NOV69a 


75 


NOV69b 


76 


gi 


7657092 | 


96 


gi 


9506601) 


98 


gi 


9665234) 


108 


gi 


7657090) 


95 


gi 


12844800) 


121 




NOV69a 


135 




137 


NOV69b 


134 




136 


gi 


7657092 


156 




158 


gi 


9506601 


158 




160 


gi 


9665234 


167 




169 


gi 


7657090 


155 




157 


gi 


12844800) 181 




183 



Table 69H lists the domain description from DOMAIN analysis results against 
NOV69. This indicates that the NO V69 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 69H. Domain Analysis of NOV69 

qnl|Pfam|pfam00340 , IL1, Interleiikin-1 / 18. This family includes 

iaterleukin-1 and interleukin-18. SEQ ZD ND$1392 

CD-Length = 142 residues 

Score = 60.1 bits (144), Expect = 8e-ll 

NOV69 : 57 KDRGNPTLQLKEKDIMDLYNQPEPVKSFLFYHSQSGRNSTFESVAFPGWFIAVSSEGGCP 116 

K+ P LQL+ + E K F F ++ G FES A+P WFIA E P 

Sbjct: 62 KEGDEPVLQLEMVEPPKyiKNSEMDKRFFFEKTEIGSKVyFESAAYPNWIATKQEEDRP 121 

NOV69: 117 LILTQELGKANTTDF 131 
j + L +++ TDF 
Sbjct: 122 VFLftNGPPESDITDF 136 \ : 

There are two structurally distinct forms of IL1 : ILl(alpha), which is the acidic form 
with pI5, and ILl(beta) (IL1B; 147720), the neutral form with pI7. Both are 17-kD proteins 
coded by separate genes. The ILIA gene has 10,206 bp with 7 exons and 6 introns (Furutani et 
5 al., 1986). By Southern transfer analysis of DNAs from human-rodent somatic cell hybrids, 
Modi et al. (1 988) assigned the ILIA gene to chromosome 2. Regional localization to 2ql 3- 
q21 was achieved by in situ hybridization. Lafage et al. (1989) confirmed assignment to 2ql3 
by in situ hybridization. 

The ILIA and IL1B proteins, which are synthesized by a variety of cell types including 
10 activated macrophages, keratinocytes, stimulated B lymphocytes, and fibroblasts, are potent 
mediators of inflammation and immunity. Lord et al. (1991) demonstrated that both the alpha 
and beta forms, but particularly the beta form, are transcribed in polymorphonuclear 
leukocytes stimulated with LPS. Both ILIA and IL1B stimulate osteoclast activity in vitro and 
are potent bone resorbing factors. Sabatino et al. (1988) studied the effects of 72-hour 
15 subcutaneous infusions of interleukins 1-alpha and -beta on plasma, calcium, and bone 
morphology. Both interleukins 1 caused a marked, dose-dependent increase in plasma 
calcium. Increased numbers of osteoclasts and bone resorption surfaces were observed on 
quantitative histomorphometry of bone. The results suggest a role for IL1 in the modulation of 
extracellular fluid calcium homeostasis. Hogquist et al. (1991) demonstrated that interleukin-1 
20 ismvolvedmapoptosis(ceUdeam).Bommealphaandmebetafornisarere^ 
consequence of cell injury regardless of the insult. 

Bailly et al. (1993) elucidated a polymorphism that consists of a variable number of 
repeats of a 46-bp sequence within intron 6 of the ILIA gene. Among 72 unrelated persons, 
they identified 6 different alleles ranging from 5 to 18 repeats; the most frequent allele, present 
25 in 62%, contained 9 repeats. They suggested that the polymorphism may be of significance in 
gene function, since each repeat contains 3 potential binding sites for transcription factors. 
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Gray et al. (1986) showed that in the mouse also there are at least 2 interleukin-1 
genes, Ill(alpha) and Ul(beta). Boultwood et al. (1989) used in situ chromosome hybridization 
to show that the 2 111 genes in the mouse are located in the F region of chromosome 2. It had 
previously been shown by studies in mouse-hamster somatic cell hybrids and in recombinant 
5 inbred strains that the 2 genes are tightly linked on murine chromosome 2, approximately 4.7 
cM distal to beta-2-microglobulin. By pulsed field gel electrophoresis, Silver et al. (1990) 
showed that the mouse Ilia and II lb genes are contained in a genomic fragment of about 70 
kb. Further studies suggested that II lb lies 5-prime to Ilia, that the 2 genes are oriented in the 
same direction, and that they are separated by about 50 kb. From restriction mapping of the 

10 human genomic region, Nicklin et al. (1 994) concluded that, relative to one terminal CpG 

island, the 3 genes mapped to the following intervals: ILIA was between +0 and +35 kb, IL1B 
between +70 and +1 10 kb, and IL1RN (147679) between +330 and +430 kb. Since the 
assignment of IL1RN to 2ql4.2 appears to be the most definitive localization, the ILIA and 
IL1B genes can be presumably be said to be also on 2ql4. Cox et al. (1998) carried out studies 

15 with multiallelic markers that grouped the 3 genes into a biallelic system for use in association 
studies. They identified a common, 8-locus haplotype of the DL1 gene cluster. 

Hurwitz et al. (1992) studied the role of IL1 in the ovary, using a solution 
hybridization/RNase protection assay to test for expression of the IL1 gene, its type I receptor 
(IL1R; 147810), and its receptor antagonist (IL1RN). They presented findings which, taken 

20 together, revealed the existence of a complete, highly compartmentalized, hormone-dependent 
intraovarian IL1 system. 

Since IL1 is an important cytokine in the control of the inflammatory response central 
to the pathology of rheumatoid arthritis (1 80300), Cox et al. (1999) used the combined sib- 
TDT (transmission/disequilibrium test) and TOT, in addition to parametric and nonparametric 

25 linkage methods, to investigate candidate genes of the IL1 gene cluster in the 2ql3 region. 
Several tightly linked IL1 cluster markers yielded suggestive evidence for linkage in the 
combined TDT in those families in which affected sibs did not share 2 HLA-DRB1 alleles 
identical by descent. The evidence was significant in those with severe disease, as assessed by 
the presence of bone erosions. In contrast, there was no evidence of linkage using 

30 nonparametric linkage analysis, but parametric analysis revealed weak evidence of linkage 
when marker-trait disequilibrium was incorporated into the analysis. The data provided 
preliminary evidence for linkage of genes of the IL1 cluster to rheumatoid arthritis and 
suggested a possible role for this region in severe erosive disease. 
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Kornman et al. (1997) suggested that genetic polymorphisms of the ILIA and IL1B 
genes may be associated with severity of periodontitis in adult nonsmokers. The IL1B 
polymorphism was referred to as IL1B+3953 and the ILIA polymorphism was referred to as 
IL1 A-889. Nonsmokers aged 40 to 60 carrying the , 2 t allele (in either homozygous or 

5 heterozygous state) at both loci were observed to have nearly 19 times the risk of developing 
• severe periodontitis compared to subjects homozygous for the T allele at either or both of 
these loci. Because of the implication of interleukin-I in adult periodontitis, Diehl et al. (1999) 
undertook an evaluation of the role of these ILIA and IL1B polymorphisms in early-onset 
periodontitis (EOP; see 170650) in 28 African- American families and 7 Caucasian-American 

1 0 families with 2 or more affected members. The 2 major EOP subtypes, localized juvenile 
periodontitis and generalized early-onset periodontitis, encompassing rapidly progressive 
periodontitis and generalized juvenile periodontitis, were analyzed separately and together. 
They obtained highly significant evidence of linkage disequilibrium for both groups of 
generalized EOP subjects. A similar trend was noted for the localized form. The IL1 alleles 

1 5 associated with high risk of EOP had been suggested previously to be correlated with low risk 
for severe adult periodontitis. Linkage disequilibrium with generalized EOP was equally 
strong for smoking and nonsmoking subjects. ILIA and IL1B polymorphisms were in strong 
linkage disequilibrium with each other in Caucasians but not in African Americans. Haplotype 
analyses evaluating both polymorphisms simultaneously indicated that the IL1B variant is 

20 likely to be more important for EOP risk. Sib pair linkage analyses, by contrast, provided only 
marginal support for a gene of very major effect on EOP risk attributable to these IL1 
polymorphisms. Diehl et al. (1999) interpreted their results as indicating that EOP is a 
complex, oligogenic disorder, with interleukin-1 genetic variation contributing an important 
but not exclusive influence on disease risk. 

25 NOV69 is predicted to be expressed in at least the following tissues: spleen, lymph 

node, thymus, tonsil and leukocyte tissues. This information was derived by determining the 
tissue sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV69 is provided in Example 2. 

30 The NOV69 nucleic acids and proteins of are useful in potential therapeutic 

applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention will have efficacy for the treatment of patients 
suffering from: inflammatory and immune system-related diseases such as rheumatoid arthritis 
and inflammatory bowel disease, periodontitis, hypothyroidism, congenital, due to thyroid 
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dysgenesis or hypoplasia; osteoarthritis of distal interphalangeal joints; selective T-cell defect; 
nephrophthisis, juvenile; purpura fulminans, neonatal; susceptibility to infections such as 
viral and bacterial; thrombophilia due to protein C deficiency; as well as other diseases, 
disorders and conditions. 
5 The NOV 69 nucleic acid encoding the ILl-like protein of the invention, or fragments 

thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. The novel nucleic acid of the invention 
encoding a Interleukin 1 epsilon-like protein includes the nucleic acid whose sequence is 
provided in Table 69A or 69C, or a fragment thereof. The invention also includes a mutant or 

10 variant nucleic acid any of whose bases may be changed from the corresponding base shown 
in Table 69A or 69C while still encoding a protein that maintains its Interleukin 1 epsilon-like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to the sequence of Table 
69A or 69C, including nucleic acid fragments that are complementary to any of the nucleic 

1 5 acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 

20 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 2% of the bases may 
be so changed. 

These materials are further useful in the generation of antibodies that bind 
25 immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV70 

30 NOV70 includes two OS9-like proteins, designated herein as NOV70a and NOV70b. 

NOV70a 
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The disclosed NOV70a (alternatively referred to herein as CG56878-01) includes the 
2739 nucleotide sequence (SEQ ID NO:241) shown in Table 70A. A NOV70a OKF begins 
with a Kozak consensus ATG initiation codon at nucleotides 86-88 and ends with a stop codon 
at nucleotides 2090-2092. The disclosed NOV70a maps to human chromosome 12ql3. 

5 . 

Table 70A. NOV70a Nucleotide Sequence (SEQ ID 
NO:241) 

TTGCACTCTCCCACACCCTTTTCTTTTCGTCCGCTC^ 

CTCTGCATAAGAAGGGGAACGAAAGATGGCGGCGGAAACGCTGCTGTCCAGTTTGTTAGG 

ACTGCTGCTTCTGGGACTCCTGTTACCCGCAAGTCTGACCGGCGGTGTCGGGAGCCTG^ 

CCTGGAGGAGCTGAGTGAGATGCGTTATGGGATCGAGATCCTGCCGTTGCCTGTCATGGG 

AGGGCAGAGCCAATCTTCGGACGTGGTGATTGTCTCCTCTAAGTACAAACAGCGCTATGA 

GTGTCGCCTGCCAGCTGGAGCTATTCACTTCCAGCGTGAAAGGGAGGAGGAAACACCTGC 

TTACCAAGGGCCTGGGATCCCTGAGTTGTTGAGCCCAATGAGAGATGCTCCCTGCTTGCT 

GAAGACAAAGGACTGGTGGACATATGAATTCrTGTTATGGACGCCACATCCAGC^^ 

CATGGAAGATTCAGAGATCAAAGGTGAAGTCCTCTATCTCGGCTACTACCAAT(^GCCT^ 

CGACTGGGATGATGAAACAGC<^GGCCTCCAAGCAGCATCGTCTTAAACGCTACCACAG 

CCAGACCTATGGCAATGGGTCCAAGTGCGACCTTAATGGGAGGCCCCGGGAGGCCGAGGT 

TCGGTTCCTCTGTGACGAGGGTGCAGGTATCTCTGGGGACTACATCX3ATCGCGTGGACGA 

GCCCTTGTCCTGCTCTTATGTGCTGACCATTCGCACTCCTCGGCTCTGCCCCCACCCTCT 

CCTCCGGCCCCC^CCCAGTGCTGC^^ 

TGAGGAGTACATGGCCTACGTTCAGAGGCAAGCCGTAGACTCAAAGCAGTATGGAGATAA 
AATCATAGAGGAGCTGCAAGATCTAGGCCCCCAAGTGTGGAGTGAGACCAAGTCTGGGGT 
GGCACCCCAAAAGATGGt^GGTGCGAGCCCGACCAAGGATGACAGTAAGGACTCAGATTT 
CrTGGAAGATGCTTAATGAGCCAGAGGACCAGGCCCCAGGAGGGGAGGAGGTGCCGGCTGA 
GGAGCAGGACCCAAGCCCTGAGGCAGCAGAT^ 

GAACAACGTGCAGGTCAAAGTCATTCGAAGCCCTGCGGATTTGATTCGATTCATAGAGGA 
GCTGAAAGGTGGAACAAAAAAGGGGAAGCCAAATATATC^ 

TGCTG(^GAAGTCCCTCAGAGGGAACCAGAGAAGGAAAGGGGTGATCCAGAACGGCAGAG 
AGAGATGGAAGAAGAGGAGGATGAGGATGAGGATGAGGATGAAGATGAGGATGAACGGCA 
GTTACTGGGAGAATTTGAGAAGGAACTGGAAGGGATCCTGCTTCCGTCAGACCGAGACCG 
GCTCCGTTCGGAGGTGAAGGCTGGCATGGAGCGGGAACTGGAGAACATCATCCAGGAGAC 
AGAGAAAGAGCTGGACCCAGATGGGCTGAAGAAGGAGTCAGAGCGGGATCGGGCAATGCT 
GGCTCTCACATCCACTCTCAACAAACTCATCAAAAGACT 

GCTGGTGAAGAAGCACAAGAAAAAGAGGGTTGTCCCCAAAAAGCCTCCCCCATCACCCCA 
ACCTACAGAGGAGGATCCTGAGC^CAGAGTCCGGGTCCGGGTCACCAAGCTCCGTCTCGG 
AGGCCCTAATCAGGATOTGACTGTCCTCGAGATGAAACGGGAAAACCC^(^GCTGAAACA 
AATCGAGGGGCTGGTGAAGGAGCTGCTGGAGAGGGAGGGACT 

TGAGATCAAAATTGTCCGCCCATGGGCTGAAGGGACTGAAGAGGGTGCACGTTGGCTGAC 

TGATGAGGACACGAGAAACCTCAAGGAGATCTTCTTGfiATATCTTGGTGC 

AGAGGCCCAGAAGGAACGCCAGCGGCAGAAAGAGCTGGAGAGCAATTACCGCCGGGTGTG 

GGGCTCTCCAGGTGGGGAGGGCACAGGGGACCTGGACGA^ 

CTACACTTGACCCTTCACGGAATCCAGACTCT^ 

TCCCCACCCTGGAACCCCTGAGGGCCAAACAGCAGAGTGGAGCTGAGCT 

GGGCAACTCTGTGGGTGTGGGGGCCCTGGGTGAATGCTGCTGCCCCTGCTGGCAGCCACC 

TTGAGACCTCACCGGGCCTGTGATATTTGCTCTCCTGAACTCTCACTCAATCCTCTTCCT 

CTCCTCTGTGGCTTTTCCTGTTATTGTCCCCTAATGATAGGATATTCCCTGCTGCCTACC 

TGGAGATTCAGTAGGATCTTTTGAGTGGAGGTGGGTAGAGAGAGCAAGC^ 

CTTAGC^^C^CTGAGCAAGCAGGCCCC(^CCTGCCCTTAGTGATGTTTGGAGTCGTT^ 

ACCCTCTTCn*ATTGAATTCCCTTGGGATTTCCTTCTCCCTTO 

CTACAATITGTGCTTCrGAGTTGAGGAGCCTTCACCTCTGTT 

TGCTGCCTATCACCTCCAGmCAATCCCAGCGAAAAAGGTGTGAAGCACC(^CCATGTTC 
TTGAACAATCAGGTTTCTAAATAAACAACTGGACCATCA 

A NOV70a polypeptide (SEQ ID NO:242) encoded by SEQ ID NO:241 is 668 amino 
acids in length and is presented using the one-letter amino acid code in Table 70B. The Psort 
profile for NOV70a predicts that this sequence has a signal peptide and is likely to be 
10 localized outside the cell with a certainty of 0.8200. hi alternative embodiments, a NOV70a 
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polypeptide is located in the nucleus with a certainty of 0.1260. The Signal P predicts a likely 
cleavage site for a NOV70a peptide is between positions 25 and 26, i.e., at the dash in the 
sequence SLT-GG. 



Table 70B. NOV70a Polypeptide Sequence (SEQ ID 
NO:242) 

MAAETLLSSLLGIJjLIjGIiLL^ 

VIVSSKYKQRYECRLPAGAIHFQI^REEETPAYQGPGIPELLSPMI®APCLLKTKDWWTY 
EFCYGRHICKJYHMEDSEIKGEVLYLGYYQSAFD 

CDLNGRPREAEVRFLCDEGAGI SGDYIDRVDEPLS CS YVLT IRTPRLCPHPLLRP P PS AA 
PQAILCHPSLQPEEYMAYVQRQAVDSKQYGDKIIEELQDLGPQVWSETKSGVAPQKMAGA 
SPTKBDSKDSDFWKMLNEPEDQAPGGEEVPAEEQDPSPEA 

RSPADLIRFIEELKGGTKKGKPNIGQEQPVDDAAEVPQREPEKERGDPERQREMEEEEDE 
DEDEDEDEDERQLLGEFEKELEGILLPSDRDRLRSEVKAGMERELENIIQETEKELDPDG 
LKKESERDRAMIJ^TSTLNKLIKRLEEKQSPELVKKHKKra 

RTOVRVTKLRLGGPNQDLTVIjEMKRENPQLKQIEGLVKELLEREGLTAAGKIEIKIVRPW 
AEGTEEGARWLTDEDTRNLKE I FFNILVPGAEEAQKERQRQKELESNYRR WGS PGGEGT 
GDIiDEFDF 

5 

NOV70b 

The disclosed NOV70b (alternatively referred to herein as CG56868-04) includes the 
2702 nucleotide sequence (SEQ ID NO: ) shown in Table 70C. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 86-88 and ends with a TGA codon at 
10 nucleotides 2036-2038. The disclosed NOV70b maps to human chromosome 12ql3. 



Table 70C. NOV70b Nucleotide Sequence (SEQ ID 
NO:243) 



CTCTG CATAAGAAGGGG AA CGAAAGATGGCGGCGG AAACG CTG CTGTC CAGTTTGTT AGG 
ACTGCTGCTTCTGGGACTCCTGTTACCCGCAAGTCTGACCGGCGGTGTCGGGAGCCTGAA 
CCTGGAGGAGCTGAGTGAGATGCGTTATGGGATCGAGATCCTGCCGTTGCCTGTCATGGG 
AGGGCAGAGCCAATCTTCGGACGTGGTGATTGTCTCCTCTAAGTACAAACAGCGCTATGA 
GTGTCGCCTGCCAGCTGGAGCTATTCACTTCCAGCGTGAAAGGGAGGAGGAAACACCTGC 
TTACCAAGGGCCTGGGATCCCTGAGTTGTTGAGCCCAATGAGAGATGCTCCCTGCTTGCT 
GAAGACAAAGGACTGGTGGACATATGAATTCTGTTATGGACGCCACATCCAGCAATACCA 
CATGGAAGATTCAGAGATCAAAGGTGAAGTCCTCTATCTCGGCTACTACCAATCAGCCTT 
CGACTGGGATGATGAAACAGCCAAGGCCTCCAAGCAGCATCGTCTTAAACGCTACCACAG 
CCAGACCTATGGC AATGGGTC CAAGTGCGACCTTAATGGGAGGCC CCGGGAGGCCGAGGT 
TCGGTTCCTCTGTGACGAGGGTGCAGGTATCTCTGGGGACTACATCGATCGCGTGGACGA 
GCCCTTGTCCTGCTCTTATGTGCTGACCATTCGCACTCCTCGGCTCTGCCCCCACCCTCT 
CCTCCGGCCCCCACCCAGTGCTGCACCACAGGCCATCCTCTGTCACCCTTCCCTACAGCC 
TGAGX3AGTACATGGCCTACGTTCAGAGGCAAGCCGACTCAAAGCAGTATGGAGATAAAAT 
CATAGAGGAGCTGCAAGATCTAGGCXrcCCAAGTGTGGAGTGAGACCAAGTCTGGGGTGGC 
ACCCC^AAAGATGGCAGGTGCGAGCCCGACCAAGGATGACAGTAAGGACTCAGATTTCTG 
GAAGATGCTTAATGAGCCAGAGGACCAGGCCCCAGGAGGGGAGGAGGTGCCGGCTGAGGA 
GCAGGACCCAAGCCCTGAGGCAGCAGATTCAGCTTCTGGTGCTCCCAATGATTTTCAGAA 
CAACGTGCAGGTCAAAGTCATTCGAAGCCCTGCGGATTTGATTCGATTCATAGAGGAGCT 
GAAAGGTGGAACAAAAAAGGGGAAGCCAAATATAGGCCAAGAGCAGCCTGTGGATGATGC 
TGCAG AAGT CC CT CAGAGGGAAC CAGAGAAGGAAAGGGGTGAT CC AGAACGG CAGAGAGA 
GATGGAAGAAGAGGAGGATGAGGATGAGGATAAGATGAGGATGAACGGCAGTTACTGGGG 
AGAATTTGAGAAGGAACTGGAAGGGATCCTGCTTCCGTCAGACCGAGACCGGCTCCGTTC 
GGAGACAGAGAAAGAGCTGGACCCAGATGGGCTGAAGAAGGAGTCAGAGCGGGATCGGGC 
AATGCTGGCTCTCACATCCACTCTCAACAAACTCATCAAAA 

TCCAGAGCTGGTGAAGAAGCACAAGAAAAAGAGGGTTGTCCCCAAAAAGCCTCCCCCA 
ACCCCAACCTACAGAGGAGGATCCTGAGCACAGAGTCCGGGTCCGGGTCACCAAGCTCCG 
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TCTCGGAGGC CCT AATCAGGAT CTGACTGTCCT CGAG ATGAAACGGGAAAACCCACAGCT 
GAAACAAATCGAGGGGCTGGTGAAGGAGCTGCTGGAGAGGGAGGGACTCACAGCTGCAGG 
GAAAATTGAGAT CAAAATTGTC CGCCCATGGGCTGAAGGG ACTGAAGAGGGTGCACGTTG 
GCTGACTGATGAGGACACGAGAAACCTCAAGGAGATCTTCTTCAATATCTTGGTGCCGGG 
AGCTGAAGAGGCCCAGAAGGAACGCCAGCGGCAGAAAGAGCTGGAGAGCAATTACCGCCG 
GGTGTGGGGCTCTCCAGGTGGGGAGGGCACAGGGGACCTGGACGAATTTGACTTCTGAGA 
CCAACACTACACTTGACCCTTCACGGAATC CAGACTCTTC CTGGACTGGCTTGCCTCCTC 
CCCACCTCCCCACCCTGGAACCCCTGAGGGCCAAACAGCAGAGTGGAGCTGAGCTGTGGA 
CCTCTCGGGCAACTCTGTGGGTGTGGGGGCCCTGGGTGAATGCTGCTGCCCCTGCTGGCA 
GC CAC CTTGAGACCTCACCGGGCCTGTGATATTTGCTCTC CTGAACTCTCACTCAATCCT 
CTTCCTCTCCTCTGTGGCTTTTCCTGTTATTGTCCCCTAATGATAGGATATTCCCTGCTG 
CCTACCTGGAGATTCAGTAGGATCTTTTGAGTGGAGGTGGGTAGAGAGAGCAAGGAGGGC 
AGGACACTTAGCAGGCACTGAGCAAGCAGGCCCCCACCTGCCCTTAGTGATGTTTGGAGT 
CGTTTTACCCTCTTCTATGGAATTGCCTGTGGATTCCTTCTCCCTTCCCTGCCCACCGTG 
TCCTACAATTGTGCTCTGAGTGAGAGCCTTCCTCTCTGCTAGGAAGGTTATGTGCCTTAC 
TCCGCAATCGGAAAGTTAGCCACGTTCTAATCGTTATACAAGGCTAAAAAAAATAAATAT 
TTATACCCGTTTTTCCCTGATTTATTTTTAAATATTATATTATTTTTAATATAATTTGTG 

GG 



A NOV70b polypeptide (SEQ ID NO:244) encoded by SEQ ID NO:243 is 650 amino 
acids in length and is presented using the one-letter amino acid code in Table 70D. The Psort 
profile for NOV70b predicts that this sequence has a signal peptide and is likely to be 
5 localized outside the cell with a certainty of 0.8200. In alternative embodiments, a NOV70b 
polypeptide is located to lysosomes with a certainty of 0. 1 900, or to the nucleus with a 
certainty of 0.1260. The Signal P predicts a likely cleavage site for a NOV70b peptide is 
between positions 25 and 26, i.e., at the dash in the sequence SLT-GG. 



Table 70D. NOV70b Polypeptide Sequence (SEQ ID 
NO:244) 

MAAETLLSSLLGIiLLGLLLPASLTGGVGSLNLEELSEMRYGIEILPLPVMGGQSQSSDV 
VIVSSKYKQRYECIOiPAGAIHFQI^REEETPAYQGPGIPELLSPMRDAPCLLKTKDWVmr 
EFCYGRHIQQYHMEDSEIKGEVLYLGYYQSAFDWDDETAKASKQHRLKRYHSQTYGNGSK 
CDLNGRPREAEVRFLCDEGAGI SGD YIDRVDEPL SCS YVLTIRTPRLCPHPLLRPPPS AA 
PQAILCHPSLQPEEYMAYVQRQADSKQYGDKIIEELQDLGPQVWSETKSGVAPQKMAGAS 
PTKDDSKDSDFWKMI^PEDQAPGGEEVPAEEQDPSPEAADSASGAPITOFQNNVQVKVIR 
SPADLIRFIEELKGGTKKGKPNIGQEQPVDDAAEVPQREPEKERGDPERQREMEEEEDED 
EDKMRmGSYWGEFEKELEGILLPSDRDRLRSETEKELDPDGLKKESERDRAMIJVLTSTL 
NKLIKRLEEKQSPELVKKHKKKRVVPKKPPPSPQPTEEDPEHRVRWVTKIiRLGGPNQDIi 
TVLEMKRENPQLKQIEGLVKELLEREGLTAAGKIEIKIVRPWAEGTEEGARWLTDEDTRN 
LKEIFFNILVPGAEEAQKERQRQKELESNYRRVWGSPGGEGTGDLDEFDF 



A BLAST analysis of NOV70 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV70 had high 
homology to other proteins as shown in Table 70E. 

15 

Table 70E. BLASTX results from PatP database for NO V70 

" ' " — — Smallest 

Sum 

High. Probability 
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Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 




patp:AAG76089 Human colon cancer antigen protein 


423 


3 


4e- 


-52 


patp:AAG41826 Arabidopsis thaliana protein fragment 


298 


2 


6e- 


•25 


patp:AAG4l827 Arabidopsis thaliana protein fragment 


279 


2 


9e- 


-23 


patp:AAG41828 Arabidopsis thaliana protein fragment 


279 


2 


9e- 


-23 


patp:AAU32255 Novel human secreted protein #2746 


232 


3 


4e- 


-18 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 2735 of 2739 bases (99%) identical to a gbrGENBANK- 
ID:HSU41635|acc:U41 635.1 mRNA from Homo sapiens (Human OS-9 precurosor mRNA, 
5 complete cds). The full amino acid sequence of the protein of the invention was found to have 
667 of 668 amino acid residues (99%) identical to, and 667 of 668 amino acid residues (99%) 
similar to, the 667 amino acid residue ptnr:SWISSPROT-ACC:Q13438 protein from Homo 
sapiens (Human) (PROTEIN OS-9 PRECURSOR). The sequence of this invention has 1 
amino acid inserion, compared to ptnr:SWISSPROT-ACC:Q13438 protein from Homo 
1 0 sapiens (Human) (PROTEIN OS-9 PRECURSOR). NOV70 also has homology to the other 
proteins shown in the BLASTP data in Table 70F. 



Table 70F. NOV70 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%> 


Positive 
(%) 


Expect 


gi|5803109|re 
f |NP_006803..1 
(NM_006812) 


amplified in 
osteosarcoma [Homo 
sapiens] 


667 


667/668 
(99) 


667/668 
(99) 


0.0 


gi|l265352l|g 
b|AAH00532.l| 
AAH00532 
(BC000532) 


Similar to amplified in 

osteosarcoma 

[Homo sapiens] 


612 


512/513 
(99) 


512/513 
(99) 


0.0 


gi | 17986213 |g 
b|AAC39523.2| 
(U81031) 


OS9 

[Homo sapiens] 


474 


474/475 
(99) 


474/475 
(99) 


0.0 


gi | 13905114 |g 
b|AAH06844.l| 
AAH06844 
(BC006844) 


Unknown (protein for 
IMAGE: 3598453) 
[Mus musculus] 


571 


435/630 
(69) 


495/630 
(78) 


0.0 


gi|l3676348|g 
b|AAH06506.ll 
AAH06506 
(BC006506) 


Similar to amplified in 
osteosarcoma 
[Homo sapiens] 


376 


373/374 
(99) 


374/374 
(99) 


e-161 



This BLASTP data is displayed graphically in the ClustalW in Table 70G. A multiple 
15 sequence alignment is given, with the NOV70a and b proteins being shown on lines 1 and 2 in 
a ClustalW analysis comparing the protein of the invention with the related protein sequences 
shown in Table 70F. 



Table 70G. ClustalW Alignment of NOV70 
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NOV70a 
NOW Ob 



5803109) 
12653521 
17986213 
13905114 
13676348 



NOV70a 
NOV70b 



(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 



;242) 
:244) 
;649) 
:650) 
:651) 
:652) 
:653) 



NOV70a 


1 


NOV70b 


1 


gi 


5803109 | 


1 


gi 


12653521 


1 


gi 


17986213 


1 


gi 


13905114 


1 


gi 


13676348 


1 



10 20 30 40 

....|....|....|....|....|....|....|....|....|. 
MAAETLLS S LLGLLLLGLLLPAS LTGGVGS LNLEELSEMRYG I E I L 
MAAETLLSSLLGLLLLGLLLPASLTGGVGSLNLEELSEMRYGIEIL 
MAAETIiLSSLLGLLLLGLLLPASLTGGVGSLNLEELSEMRYGIEIL 
MAAETLLS SLLGLLLLGLLLPASLTGGVGSLNLEEL SEMRYGI EIL 



50 



60 



PLPVMGGQSQ 


rSDV 


60 


PLPVMGGQSQ 


JSDV 


60 


PLPVMGGQSQ 


:SDV 


60 


PLPVMGGQSQ 


*SDV 


60 






1 






PLPVMGGQSQ 


SSDV 


14 






1 



70 
-I.. 



I , 



80 



90 



100 



110 



120 



NOV70a 


61 


NOV70b 


61 


gi 


5803109) 


61 


gi 


12653521 


61 


gi 


17986213 


1 


gi 


13905114 


15 


gi 


13676348 


1 


NOV70a 


121 


NOV70b 


121 


gi 


5803109) 


121 


gi 


12653521 


121 


gi 


17986213 


1 


gi 


13905114 


75 


gi 


13676348 


1 


NOV70a 


181 


NOV70b 


181 


gi 


5803109) 


181 


gi 


12653521 


181 


gi 


17986213 


1 


gi 


13905114 


135 


gi 


13676348 


1 



VgVSSKYKQRYECRLPAGAIHFQREREEETPAYQGPGIPELLSPMRDAPCLLKTKDWWTY^ 
vS VSSKYKQRYECRLPAGAIHFQREREEETPAYQGPGI PELLS PMRDAPCLLKTKDWWTY 
vS VS S KYKQRYE CRLP AGAIHPQREREEET PAYQGPG I PELLS PMRDAPCLLKTKDWWTY 
VjjvSSKYKQRYECRLPAGAIHFQREREEETPAYQGPGIPELLSPMRDAPCLLKTKDWWTY 



V5VSSKYKQRYECRLPAGAIHFQREREEETPAYQGPGIPELLSPMRDAPCLLKTKDWWTY 



120 
120 
120 
120 
1 

74 
1 



130 



I - 



I 



140 



150 



160 



170 



180 



efcygrhiqqyhmedseikg|vlylgsyqs|f|vrodetakaskqhrlkryhsqtygngs^ 
efcygrhiqqyhmedseikgSvlylgSyqsBfSwddetakaskqhrlkryhsqtygngsk 
efcygrhiqqyhmedseikgtsvlylgqyqsbhswddetakaskqhrlkryhsqtygngsk' 



^FCYGRHIQQYHMEDSEIKGSVLYLG<t>YQSgFSjWDDETAKASKQHRLKRYHSQTYGNGSK 



180 
180 
180 
180 
1 

134 
1 



210 



220 



230 



240 



241 
241 



gi 


5803109 | 


241 


gi 


12653521 


241 


gi 


17986213 


48 


gi 


13905114 


. 195 


gi 


13676348 


1 


NOV70a 


301 


NOV70b 


300 


gi 


5803109) 


300 


gi 


12653521 


300 


gi 


17986213 


107 


gi 


13905114 


254 




310 



320 



330 



350 



360 



I 



SPTKDDSKDSDFWKMLNEPEDQAPGGEEVPAEEQDPSPE 
SPTKDDSKDSDFWKMLNEPEDQAPGGEEVPAEEQDPSPE 
SPTKDDSKDSDFWKMLNEPEDQAPGGEEVPAEEQDPSPE 
SPTKDDSKDSDFWKMLNEPEDQAPGGEEVPAEEQDPSPE 
S PTK DDs kDSDFWK MLNE PEDQ APGG EEVPAEEQD PSP E 
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gi|!3676348| 9 



NOV70a 
NOV70b 



NOV70a 
NOV70b 



SPTKDDSKDSDFWKMLNEPEDQAPGGEEVPAEEQDPSPEDAADSASGAPNDFQNNVQVKV 



67 



370 



I 



I. 



380 

..I.. 



390 



I. 



400 



410 



360 
359. 





5803109 | 


359 


gi 


12653521 


359 


gi 


17986213 


166 


gi 


13905114 


312 


gi 


13676348 


68 


NOV70a 


417 


NOV70b 


416 


gi 


5803109) 


416 


gi 


12653521 


416 


gi 


17985213 


223 


gi 


13905114 


371 


gi 


13676348 


125 



irspadlirfieelkggtkkgkpnigqeqpvddaaevpqrepe 
irspadlirfieelkggtkkgkpnigqeqpvddaaevpqrepe 
irspadlirfieelkggtkkgkpnigqeqpvddaaevpqrepe 
irspadlirfieelkggtkkgkpnigqeqpvddaaevpqrepe 
irspadlirfieelkggtkkgkpnigqeqpvddaaevpqrepel 

irspadlir2ieelkEE kgkp SE22 e Q p S dd !S3 e S p Q re S e 

i rs padli rf i eelkggtkkgkpni gqeqp vddaaevp qre pe 




420 



ERQREMEE 
ERQREMEE 
ERQREMEE 
ERQREMEE 
E RQRE MEE 

ERQREMEE 



416 
415 
415 
415 
222 
370 
124 



450 



460 




490 



500 



510 



520 



530 

. - 1 - - 



I. 



540 
••I 



473 
455 



gi 


5803109) 


472 


gi 


12653521 


472 


gi 


17986213 


279 


gi 


13905114 


431 


gi 


13676348 


181 


NOV70a 


533 


NOV70b 


515 


gi 


5803109) 


532 


gi 


12653521 


532 


gi 


17986213 


339 


gi 


13905114 


491 


gi 


13676348 


241 



ekeldpdglkkeserdramlaltstlnklikrleekqspelvkkhkkkrwpkkpppspq 
ekeldpdglkkeserdramlaltstlnklikrleekqspelvkkhkkkrwpkkpppspq 
ekeldpdglkkeserdramlaltstlnklikrleekqspelvkkhkkkrwpkkppp5pq 
e keldpdglk kes erdramlaltstlnkli krlee kqs pel vkkhkkkr wpkkp pp s pq 
ekeldpdglkkeserdramlaltstlnklikrleekqspelvkkhkkkrwpkkpppspq 
e keldp^gl^keser^aQlalts tlSJkli krl^e^qs pelv^k^kk^r wpJkpp p s p|^ 

EKELDPDGLKKESERDRAMIALTSTLNKLIKRLEEKQSPELVKKHKKKRWPKKPPPSPb 



550 
..|.. 



560 



570 



580 



I. 



[ 



590 



I 



600 
.-I 



pteedpehrvrvrvtklrlggpnqdltvlemkrenpqlkqieglvkelleregltaagki 
pteedpehrvrvrvtklrlggpnqdltvlemkrenpqlkqieglvkelleregltaagki 
p't eeopehrvrvrvtklrlggpnqdltvlemkrenpqlkqieglvkelleregltaa gki 

pt eedpehrvrvrvtklrlggpnqdltvlemkrenpqlkqieglvkelleregltaa gki 

pteedpehrvrvrvtklrlggpnqdltvlemkrenpqlkqieglvkelleregltaagki 
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NOV70a 


593 


NOV70b 


575 


gi 


5803109) 


592 


gi 


12653521 


537 


gi 


17986213 


399 


gi 


13905114 


496 


gi 


13676348 
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eikivrpwaegthegarwltdedtrnlkeiffnilvpgaeeaqkerqrqkelesnyrrvw 
eikivrpwaegteegarwltdedtrnlkeiffnilvpgaeeaqxerqrqkelesnyrrwj 
eikivrpwaegteegarwltdedtrnlkeiffnilvpgaeeaqkerqrqkelesnyrrvw 
eikivrpwaegteegarwltdedtrnlkeiffnilvpgaeeaqkerqrqkelesnyrrvw 
eikivrpwaegteegarwltdedtrnlkeifenilvpgaeeaqkerqrqkelesnyrrvw 
eikivrp^aeg^ee^rwltdedtrnlkeiffnilv^gaeea^kerqrq^elesnyrrvw 
eikivrpwaegteegarwltdedtrnlkeiffnilvpgaeeaqkerqrqkelesnyrrvw 
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Amplification and overexpression of genes involved in cellular growth control occur 
frequently in human tumors. Using a chromosome microdissection-based hybrid-selection 
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strategy, the OS-9 gene has been identified within 12ql3-15, a region frequently amplified in 
human cancers. The full-length OS-9 cDNA sequence consists of 2785 bp from which an open 
reading frame (ORF) with 667 amino-acid residues has been deduced. The predicted 
polypeptide is water soluble and acidic. The OS-9 gene encodes a 2.8-kb mRNA transcribed in 

5 all 16 human tissues examined, suggesting that OS-9 is ubiquitously expressed in human 
tissues. OS-9 is co-amplified with CDK4 in three of five sarcoma tissues. Homology analysis 
of the amino-acid -sequence has revealed significant similarities between OS-9 and two ORFs 
deduced from genomic sequences in Caenorhabditis elegahs and Saccharomyces cerevisiae. 
The region of similarity extends over 200 residues (approximately one-third of each ORF), 

10 and eight cysteines were conserved in all three ORFs. These observations suggest that this 

region comprises a functional domain present in a novel evolutionarily conserved gene family 
defined by OS-9. 

The OS-9 genomic DNA has been isolated and characterized from a human BAC 
library. Sequencing of the genomic DNA has shown that the gene spanned approximately 30.4 

1 5 kbp and had 1 5 exons. The 1 ,0 1 0 bp sequence of the 5' upstream region has also been 
determined. The potential binding-sequence motifs TATA and CCAAT for general 
transcription factors have been found in the 5' upstream region. Primer extension analysis has 
revealed two putative transcription start sites. 

Three isoforms of OS-9 cDNA have been isolated from a myeloid leukemia HL-60 

20 cDNA library and characterized. Isoform 1 consisted of 2,700 bp, from which a 667 amino 
acid sequence was deduced and found to be identical with that of OS-9 cDNA from 
osteosarcoma cells. Isoform 2 cDNA lacked a 165 nucleotide sequence in the coding region. 
Isoform 3 cDNA had an additional 45 bp deletion in the coding region. Isoforms 2 and 3 
encode 612 and 597 amino acid polypeptides, respectively. Comparison of their cDNA 

25 sequences with the genomic structure has indicated that three isoforms are splice variants. 
Reverse transcription-polymerase chain reaction analysis has shown predominant expression 
of isoform 2 mRNA in myeloid leukemia HL-60 cells, osteosarcoma OsA-CL cells and 
rhabdomyosarcoma Rh30 cells. Northern blotting has revealed similar levels of expression of 
OS-9 gene in various tumor cell lines of sarcoma cells, carcinoma cells and myeloid leukemia 

30 cells, but 3-4 times higher expression in OsA-CL cells and Rh30 cells containing a 

homogeneously staining region of 12ql3-15. OS-9 expression decreased in differentiation- 
induced HL-60 cells. The above data suggests a possible involvement of OS-9 in cell growth 
and tumour development. 
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The NOV70 disclosed in this invention is predicted to be expressed in at least the 
following tissues: adrenal gland, bone marrow, brain - amygdala, brain - cerebellum, brain - 
hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, 
fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary gland, pancreas, pituitary 
5 gland, placenta, prostate, salivary gland, skeletal muscle, small intestine, spinal cord, spleen, 
stomach, testis, thyroid, trachea, uterus, amnion, aorta, ascending colon, bone, bronchus, 
cartilage, cervix, colon, cornea, coronary artery, dermis, duodenum, epidermis, epididymis, 
hair follicles, hypothalamus, islets of langerhans, kidney cortex, liver, lung, lymph node, 
lymphoid tissue, esophagus, ovary, parathyroid gland, peripheral blood, pineal gland, 

10 respiratory bronchiole, retina, skin, thymus, tonsils, umbilical vein, urinary bladder, vulva, 
whole organism. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV70 is provided in Example 2. 

1 5 The NOV70 nucleic acids and proteins of are useful in potential therapeutic 

applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention may have efficacy for the treatment of patients 
suffering from leukemias, sarcomas and other types of cancer, as well as other diseases, 
disorders and conditions. OS-9 was co-amplified with CDK4 in three of five sarcoma tissues 

20 (Mol Carcinog 1 996 Apr; 1 5(4):270-5). Three isoforms of OS-9 cDNA were found in a 

myeloid leukemia HL-60 cDNA library and reverse transcription-polymerase chain reaction 
analysis has shown predominant expression of isoform 2 mRNA in myeloid leukemia HL-60 
cells, osteosarcoma OsA-CL cells and rhabdomyosarcoma Rh30 cells. Northern blotting has 
revealed similar levels of expression of OS-9 gene in various tumor cell lines of sarcoma cells, 

25 carcinoma cells and myeloid leukemia cells (J Biochem (Tokyo) 1998 May; 123(5): 876-82). 

The NOV70 nucleic acid encoding the OS-9-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. The novel nucleic acid of the invention 
encoding a OS-9-like protein includes the nucleic acid whose sequence is provided in 70A or 

30 70C, or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in 70A or 70C while still 
encoding a protein that maintains its OS-9-Iike activities and physiological functions, or a 
fragment of such a nucleic acid. 
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The invention further includes nucleic acids whose sequences are complementary to 
the sequence of 70A or 70C, including nucleic acid fragments that are complementary to any 
of the nucleic acids just described. The invention additionally includes nucleic acids or nucleic 
acid fragments, or complements thereto, whose structures include chemical modifications. 
5 Such modifications include, by way of non-limiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications are 
carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
that they may be used, for example, as antisense binding nucleic acids in therapeutic 
applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
10 about 1% of the bases may be so changed. 

The novel protein of the invention includes the OS-9-like protein whose sequence is 
provided in Table 70B or 70D. The invention also includes a mutant or variant protein any of 
whose residues may be changed from the corresponding residue shown in Table 70B or 70D 
while still encoding a protein that maintains its OS-9-like activities and physiological 
1 5 functions, or a functional fragment thereof. In the mutant or variant protein, up to about 1 % of 
the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
20 prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV71 

The disclosed NOV71 (alternatively referred to herein as CG56906-01) includes the 
25 2081 nucleotide sequence (SEQ ID NO:245) shown in Table 71 A. A NOV71 ORF begins 

with a Kozak consensus ATG initiation codon at nucleotides 55-57 and ends with a stop codon 
at nucleotides 1978-1980. The disclosed NOV71 maps to human chromosome Xpl 1. 

Table 71 A. NOV71 Nucleotide Sequence (SEQ ID 
NO:245) 

GCGGCCGCGGCCTCCX3CCTCCT 

GAGCTCGCTACTGAGAAGGAGGCGGAGGAGAGCCACCGGCAAGACAGTO 
ACCTTCATCCTGCTGCTCACGCTCACCATC 

GTGCGCTTTCTGCACGAGACCGGGCTGGCCATGATCTATGGGCTCATOT 

CTGAGGTATGGTACCCCTGCTACCAGTGGC^^ 

GACAGGGCCTTCAGTACCTTATTAGTGAATGTCM 

AAAGGAGAAATCAGTCCTGGCAAGATCAACAGCGTAGAGCAGAATGATATGCTACGGAAG 
GTAACATTCGATCC^GAAGTATTTTTCAACATT 
GGATACAGCTTAAAGAAGAGACACTTTTTCAGAAATCTTGGATCT 
TTCTTGGGGACTGCTGTTTCATGCTTCATTATTGGAAATCTCATGTATGGTGTGGTGAAG 
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CT CATGAAG ATTATGGGACAG CT CTCAGAT AAATT TT ACTACACAGATTGTCTCTTTTTT 
GGAGCAATCATCTCTGCCACTGACCCAGTGACTGTGCTGGCGATATTTAATGAATTGCAT 
GCAGACGTGGATCTTTACGCACTTCTTTTTGGAGAGAGCGTCCTAAATGATGCTGTTGCC 
ATTGTACTGTCCTCGTCTATTGTTGCCTACCAGCCAGCGGGACTGAACACTCACGCCTTT 
GATGCTGCTGCCTTTTTTAAGTCAGTTGGCATTTTTCTAGGTATATTTAGTGGCTCTTTT 
ACCATGGGAGCTGTGACTGGTGTTGTGACTGCTCTAATATCCTTTTTACAGAATGCCAAC 
GTGACTAAGTTTACCAAACTGCACTGCTTCCCCCTGCTGGAGACGGCGCTGTTCTTCCTC 
ATGTCCTGGAGCACGTTTCTCTTGGCAGAAGCCTGCGGATTTACAGGTGTTGTAGCTGTC 
CTTTTCTGTGGAATCACACAAGCTCATTACACCTACAACAATCTGTCGGTGGAATCAAGA 
AGTCGAACCAAGCAGCTCTTTGAGGTGTTACATTTCCTGGCAGAGAACTTCATCTTCTCC 
TACATGGGCCTGGCACTGTTTACCTTCCAGAAGCACGTTTTCAGCCCCATTTTCATCATC 
GGAGCTTTTGTTGCCATCTTCCTGGGCAGAGCCGCGCACATCTACCCGCTGTCCTTCTTC 
CT CAACTT GGG CAGAAGGC AT AAG ATTGG CTGGAATTTT C AAC ACATGATG ATGTTTTCA 
GGCCTCAGGGGAGCAATGGCATTTGCGTTGGCCATCCGTGACACGGCATCCTATGCTCGC 
CAGATGATGTTCACGACCACCCTTCTCATTGTGTTCTTCACTGTCTGGATCATTGGAGGA 
GGCACGACACCCATGTTGTCATGGCTTAACATCAGGTTGGACGGCCCAGATTCTGCCAGA 
GG AAAC CGGACAAAACAGGAGAGCG CATGGATATTCAGG C TGT GGTAC AG CTTTGATCAC 
AATTACCTGAAGCCCATCCTCACACACAGTGGTCCCCC^CTAACCACCACGCTCCCCGCC 
TGGTGTGGCTTACTAGCTCGATGTCTGACCAGTCCCCAGGTGTACGATAACC1AAGAGCCA 
CTGAGAGAGGAAGACTCTGATTTCATCCTGACCGAAGGCGACCTGACATTGACCTACGGG 
GACAGCACAGTGACTGCAAATGGCTCCTCAAGTTCGCACACCGCCTCCACGAGTCTGGAG 
GGCAGCCGGAGAACGAAGAGCAGCTCGGAGGAAGTGCTGGAGCGAGACCTGGGAATGGGA 
G AC CAGAAGGTTT CGAGCCGGGGCAC C CGCCTAGTGT TTC CC CTGGAAGATAATGC TTGA 
CTTTCCCCCCAAGCCCTGGCGCGATGGGGTAGGCTCCCGATGGGGTGAGGACAGCTGCAA 
GCCCTAGTGTTGTTGGAGGTGGGGCAGTGACTAGATTGAAC 

A NOV71 polypeptide (SEQ ID NO:246) encoded by SEQ ID NO:245 is 641 amino 
acids in length and is presented using the one-letter amino acid code in Table 71B. The Psort 
profile for NOV71 predicts that this sequence is likely to be a Type III 6 membrane protein, 
5 has a signal peptide, and is likely to be localized to the plasma membrane with a certainty of 
0.8200. In alternative embodiments, a NOV71 polypeptide is located to Golgibodies with a 
certainty of O.4600, or to the endoplasmic reticulm (membrane) with a certainty of 0.6850. 
The Signal P predicts a likely cleavage site for a NOV71 peptide is between positions 37 and 
38, i.e., at the dash in the sequence IWL-FK. 

10 ; 

Table 71B. NOV71 Polypeptide Sequence (SEQ ID 
NO:246) 

MEEIATEKEAEESHRQDSVSLLTFILLLTLTILTIWLFKHRRVRFLHETGLAMIYG 
VILRYGTPATSGRDKSLSCTQEDRAFSTLLVNVSGKFFEYTLKGE IS PGKINSVEQNDML 
RKVTFDPEVFFNII*LPPI IFHAGYSLKKRHFFRNXiGS I LAYAFLGTAVS CFI IGNLMYGV 
VKLMKIMGQLSDKFYYTDCLFFGAI I SATDPVTVLAI FNELHADVDLYALLFGESVLNDA 
VAIVLSS SI VAYQPAGLNTHAFDAAAFFKSVGIFLGI FSGS FTMGAVTGWTALI SFLQN 
ANVTKFTKLHCFPI^ETALFFLMSWSTFLLAEA 

SRSRTKQLFEViaFIJ^FIFSYMGIiALFTFQKHWSPIFIIGAFVAIFLGRAAHIYPLS 
FFLNl^RRHKIGWNFQHMMMFSGLRGAMAFALAIRDTi^ I 
GGGTTPMLSWLNIRLDGPDSARGNRTKQESAWIFRLWYSFDHNYLKPILTHSGPPLTTTL 
PAWCGLLARCLTSPQVYDNQEPLREEDSDFILTEGDIiTLTYGDSTVTANGSSSSHTASTS 
LEGSRRTKSSSEEVLERDLGMGDQKVSSRGTRLVFPLEDNA 

A BLAST analysis of NOV71 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV71 had high 
homology to other proteins as shown in Table 71C. 

15 
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Table 71 C. BLASTX results from PatP database for NOV71 



Sequences producing High-scoring Segment Pairs: 

Homo sapiens, 



Smallest 
Sum 

High Probability 
Score P (N) 



patp:AAB9Q555 Human secreted protein, 
patp:AAB90637 Human secreted protein, 
patp:AAU02883 Human HsNHE-6 polypeptide 
patp:AAB90590 Human secreted protein, 
patp:AAB90591 Human secreted protein, 



2410 
2410 
1693 
902 
902 



5.1e-250 
5.1e-250 
5.2e-201 
2.le-169 
2.1e-169 
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In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 747 of 1 121 bases (66%) identical to a gb:GENBANK- 
ID:AF030409|acc:AF030409.1 mRNA from Homo sapiens (sodium-hydrogen exchanger 6 
(NHE-6) mRNA, nuclear gene encoding mitochondrial protein). The full amino acid sequence 
of the protein of the invention was found to have 391 of 518 amino acid residues (75%) 
identical to, and 443 of 518 amino acid residues (85%) similar to, the 669 amino acid residue 
ptnr:SWISSNEW-ACC:Q92581 protein from Homo sapiens (Human) 
(SODIUM/HYDROGEN EXCHANGER 6 (NA(+)/H(+) EXCHANGER 6) (NHE-6)). 
NOV71 also has homology to the other proteins shown in the BLASTP data in Table 7 ID. 



Table 71D. NOV71 BLASTP results 


Gene Index / \ 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 14211919 |r 
ef |NP_115980. 
l| (NM_032591) 


solute carrier family 9 
( sodium/hydrogen 
exchanger), isoform 7; 
nonselective sodium 
potassium/proton 
exchanger [Homo sapiens] 


725 


631/681 
(92) 


632/681 
(92) 


0.0 


gi|33l9946|em 
b|CAA18155.l| 
(AL022165) 


dJ71L16.5 (KIAA0267 LIKE 
putative Na( + )/H{+) 
exchanger) [Homo 
sapiens] 


616 


575/625 
(92) 


576/625 
(92) 


0.0 


gi|l665827|db 
j |BAA13449.l| 
(D87743) 


Similar to Human Na+/H+ 
exchanger 2 (A57644) 
[Homo sapiens] 


666 


451/649 
(69) 


513/649 
(78) 


0.0 


gij 5454070) re 
f |NP_006350.1 
| (NM_006359) 


solute carrier family 9 
( sodium/hydrogen 
exchanger) , isoform 6 
[Homo sapiens] 


669 


451/649 
(69) 


513/649 
(78) 


0.0 


gi|l7474970|r 
ef |XP_062645. 
l[ (XM 062645) 


similar to solute 
carrier family 9 
( sodium/hydrogen 
exchanger), isoform 7; 
nonselective sodium 
pot as s ium/proton 
exchanger (H. sapiens) 
[Homo sapiens] 


485 


292/412 
(70) 


305/412 
(73) 


e-148 
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This BLASTP data is displayed graphically in the ClustalW in Table 71 E. A multiple 
sequence alignment is given, with the NOV71 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 71D. 



Table 71E. ClustalW Alignment of NOV71 



NOV71 



gi 
gi 
gi 
gi 
gi 



14211919 
3319946) 
1665827 | 
5454070) 
17474970 



NOV71 


1 


gi 


14211919 | 


1 


gi 


3319946| 


1 


gi 


1665827| 


1 


gi 


5454070 | 


1 


gi 


17474970 | 


1 



NOV71 


8 


gi 


14211919 | 


61 


gi 


3319946 | 


1 


gi 


1665827) 


58 


gi 


5454070 1 


61 


gi 


17474970 | 


8 



NOV71 



14211919 

3319946 

1665827 

5454070 

17474970 



NOV71 


128 


gi 


14211919) 


181 


gi 


3319946) 


72 


gi 


1665827) 


171 


gi 


5454070) 


174 


gi 


17474970) 


121 


NOV71 


188 


gi 


14211919 | 


241 


gi 


3319946) 


132 


gi 


1665827) 


206 


gi 


5454070) 


209 


gi 


17474970) 


121 



NOV71 



(SEQ ID NO: 246) 
(SEQ ID NO: 654) 
(SEQ ID NO: 655) 
(SEQ ID NO: 656) 
(SEQ ID NO: 657) 
(SEQ ID NO: 658) 
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MEPGDAARPGSGRATGAP PPRLLLLPLLLGWGLRVAAAAS AS SSGAAAEDS S. 



RGWRRAPIiRRGVGSS PRARRLMRPL WLLIoAVGVFDWAGASJ 

' MARRGWRRAPLRRGVGSSPRARRLMRPLWLLLAVGVFDWAGASDGGGl 
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FILLLTLTILTIWLFKKRR 
FILLLTLTI LT I WL F KHRR 





13 0 



14 0 



I 



150 



I. 



160 



170 
..| 



180 



68 
121 
12 

118 HVE 

121 HVPgDVNNVTLSCEVQSSPI 
68 



PgTSGRDKjLgCTQEDgAFSTLLVgVSGKFFEYTLKgEISPGKINSVialQNDMLR 

pgrsGRDK j lSctqedBafstllvwsgkffeytlxBeispgkinsvBqndmlr 
p3ts grdk§l"ctqed§afs tllv°vsgkffeytlk"e i s pgkins vSqndmlr 

Sib. 



JVJJNVTLSCEVQSjSPj] 



— , „ H TLLVp^FBL^lgBBs .^A^T"g" g 

^TgGRDKBLjgCTQED^^ 



KVTFDP 127 
KVTFDP 180 
KVTFDP 71 

171 

- 174 

121 



190 



200 



210 




JILLPP I I fhagyslkkrhffru 
jillppiifhagyslkkrhffrk 

EVFFjSlLLPP 1 1 FHAGYSLKKRHFFR1 



220 
- - I . . 



230 
..I 



240 



LGSI LAYAFLGTAwS CFmIGi 
ILGS ILAYAFLGTAgS CFOIG! 

ilgsilayaflgtaS^pf 
Ilgsilayaflgt 
|lgs i l ay aflgtaps c i g \ 



AgSCFMIGgJMYGEvgL 
'AgSCFMIGiSiMYGRvgL 

s cfSig-^m ygBvBl 
'aiiscfSigSmygRvHl 



Jim 187 

m 240 
tM 131 
ifr 205 
208 
121 



I 



250 
..I.. 



260 



I. 



270 
..I.. 



I. 



280 



290 
..| .. 



300 



3QLg5gFY2TDCLgFGAI0SATDPVTVLAIFSELiISDV!SLYALLFGESVLNDAVAIVLSS 



GQL §ffi FY S TDCL S FGAI 8 SATDPWVLAIF S EL S D 

GQLJ^FYyTDClflFGAlMSATDPVTVLAI^^ 
GOL^ffiFYpTDCI^FGAlBsATDPVTVLAIFitiELiSpV^LYALLFGESVLNDAVAIVLSS 



LYALLFGESVLNDAVAIVLSS 



247 
300 
191 
265 
268 
121 



310 



320 



330 



248 



340 



FKSVGIFLGIFSGSFTMGAVTGWTALi 



350 

.(.... I 

SFLQNANj 



360 
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gi 


14211919| 


301 


gi 


3319946 




192 


gi 


1665827 




266 


gi 


5454070 




269 


gi 


17474970) 


121 


NOV71 




308 


gi 


14211919) 




gi 


3319946 




243 


gi 


1665827 | 


318 


gi 


5454070 1 


321 


gi 


17474970 | 


153 


KOV71 




368 


gi 


14211919) 


412 


gi 


3319946 




303 


gi 


1665827 




378 


gi 


5454070 




381 


gi 


17474970) 


213 



NOV71 



14211919 | 

3319946 

1665827 

5454070 

17474970) 



NOV71 



14211919 

3319946 

1665827 

5454070 

17474970 



SIVAYQPAGffiNj 
SIVAYQPAGffiNj 

sivayqpagSn! 
sivayqpaqw 



JhSfdSBaS: 
|hhfd^SaS: 



FKSVGIFLGIFSGSFTMGAVTGHygAj 

fksvgiflgifsgsftmgavtghvSa^ 

FKSHgIFLGI FSGS FgMGAgTGWTAL 

fksBgiflgifsgsfSmgaStgwtal 



NOV71 



14211919 

3319946 

1665827 

5454070 

17474970 



FKSyGIFLGIFSGS FgJMGA^TGVVTAL 

ks vgi flgi fsgHftmgavtg WTAL 




370 
. . I . . 



380 



390 



400 



410 



420 



| 



I ■ 



KLgCFPLLETALFFLMSWSTFLLAEACGFTGWAVLFCGITQAHYTYNKLSVESRSRTKQ 
KLgCFPLLETALFFLMSWSTFLLAEACGFTGWAVLFCGITQAHYTYNNLSVESRSRTKQ 
KijtaFPLLETALFFLMSWSTFLLAEACGFTGW 

KifeFELLETSLFFLMSWSTFLLAEAffiGFTGWAVLFCGITQAHYTYNm 

kl^f^letglfflmswstfllaeaflgftgwavlfcgitqahytyi^saesjgrtkq 
klScfplletalfflmswstfllaeacgftgwavlfcgitq^ 



428 
472 
363 
438 
441 
268 



508 
592 
483 
526 
529 
388 



NOV71 


568 


gi 


14211919 | 


652 


gi 


3319946) 


543 


gi 


1665827 | 


586 


gi 


5454070) 


589 


gi 


1747*4970 | 


412 



367 
411 
302 
377 
380 
212 



43 0 



440 



450 



J. 



460 



470 



480 



LFESLWFLAENFIFSYMGLALFTFQKHVFSPHFIIGAFVAIFLGRAAHIYPLSFFLNLGRi 



lfeSdTiflaenfifsymglalftfqkhvfspmfiigafvaiflgraahiyplsfflnlgr 
lfeSlSflaenfifsymglalftfqkhvfsp|fiigafvaiflgraahiyplsfflnlgr! 

LFESLSFLAENFIFSYMGLgLFTFQgHVFgp|F^GAFVAIFLGRAA|lYPLSgLNLGR! 
LFE[|lJ§FLAENFI FS YMGLflLFTFQgHVFgpBF^GAFVAI FLGRAAiJl YPLS^LNLGR 
I,FF^^BAENFIFSSMnLALFTFQKHVF5PBFIIGAFVA5FLGRAAHIYPLSFFLgLGR 



427 
471 
362 
437 
440 
267 



490 



500 



510 



520 



530 



540 



RHKI GWNFQHMMMFS GLRG AMAFALAI RDTAS Y ARQMM FyTTLL I VFFT VW I IGGGTT PM 
RHKI GWNFQHMMMFSGLRGAMAFALAIRDTAS YARQMMFyTTLLI VFFTVWI IGGGTTPM 
RKKI GWNFQHMMMFS GLRG AMAFALAI RDTAS YARQMM FyTTLL I VFFT VWI IGGGTT PM 
P^KIG^FQHMMMFgGLRGAMAFALAIRDTAgYARQMMFgTTLLIVFFTVWraGGGTTgM 
R@KI G@N FQKMMMF§GLRGAMAF ALAI RDTAbYARQMMFbTTLL I VFFTVWflgGGGTTgM 



PRyTf?Wl>7FOW^MFRnT.RGAMAFALAl{foTASYAROMBF^Ty33lVFFTgWIIGGG'I 



487 
531 
422 
497 
500 
327 



NOV71 




488 


LS 


gi 


14211919) 


532 


hS 


gi 


3319946 




423 


LS 1 


gi 


1665827 




498 


LSI 


gi 


5454070 




501 


ls! 


gi 


17474970) 


328 


LS 




560 



PSEEDQNEHHWQYFRVi 
PSEEDQNEHHWQYFRVI 



I KEPSKEDHNEHHRQYFRV* 



610 

..!.. 



I 



620 



QESAWIFRnPWYgFDHNYLKPgLTHSGPPLTTTLPA^jCGl 

qesawifrSwyBfdhnylkpBlthsgppltttlpaScgwJaR' 
qesawifrSwyBfdhnylkpBlthsgppltttlpaBcgiJBaR' 
esa^2fr|wySfdhnylkp5lthsgppltttlpaHcgE 
esawSfrRwySfdhnylkpslthsgppltttlpaScgl 




.cltspqvydnqeplregd 
cltspqvydnqeplreBd 



iPQVYDNQEPLREgi 



567 
651 
542 
585 
588 
411 



670 



680 



690 



700 



710 



720 



sdfiltegdltltygdstvtang 
sdfiltegdltltygdstvtang 
sdfiltegdltltygdstv tang 
sdSil^gdHSltygdstvi" 

SDgILgjGDJ£LTYGDSTVi 

SDFILTEGDLTLTYGDSTVTANG 




730 



740 




641 

725 

616 

g^EPPLNLLDNTRHGPA 666 
*TE PPLNLLDNTRHGPA 669 
485 
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Table 71F lists the domain description from DOMAIN analysis results against NOV7L 
This indicates that the N0V71 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 71F. Domain Analysis of NO V71 

gnl lPfam|pfam00999 , Na H Exchanger. Sodium/hvdroqen exchanger family. Na/H 

antiporters are key transporters in maintaining the pH of actively 

metobolizing cells. The molecular mechanisms of antiport are unclear. These 

antiporters contain 10-12 transmembrane regions (M) at the amino -terminus and 

a large cytoplasmic region at the carboxyl terminus. The transmembrane 

regions M3-M12 share identity with other members of the family* The M6 and M7 

regions are highly conserved. Thus/ this is thought to be the region that is 

involved in the transport of sodium and hydrogen ions. The cytoplasmic region 

has little similarity throughout the family. SEQ ID NO: 865 

CD-Length = 400 residues, 88.5% aligned 
Score = 276 bits (706), Expect = 3e-75 

N0V71; 123 VTFDPEVFFNI LLPPI I FHAGYSLKKRHFFRNLGS I LAYAFLGTAVSCFI IGNLMYGWK 182 

V D EVFF ILLPPI+F AG L R FRNLGSIL A LG + IG LMY +V 
Sbjct: 46 VDLDSEVFFEI LLPPILFEAGLELDLRELFRNLGSILLLALLGVLI PALGIGGLMYALVP 105 

N0V71: 183 LMKIMGQLSDKFYYTDCLFFGAI I SATDPVTVIAI FNEL - HADVDLYALLFGES VLNDAV 241 

++ + + L FGAI+SATDPV VLA+ EL L L+FGES+LND V 

Sb j Ct : 106 ILGL- - DFLAALLFGAILS ATDP VAVLAVLKELGRVPKRLGTL I FGE SLLNDGV 157 

NOV71: 242 AIVLSSSI VAYQPAGLNTHAFDAAAFFKSVGI FLGIFSGS FTMGAVTGWTALISFLQNA 301 

A+VL + ++++ G A+A F + FL+FG +GVG + +LI 
Sbjct: 158 AWLLAVL ISFALGG AVEAFDIFLGILSFLWFLGGILIGLVLGYLLSLI 207 

NOV71: 302 NVTKFT^LHCFPLLETALFFIiMSWSTFLLAEACGFTGVVAVLFCGI TQAHYTYNNLSVES 361 

T+FT L+E L L+++ +LLAE G +G++AV G+ ++Y N+S +S 

Sbjct: 208 - - TRFTFRE - DRLI EPLLVLLLAYLAYLLAE ILGLSGI LAVFAAGLALSNYVEANI SEKS 264 

N0V71: 362 RSRTKQLFEVLHFLAENFI FS YMGLALFTFQKHVFSPIFI IGAFVAI FLGRAAHIYPLS F 421 

R+ K ++VL FL E IF +GL+L H ++ 1+ A V I L RA ++ L+ 

Sb j ct : 265 RTTEKYFWKVLSFLFEPLI FVLLGLSLDLSVLHNWNIALILLAI VL ILLARAIGVFLLTL 324 

N0V71: 422 FLNIjGRRHKI GWNFQHMMMFSGLRGAMAFAIiAI RDTAS Y - - ARQMMFTT3I»LIVFFTVW I 479 

LN RR KI + Q ++ + GLRGA+A ALA+ + AR ++ TT +++V TV + 
Sbjct: 325 LLNFFRREKIPFGDQLVIGWGGLRGAVALALALSGPLTSGPARDLI LTTAI I WLVTVLV 384 

N0V71: 480 IGGGTTPMLSWLNIR 494 
G P++ L ++ 

Sbjct: 385 QGITLKPLVKKLRVK 399 



Na+/H+ exchangers are integral membrane ion transporter proteins that exchange 
extracellular Na+ for intracellular H+ with a stoichiometry of one for one. They have multiple 
cellular functions, including maintenance of intracellular pH, cell volume control, and 
1 0 reabsorption of sodium across renal, intestinal, and other epithelia. Multiple Na+/H+ 
exchanger isoforms (NHE1-NHE6) exist, exhibiting considerable differences in their 
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membrane localization, biochemical and pharmacologic properties, and responsiveness to 
various stimuli. For example, NHE1, the most predominant isoform expressed in heart, 
contributes significantly to myocardial intracellular pH. Hyperactivation of NHE1 during 
episodes of cardiac ischemia and reperfiision has been shown to disrupt the intracellular ion 
5 balance, leading to cardiac dysfunction and damage in several animal models, which can be 
prevented by pharmacological antagonists of NHEH. Increased activity of sodium/hydrogen 
exchange also provides a potentially important mechanism for the development of 
hypertension. 

In 1998, Numata et al. identified the gene encoding a novel isoform of 
10 sodium/hydrogen exchanger that they called NHE6 or SLC9A6 (3). The NHE6 protein has 
similar topology to the other NHEs in that it has 12 putative membrane-spanning segments 
within the N-terminal region and a hydrophilic C terminus. However, NHE6 also has a 
putative mitochondrial inner membrane-targeting signal at its N terminus. The NOV71 protein 
is homologous to the NHE6 protein, except that it is predicted to localize to the plasma 
1 5 membrane instead of the mitochondrial inner membrane. The NHE6-like gene maps to human 
chromosome Xpl 1 . Based on its expression pattern, NOV71 may play a role in renal or 
metabolic diseases and immune function through its sodium/hydrogen exchange activity at the 
plasma membrane. 

• NOV71 is predicted to be expressed in at least the following tissues: kidney, tonsils, 
20 germinal B cells, uterus, pituitary gland, brain, skeletal muscle, heart, lung, liver, pancreas, 
small intestine, colon, kidney, spleen, thymus, peripheral blood leukocytes, testis, ovary, 
placenta and prostate. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
25 NOV71 is provided in Example 2. 

The NOV71 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, cancer, 
trauma, bacterial and viral infections, in vitro and in vivo regeneration, endometriosis, fertility, 
hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, immunodeficiencies, 
30 graft versus host disease, diabetes, autoimmune disease, renal artery stenosis, interstitial 

nephritis, glomerulonephritis, polycystic kidney disease, systemic lupus erythematosus, renal 
tubular acidosis, IgA nephropathy, hypercalcemia, Lesch-Nyhan syndrome, endocrine 
dysfunctions, diabetes, obesity, growth and reproductive disorders, tonsillitis as well as other 
diseases, disorders and conditions. 
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The NOV71 nucleic acids encoding the sodium/hydrogen exchanger NHE6-like 
protein of the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. The 
novel nucleic acid of the invention encoding a Sodium/Hydrogen Exchanger 6-like protein 
5 includes the nucleic acid whose sequence is provided in Table 71 A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 71 A while still encoding a protein that maintains 
its Sodium/Hydrogen Exchanger 6-like activities and physiological functions, or a fragment of 
such a nucleic acid. The invention further includes nucleic acids whose sequences are 

1 0 complementary to the sequence of Table 71 A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

15 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 34% of the bases 
may be so changed. 

20 The novel protein of the invention includes the Sodium/Hydrogen Exchanger 6-Iike 

protein whose sequence is provided in Table 71B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 71B while still encoding a protein that maintains its Sodium/Hydrogen Exchanger 6- 
like activities and physiological functions, or a functional fragment thereof. In the mutant or 

25 variant protein, up to about 25% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies'' section 

30 below. 

NOV72 

The disclosed NOV72 (alternatively referred to herein as CG56910-01) includes the 
1094 nucleotide sequence (SEQ ID NO:247) shown in Table 72A. A NOV72 ORF begins 
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Kozak consensus ATG initiation codon at nucleotides 14-16 and ends with a stop codon at 
nucleotides 1082-1084. The disclosed NOV72 maps to human chromosome 6. 



Table 72A. NOV72 Nucleotide Sequence (SEQ ID 
NO:247) 

ACTTCTATAAGACATGGATAGATGCAAACATGTAGGGCGGTTAGCCCCAGGCGTCACGGG 
CCTGCGCAACCTGGGCAACACCTGCTACATGAACTCCATCCTCCAGGTGCTCAGCCACCT 
CCAGAAGTTCCGAGAATGTTTCCTCAACCTTGACCCTTCCAAAACGGAACATCTGAGTTC 
AAAGCACATTTCCCTCTGCCGTGAACTGCACACCCTCTTCCGAGTCATGTGGTCCGGGAA 
GTGGGCCCTAGTGTCGCCCTTCGCCATGCTCCACTCAGTGTGGAGCCTGATCCCTGCCTT 
CCGCGGCTACGACCAACAGGACGCGCAGGAATTTCTCTGCGAGCTGCTGCACAAGGTGCA 
GCAGGAACTCGAGTCTGAGGGCACCACACGCCGGATCCTCATCCCCTTCTCCCAGAGGAA 
GCT CAC CAAACAGGTC TTAAAGGTGGTGAATAC CAT ATTTCATGGGCAGCTG C T CAGTC A 
GGTATGTCTGGTCACATGTATATCATGCAATTACAAATCCAATACCATTGAGCCCTTTTG 
GGACCTATCCCTGGAATTCCCTGAACGCTATCACTGCATAGAAAAGGGGTTTGTCCCTTT 
GAATC AAACAG AGTG C TTG CTCACTGAG ATGCTGGC CAAATT CAC AG AGACAGAGG C CCT 
GGAAGGGAGAATCTACGCTTGTGACCAGTGTAACAGTGAGTGCTGTGTGAAGCAGTTAAT 
GATCTACAGACTACCTCAG^TTCTCCGGCTGCACCTTAAAAGATTCCATCGAGAGAAGAT 
TGGGGTCCATGTCGTCTTTGACCAGGTATTAACCATGGAACC^ 

GCTCTCCTCTCTTGACAAAGAGACCTTTGCCTATGATCTCTCCGCAGTGGTCATGCAT(^ 
CGGGAAAGGGTTTGGCTCAGGACACTACACAGCCTATTGCTACAAGACAGAGGGAGGTTT 
TTGGGTCCACTGCAATGACTCAAAGCTGAATGTATGCAGTGTCGAGGAAGTGTGCAAAAC 
CCAGGCCTACATCCTTTTTTACGCGCTGACTGAGATGGCGCTGAGTGAATGTGGAAGGTG 
CTAAGACCCAGTCT 



A NOV72 polypeptide (SEQ ID NO:248) encoded by SEQ ID NO:247 is 356 amino 
acids in length and is presented using the one-letter amino acid code in Table 72B. The Psort 
profile for NOV72 predicts that this sequence has no signal peptide and is likely to be 
localized in the cytoplasm with a certainty of 0.5050. In alternative embodiments, a NOV72 
polypeptide is located to lysosomes with a certainty of 0.1000, to the mitochondrial matrix 
space with a certainty of 0.1000, or to paroxisomal microbodies with a certainty of 0.3547. 



Table 72B. NOV72 Polypeptide Sequence (SEQ ID 
NO:248) 



^RCKHVGRLAPGVTGLRNLGNTCYMNS I LQVLSHLQKFRE CFLNLDPS KTEHLS SKHI S 

LCRELHTLFRVMWSGKWALVSPFAMLHSVWS 

SEGTTRRILIPFSQRKIiTKQVLKVVNTira^ 

EFPERYHCIEKGPVPI^QTECLLTEMLAKFTETEALEGRIYACDQCNSECCVKQLMIYRL 
PQVLRLHLKRFHREKIGVHWFDQVLTMEPYCCRDM^ 

GSGHYTAYCYNTEGGFWVHCNDSKLNVCSVEEVCKTQAYILFYALTEMALSECGRC 



A BLAST analysis of NOV72 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV72 had high 
1 5 homology to other proteins as shown in Table 72C. 



Table 72C BLASTX results from PatP database for NOV72 



Smallest 



5 



10 
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Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P 


(N) 




patp:AAB92670 Human protein sequence 


1133 


2 


.oe- 


-121 


patp:AAB42259 Human ORFX ORF2023 polypeptide sequence 


497 


2 


7e- 


-47 


patp:AAYl3ll5 Human secreted protein encoded by 5' EST 


492 


9 


le- 


-47 


patp:AAM79l94 Human protein 


487 


3 


.le- 


-46 


patp:AAB74672 Human protease and protease inhibitor 


464 


8 


4e- 


•44 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 215 of 342 bases (62%) identical to a gb.GENBANK- 
lD:AF073344|acc:AF073344.1 mRNA from Homo sapiens (ubiquitin-specific protease 3 
5 (USP3) mRNA). The full amino acid sequence of the protein of the invention was found to 
have 124 of 345 amino acid residues (35%) identical to, and 183 of 345 amino acid residues 
(53%) similar to, the 353 amino acid residue ptnr:SWISSNEW-ACC:088623 protein from 
Mus musculus (Mouse) (UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 2 (EC 
3.1.2.15) (UBIQUITIN THIOLESTERASE 2) (UBIQUITIN-SPECIFIC PROCESSING 
1 0 PROTEASE 2) (DEUBIQUITINATING ENZYME 2) (41 KDA UBIQUITIN-SPECIFIC 

PROTEASE)). NOV72 also has homology to the other proteins shown in the BLASTP data in 
Table 72D. 



Table 72D. NOV72 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

<%) 


Expect 


gi | 15208127 |d 
bj |BAB63088.1 
| (AB070143) 


hypothetical protein 
[Macaca f ascicularis] 


550 


282/317 
(88) 


287/317 
(89) 


e-157 


gi | 16174134 |r 
ef |XP_057397. 
1 1 (XM 057397) 


similar to Unknown 
(protein for MGC:20741) 
(H. sapiens) 


640 


255/289 
(88) 


260/289 
(89) 


e-140 


gi| 14149817 |r 
ef |NP_115523. 
l| (NM 032147) 


hypothetical protein 
DKFZp434D0127 
[Homo sapiens] 


712 


221/307 
(71) 


247/307 
(79) 


e-119 


gi| 15451368 |d 
bj |BAB64488.1 
| (AB071094) 


hypothetical protein 
[Macaca fascicularisj 


585 


205/239 
(85) 


210/239 
(87) 


e-108 


gi | 16041104 |d 
bj (BAB69719.1 
| (AB072750) 


hypothetical protein 
[Macaca fascicularisj 


497 


111/213 
(52) 


123/213 
(57) 


2e-45 



1 5 This BLASTP data is displayed graphically in the ClustalW in Table 72E. A multiple 

, sequence alignment is given, with the NOV72 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 72D. 
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Table 72E. ClustalW Alignment of NOV72 



NOV72 



15208127 
16174134 
14149817 
15451368 
16041104 



NOV72 

gi 1 15208127 
gi I 16174134 
gij 14149817 
gi j 15451368 
(16041104 



NOV72 

gi[ 15208127 
gij 16174134 
gi jl4149817 
gij 15451368 
gij 16041104 



N0V72 


gi 


15208127 


gi 


16174134 


gi 


14149817 


gi 


15451368 


gi 


16041104 



N0V72 

gi|l5208127 
gi | 16174134 
gij 14149817 
gi j 15451368 
gi | 16041104 



N0V72 


gi 


15208127 


gi 


16174134 


gi 


14149817 


gi 


15451368 


gi 


16041104 


N0V72 


gi 


15208127 


gi 


16174134 


gi 


14149817 


gi 


15451368 


gi 


16041104 



(SEQ ID NO:248) 
(SEQ ID NO: 659) 
(SEQ ID NO: 660) 
(SEQ ID NO: 661) 
(SEQ ID NO: 662) 
(SEQ ID NO: 663) 



10 



NOV72 



20 



30 



40 



50 




60 



70 



80 



90 



100 




110 



120 



TO 


3 


PE 


GDLKLLR 

f~i-r\T Zr"T T T) 


S 


JD 


N 
N 


AT 
PE 


GD-LiKllLlK 

GDLKLLK 


R 
S 




N 


AT 


GDLKLLR 


S 


140 


150- 



200 




250 

























QGLAQSTI IEIVSg 






QGIiAQSTIIEIVPJj 




260 



270 



280 



290 



300 




310 



320 



330 



340 



350 
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.gi 


15208127 




gi 


16174134 




gi 


14149817 




gi 


15451368 




gi 


16041104 





NOV72 



gi 


15208127 


gi 


16174134 


gi 


14149817 


gi 


15451368 


gi 


16041104 


NOV72 


gi 


15208127 


gi 


16174134 


gi 


14149817 


gi 


15451368 


gi 


16041104 


NOV72 


gi 


15208127 


gi 


16174134 


gi 


14149817 


gi 


15451368 


gi 


16041104 


NOV72 


gi 


15208127 


gi 


16174134 


gi 


14149817 


gi 


15451368 


gi 


16041104 


NOV72 


gi 


15208127 


gi 


16174134 


gi 


14149817 


gi 


15451368 


gi 


16041104 


NOV72 


gi 


15208127 


gi 


16174134 


gi 


14149817 


gi 


15451368 


gi 


16041104 


NOV72 


gi 


15208127 


gi 


16174134 


gi 


14149817 


gi 


15451368 


gi 


16041104 




FPKATNGKTQLSGKP^SATEI^LRSDRAEAC 
|FPKATNGKTQLSGKP^SSATI^ 
QWLAM TASEKTRS - CKHPPVTOajVVY^^CQEKDTGFVCS 
jFPKATNGKTQLSGKPjp^SAI^t^RSDRAEACE^E 
jWLA^ASEKKRS FSKHP P V£p3^ 



400 




gfc m 

GFC m 

RQSSLSSGLS 

GFC WK 

RQSSLSSGLS 




460 



470 



I 



ILIPFSQRKLTKQVLKWNTIFHGQLLSQ 
ILIPFSQRKLTKQVLKWNTIFHGQLLSQ 
ILIPFSQRKLTKQVLKWNTIFHGQLLSQ 

LIp3sQRKL0KQVLjSwNiJilFHGQLLSQ 
ILIPFSQRKLTKQVLKWNTIFHGQLLSQ 

LIPBSQRKLBKQVLjSWNjJjIFHGQLLSQS 




SVEG- 



510 



520 
I....L. 



530 



540 
.J.. 



550 

..Ij' 



LSLEFPERYHCISKGFVPLNQTECLLTEMLAKFT5TEALEGB^BjlY 
LSLEFPERYHCIEKGFVPLNQTECLLTEMLAKFTETEALEG^^^KlY 
L SLE FPERYHCI5K G FVPLNQTE CLLTEMLAKFTETEALSgBS^Jn T Y 
LSLEFPERYBcggKBEg^ggci gTEMI AKFTETEAI ^t^^'" 
LSLEFPERYHCIEKGFVPLNQTECLLTEMLAKFTETEALEG 

fp! 



KIYV( 



CDQCN 

-m 




570 
|....|.. 



580 



590 



KQLMIYRLPQVLRLHLKRF 



600 



SNPKPL 
HKSNPKPt 

SKPppr§3Kp jv 

SNPKPI, 



vlgeagkqlmiyrlpqvlrlhlkrfrwsgrnhr 
ivl9ea§kqlmiyrlpqvlrlhlkrfrwsgrnhr 
vlHeaSkqlmi^lpqvlrlhlkrfrwsgrn^r 
vlseajJkqlmiyrlpqvlrlhlkrfrwsgrnhr 

rn! 



620 



630 




660 



690 




700 



670 680 
-|....|....|....|....| 
FWVHCNDSKLNVCSVEEVCKTQAYILFY 

CNXJSKLNVCSVEEVCKTQAYILFYTQRTVQGNAR 

.CALLCGVGDTERG 

'WVHCNDSKLSMCTMDEVCKAQAYILFYTQRVTENGHS 
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710 720 


NOV72 


....).... |....|.. ..]....).. 


gi 


15208127 


I SETHLQAQVQSSNNDEGRPQTFS - - - 


gi 


16174134 




gi 


14149817 


KLLPPELLLGSQHPNEDADTSSNEILS 


gi 


15451368 




gi 


16041104 





Table 72F lists the domain description from DOMAIN analysis results against NOV72. 
This indicates that the NOV72 sequence has properties similar to those of other proteins 
known to contain this domain. 

5 

Table 72F. Domain Analysis of NOV72 

j~~ gnl|Pfam|pfam00443 # UCH-2, Ubiquitin carboxyl- terminal 
hydrolase family 2. SEQ ID NO: 866 

CD- Length = 68 residues, 91.2% aligned 
Score = 59.7 bits (143), Expect = 3e-10 

N0V72 : 2 87 YDLSAVVMHHGKGFGSGHYTAYCYNTEGGFWVHCNDSKLNVCSVEEVCK -TQAY 339 

Y+L AW+H G GHY AY G W +D K++ + EEV + + AY 

Sb j Ct : 5 YEL YAVWHSG - SL S GGHYI AYVKKENDG- WYKFDDDK VSRVTEEEVLEFSGGGET S SAY 62 

NOV72: 340 ILFY 343 
ILFY 

Sbjct; 63 ILFY 66 

Ubiquitin is a highly conserved 76-amino acid protein involved in the regulation of 
intracellular protein breakdown, cell cycle regulation, and stress response. Ubiquitin is 
released from degraded proteins by disassembly of the polyubiquitin chains, which is mediated 

10 by ubiquitin-specific proteases (USPs). The ubiquitin-specific proteases are a family of 
largely dissimilar enzymes with two major conserved sequence regions, containing either a 
conserved cysteine residue or two conserved histidine residues, respectively. The murine Unp 
oncoprotein and its human homologue, Unph, both contain regions similar to the conserved 
Cys and His boxes common to all the Ubps. 

15 Unp and Unph have been shown to be active deubiquitinating enzymes, able to cleave 

ubiquitin from both natural and engineered linear ubiquitin-protein fusions, including the 
polyubiquitin precursor. Mutation of the conserved Unp Cys and His residues abolishes this 
activity, and identifies the likely His residue in the catalytic triad. Unp is tumorigenic when 
overexpressed in mice, leading to the suggestion that Unp may play a role in the regulation of 

20 ubiquitin-dependent protein degradation. It was demonstrated that the high-level expression of 
Unp in yeast does not disrupt the degradation of the N-end rule substrate Tyr-beta- 
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galactosidase (betagal), the non-N-end rule substrate ubiquitin-Pro-betagal, or the degradation 
of abnormal, canavanine-containing proteins. 

Data suggests that Unp is not a general modulator of ubiquitin-dependent proteolysis. 
However, Unp may have a role in the regulation of the degradation of a specific, as yet 
5 undescribed, substrate(s). The novel human Ubiquitin-Specific Protease-like Proteins of the 
invention contains two Ubiquitin carboxyl-terminal hydrolase domains. Therefore, it is 
anticipated that NOV72 has a role in regulation of specific ubiquitins and could be a 
potentially important target for drugs. Such drugs may have important therapeutic 
applications, such as treating numerous tumors. 
10 NOV72 is predicted to be expressed in at least the following tissues: bladder and 

cervix. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, public EST 
sources, literature sources, and/or RACE sources. Further expression data for NOV72 is 
provided in Example 2. 

1 5 The NOV72 nucleic acids and proteins are useful in potential therapeutic applications 

implicated in various pathological disorders described further herein, for example, the 
compositions of the present invention will have efficacy for the treatment of patients suffering 
from: cystitis, incontinence, fertility, systemic lupus erythematosus, autoimmune disease, 
asthma, emphysema, scleroderma, allergy, ARDS, Von Hippel-Lindau (VHL) syndrome, 

20 Alzheimer's disease, stroke, tuberous sclerosis, hypercalcemia, Parkinson's disease, 

Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, 
ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, anxiety, pain, 
neurodegeneration. as well as other diseases, disorders and conditions. The NOV72 nucleic 
acid encoding the ubiquitin specific protease-like protein of the invention, or fragments 

25 thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a ubiquitin-specific protease-like 
protein includes the nucleic acid whose sequence is provided in Table 72A, or a fragment ' 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 

30 be changed from the corresponding base shown in Table 72A while still encoding a protein 
that maintains its ubiquitin-specific protease-like activities and physiological functions, or a 
fragment of such a nucleic acid. 

The invention further includes nucleic acids whose sequences are complementary to 
the sequence of Table 72A, including nucleic acid fragments that are complementary to any of 
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the nucleic acids just described. The invention additionally includes nucleic acids or nucleic 
acid fragments, or complements thereto, whose structures include chemical modifications. 
Such modifications include, by way of non-limiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications are 

5 carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
that they may be used, for example, as antisense binding nucleic acids in therapeutic 
applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
about 38% of the bases may be so changed. 

The novel protein of the invention includes the ubiquitin-specific protease-like protein 

10 whose sequence is provided in Table 72B. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
72B while still encoding a protein that maintains its ubiquitin-specific protease-like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 
up to about 65% of the amino acid residues may be so changed. 

15 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from Jiydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

20 

NOV73 

The disclosed NOV73 (alternatively referred to herein as CG56822-01) includes the 
967 nucleotide sequence (SEQ ID NO:249) shown in Table 73 A. A NOV73 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 26-28 and ends with a stop codon at 
25 nucleotides 935-937. The disclosed NOV73 maps to human chromosome 2. 



Table 73A. NOV73 Nucleotide Sequence (SEQ ID 
NO:249) 

GCAGGTTCTTTTACAGGGAGCCACCA^ 

CGAAGCTTT AG AAAAATCAAAAGAAAC TGCACTCTCTCATTTATTTTT CAC CT AT C AGGG 
GATTCCTTACCCCATCACCATGTGCACCTCAGAAACTTTCCAAGCGCTGGACACCTTCGA 
AGCCAGACATGATGACATCGTGCTAGCATCTTATCCAAAGTGCGGTTCAAACTGGATTCT 
CCACATTGTCAGTGAATTAATATATGCTGTTTCTAAAAAAAAGTATAAATATCCAGAATT 
C CCAGTTCTTG AATGTGGGGAT T CAGAAAAAT ATCAGAGAATGAAAGG CTTTC CAT CAC C 
AAGGATTTTGGCAACT CAC CT CCACTATGACAAAT T ACCTGGGTC T AT CTT CGAGAATAA 
AGCCAAGATATTGGTGATATTTCGAAACCCTAAAGATAC^ 

CCACAACGATGTCCCCG^TATTCCAAGCTATGGCTCTTGGGATGAATTCTTCAGACAGTT 
CATGGTGTTTTT AGTTTC TTGGGGAAGGTATTTTGATTTTGCAAT CAATTGGAACAAACA 
TCTTGATGGCGACAATGTTAAGTTCATATTATATGAAGACCTGAAAGAGGTGAGATTATT 
AGGAATAAAACAGATTGCTGAGTTLTTGGGATTCTT^ 
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TATCTCAGTCCAGAGCACCTTCCAAGCCATGCGTGCGAAGTCTCAGGACACACACGGTGC 
TGTCGGCCCATTCCTTTTC CGCAAAGGTAAAGTCG CAGATTGGAAAAATTTGTTCAGTGA 
AATTC AGAAC CAGGAAATGGATGAAAAATTCAAAGAGTGCTTAGCAGGCACCT CCCTCGG 
AGCAAAGTTGAAGTATGAATCATATTGCCAGGGTTGATTCCAGTCAATTCAGCAGGCCTA 
GATTTAT 



A NOV73 polypeptide (SEQ ID NO:250) encoded by SEQ ID NO:249 is 303 amino 
acids in length and is presented using the one-letter amino acid code in Table 73B. The Psort 
profile for NOV73 predicts that this sequence has no signal peptide and is likely to be 
localized to peroxisomal microbodies with a certainty of 0.7480. In alternative embodiments, 
a NOV73 polypeptide is located to lysosomes with a certainty of 0.1000, or to the 
mitochondrial matrix space with a certainty of 0.1000. 



Table 73B. NOV73 Polypeptide Sequence (SEQ ID 
NO:250) 



MADKS KF I EYIDEALEKS KETALSHL FFT YQG I PY PITMCTSETFQALDTFEARHDDI VL 
AS YPKCGSNWILHI VSELI YAVS KKKYKYPEFPVLECGDSEKYQRMKGFPS PRILATHLH 
YDKLPGS I FENKAKILVI FRNPKDTAVS FIoHFHNDVPDI PSYGSWDEFFRQFMVFLVS WG 
RYFDFAINWNKHLDGDNVKFILYEDLKEVRLLGI KQI AEFLGFFLTGEQI QTI SVQSTFQ 
i^RAKSQDTHGAVGPFLFRKGKVADWKNL FSEI QNQEMDEKFKECLAGTSLGAKLKYES Y 
CQG 



1 0 A BLAST analysis of NOV73 was run against the proprietary PatP GENESEQ Protein 

Patent database. It was found, for example, that the amino acid sequence of NOV73 had high 
homology to other proteins as shown in Table 73C. 



Table 73C. BLASTX results from PatP database for NOV73 






Smallest 






Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAU07758 Human novel transferase protein, 


1562 


3.7e-160 


patp:AAU07760 Human novel transferase protein, 


1366 


2.2e-139 


patp:AAU07765 Human novel transferase protein, 


1013 


5.5e-102 


patp:AAU07763 Human novel transferase protein, 


930 


3.5e-93 


patp:AAU0776l Human novel transferase protein, 


734 


2.1e-72 



15 In a search of sequence databases, it was found, for example, that the nucleic acid 

sequence of this invention has 427 of 643 bases (66%) identical to a gb:GENBANK- 
ID:AF033189|acc:AF033189.1 mRNA from Gallus gallus (Gallus gallus sulfotransferase 
mRNA). The full amino acid sequence of the protein of the invention was found to have 151 
of 307 amino acid residues (49%) identical to, and 212 of 307 amino acid residues (69%) 

20 similar to, the 3 12 amino acid residue ptnr:SPTREMBL-ACC:057338 protein from Gallus 
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gallus (Chicken) (SULFOTRANSFERASE). NOV73 also has homology to the other proteins 
shown in the BLAST? data in Table 73D. 



Table 73D. NOV73 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 
(%) 


Positive 
(%) 


Expect 


gi| 17447308 |r 
ef |XP_065865. 
l| (XM_065865) 


similar to 
sulfotransf erase (H. 
sapiens) 
[Homo sapiens] 


303 


294/303 
(97) 


295/303 
(97) 


e-168 


gi|2687360|gb 
|AAB88818.l| 
(AF033189) 


sulfotransf erase 
[Gallus gallus] 


312 


151/307 
(49) 


212/307 
(68) 


4e-84 


gi| 12229955 |s 
p | Q9WUW8 | STK1 
RAT 


SULFOTRANSFERASE Kl 
(RSULT1C2) 


296 


94/294 
(31) 


146/294 
(48) 


3e-35 


gi| 18079235 ) r 
ef |NP_081211. 
l| (NM 026935) 


sulfotransferase family i 
cytosolic, 1C, member 1 
[Mus musculus] 


296 


92/292 
(31) 


147/292 
(49) 


5e-35 


gi 11262122 |p 
ir |JC7283 


hydroxyaryl amine 
sulfotransferase (EC 
2.8.2) 2A - rat 


296 


93/294 
(31) 


149/294 
(50) 


2e-34 



This BLASTP data is displayed graphically in the ClustalW in Table 73E. A multiple 
sequence alignment is given, with the NOV73 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 73D. 



Table 73E. ClustalW Alignment of NOV73 



N0V73 



17447308 J 

2687360) 

12229955| 

18079235| 

11262122 



(SEQ ID NO: 250) 
(SEQ ID NO: 664) 
(SEQ ID NO: 665) 
(SEQ ID NO: 666) 
(SEQ ID NO: 667) 
(SEQ ID N0:668) 



10 



N0V73 



gi 


17447308 | 


gi 


2687360 | 


gi 


12229955 


gi 


18079235 


gi 


11262122 


NOV73 


gi 


17447308] 


gi 


2687360) 


gi 


12229955 


gi 


18079235 


gi 


11262122 



MAD-KSKFIEYjED] 
MAD-KSKFIEYpD] 
MEKSRKKFIDVpjD] 





30 

| |. 
HUFFTYQGIPY 
HIjFFTYQGIPY 
' iFSYKGVLY 
'AG 
'AG 



80 




90 



100 



iE-SlYAVS: 
JE^IYAVSl 

EQNG] 

iqng] 



TFKJAK<QDD» 



110 



120 



130 




skSSndeelee 
^iqhr^Sf 

jC^-lEEIQH 



150 
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NOV73 


gi 


17447308| 


gi 


2687360) 


gi 


12229955 




gi 


18079235 




gi 


11262122 





NOV73 



17447308 
2687360 | 
12229955 
18079235 
11262122 



N0V73 



17447308 
2687360| 
12229955 
18079235 
11262122 



fpvlecgdsekyqrmkgf 
fpvlecgdsekyqrmkg 
fpSleigdtekyeSmkk| 

IE^ARPPQPSGVDKANi 
IE|WARPPQPSGVDKANE| 
IESARPPQPSGV&K 




NOV73 


P 


KD 


gi 


17447308| P 


KD 


gi 


2687360 | 


P 


KD 


gi 


12229955 




2s 


gi 


18079235 




KD 


gi 


11262122 




KD 





260 270 

I • • • - I I 

QDTHGA^G P-- 

QDTHGA^G P FLi 

Skthga^g S— 

IPMjTNRSltsiPKSVLDQS IS P! 
[PfiTNRSTAPKSILDQSISP: 
•jiTNFSTiPKTIMDQSISP: 

310 3 



NOV73 




gi 


17447308| 




gi 


2687360| 




gi 


12229955 




3K#G| 


gi 


18079235 






gi 


11262122 







Table 73F lists the domain description from DOMAIN analysis results against NOV73. 
This indicates that the NOV73 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 73F. Domain Analysis of NOV73 


gnl|Pfam|pfam00685, Sulfo transfer, SulfotranaferasA proton skq to wn.fl^ 






CD-Length * 269 residues, 93.3% aligned 






Score = 


176 bits (446), Expect » 2e-45 




NOV73 : 


49 


DTFEARHDDIVLASYPKCGSNWILHIVSELIYAVSKKKYKYPE FPVLECGDSEK- 


102 






+ F+AR DD+++A YPK G+ W+ I+S L V + + P LE E 




Sbjct: 


18 


NCFQARPDDVLIAGYPKSGTTWLQEILS - LHPNVGDFEPS PSDPLLFRNP WLE YPKGEDW 


76 


N0V73 : 


103 


YQRMKGFPS - PRILATHLHYDKLPGSIFEinCAKILVIFRNPKDTAVSFLHFHNDVPDI PS 


161 






Y+ +K PS PR++ THL + LP S +KAKI+ + RNPKD AVS+ HF D+P+ 




Sbjct: 


77 


YETLKPMPSSPRLIKTHLPI^LLPKSFLSSKAKIIYVLJ^PKDVAVSYYHFSRSHKDLPA 


136 


N0V73 : 


162 


Y - GSWDEFFRQFMVFLVSWGRYFDFAINWNKHLDGDNVKFILYEDLKEVRLLGI KQIAEF 


220 
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l»+++ii,r £+ V -t-Vj KrU + ri ■+ V r+ XctUijiui ijxtift&r 




Sbjct: 


137 


DPGTFEEFLEAFLNGKVLYGSYFDHVLGWVIELRPEPQVIiFLDYEDLKEDPAGEIKKIAEF 


196 


NOV73 : 


221 


LGFFLTGEQIQTISVQSTFQAMRA KSQDTHGAVGPFLFRKGKVADWKNLFS 


271 






LG L+ E++ + S+F M+ + G PF RKG V DWKN F+ 




Sbj Ct : 


197 


LGLPLSEEELDKLLDHSSFFLMKLNPLSNYETLCLGKSKGRKSPF-MRKGLVGDWKNYFT 


255 


N0V73 : 


272 


EIQNQEMDEKFKE 284 








QN++ D+ KE 




Sbjct: 


256 


PEQNEKFDKVIKE 268 





This family includes a range of sulfotransferase proteins including flavonyl 3- 
sulfotransferase, aryl sulfotransferase, alcohol sulfotransferase, estrogen sulfotransferase and 
phenol-sulfating phenol sulfotransferase. These enzymes are responsible for the transfer of 
sulphate groups to specific compounds. 

NOV73 is predicted to be expressed in at least the following tissues: epithelial, 
endothelial, muscle, and neuronal tissues. This information was derived by determining the 
tissue sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV73 is provided in Example 2. 

The novel nucleic acid of the invention encoding a sulfotransferase-like protein 
includes the nucleic acid whose sequence is provided in Table 73 A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 73 A while still encoding a protein that maintains 
its sulfotransferase-like activities and physiological functions, or a fragment of such a nucleic 
acid. 

The invention further includes nucleic acids whose sequences are complementary to 
the sequence of Table 73 A, including nucleic acid fragments that are complementary to any of 
the nucleic acids just described. The invention additionally includes nucleic acids or nucleic 
acid fragments, or complements thereto, whose structures include chemical modifications. 
Such modifications include, by way of non-limiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications are 
carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
that they may be used, for example, as antisense binding nucleic acids in therapeutic 
applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
about 34% of the bases may be so changed. 

The novel protein of the invention includes the Sulfotransferase-like protein whose 
sequence is provided in Table 73B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 73B while 
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still encoding a protein that maintains its Sulfotransferase-like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 51% 
of the amino acid residues may be so changed. 

The NOV73 nucleic acids and proteins are useful in potential therapeutic applications 
5 implicated in various pathological disorders described further herein, for example, the 

compositions of the present invention will have efficacy for the treatment of patients suffering 
from: cystitis, incontinence, fertility, systemic lupus erythematosus, autoimmune disease, 
asthma, emphysema, scleroderma, allergy, ARDS, Von Hippel-Lindau (VHL) syndrome, 
Alzheimer's disease, stroke, tuberous sclerosis, hypercalceimia, Parkinson's disease, 

10 Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, 
ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, anxiety, pain, 
neurodegeneration. as well as other diseases, disorders and conditions. The NOV73 nucleic 
acid encoding the sulfotransferase-like protein of the invention, or fragments thereof, may 
further be useful in diagnostic applications, wherein the presence or amount of the nucleic acid 

15 or the protein are to be assessed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 

20 below. 

NOV74 

The disclosed NOV74 (alternatively referred to herein as CG56775-01) includes the 
732 nucleotide sequence (SEQ ID NO:251) shown in Table 74A. A NOV74 ORF begins with 
25 a Kozak consensus ATG initiation codon at nucleotides 52-54 and ends with a stop codon at 
nucleotides 673-675. The disclosed NOV74 maps to human chromosome 15. 



Table 74A. NOV74 Nucleotide Sequence (SEQ ID NO:251) 

CAGACTCCCCCTTGCTGGCTCCT 

GCTCCATCK3AAGATCTCAGTGACCTGCTCTCA 

CAATGAGGTCACCCTGAAGAT^ 

CTAGGCAGTATTCATGCCCTGAATACC^CTO 

CCAAGGACTCTGGCATCACCTACCrrGGGC^TCAAGGCCAATGAAG 

CITTGAAAGGGCTACAGACTTCACTGACCAG 

GAAGGTTACAGCCATCTCCAGCTCGTTATCATGTACCrTATGA 

TGACAAACTAGCCAAGGAGGGGAAGGTGAAACCCTGGGGTC 
TGGGAGAGGCCCTGGTTGAAGGTATCCTG^ 

TCAAGTCTGTCTCAACAAGTCCTGGGCCACTT 
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A NOV74 polypeptide (SEQ ID NO:252) encoded by SEQ ID NO:251 is 214 amino 
acids in length and is presented using the one-letter amino acid code in Table 74B. The Psort 
profile for NOV74 predicts that this sequence has no signal peptide, and is likely to be 
localized at the cytoplasm with a certainty of 0.4500. In alternative embodiments, a NOV74 
5 polypeptide is located to peroxisomal microbodies with a certainty of 0.3625, or, to the 
mitochonrial matrix space with a certainty of 0.1000. 



Table 74B. NOV74 Polypeptide Sequence (SEQ ID 
NO:252) 

MPGPTAAMTGPFKJISMEDLSDIiLSDSSGCYSLPSQPCNEVTLKIYMGNTSVDQDIPKLQK 
LGS IHALNTTEGRS FMHINNANFSKDSGITYIjGIKANEVQEFNLSTYFERATDFTDQAIiA 
QNGQVLVQCWEGYSHLQLVIMYLMIVRS WTS SHLSIMRQNCE ISPNttDGFLAQLCHLNDKL 
AKEGKVKPWGAPTTFAREVQWERP WUCVS CDTVP | 

A BLAST analysis of NOV74 was run against the proprietary PatP GENESEQ Protein 
10 Patent database. It was found, for example, that the amino acid sequence of NOV74 had high 
homology to other proteins as shown in Table 74C. 



Table 74C. BLASTX results from PatP database for NOV74 






Smallest 






Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAR56968 Human phosphatase VHR - Homo sapiens, 185 aa. 


664 


5.4e-65 


patp:AAW35330 Human cdc25B vaccinia HI related phosphatase 


664 


5.4e-65 


patp:AAB42873 Human ORFX ORF2637 polypeptide sequence 


664 


5.4e-65 


patp:AAG67449 Amino acid sequence of a human polypeptide 


664 


5.4e-65 


patp:AAG67628 Amino acid sequence of a human protein 


664 


5.4e-65 



In a search of public sequence databases, it was found, for example, that the nucleic 
1 5 acid sequence of this invention has 347 of 420 bases (82%) identical to a gb:GENBANK- 
ID:HUMDSPHS|acc:L05 147.1 mRNA from Homo sapiens (Human dual specificity 
phosphatase tyrosine/serine mRNA). The full amino acid sequence of the protein of the 
invention was found to have 135 of 185 amino acid residues (72%) identical to, and 150 of 
1 85 amino acid residues (81%) similar to, the 185 amino acid residue ptnnSWISSNEW- 
20 ACC:P51452 protein from Homo sapiens (Human) (DUAL SPECIFICITY PROTEIN 
PHOSPHATASE 3 (EC 3.1.3.48) (EC 3.1.3.16) (DUAL SPECIFICITY PROTEIN 
PHOSPHATASE VHR)). NOV74 also has homology to the other proteins shown in the 
BLASTP data in Table 74D. 
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Table 74D. NOV74 BLASTP results 


Gene Index/ 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi|l745832l|r 
ef |XP_063782. 
1 | (XM 0S3782) 


similar to dual 
specificity phosphatase 
3; vaccinia virus 
phosphatase VHl-related; 
protein tyrosine 
phosphatase ,* 
serine/ threonine 
specific protein 
phosphatase (H. sapiens) 
[Homo sapiens] 


597 


180/207 
(86) 


180/207 
(86) 


4e-92 


• 

gi|4758208|re 
f |NP_004081.1 
| (NM_004090) 


dual specificity 
phosphatase 3 ; vaccinia 
virus phosphatase VHl- 
related; protein 
tyrosine phosphatase; 
serine/ threonine 
specific protein 
phosphatase 
[Homo sapiens] 


185 


135/185 
(72) 


150/185 
(80) 


5e-71 


gi| 1633321 |pd 

D j lvxiK | A 


Chain A, Human Vhl- 
Related Dual -Specif icity 
Phosphatase 


184 

r 


137/184 
(72) 


149/184 
(80) 


2e-70 


gi|l815894l|p 
db|lJ4X|A 


Chain A, Human Vhl- 
Related Dual -Specificity 
Phosphatase C124s 
Mutant- Peptide Complex 


184 


133/184 
(72) 


148/184 
(80) 


3e-69 


gi| 12843112 |d 
bj |BAB25864.1 
| (AK008734) 


DUAL SPECIFICITY PROTEIN 
PHOSPHATASE 3 (EC 
3.1.3.48) (EC 3.1.3.16) 

(DUAL SPECIFICITY 
PROTEIN PHOSPHATASE 
VHR) -putative 

[Mus musculus] 


185 


126/184 
(68) 


146/184 
(78) 


3e-65 



This BLASTP data is displayed graphically in the ClustalW in Table 74E. A multiple 
sequence alignment is given, with the NOV74 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
5 Table 74D. 



Table 74E. ClustalW Alignment of NOV74 



NOV74 (SEQ ID NO: 252) 



gi 
gi 
gi 
gi 



17458321 | (SEQ ID NO: 669) 

4758208) (SEQ ID NO.-670) 

1633321 j (SEQ ID NO: 671) 

18158941) (SEQ ID NO:672) 

12843112) (SEQ ID NO: 673) 



NOV74 

gi 



10 20 30 40 50 

.|....|....|--..|....|....|....|....|....| 



17458321 j MGGQEETVEEIQRMNSRSWWRIVICCDFTITLRTEIWKPEQRGEALTLQ 

4758208) - 

1633321) - 

18158941). 

12843112 I - 
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60 



70 



NOV74 





17458321 | 


gi 


4758208 | 


gi 


1633321 | 


gi 


1815894l| 


gi 


12843112 | 


NOV74 


gi 


17458321 | 


gi 


4758208) 


gi 


163332l| 


gi 


18158941) 


gi 


12843112) 


NOV74 


gi 


1745832l| 


gi 


4758208 | 


gi 


1633321 j 


gi 


18158941) 


gi 


12843112) 


NOV74 


gi 


17458321) 


gi 


4758208 | 


gi 


1633321) 


gi 


18158941) 


gi 


12843112 j 


NOV74 


gi 


1745832l| 


gi 


4758208) 


gi 


1633321) 


gi 


1815894l| 


gi 


12843112 j 



NOV74 

gi | 17458321 
gi j 4758208 | 
gi j 1633321 j 
gi | 18158941 
gi j 12843112 



NOV74 

gi | 17458321 | 
gi|4758208| 
gi j 1633321 | 
gi | 18158941 | 
gi | 12843112 j 



NOV74 



17458321 | 

4758208) 

1633321) 



80 
.|.. 



90 



100 



I |-....|....| | 

EFSSLSLCPPMPGGPADNKEYPRGQGRKAVPQPPLGTVWEQVCGGWHSAR 



110 120 130 140 150 

;;;;i;;;;l;;;;l;;;;f;--l;;--l---l----l--l--l 

GTSCTKSQCTLDQERGERSGTHRQNLSETSPNLPTAILGLLHPDLDVCKK 



160 170 
..|....|.,..|....|. 



180 



190 200 

•■I — 



KRCNRIYSGMEKTHPQRALVRTQNGINEKNQKWDKRRCLGPGCHSAGLMP 



210 220 
■|....|....|....|.. 



230 



240 250 
••I I I 



TAAYQLLAVLPGSPAPPDHSLRGSEEVLAHTESTGDENMRHPQTPGLSKA 



260 270 

..|....|....|.. 



280 290 300 

..|....|....|....|....| 



LTAMQGAAREVGGHWELSPRLPRTSPGTSSELSPHPLVPHPVHPPPINNQ 



310 



320 330 
..)....|....|.. 



340 350 
..|....|....| 



lrerqdsrpagkkapvatwpvsnlrekgpglrrrggsvpsipdaaima.it 
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NOV74 


gi 


17458321 [ 


gi 


4758208 | 


gi 


1633321 | 


gi 


18158941] 


gi 


12843112| 



gi | 18158941 | 
gi 1 12843112 J 



NOV74 



gi 
gi 
gi 
gi 
gi 



17458321 | 
4758208 | 
1633321) 
18158941) 
12843112 



N0V74 



17458321) 

4758208) 

1633321) 

18158941) 

12843112 



NOV74 

gi | 17458321 | 
gi (4758208 | 
gi j 1633321 | 
gi j 18158941) 
gi | 12843112 | 



E VTPRI YVGNAS VAQDI PKLQKLGI THVLNAAEGRS FMHVNTNANF YKDS 
EvBPRBYVGNASVAODlffiLQKLGITHVLNAAEGRSF^HVNTSASFYf^DS 



I. 



460 
g^EVB 



470 480 490 500 

--D 



GITYLGIKAK^QEF^SgYFEPJ^FTOQAIAQHNG^v 
GlTYLGIKAN^QEFNLSuYFEPJ^F3DQALAQfMG§SL 

g it ylgi kandtqefnls ayferagdfi dqalaqkngrvlvhcregysrs 
gitylgikandtqefnlsayferagdfidqaiaqkngrvlvhcregysrs 
gitylgikatotqefnlsayfepj^gdfidqaiiaqkngrvlvi^regysrs 
gitylgikamdtqefnlsayfefj^fidqaijJSkngrvlvhcregysrs 



510 



520 



550 



||--vrswt@s: 
S 



PTLVIAYLMMRQKMDVKS 
P TL VI A YLMMRQKMD VK S 
PTLVIAYLMMRQKMDVKS 
PTLVIAYLMMRQKMDVKS 




560 



570 



580 



600 




590 

I 

GAPTTFAREVQWERPWLK - -VS CDTVP 

GAPTTFAREVQWERPWLKPLLSSEPEHHGCETLVLSSGPQHPQP 
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Table 74F lists the domain description from DOMAIN analysis results against NOV74. 
This indicates that the NOV74 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 74F. Domain Analysis of NOV74 

gnl 1 Pf am | pf am00782 , DSPc, Dual specificity phosphatase, catalytic domain. Ser/Thr and 
Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of 
tyrosine-specific phosphatases, except for a "recognition" region. SEQ ID N0:868 

CD-Length = 139 residues, 100.0% aligned 

Score = 90.5 bits (223) , Expect = 9e-20 

NOV74: 36 PCNEVTLKI YMGNTSVDQDI PKLQKLGS IHALNTTEGRSFMHINNANFS KDSGITYLGIK 95 

+E+ +Y+G+ ++ L KLG H +N T SK+SG YL I 

Sbjct: 1 GPSEILPHLYLGSYPTASNLAFLSKLGITHVINVT EEVPNSKNSGFLYLHIP 52 

NOV74: 96 ANEVQEFI^STYFERATDFTDQALAQNGQVLVQCWEGYSHLQLVM 153 

++ E ++S Y + A +F + A + G+VLV C G S +1+ YLM R+ + + 
Sbjct: 53 VDDNHETDISPYLDEAVEFIEDARQKGGKVLVHCQAGI SRSATLI IAYLMKTRNLSLNEA 112 

N0V74: 154 LSIMRQNCE- ISPNDGFLAQLCHLNDK 179 

S +++ ISPN GF QL K 
Sbjct: 113 YSFVKERRPI I SPNFGFKRQLIEYERK ,139 
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Mitogen-activated protein (MAP) kinase phosphatases constitute a growing family of 
dual specificity phosphatases thought to play a role in the dephosphorylation and inactivation 
of MAP kinases and are therefore likely to be important in the regulation of diverse cellular 
processes such as proliferation, differentiation, and apoptosis. For this reason it has been 
5 suggested that MAP kinase phosphatases may be tumor suppressors. DUSP6 (alias PYSTI), 
one of the dual-specificity tyrosine phosphatases, is localized on 12q21, one of the regions of 
frequent allelic loss in pancreatic cancer. This gene is composed of three exons, and two forms 
of alternatively spliced transcripts are ubiquitously expressed. Although no mutations were 
observed in 26 pancreatic cancer cell lines, reduced expressions of the full-length transcripts 

10 were observed in some cell lines, which may suggest some role for DUSP6 in pancreatic 
carcinogenesis. The mitogen-induced gene, DUSP2, encodes a nuclear protein, PAC1, that 
acts as a dual-specific protein phosphatase with stringent substrate specificity for MAP kinase. 
MAP kinase phosphorylation and consequent enzymatic activation is a central and often 
obligatory component in signal transduction initiated by growth factor stimulation or resulting 

1 5 from various types of oncogenic transformation. DUSP2 downregulates intracellular signal 
transduction through the dephosphorylation/inactivation of MAP kinases. 

NOV74 is predicted to be expressed in at least the following tissues: retina. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 

20 sources, and/or RACE sources. Further expression data for NOV74 is provided in Example 2. 
The NOV74 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
brain disorders including epilepsy, eating disorders, schizophrenia, ADD, and cancer; heart 
disease; blood disorders, kidney disorders, liver diseases, inflammation and autoimmune 

25 disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 

inflammatory skin disorders, allergies, blood disorders; psoriasis; colon-, ovarian-, testicular-, 
lymphatic-, brain-, and pancreatic cancers; leukemia AIDS; thalamus disorders; metabolic 
disorders including diabetes and obesity; lung diseases such as asthma, emphysema, cystic 
fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency; and prostate 

30 disorders including prostate cancer and other diseases, disorders and conditions of the like. 

The NOV74 nucleic acid encoding the phosphatase-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 
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The novel nucleic acid of the invention encoding a dual specificity phosphatase-like 
protein includes the nucleic acid whose sequence is provided in Table 74 A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 74A while still encoding a protein 
5 that maintains its dual specificity phosphatase-like activities and physiological functions, or a 
fragment of such a nucleic acid. 

The invention further includes nucleic acids whose sequences are complementary to 
the sequence of Table 74A, including nucleic acid fragments that are complementary to any of 
the nucleic acids just described. The invention additionally includes nucleic acids or nucleic 
10 acid fragments, or complements thereto, whose structures include chemical modifications. 
Such modifications include, byway of non-limiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications are 
carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
that they may be used, for example, as antisense binding nucleic acids in therapeutic 
15 applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
about 18% of the bases may be so changed. 

The novel protein of the invention includes the dual specificity phosphatase-like 
protein whose sequence is provided in Table 74B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
20 in Table 74B while still encoding a protein that maintains its dual specificity phosphatase-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 28% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
25 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV75 

The disclosed NOV75 (alternatively referred to herein as CG56783-01) includes the 
30 840 nucleotide sequence (SEQ ID NO:253) shown in Table 75A. A NOV75 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 31-33 and ends with a stop codon at 
nucleotides 769-771 . The disclosed NOV75 maps to human chromosome 1 . 
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Table 75A. NOV75 Nucleotide Sequence (SEQ ID NO:253) 

AGAACC CAAGG CT C C CTGG ATT TG CAGT C CATG AG CAACAAG C C CTG CTTACAGAC C CCTG GGAGG AG CA 
GATTCCACGAGGGGCTGGACCAAGTCTACCTTCCAAATGTGGCTGGACTCTCTGCAGCCCCCACCCAGAG 
ACTGCCCATCAGGGAAGAGATGGTGCCATCAAGAGGCTATGGGGAAGAGGTGGATGAGGTCTGGCCCAAT 
GTCTTCATAGCTGAGAAGAGTGTGGCTGTGAACAAGGGGAGGCTGAAGAGGCTGGGAATCACCCACATTC 
TGAATGCTGCXJCATGGCACCGGCGTTTACACTGGC^ 

GGGTGTAGAGGTGGATGACTTTCCTGAGGTGGACATTTCCCAGCATTTCCGGAAGGCGTCTGAGTTCCTG 
GATGAGGCGCTGCTGACTTACAGAGGACGTTTGACCAACGTGGGATTGAATGGGTCTGTCGGTCGCCTGC 
GGCGTAAGGAGTGTGTCCCACCTCGCTCGCAGGT C CTGGAGCGCACCGGCAGACCTCGAGGCGGAGCGGG 
GAAAGTCCTGGTCAGC^GCGAAATGGGCATCAGCCGGTCAGCAGTGCTGGTGGTCGCCTACCTGATGATC 
TT C CACAACAT GG CCAT CCTGGAGGCT TT G ATGAC C GTG CGT AAG AAG CGGGC CAT CT ACC CCAATG AGG 
GCTTCCTGAAGaGCTGCGGGAGCTCAATGAGAAGTTGATGGAGGAGAGAGAAGAGGACTATGGCCGGTA 
GGGGGGAT(^GCTGAGGCTGAGGAGGGCGAGGGCACTGGGAGCATGCTCGGGGCCAGAGTGCACGCCCTG 



A NOV75 polypeptide (SEQ ID NO:254) encoded by SEQ ID NO:253 is 246 amino 
acids in length and is presented using the one-letter amino acid code in Table 75B. The Psort 
profile for NOV75 predicts that this sequence has no signal peptide and is likely to be 
5 localized at the plasma membrane with a certainty of 0.7000. In alternative embodiments, a 
NOV75 polypeptide is located, to the endoplasmic reticulum (membrane) with a certainty of 
0.1000, or to the nucleus with a certainty of 0.2000. 



Table 75B. NOV75 Polypeptide Sequence (SEQ ID 
NO:254) 



MSNKPCLQTPGRSRFHEGLDQVYLPNVAGLSAAPTQRLPIREEMVPSRGYGEEVDEVWPN 
WI AEKSVAVl^GRLKRLGITHI LNAAHGTGVYTGPE FYTGLEI Q YLGVEVDDFPEVD I S 
' QHFRKASEFLDEALLTYRGRLTNVGIiNGSVGRLRRKSCVPPRSQVLERTGRPRGGAGKVIi 
VS SEMGI SRS AVL WAYLMI FHNMAI LE ALMTVRKKRAI YPNEGFLKQLRELNEKLMEER 
EEDYGR . — - 



10 A BLAST analysis of NOV75 was run against the proprietary PatP GENESEQ Protein 

Patent database. It was found, for example, that the amino acid sequence of NOV75 had high 
homology to other proteins as shown in Table 75C. 



Table 75C. BLASTX results from PatP database for NOV75 






Smallest 






Sum 




High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P'(N) 


patp:AAE04836 Human SGP018 phosphatase polypeptide 


3 89 


1.6e-67 


patp:AAB40919 Human ORFX ORF683 polypeptide sequence 


457 


1.0e-55 


patp:AAE04837 Human SGP003 phosphatase polypeptide 


218 


2.2e-34 


patp:AAY85620 Human dual specificity phosphatase- 9 


210 


1.0e-32 1 


patp:AAE04839 Human SGP060 phosphatase polypeptide 


210 


1.0e-32 



15 In a search of sequence databases, it was found, for example, that the nucleic acid 

sequence of this invention has 167 of 257 bases (64%) identical to a gb:GENBANK- 

ID:AB027004|acc:AB027004.1 mRNA from Homo sapiens (mKNA for protein phosphatase). 
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The full amino acid sequence of the protein of the invention was found to have 39 of 89 amino 
acid residues (43%) identical to, and 58 of 89 amino acid residues (65%) similar to, the 198 
amino acid residue ptnr:SPTREMBL-ACC:Q9QYJ7 protein from Mus musculus (Mouse) 
(PROTEIN PHOSPHATASE). NOV75 also has homology to the other proteins shown in the 
5 BLASTP data in Table 75D. 



Table 75D. NOV75 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi|l7454087|r 
efjXP 061191. 
l| (XM 061191) 


similar to protein 
phosphatase (H. sapiens) 
[Homo sapiens] 


370 


73/185 
(39) 


104/185 
(55) 


4e-30 


gi|l3128968 |r 
ef |NP_07693 0. 
l| (NM_024025) 


hypothetical protein 
MGC1136; hypothetical 
protein MGC2627 
[Homo sapiens] 


211 


72/187 
(38) 


105/187 
(55) 


2e-29 


gi 1 15072533 |g. 
b|AAK77966.l| 
(AY040091) 


branching-enzyme 
interacting dual- 
specificity protein 
phosphatase BEDP [Homo 
sapiens] 


168 


75/201 
(37) 


108/201 
(53) 


69e-29 


gi| 7305011) re 
f |NP_038877.1 
| (NM 013849) 


dual specificity 
phosphatase 13 
[Mus musculus] 


198 


73/187 
(39) 


106/187 
(56) 


9e-29 


gi|l7390456 |g 
b|AAH18204.l| 
AAH18204 
(BC018204) 


Similar to hypothetical 
protein MGC1136 
[Mus musculus] 


211 


72/187 
(38) 


104/187 
(55) 


le-28 



This BLASTP data is displayed graphically in the ClustalW in Table 75E. A multiple 
sequence alignment is given, with the NOV75 protein being shown on line 1 in a ClustalW 
10 analysis comparing the protein of the invention with the related protein sequences shown in 
Table 75D. 



Table 75E. ClustalW Alignment of NOV75 



NOV75 (SEQ ID NO: 254) 



gi 

gi 
gi 
gi 
gi 



17454087) (SEQ ID N0:674) 

13128968 j (SEQ ID NO:675) 

15072533) (SEQ ID NO:676) 

7305011) (SEQ ID NO: 677) 

173 90456) (SEQ ID NO: 678) 



10 



20 



30 



NOV75 



gi 
gi 
gi 
gi 
gi 



17454087 
13128968 
15072533 
7305011) 
17390456 



40 
■I.. 



MSNKPCIjQTPG- - 

MLEVDPVTVPVIMPNVKI^E&EVKEFAQVQ^GLGDVLFPLADGS: 

MCPGNWL WASMTFhlARFSR - -SS 

- MAETS 

- MDSkjQ 

MCPGNWLWASMTFgARFSR- - GSS 
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NOV75 


gi 


17454087 


gi 


13128968 


gi 


15072533 


gi 


7305011) 


gi 


17390456 


NOV75 


gi 


17454087 


gi 


13128968 


gi 


15072533 


gi 


7305011 | 


gi 


17390456| 
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NOV75 

gi 
gi 
gi 
gi 
gi 



17454087] 
13128968) 
15072533 j 
7305011| 
173904561 



NOV75 

gi 
gi 
gi 
gi 
gi 



17454087 | 
13128968 j 
15072533| 
7305011 | 
173904561 



NOV75 



17454087) 
13128968 | 
15072533) 
7305011) 
17390456) 



NOV75 

gi 1 17454087 | 
gi j 13128968 | 
gi j 15072533 j 
gi j 7305011 | 
gi j 17390456 | 



NOV75 



17454087 
13128968 
15072533 



70 



80 



90 



100 



fheBldqvylpn 

3pqawnvrnllglpilgpkwpkpvgfsqikekvgtk j vks!kescrkerk 

TR e TLEENP 

ELgGEDjt 

R p K-IHG- 

TR^SLEEMP 



150 



200 



250 



N0V75 


YT» 


gi 


17454087) 


DTg! 


gi 


13128968) 


RGTj 


gi 


15072533) 


QGBs 


gi 


7305011) 


DTga 


gi 


17390456 | 


RGSjj 




300 



350 



390 



400 



360 370 380 

....|-...|.. ..).... |....|.... |....|....|.... |....| 

DYGR - 

SQRQDGENGSRHEAGSDSFQGQESQTRQKPKRVAAVGDAGRQGPGMEMAW 



RNQNVIKAF 
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gi [7305011 | 
gi j 17390456 | 



Table 75F lists the domain description from DOMAIN analysis results against NOV75. 
This indicates that the NOV75 sequence has properties similar to those of other proteins 
known to contain this domain. 

5 



Table 75F. Domain Analysis of NO V75 


gml|pfam|pfam00782, DSPc, Dual specif icitv phosphatase, catalytic domain. 


Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly 


similar to that of tyrosine-specific phosphatases , except for a "recognition" 


region. 


SEQ ID NO: 869 






CD-Length = 139 residues, 92.8% aligned 




Score = 98.2 bits (243), Expect = 5e-22 




N0V75 : 


5 6 EVWPNVFI AEKS VAVNKGRLKRLGITH I LNAAHGTGVYTGPEFYTGLE I QYLGVE VDDFP 


115 




E+ P++++ AN L +LGITH++N F YL + VDD 




Sbjct: 


4 E I LPHLYLGS Y PTASNLAFLSKLGI THVINVTEEVPNS KNS GF LYLHI PVDDNH 


57 


NOV75: 


116 E VD I S QHFRKASEFLDEALLTYRGRLTNVGLNGSVGI^RRKECVPPRSQVLERTGRPRGG 


175 




E DIS + +A EF+++A R • 




Sbjct: 




78 


N0V75 : 


176 AGKVLVSSEMGISRSAVLWAYLMIFHNMAILEALMTVRKKR-AIYPNEGFLKQ 228 






GKVLV + GISRSA L++AYLM N+++ EA V+++R I PN GF +Q 




Sb j ct : 


7 9 GGKVLVHCQAGISRSATLI IAYLMKTRNLSLNEAYSFVKERRPI ISPNFGFKRQ 132 





Mitogen-acti vated protein (MAP) kinase phosphatases constitute a growing family of 
dual specificity phosphatases thought to play a role in the dephosphorylation and inactivation 
of MAP kinases and are therefore likely to be important in the regulation of diverse cellular 

10 processes such as proliferation, differentiation, and apoptosis. For this reason it has been 

suggested that MAP kinase phosphatases may be tumor suppressors. DUSP6 (alias PYST1), 
one of the dual-specificity tyrosine phosphatases, is localized on 12q21, one of the regions of 
frequent allelic loss in pancreatic cancer. This gene is composed of three exons, and two forms 
of alternatively spliced transcripts are ubiquitously expressed. Although no mutations were 

15 observed in 26 pancreatic cancer cell lines, reduced expressions of the full-length transcripts 
were observed in some cell lines, which may suggest some role for DUSP6 in pancreatic 
carcinogenesis. The mitogen-induced gene, DUSP2, encodes a nuclear protein, PAC1, that 
acts as a dual-specific protein phosphatase with stringent substrate specificity for MAP kinase. 
MAP kinase phosphorylation and consequent enzymatic activation is a central and often 

20 obligatory component in signal transduction initiated by growth factor stimulation or resulting 
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from various types of oncogenic transformation. DUSP2 downregulates intracellular signal 
transduction through the dephosphorylation/inactivation of MAP kinases. 

NOV75 is predicted to be expressed in at least the following tissues: heart, breast and 
ovarian tissue, pancreas, brain, liver, kidney, spleen, testis, ovary, and peripheral blood 
5 leukocytes. This information was derived by determining the tissue sources of the sequences 
that were included in the invention including but not limited to SeqCalling sources, public 
EST sources, literature sources, and/or RACE sources. Further expression data for NOV75 is 
provided in Example 2. 

The NOV75 nucleic acids and proteins of are useful in potential therapeutic 

10 applications implicated in various pathological disorders described further herein, for example, 
brain disorders including epilepsy, eating disorders, schizophrenia, ADD, and cancer; heart 
disease; blood disorders, kidney disorders, liver diseases, inflammation and autoimmune 
disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 
inflammatory skin disorders, allergies, blood disorders; psoriasis; colon-, ovarian-, testicular-, 

15 lymphatic-, brain-, and pancreatic cancers; leukemia AIDS; thalamus disorders; metabolic 
disorders including diabetes and obesity; lung diseases such as asthma, emphysema, cystic 
fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency; and prostate 
disorders including prostate cancer and other diseases, disorders and conditions of the like. 
The NOV75 nucleic acid encoding the phosphatase-like protein of the invention, or fragments 

20 thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a dual specificity phosphatase-like 
protein includes the nucleic acid whose sequence is provided in Table 75 A, or a fragment 
thereof The invention also includes a mutant or variant nucleic acid any of whose bases may 

25 be changed from the corresponding base shown in Table 75A while still encoding a protein 
that maintains its dual specificity phosphatase-like activities and physiological functions, or a 
fragment of such a nucleic acid. 

The invention further includes nucleic acids whose sequences are complementary to 
the sequence of Table 75A, including nucleic acid fragments that are complementary to any of 

30 the nucleic acids just described. The invention additionally includes nucleic acids or nucleic 
acid fragments, or complements thereto, whose structures include chemical modifications. 
Such modifications include, by way of non-limiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications are 
carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
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that they may be used, for example, as antisense binding nucleic acids in therapeutic 
applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
about 36% of the bases may be so changed. 

The novel protein of the invention includes the dual specificity phosphatase-like 
5 protein whose sequence is provided in Table 75B. The invention also includes a mutant or 
variant protein any of whose, residues may be changed from the corresponding residue shown 
in Table 75B while still encoding a protein that maintains its dual specificity phosphatase-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 57% of the amino acid residues may be so changed. 
1 0 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



15 NOV76 

NOV76 ncludes two phospatase-like proteins, designated herein as NOV76a and 
NOV76b. 



NOV76a 

20 The disclosed NOV76a (alternatively referred to herein as CG56789-01) includes the 

2200 nucleotide sequence (SEQ ID NO:255) shown in Table 76A. A NOV76a ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 61-63 and ends with a stop codon 
at nucleotides 2101-2103. The disclosed NOV76 maps to human chromosome 12. 



Table 76A. NOV76a Nucleotide Sequence (SEQ ID NO:255) 

ACCATTACATCATCGTGGCAAATTAAAGAAGGAGGTGGGAAAAGAGGACTTATTOTTGTCATGGCCCATG 

AGATGATTGGAACT(^^TTGTTACTGAGAGGTTGGTGGCTCTGCTGGAAAGTGG^CGGAAAAAGTGOT 

GCTAATTGATAGCCGGCCATTTGTGGAATAC^TACATCCCACATTTTGGAAGCCATTAATATCAACTGC 

TCCAAG CTTATGAAGCGAAGGTTGC AACAGGACAAAGTGTT AATT ACAG AG CT CATC CAGCATTCAGCGA 

AACATAAGGTAAACGCTCAGGTTGACATTGATTGCAGTO 

AGATGTTGCCTCTCTCTCTTCAGACTGTTTOCTCACT 

TCTGTTCACCTGCTTGCAGGTTTATTCTTAGGTGGGTTT^ 

GTGAAGGAAAATCCACTCTAGTCCCTACCTG^TTTCTCAGCCTTGCTTACCTGTTGCCAACATTGGGCC 
AACCCGAATTCTTCCGAATCTTTATCTTGGCTGC 

AATGGGATTGGTTATGTGTTAAATGCCAGCAATACCTGTCCAAAGCCTGACTTTATCCCCGAGTCTCATT 
TCCTGCGTGTGCCTGTGAATGACAGCTTTTGTGAGAAAAT 
CATTGGTAAGTTGACTTATAC&G&GAAAGC^LAAAG 
GGGATCTCCCGCTCCGCCACCATCGCrrATCGC^ 

CTTACAGGAGATTTGTGAAAGAAAAAAGACCTACTATATCTCCAAACTTCAATTTTCTGGGCCAA 
GGACTATGAGAAGAAGATT AAG AAC CAGACTGG AGCAT CAG GGCCAAAG AG CAAACT CAAGCTGCTG CAC 
CTGGAGAAGCCAAATGAACCTGTCCCTGCTGTCTCAGAGGGTGGACAGAAAAGCGAGA 
CACCCTGTGCCGACTCTGCTACCTCAGAGGC^GCAGGACAAAGGCCCGTGCATCCCGCCAGCGTGCCCAG 
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CGTGCCCAGCGTGCAGCCGTCGCTGTTAGA.GGACAGCCCGCTGGTACAGGCGCTCAGTGGGCTGCACCTG 
TC CGCAGACAGGCTGGAAGACAGCAATAAG CTCAAGCGTTCCTTCTCTCTGGATATCAAATCAGTTTCAT 
ATTCAGCCAGCATGGCAGCATCCTTACATGGCTTCTCCTCATCAGAAGATGCTTTGGAATACTACAAACC 
TTCCACTACTCTGGATGGGACCAACAAGCTATGCCAGTTCTCCCCTGTTCAGGAACTATCGGAGCAGACT 
CCCGAAACCAGTCCTGATAAGGAGGAAGCCAGCATCCCCAAGAAGCTGCAGACCGCCAGGCCTTCAGACA 
GCCAGAGCAAGCGATTGCATTCGGTCAGAACCAGCAGCAGTGGCACCGCCCAGAGGTCCCTTTTATCTCC 
ACTGCATCGAAGTGGGAGCGTGGAGGACAATTACCACACC^GCTTCCTTTTCGGCCTTTCCACCAGCCAG 
CAG CAC CT CACG AAGTCTGCTGGCCTGGG C CTTAAGGGCTG G CACT CGG AT ATCTTGG C C C CCCAG AC CT 
CTACCCCTTCCCTGACCAGCAGCTGGTATTTTGCCACAGAGTCCTCACACTTCrrACTCTGCCTCAGCCAT 
CTACGGAGGCAGTGCCAGTTACTCTGCOTACAGCTGCAGCCAGCrGCCCACTTGCGGAGACCAAGTCrAT 
TCTGTGCGCAGGCGGCAGAAGCCAAGTGACAGAGCTGACTCGCGGCGGAGCTGGCATGAAGAGAGCCCCT 
TT GAAAAGCAGTTT AAACG CAGAAGCTGCC AAATGGAATTTGGAGAGAG CAT CATGT CAG AG AAC AGGTC 
ACGGGAAGAGCTGGGGAAAGT GGGCAGT CAGT CTAGCTTTTCGGGCAGC ATGGAAAT CATT G AGGT CTC C 
TG AGAAGAAAGACAC TTGTGAC TT CTATAGACAATTTTTTTTT CTTGTTCACAAAAAAATT C CCTGTAAA 
TCTGAAATATATATATGTACATACATATAT 



A NOV76a polypeptide (SEQ ID NO:256) encoded by SEQ ID NO:255 is 680 amino 
acids in length and is presented using the one-letter amino acid code in Table 76B. The Psort 
profile for NOV76a predicts that this sequence has no signal peptide and is likely to be 
5 localized at the nucleus with a certainty of 0.8800. In alternative embodiments, a NOV76a 
polypeptide is located to peroxisomal microbodies with a certainty of 0.3000, to the 
mitochondrial matrix space with a certainty of 0.1000, or to lysosomes with a certainty of 
0.1000. 



Table 76B. NOV76a Polypeptide Sequence (SEQ ID 
NO:256) 

"^Kahemigtqtvt 

dkvliteliqhsaioikvnaqvdidcsqkvvvydq^ 

svhllaglflggfaefsrcfpglcegkstlvptcisqpclpvanigptrilpnlylgcqr 
dvlnkelmqqngigyvlnasotcpkpdfipe 

ltytekakasngcvlvhciiagisrsatxaiayimkrmdmsldeayrrfvkekrptisp 

nfiigqlldyekkikn^gasgpkskiikiilhlekpnepvpavseggqksetplsppcadsa 

tseaagqrpvhpasvpsvpsvqpslledsplvqalsglhlsadrledsnklkrsfsldik 

SVSYSASMAASLHGFSSSEDAIiEYYKPSTTLDGTNIOjCQFSPVQELSEQTPETSPDKEEA 
SI PKKLQTARPSDSQSKRLHSVRTSS SGTAQRSLLSPLHRSGSVEDNYHTSFLFGLSTSQ 
QHLTKSAGLGLKGWHSDIIiAPC3TSTPSIjTSSWYFATESSHFYSASAIYGGSASYSAYSCS 
QLPTCGDQVYSVRRRQKPSDRADSRRSWHEESPFEKQFKRRS CQMEFGES IMSENRSREE 
LGKVGSQSSFSGSMEIIEVS 

10 

NOV76b 

The disclosed NOV76b (alternatively referred to herein as CG56789-02) includes the 
2071 nucleotide sequence (SEQ ID NO:257 ) shown in Table 76C. A SEC2 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 61-63 and ends with a stop codon at 
15 nucleotides 2047-2049. The disclosed NOV76b maps to human chromosome 12. 



Table 76C. NOV76b Nucleotide Sequence (SEQ ID 
NO:257) 

ACCATTACATCATCGTGGCAAATTAAAGAAGGAGGTGGGAAAAGAGGACTTATTGTTGTC 
ATGGCC CATGAGATGATTGGAACTCAAATTGTOACTGAGAGGTTGGTGGCT 
AGTGGAACGGAAAAAGTGCTGCTAATTGATAGCCGGCCATTTGTGGAATACAATACATCC 
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CACATTTTGGAAGCCATTAATATCAACTGCTCCAAGCTTATGAAGCGAAGGTTGCAACAG 
GACAAAGTGTT AATTACAG AG CT CATCCAG CATTCAGCGAAACAT AAGGTTGACATTGAT 
TGCAGTCAGAAGGTTGTAGTTTACGATCAAAGCTCCCAAGATGTTGCCTCTCTCTCTTCA 
GACTGTTTTCTCACTGTACTTCTGGGTAAACTGGAGAAGAGCTTCAACTCTGTTCACCTG 
CTTGCAGGTGGGTTTGCTGAGTTCTCTCGTTGTTTCCCTGGCCTCTGTGAAGGAAAATCC 
ACTCTAGTCCCTACCTGCATTTCTCAGCCTTGCTTACCTGTTGCCAACATTGGGCCAACC 
CGAATTCTTCCCAATCTTTATCTTGGCTGCCAGCGAGATGTCCTCAACAAGGAGCTGATG 
CAGCAGAATGGGATTGGTTATGTGTTAAATGCCAGCAATACCTGTCCAAAGCCTGACTTT 
ATCCCCGAGTCTCATTTCCTGCGTGTGCCTGTGAATGACAGCTTTTGTGAGAAAATTTTG 
C CGTGGTTGGACAAATCAGTAGATTTCATTGAGAAAGCAAAAG CCTCCAATGGATGTGTT 
CTAGTGCACTGTTTAGCTGGGATCTCCCG CTC CGCCACCATCG CTATCGC CTACATCATG 
AAGAGGATGGACATGTCTTTAGATGAAGCTTACAGATTTGTGAAAGAAAAAAGACCTACT 
ATAT CTCC AAAC TTCAATTTT C TGGG C CAACT C CTGGACT ATGAGAAGAAG AT TAAGAAC 
CAG ACTGGAG CAT CAGGGC CAAAGAGCAAACT CAAGC TGC TGC ACCTGG AGAAGCCAAAT 
GAACCTGTCCCTGCTGTCT CAGAGGGTGGACAGAAAAG CGAGACGCCCCTCAGTCCACCC 
TGTGCCGACTCTGCTACCTCAGAGGCAGCAGGACAAAGGCCCGTGCATCCCGCCAGCGTA 
CCCAGCGTGCAGCCCTCGCTGTTAGAGGACAGCCCGCrGGTACAGGCGCTCAGTGGGCTG 
CACCTGTCCGCAGACAGGCTGGAAGACAGCAATAAGCTCAAGCGTTCCTTCTCTCTGGAT 
ATCAAATCAGTTTCATATTCAGCCAGCATGGCAGCATCCTO 

GAAGATGCTTTGGAATACTACAAACCTTCCACTACTCTGGATGGGACCAACAAGCTATGC 
CAGTT CTC CC CTGTTCAGGAACTATCGGAGCAGACTCCCGAAACCAGTCCTGATAAGGAG 
GAAGCCAGCATCCCCAAGAAGCTGCAGACCGCCAGGCCTTCAGACAGCCAGAGCAAGCGA 
TTGCATTCGGTCAGAACCAGCAGCAGTGGCACCGCCCAGAGGTCCCTTTTATCTCCACTG 
CATCGAAGTGGGAGCGTGGAGGACAATTACCACACCAGCTTCCTTTTCGGCCTTTCCACC 
AGCCAGCAGCACCTCACGAAGTCTGCTGGCCTGGGCCTTAAGGGCTGGCACTCGGATATC 
TTGGCCCCCCAGACCTCTACCCCTTCCCTGACCAGCAGCTGGTATTTTGCCACAGAGTCC 
TCACACTTCTACTCTGCCTCAGCCATCTACGGAGGCAGTGCCAGTTACTCTGCCTACAGC 
TGCAGCCAGCTGCCCACTTGCGGAGACCAAGTCTATTCTGTGCGCAGGCGGCAGAAGCCA 
AGTGA»GAGCTGACrCGCGGCGGAGCTCGCATGAAGAGAGCCCCTTTGAAAAGCAGTTT 
AAACGC^GAAGCTGCCAAATGGAATTTGGAGAGAGCATCATGTCAGAGAACAGGTCACGG 
GAAGAGCTGGGGAAAGTGGGCAGTCAGTCTAGCTTTTCG<3GCAGCATGGAAATCATTGAG 
GTCTCCTGAGAAGAAAGACACTTGTGACTTC 



A NOV76b polypeptide (SEQ ID NO:258) encoded by SEQ ID NO:257 is 662 amino 
acids in length and is presented using the one-letter amino acid code in Table 76D. The Psort 
profile for NOV76b predicts that this sequence has no signal peptide and is likely to be 
5 localized to the nucleus with a certainty of 0.8800. In alternative embodiments, a NOV76b 
polypeptide is located to peroxisomal microbodies with a certainty of 0.3000. 



Table 76D. NOV76b Polypeptide Sequence (SEQ ID 
NO:258) 

DKVLITELIQHSAKHKVDIDCSQKVVVYDQ 

LAGGFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTiaLPNLYLGCQRDVLNKELM 
QQNGIGYVLNASNTCPKPDFI PESHFLRVPVNDS FCEKILPWLDKSVDFIEKAKASNGCV 
LVHCLAGI SRSATIAIAYIMKRMDMSLDEAYRFVKEKRPTISPNFNFLGQLIiDYEKICI KN 
QTGASGPKSKLKLLHLEKPNEP.VPAVSEGGQKSETPLSPPCADSATSEAAGQRPVHPASV 
PSVQPSIJjEDSPLVQAIiSGLHLSADRLEDSNKLRI^FSLDXKSVSYSASMAASIiHGFSSS 
EDALEYYKPSTTLDGTNKLCQFSPVQELSEQTPETSPDKEEASIPKKLQTARPSDSQSKR 
LHS VRT S S SGT AQRS LL S PLHRSGSVEDNYHTS FLFGLS TS Q QHLTKS AGLGLKGWHSDI 
LAPQTSTPSLTS SWYFATES SHFYSASAI YGGSASYSAYSCSQLPTCGDQVYS VRRRQKP 
SDRADSRRSWHEESPFEKQFKRRSCQMEFGESIMSENRSREELGKVGSQSSFSGSMEIIE 

vs 



A BLAST analysis of NOV76 was run against the proprietary PatP GENESEQ Protein 
10 Patent database. It was found, for example, that the amino acid sequence of NOV76 had high 
homology to other proteins as shown in Table 76E. 
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Table 76E. BLASTX results from PatP database for NOV76 






Smallest 






Sum 




High 


Probability 


Sequences producing High- scoring Segment Pairs : 


Score 


P(N) 


patp;AAE04834 Human SGP002 phosphatase polypeptide 


3360 


0.0 


patp:AAU09016 Human dual specificity phosphatase 21117 


3360 


0.0 


patp:AAB20325 Human protein phosphatase and kinase protein 


2963 


1.3e-308 


patp:AAM25744 Human protein sequence 


2860 


l.le-297 


patp:AAW29l50 Dual-specif ic murine thr-tyr phosphatase 


1088 


1.5e-125 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1 149 of 1 150 bases (99%) identical to a gbrGENBANK- 

5 ID:AB052156|acc:AB052156.1 mRNA from Homo sapiens (MKP-7 mRNA for MAPK 
phosphatase-7). The full amino acid sequence of the protein of the invention was found to 
have 662 of 665 amino acid residues (99%) identical to, and 662 of 665 amino acid residues 
(99%) similar to, the 665 amino acid residue ptnr:SPTREMBL-ACC :Q9B Y84 protein from 
Homo sapiens (Human) (MAPK PHOSPHATASE-7). NOV76 also has homology to the 

1 0 other proteins shown in the BLASTP data in Table 76F. 



Table 76F. NOV76 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi| 12697945 |d 
bj |BAB21791.1 
| (AB051487) 


KIAA1700 protein 
[Homo sapiens] 


690 


665/680 
(97) 


665/680 
(97) 


0.0 


gi| 14756395 |r 
ef |XP_039106. 
1| (XM~039106) 


MAPK phosphatase- 7 [Homo 
sapiens] 


665 


665/680 
(97) 


665/680 
(97) 


0.0 


gi| 16550836 |d 
bj |BAB71060.1 
| (AK055973) 


unnamed protein product 
[Homo sapiens] 


665 


664/680 
(97) 


664/680 
(97) 


0.0 


gi|l3990989|d 
bj |BAB47240.1 
| (AB052157) 


MAP kinase phosphatase- 7 
EMus musculus] 


660 


601/680 
(88) 


626/680 
(91) 


o.o 


gi| 13625393 |g 
b|AAK35052.l| 
AF345951 1 
(AP345951) 


map kinase phosphatase-M 
Al isoform 
[Mus musculus] 


677 


533/644 
(82) 


568/644 
(67) 


0.0 



This BLASTP data is displayed graphically in the ClustalW in Table 76G. A multiple 
sequence alignment is given, with the NOV76 protein being shown on line 1 in a ClustalW 
15 analysis comparing the protein of the invention with the related protein sequences shown in 
Table 76F. 
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Table 76G. ClustalW Alignment of NOV76 



NOV76a 

NOV76b 

gi 1 12697945 | 

gi j 14756395 j 

gi 1 16550836 j 

gi | 13990989 j 

gi |13625393 j 



NOV76a 
NOV76b 



12697945 
14756395 
16550836 
13990989 
13625393 



NOV76a 
NOV76b 





12697945 


gi 


14756395 


gi 


16550836 


gi 


13990989 


gi 


13625393 


N0V76a 


NOV76b 


gi 


12697945 


gi 


14756395 


gi 


16550836 


gi 


13990989 


gi 


13625393 



NOV76a 


NOV76b 


gi 


12697945 


gi 


14756395 


gi 


16550836 


gi 


13990989 


gi 


13625393 



NOV76a 

NOV76b 

gi | 12697945 

gi j 14756395 

gi j 16550836 

gi | 13990989 

gi j 13625393 



NOV76a 
NOV76b 



12697945 
14756395 
16550836 
13990989 



(SEQ ID NO: 256) 
{SEQ ID NO:258) 
{SEQ ID NO: 679) 
(SEQ ID NO: 680) 
(SEQ ID NO: 681) 
(SEQ ID NO: 682) 
(SEQ ID NO : 683) 




60 



70 



80 



90 



100 



VLLIDSRPFVEYNTSHILEAIISriNCSKLMKRRLQQDKVLITELIQHSAKK 
VLLIDSRPFVEYNTSHILEAININCSKLMKRRLQQDKVLITELIQHSAKH 
VLLIDSRPFVEYNTSHILEAININCSKLMKRRLQQDKVLITELIQHSAKH 
VLLIDSRPFVEYNTSHILEAININCSKLMKRRLQQDKVLITELIQHSAKH 
VLLIDSRPFVEYNTSHIIiEAININCSKLMKRRLQQDKVLITELIQHSAKH 
VLLIDSRPFVEYNTSHILEAININCSKLMKRRLQQDKVLITELIQHSAKH, 
VLLIDSRPFVEYNTSHILEAININCSKLMKRRLQQPSjVLITEIjIQHSAKH 




110 



! 



120 

. . I . . 



I. 



13 0 



I 



140 



150 



DIDCSQKWVYDQSSQDVASLSSDCFLTVLLGKLEKSFNSVHLL 
DIDCSQKVWYDQSSQDVASLSSDCFLTVLLGKLEKSFNSVHLL 
DIDCSQKWVYDQSSQDVASLSSDCFLTVLLGKLEKSFNSVHLL 
DIDCSQKVWYDQSSQDVASLSSDCFLTVLLGKLEKSFNSVHLL 
DIDCSQXVVVYDQSSQDVASLSSDCFLTVLLGKLEKSFNSVHLL 
DIDCSQKVWYDQSSQDVSsLSSDCFLTVLLGKLEgSFNSVHLL 
D I D oSqKVVVYDQS S QDVffiSLS S DC FLTVLLGKLE* c ttm rtiT .t 



160 

LjFL 



170 



180 

.■•|.. 



190 

..|...-|. 



200 
-•I 



--G 



gGFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLY 
GFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLY 
GFAEFSRCFPGLCEGXSTLVPTCISQPCLPVANIGPTRILPNLY 
GFAEFSRCFPGLCEGXSTLVPTCISQPCLPVANIGPTRILPNLY 
GFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTjJiLPNLY 
GFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLY 
GFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLY 



210 220 
I | I.. 



230 240 



250 
..I 



LGCQRDVLNKELMQQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCE 
LGCQRDVLNKELMQQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCE 
LGCQRDVLNKELMQQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCE 
LGCQRDVLNKELMQQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCE 
LGCQRDVLNKELMQQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCE 
LGCQRDVLNKgLMQQNGIGYyi^ASNTCPKPDFIPESHFLRVPVNDSFCE 
LGCQRDVLNKgLMQQNGIGYWiNASNTCPKPDFIPESHFLRVPVNDSFCE 



260 270 280 290 300 

l- -l- • • I ■ • • 

KLTYTW!Wii^}W4tj**iJU#lT e 4lil:J.4 1 lll* 1 lJAUJ!;!j 



KILPWLDKSVDFI 
KILPWLDKSVDFI 
KILPWLDKSVDFI 
KILPWLDKSVDFI 
KILPWLDKSVDFI 
KILPWLDKSVDFI 



ekakasngcvlvhclagisrsatiaiayim. 
ekakasngcvlvhplagisrsatiaiayimk 
ekakasngcvlvhclagisrsatiaiayimk 
ekakasngcvlvhclagisrsatiaiayimk 
eka-kasngcvlvhc^agisrsat iaiayimk 
ekakAsngcvlShglagisrsatiaiaYimk 
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gi | 13625393 | 



NOV76a 
NOV76b 



12697945 | 
14756395 | 
16550836) 
13990989) 
13625393 j 



NOV76a 
NOV76b 



12697945 
14756395 
16550836 
13990989 
13625393 



NOV76a 

NOV76b 

gi | 12697945 | 

gi j 14756395 | 

gi 1 16550836 ) 

gi|l3990989 j 

gi | 13625393 j 



NOV76a 

NOV76b 

gi 1 12697945 | 

gi 1 14756395 j 

gi 1 16550836 | 

gi j 13990989 j 

gi j 13625393 | 



NOV76a 
NOV76b 





12697945 


gi 


14756395 


gi 


16550836 


gi 


13990989 


gi 


13625393 


NOV76a 


NOV76b 


gi 


12697945 


gi 


14756395 


gi 


16550836 


gi 


13990989 


gi 


13625393 



NOV76a 
NOV76b 



gi 
gi 
31 



12697945 
14756395 
16550836 



KILPWLDKSVDFI 



310 



320 



330 



340 



350 




400 



410 



420 



430 



I 



440 



450 



SVQPSLLEDSPLVQALSGLHLSADRLEDSNKLKRSFSLDIKSVSYS 
SVQPSLLEDSPLVQALSGLHLSADRLEDSNKLKRSFSLDIKSVSYS 
SVQPSLLEDSPLVQALSGLHLSADRLEDSNKLKRSFSLDIKSVSYS 
SVQPSLLEDSPLVQALSGItHLSADRLEDSNKLKRSFSLDIKSVSYS 
SVQPSLLEDSPLVQALSGLHLSADRLEDSNKLKRSFSLDIKSVSYS 
sgQPSLLEDSPLVQALSGLSLS^^LEDsgKLKRSFSLDIKSVSYS 
sSQPSLLEDSPLVQALSGLffiLsllSLEDsSKLKRSFSLDIKSVSYS 



460 



470 



480 



490 



500 




560 



570 



580 



I . 



590 



I 



600 
••I 



DNYHTSFLFGLSTSQQHLTKSAGLGLKGWHSDILAPQTSTPSLTSSWYFA 
DNYHTSFLFGLSTSQQHLTKSAGLGLKGWHSDILAPQTSTPSLTSSWYFA 
DNYHT. S FL FGL S T S QQHLTK S AGLGLKG WH S D I L AP QT S TP S LT S S WYF A 
DNYHTSFLFGLSTSQQHLTKSAGLGLKGWHSDILAPQTSTPSLTSSWYFA 
DNYHT-S F-LFGL S TS.QQHLTKS AGLGLKGWHSD I LAPQT STP SLTS S WYFA 
DNYHTi^FLFG LSTSQQ^LTKSAGLG LKGWHSDILAPQ^S^PSLTSSWYFA 

dnyhtSelfgI ' 22 i& 



LKGWHSDILAPG 



610 



620 



I 



630 



640 



tesshfysasaiyggsasysayscsq: 
tesshfysasaiyggsasysayscsq: 
tesshfysasaiyggsasysayscsq: 
tesshfysasaiyggsasysayscsq' 
te s s hfys as atyggs as y§ ays cs q 



i 



mm 



iii 



III 



ii 



€50 
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gi|l3990989| 
gi | 13625393 | 



NOV76a 
NOV76b 



gi 
gi 
gi 
gi 



12697945 
14756395 
16550836 
13990989 
13625393 



NOV76a 
NOV76b 



12697945 
14756395 
16550836 
13990989 
13625393 



I 



660 



I. 



670 



680 



690 



1 



RSWHEES PFEKQFKRRSCQMEFGES I 
RSWHEESPFEKQFKRRSCQMEFGESI 
RS WHEE S P FEKQFKRRS CQM E FGE S I 
RSWHEESPFEKQFKRRSCQMEFGESI 
RSWHEESPFEKQFKRRSCQMEFGESI 
R SWHEES PFEKQFKRRSCQMEFGESI 

RRSBESEpf* 









3 


■ f~- 


:mi 








IK* 


'Mi 








me 


M< 


m 




: 


i 




\h 




* 








ME 






D 







L 



700 
--1 



KVGSQSSFS 
KVGSQSSFS 
KVGSQSSFS 
KVGSQSSFS 
KVGSQSSFS 
KVGSQSSFS 




ILLLTVLFPVHKK 



Table 76H lists the domain description from DOMAIN analysis results against 
NOV76. This indicates that the NOV76 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 76H. Domain Analysis of NOV76 



gnl 1 Pf am [ pf amO 07 82 . DSPc, Dual specificity phosphatase, catalytic domain. 

Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly 

similar to that of tyrosine -specific phosphatases, except for a "recognition" 

region. SEQ ID NO: 870 

CD-Length =139 residues, 100.0% aligned 
Score » 172 bits (436), Expect = 6e-44 

N0V76: 166 GPTRILPNLYLGCQRDVLNKELMQQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCE 225 

GP+ ILP+LYLG N + + GI +V+N + P +L +PV+D+ 

Sbjct: 1 GPSEILPHLYLGSYPTASKLAFLSKLGITHVINVTEEVPN-SKNSGFLYLHIPVDDNHET 59 



NOV76 : 226 KILPWId)KSVDFIGKLTYTEKAKASNGCVLVHC^ 285 
I P+LD++V+FI E A+ G VLVHC AGISRSAT+ IAY+MK ++SL+EAY 

Sbjct: 60 DISPYLDEAVEFI EDARQKGGKVLVHCQAGISRSATLI IAYLMKTRNLSLNEAY 113 

o 

NOV76: 286 RRFVKEKRPTISPNFNFLGQLLDYEKK 312 

FVKE+RP ISPNF F QL++YE+K 
Sbjct: 114 -SFVKERRPIISPNFGFKRQLIEYERK 139 



10 



Mitogen-activated protein kinases (MAPKs) are inactivated via dephosphorylation of 

either the threonine or tyrosine residue or both in the P-loop catalyzed by protein phosphatases 

which include serine/threonine phosphatases, tyrosine phosphatases, and dual specificity 

phosphatases. Nine members of the dual specificity phosphatases specific for MAPKs, termed 

MKPs, have been reported. Each member has its own substrate specificity, tissue distribution, 
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and subcellular localization. MKP-7 is most similar to hVH5, a member of previously known 
MKPs, in the primary structure. MKP-7 is predominantly localized in the cytoplasm when 
expressed in cultured cells, whereas hVH5 is both in the nucleus and the cytoplasm. MKP-7 
binds to and inactivates p38 MAPK and JNK/SAPK, but not ERK. Furthermore, MKPs have 
5 the substrate specificity toward the isoforms of the p38 family (alpha, beta, gamma, and delta). 
MKP-7 binds to and inactivates p38 alpha and -beta, but not gamma or delta. MKP-5 and 
CL100/MKP-1 also bind to p38 alpha and -beta, but not gamma or delta. 

NOV76 is predicted to be expressed in at least the following tissues: blood, brain, 
CNS, colon, heart, kidney, lung, and stomach. This information was derived by determining 

1 0 the tissue sources of the sequences that were included in the invention including but not 

limited to SeqCalling sources, public EST sources, literature sources, and/or RACE sources. 
Further expression data for NOV76 is provided in Example 2. 

The NOV76 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 

15 cardiomyopathy, atherosclerosis, hypertension, congenital heart defects, aortic stenosis, atrial 
septal defect (ASD), atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary 
stenosis, subaortic stenosis, ventricular septal defect (VSD), valve diseases, tuberous sclerosis, 
scleroderma, obesity, transplantation, hypercalcemia, ulcers, Hirschsprung's disease, Crohn's 
Disease, anemia, ataxia-telangiectasia, autoimmune disease, immunodeficiencies, Von Hippel- 

20 Lindau (VHL) syndrome, Alzheimer's disease, stroke, Parkinson's disease, Huntington's 

disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, leukodystrophies, 
behavioral disorders, addiction, anxiety, pain, neurodegeneration, neuroprotection, systemic 
lupus erythematosus, asthma, emphysema, allergy, ARDS, diabetes, renal artery stenosis, 
interstitial nephritis, glomerulonephritis, polycystic kidney disease, renal tubular acidosis, IgA 

25 nephropathy, as well as other diseases, disorders and conditions. NOV76 nucleic acids 

encoding the MAP kinase-like protein of the invention, or fragments thereof, may further be 
useful in diagnostic applications, wherein the presence or amount of the nucleic acid or the 
protein are to be assessed. 

The novel nucleic acid of the invention encoding a phosphatase-like protein includes 

30 the nucleic acid whose sequence is provided in Table 76A or 76C, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
, from the corresponding base shown in Table 76A or 76C while still encoding a protein that 
maintains its phosphatase-like activities and physiological functions, or a fragment of such a 
nucleic acid. The invention further includes nucleic acids whose sequences are complementary 
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to the sequence of Table 76 A or 76C, including nucleic acid fragments that are complementary 
to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
5 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1% of the bases may 
10 be so changed. 

The novel protein of the invention includes the phosphatase-like protein whose 
sequence is provided in Table 76B or 76D. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
76B or 76D while still encoding a protein that maintains its phosphatase-like activities and 
1 5 physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 1% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
20 prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV77 

The disclosed NOV77 (alternatively referred to herein as CG56804-01) includes the 
25 881 nucleotide sequence (SEQ ID NO:259) shown in Table 77A. A NOV77 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 61-63 and ends with a stop codon at 
nucleotides 769-771. The disclosed NOV77 maps to human chromosome 14. 



Table 77A. NOV77 Nucleotide Sequence (SEQ ID NO:259) 

GAGGGTCGGCCGGCTGTGTAACACTCTCCCACCCCACCCACCAGCCCGCGGGCCAGCACCATGGAGGACG 
TGAAGCTGGAGTTCCCTTCCCTTCCACAGTGC^GGAAGACGCCGAGGAGTGGACCTACCCTGAGTGGAC 
CTACCCTATGAGACGAGAGATGCAGGAAATTTTACCTGGATTGT^ 

AAAAGCAAGGTACTACCTGTACTACAGAAACATGGAATAACCCATATAATATGCATACGACAAAATATTG 

AAGCAAACTTTATTAAACCAAACTTTC^GCAGTTATT^ 

AGTTGAAAATATAATACGTTTTTTCCCTATGTTTTGCCT 

CAAATGGGAGGTAAAGTTCTTGTGCATGGAAATGCAGGGATCTC 

ACATTATGGAAACATTTGGAATGAAGTACAGGTTCAGAGATGCT 
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TTGTATTAATCCTAATGCTGGATTTGTC CATCAACTTCAGGAATATGAAG CCATCTACCTAGCAAAATTA 
ACAATACAGATGATGTCACCACTCCAGATAGAAAGGTCATTATCfGTTCATTCTGGTACCACAGGTGGCA 
GTTTGAAGAGAACACATGAAGAAGAGGATGATTTTGGAACCATGCAAGTGGCGACTGCACAGAATGGCTG 
ACTTGAAGAGCAACATCATAGAGTGTGAATTTCTATTTGGGAAGGAGAAAATACAAGAGAAAATTATAAT 
GTAAAATGGTAAAAACATAAGTAGTTTTTTTTTCAATTACA 



A NOV77 polypeptide (SEQ ID NO:260) encoded by SEQ ID NO:259 is 236 amino 
acids in length and is presented using the one-letter amino acid code in Table 77B. The Psort 
profile for NOV77 predicts that this sequence has no signal peptide and is likely to be 
5 localized at the cytoplasm with a certainty of 0.6036. In alternative embodiments, a NOV77 
polypeptide is located to lysosomes with a certainty of 0.2040. 



Table 77B. NOV77 Polypeptide Sequence (SEQ ID 
NO:260) 

MEDVKLEFPSLPQCKEDAEEWTYPEWTY^ 

HGITHI ICIRQNIEANFIKPNFQQLFRYLVLDIADNPVENI IRFFPMFCLQTKEFIDGSL 
QMGGKVLVHGNAG I S RS AAFV I AYIMETFGMKYRFRD AFAYVQERRFCINPNAGFVHQLQ 
EYEAIYIiAKLTIQMMSPLQIERSLSVHSGTTGGSLKRTHEEEDDFGTMQVATAQNG 



A BLAST analysis of NOV77 was run against the proprietary PatP GENESEQ Protein 
10 Patent database. It was found, for example, that the amino acid sequence of NOV77 had high 
homology to other proteins as shown in Table 77C. 



Table 77C. BLASTX results from PatP database for NOV77 



Smallest 
Sum 





High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 




patp:AAM39734 Human polypeptide 


1099 


4.3e- 


111 


patp:AAM4l520 Human polypeptide 


1099 


4.3e- 


111 


patp:AAE08552 Human phosphatase protein - Homo sapiens 


1099 


4.3e- 


111 


patp:AAO09017 Human dual specificity phosphatase 38692 


1099 


4.3e- 


111 


patp:AAY68795 Amino acid sequence of a human protein 


210 


6.9e- 


17 



In a search of sequence databases, it was found, for example, that the nucleic acid 
1 5 sequence of this invention has 228 of 249 bases (91 %) identical to a gbrGENBANK- 

ID:MMU34973|acc:U34973.1 mRNA from Mus musculus (protein tyrosine phosphatase-like 
mRNA, unspliced c-terminal product and spliced c-terminal end STYX). The foil amino acid 
sequence of the protein of the invention was found to have 214 of 236 amino acid residues 
(90%) identical to, and 221 of 236 amino acid residues (93%) similar to, the 223 amino acid 
20 residue ptnr.SPTREMBL-ACC:Q60970 protein from Mus musculus (Mouse) (PROTEIN 

TYROSINE PHOSPHATASE-LIKE). NOV77 also has homology to the other proteins shown 
. in the BLASTP data in Table 77D. 
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Table 77D. NOV77 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
{%) 


Expect 


gi 1 17476793 |r 
ef |XP_058659. 
l| (XM 058659) 


similar to putative <H. 
sapiens) [Homo sapiens) 


223 


223/236 
(94) 


223/236 
(94) 


e-118 


gi 

f 


12833088|d 
BAB22384.1 
:AK002822) 


phosphoserine/ threonine/ 
tyrosine interaction 
protein-putative 
[Mus musculus] 


223 


215/236 
(91) 


221/236 
(93) 


e-116 


r| 


2137698|pi 
149365 


protein tyrosine 
phosphatase - mouse 


233 


214/236 
(90) 


221/236 
(92) 


e-116 


gi| 9789981 | re 
f |NP_062611.1 
| (NM_019637) 


phosphoserine/ threonine/ 
tyrosine interaction 
protein; STNS 
(alternatively spliced 
intron of Styx) ; protein 
tyrosine phosphatase- 
like unspliced c- 
terminal product and 
spliced c- terminal end 
STYX [Mus musculus] 


205 


163/180 
(90) 


167/180 
(92) 


3e-87 j 


gi| 1842088 |gb 

|AAB4756l.l| 

(UB7169) 


tyrosine phosphatase- 
like protein homolog 
hSTYXb [Homo sapiens] 


66 


66/68 
(97) 


66/68 

(97) j 


8e-30 



This BLASTP data is displayed graphically in the ClustalW in Table 77E. A multiple 
sequence alignment is given, with the NOV77 protein being shown on line 1 in a ClustalW 
5 analysis comparing the protein of the invention with the related protein sequences shown in 
Table 77D. 



Table 77E. ClustalW Alignment of NOV77 



NOV77 



gi 
gi 
gi 
gi 
gi 



17476793) 
12833088| 
2137698 | 
9789981| 
1842088 



(SEQ ID NO:260) 

(SEQ ID NO: 684) 

(SEQ ID NO: 685) 

(SEQ ID NO: 686) 

(SEQ ID NO: 687) 

(SEQ ID N0:686) 



NOV77 



gi 
gi 
gi 
gi 
gi 



17476793 

12833088 

2137698 

9789981 

1842088 



10 

J-. 



20 

.L. 



30 
.1.. 



40 
•I.. 



I , 



50 
. I 



MEDVKLEFPSfPQCK^DAEEWTY 

medvklefpsSpqckSdaeewty, 
medvklefpsSpqckBdaeewty 



MEDVKLEFPSgPQCKgDAEEWTY 

medvklefpsSpqckSdaeewty 



PMRREMQEmLPGLFLGPYSSAM 

pmrremqeBlpglflgpyssam. 
pmrremqeSlpglflgpyssam 
pmrremqeKlpglflgpyssam 
pmrremqeSlpglflgpyssam 



N0V77 



gi 
gi 
gi 
gi 



17476793 I 
12833088) 
2137698) 
9789981) 



I 



60 
. I . . 



70 



80 



LPgLQKHGITHilCIRQ: 

lpwlqkhgithiicirq: 

LPgLQKHGITHIICIRQ 
LPMLOKHGITHliCIRQ: 

lp§lqkhgixhiicirq: 




fikpnfqq 
fikpnfqq 

FIKPNF&p; 
FIKPNFQQ 
FIKPNFQQ 



90 



100 





; 








w 




J 






I ij » 








i 


a 










I 








1 




At 




i 
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gi | 1842088 | 



NOV77 


IIRFFPM 


gi 


17476793 | 


IIRFFPM 


gi 


12833088 j 


IIRFFPM 


gi 


2137S98] 


IIRFFPM 


gi 


978998l| 


IIRFFPM 


gi 


1842088 






NOV77 



17476793] 

12833088) 

2137698 

9789981 

1842088 



MKY 


kFi 


MKY 


B 


MKY 


I 


MKY 


S 


MKY 


9 


MKY 





160 



170 



180 



190 
. . I . . 



I 



200 



RDAFAYVQERRFCINPNAGFVHQLQEYEAIYLAKLTIQMMSPLQI 
RDAFAYVQERRFCINPNAGFVHQLQEYEAIYLAKLTIQMMSPLQI 
RDAFAYVQERRFCINPMAGFVHQLQEYEAIYLAKLTIQMMSPLQI 
RDAFAYVQERRFCINPNAGFVHQLQE YEAIY LAKLTIQMMSPLQI 
RDAFAYVQERRFCINPNAGFVHQLQJ 

RDAFAYVQERRFCINPNAGFVHQLQEYEAIYLAKLTIO 



210 



220 



NOV77 



17476793 | 
12833088 j 
2137698 
9789981 
1842086 




Table 77F lists the domain description from DOMAIN analysis results against NOV77. 
This indicates that the NOV77 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 77F. Domain Analysis of NOV77 



gnl| Smart |smart00195, DSPc, Dual specificity phosphatase, catalytic domain 



SEQ ID NO: 871 



CD-Length = 139 residues, 98.6% aligned 
Score = 142 bits (358), Expect - 2e-35 



NOV77 : 33 EMQEILPGLFLGPYSSAMKSKVLPVLQKHGITHI I CIRQNIEANFI KPNFQQLFRYLVLD 92 

EILP L+LG YS A L +L+K GITH+I + + + + F YL + 

Sb;)Ct: 1 GPSE I LPHLYLGSYSDASN LALLKKLGI THVINV TEEVPNSNKSGFLYLGIP 52 

NOV77: 93 IADNPVENIIRFFPMFCLQTKEFIDGSLQMGGKVLVHGNAGISRSAAFVIAYIMETFGMK 152 

+ DN 1 + P EFI + + + GGKVLVH AG+SRSA +IAY+M+ M 

SbjCt: 53 VDDNTETKISPYLPE AVEFIEDAEKKGGKVLVHCQAGVSRSATLI IAYLMKYRNMS 108 

NOV77: 153 YRFRDAFAYVQERRFC INPNAGFVHQLQE YE 183 

DA+ +V+ERR I+PN GF+ QL EYE 
Sbjct: 109 L--NDAYDFVKERRPIISPNFGFLRQLIEYE 137 



Mitogen-activated protein (MAP) kinase phosphatases constitute a growing family of 
dual specificity phosphatases thought to play a role in the dephosphorylation and inactivation 
of MAP kinases and are therefore likely to be important in the regulation of diverse cellular 
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processes such as proliferation, differentiation, and apoptosis. For this reason it has been 
suggested that MAP kinase phosphatases may be tumor suppressors. 

NOV77 is predicted to be expressed in at least the following tissues: lung, lymphoid 
tissue, spleen, tonsils, whole organism. This information was derived by determining the 
5 tissue sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV77 is provided in Example 2. 

The NOV77 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 

10 brain disorders including epilepsy, eating disorders, schizophrenia, ADD, and cancer; heart 
disease; blood disorders, kidney disorders, liver diseases, inflammation and autoimmune 
disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 
inflammatory skin disorders, allergies, blood disorders; psoriasis; colon-, ovarian-, testicular-, 
lymphatic-, brain-, and pancreatic cancers; leukemia AIDS; thalamus disorders; metabolic 

15 disorders including diabetes and obesity; lung diseases such as asthma; emphysema, cystic 
fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency; and prostate 
disorders including prostate cancer and other diseases, disorders and conditions of the like. 
The NOV77 nucleic acid encoding the MAP kinase-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 

20 the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a dual specificity phosphatase-like 
protein includes the nucleic acid whose sequence is provided in Table 76A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 76A while still encoding a protein 

25 that maintains its dual specificity phosphatase-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 76A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 

30 complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
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In the mutant or variant nucleic acids, and their complements, up to about 9% of the bases may 
be so changed. 

The novel protein of the invention includes the dual specificity phosphatase-like 
protein whose sequence is provided in Table 76B. The invention also includes a mutant or 
5 variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 76B while still encoding a protein that maintains its dual specificity phosphatase-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 10% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
10 immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

15 NOV78 

The disclosed NOV78 (alternatively referred to herein as CG56810-01) includes the 
777 nucleotide sequence (SEQ ID NO:261) shown in Table 78A. A NOV78 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 61-63 and ends with a stop codon at 
nucleotides 768-770. The disclosed NOV78 maps to human chromosome 2. 

20 - 

Table 78A. NOV78 Nucleotide Sequence (SEQ ID NO:261) 

TCA.CCTGAGCCTAGGAGTTCAAGATTGCAGTGGCCTATGATTGCATCATTGCACTCCAGCCTGGGTGACA 

TCACCTGAGCCTAGGAGTTCAAGATTGCAGTGGCCTATGATTGCATCATTGCACTCCAGCCTGGGTGACA 

GAAAGAGAC CCTGT CT CTG CATGATGATAATAATAATTAAAAGAG AGAGAGAGAGAGAGAAAACATATAG 

TAGAAGGAGCACTTCTGGTGTGGGATTGAAGCAGTATTATCATTCAAGTGAGTCGCATTTAAATTTTTTA 

CCTTT CTTGTTACCGATATCAACACCCCAGTCT CTTTTTAGGTACCCTAATTCATGGGATTTAAAACAAT 

CAT CT ±~r T TT C TT TCT CTTTAAGGTGACT CAT ATT CTTAATGTTG CAT ATGGAGTTGAAAATGCTTTC CT 

aGTGACTTTACATATAAGAGCATTTCTATATTGGATCTGCCTGAAACCAACATCCTGTCTTATTTTCCA 

GAATGTTTTGAATTTATTGAAGAAGCAAAAAGAAAAGTGAGTTTTGTTTTGATCCATA 

TGGTTCTTGTTCATTGTAATGCAGGCGTTTCCAGGGCT 

TGAACAAACCTCATTTACGAGTGCTTTTTCTTTGG 

GGCTTCATGGAGCAGCTTCGTACATATCAAGAGGGCAAAGAAAGCAATAAGTGTGACAGAATACAGGAGA 
ACAGTTCATGAGTTGCATTGTAGCAGACAATGGACAACTGTAGTTTCTGAA^ 

TTTCCCT , 

A NOV78 polypeptide (SEQ ID NO:262) encoded by SEQ ID NO:261 is 224 amino 
acids in length and is presented using the one-letter amino acid code in Table 78B. The Psort 
profile for NOV78 predicts that this sequence has no signal peptide and is likely to be 
25 localized to the endoplasmic reticulum (membrane) with a certainty of 0.6400. In alternative 
embodiments, a NOV78 polypeptide is located to the plasma membrane with a certainty of 
0.4960, or to the nucleus with a certainty of 0.2420. 
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Table 78B. NOV78 Polypeptide Sequence (SEQ ID 
NO:262) 

MIASLHSSLGDRKRPCLCMMI II IKREREI^KTYSRRSTSGVGLKQYYHSSESHLNFLPF 
LLPISTPQSLFRYPNSWDIJCQSSFFFLFKVTHII^ 

TNILSYFPECFEFIEEAKRKVSFVLIHSSAGVVLVHCNAGVSRAAAIVIGFLMNSEQTSF 
TSAFSLVKNARPSICPNSGFMEQLRTYQEGKESNKCDRIQENSS 



A BLAST analysis of NOV78 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV78 had high 
5 homology to other proteins as shown in Table 78C. 



Table 78C. BLASTX results from PatP database for NOV78 






Smallest 






Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAB29109 Human cellular proliferative response protein 


618 


4.0e-60 


patp:AAB73224 Human phosphatase AI031656_h - Homo sapiens 


618 


4 . 0e-60 


patp:AAB732l5 Murine phosphatase AA274457_m 


508 


1.8e-48 


patp:AAM422ii Human polypeptide 


290 


2.3e-25 


patp:AAB940l8 Human protein sequence 


201 


6.2e-16 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 103 of 160 bases (64%) identical to a gb:GENBANK- 

10 ED:HS106C24|acc:Z83313.1 mRNA from Homo sapiens (Human DNA sequence from PAC 
106C24, between markers DXS294 and DXS730 on chromosome X). The full amino acid 
sequence of the protein of the invention was found to have 71 of 172 amino acid residues 
(41%) identical to, and 99 of 172 amino acid residues (57%) similar to, the 203 amino acid 
residue ptnr:SPTREMBL-ACC:Q9NGLl protein from Drosophila melanogaster (Fruit fly) 

15 (MAP KINASE PHOSPHATASE- 1). NOV78 also has homology to the other proteins shown 
in the BLASTP data in Table 78D. 



Table 78D. NOV78 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi|l8146956|d 
bj |BAB82499.1 
| (AB038770) 


protein phosphatase 
[Homo sapiens] 


217 


126/136 
(92) 


126/136 
(92) 


6e-65 


gi| 13277360 |r 
ef |NP_077758. 
l| (NM 024438) 


dual -specificity 
phosphatase 
[Mus musculus] 


220 


103/131 
(78) 


113/131 
(85) 


3e-54 
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gi|l2845353 |d 
bj |BAB26718.1 
| (AK010127) 


Dual specificity protein 
phosphatase containing 
protein [Mus musculus] 


220 


102/131* 
(77) 


113/131 
(85) 


le-53 


gi|l2858039|d 
bj |BAB31181.1 
| (AK018369) 


Dual specificity protein 
phosphatase containing 
protein 


162 


103/131 
(78) 


113/131 
(85) 


le-52 


gi|l814891l[d 
bj (BAB83499.1 
| (AB063X87) 


SKRPi [Homo sapiens] 


166 


74/74 
(100) 


74/74 
(100) 


7e-37 



This BLASTP data is displayed graphically in the ClustalW in Table 78E. A multiple 
sequence alignment is given, with the NOV78 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
5 Table 78D. 



Table 78E. ClustalW Alignment of NOV78 



gi 


18146956 


gi 


13277360 


gi 


12845353 


gi 


12856039 


gi 


18148911 


NOV78 


gi 


18146956 


gi 


13277360 


gi 


12845353 


gi 


12858039 


gi 


18148911 



NOV78 



NOV78 

gi | 18146956 | 
gi j 13277360 I 
gi j 12845353 | 
gi|l2858039 j 
gi|l814891l| 



NOV78 



18146956 
13277360 
12845353 
12858039 
18148911 



NOV78 



18146956 
13277360 
12845353 
12858039 
18148911 



(SEQ ID NO:262) 
(SEQ ID NO:689) 
(SEQ ID NO: 690) 
(SEQ ID NO:691) 
(SEQ ID NO: 692) 
(SEQ ID NO: 693) 




110 
..I.. 



120 



130 



140 
..I.. 



150 



YGVENAFLSjjFTYKgl S ILDgPETNILSYFPECFEFIEgAKjjjA, 

aygvenaflsjjftyk|isildSpetnilsyfpecfefieBakBkd 

AYGVENAFLSgFTYKglSILDSpETNILSYFPECFEFIEgAKfflKD 
AYGVENAFLSjjFTYKyISILDSpETNILSYFPECFEFIE?AKfflKD 

aygvenafls^.ftyk^ isildBpetnilsyfpecfefieSakBkd, 



tfSFVLI 



160 



170 



160 



190 



200 

..I 



HSS 



gGWLVKCNAGVSRAAAIVIGFLNlgSE^gFTgAgSLVKgARPSICP 
Bg VVL VECNAG V S RAAA I VI G FLmSsE^SftBaBsLVkSaRP S I C P 

SgvvlvhcnagvsraaaivigflmSse^ 
Bgvvlvhci^gvsraaaivigflmSse^^ 

|G\^VECNAGVSRAAAIVIGFLm|seS^ET^^ 

BGWI;VHgNAGy^RAAA£VIGFLM^ 
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NOV78 

gi|l8146956 
gi | 13277360 
gi|12845353 
gi|12858039 
gi j 18148911 



I 



210 
. • I . . 



220 



Nfc GFMEQLRTYQgGKESNjgDip 
nS GFMEQLRTYCjgGKESNnBDjMa 

m gfmeqlrtyqSgkesnSdj^ 

I^GFMEQLRTYcBGKESNfflD|^; 

m gfkeqlrtyqSgkesnSdv^: 

NfeGFKiEQLRTYcSGKESNEDligS 



230 

QENSS 

ENSI 

paedhtggl 
paedItggl 
paedStggl 

ENSI 



Table 78F lists the domain description from DOMAIN analysis results against NOV78. 
This indicates that the NOV78 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 78R Domain Analysis of NOV78 



gnl [Pfam|pf am00782 , DSPc, Dual specificity phosphatase, catalytic domain, 

Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly 

similar to that of tyrosine -specific phosphatases, except for a "recognition" 

region. SEQ ID NO: 872 

CD- Length » 139 residues, 80.6% aligned 
Score = 112 bits (280), Expect = 2e-26 



N0V78: 88 



Sbjct: 26 



FKVTHILNVAYGVENAFLSDFTYKSISILDLPETNILSyFPECPEFIEBAKRKVSFVLIH 14 7 

+TH++NV VN+ SFY I+D ET+I Y E EFIE+A++K 
LGI THVINVTEEVPNSKNSGFLYLHI PVDDNHETDIS PYLDEAVBFI EDARQK 78 



207 



NOV78 : 148 SS AGWLVHCNAGVSRAAAI VI GFLMNS EQTS FTSAFSLVKNARPS I CPNS G FMEQLRTY 
G VLVHC AG+SR+A ++I +LM + S A+S VK RP I PN GF QL Y 
Sbjct: 79 - - GGKTOVHCQAGISRSATLI IAYLMKTRNLSLNEAYSFVKERRP IIS PNFGFKRQLIEY 236 

NOV78: 208 Q 208 
+ 

Sbjct: 137 E 137 



Mitogen-activated protein (MAP) kinase phosphatases constitute a growing family of 
dual specificity phosphatases thought to play a role in the dephosphorylation and inactivation 
of MAP kinases and are therefore likely to be important in the regulation of diverse cellular 
10 processes such as proliferation, differentiation, and apoptosis. For this reason it has been 
suggested that MAP kinase phosphatases may be tumor suppressors. 

NOV78 is predicted to be expressed in at least the following tissues: parathyroid gland, 
peripheral blood, whole organism. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
15 SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV78 is provided in Example 2. 

The NOV78 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 



640 



WO 02/068649 PCT7US02/02785 



brain disorders including epilepsy, eating disorders, schizophrenia, ADD, and cancer; heart 
disease; blood disorders, kidney disorders, liver diseases, inflammation and autoimmune 
disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 
inflammatory skin disorders, allergies, blood disorders; psoriasis; colon-, ovarian-, testicular-, 
5 lymphatic-, brain-, and pancreatic cancers; leukemia AIDS; thalamus disorders; metabolic 
disorders including diabetes and obesity; lung diseases such as asthma, emphysema, cystic 
fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency; and prostate 
disorders including prostate cancer and other diseases, disorders and conditions of the like. 
NOV78 nucleic acids encoding the MAP kinase-like protein of the invention, or fragments 

10 thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a dual specificity phosphatase-like 
protein includes the nucleic acid whose sequence is provided in Table 78A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 

1 5 be changed from the corresponding base shown in Table 78 A while still encoding a protein 
that maintains its dual specificity phosphatase-jike activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 78 A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

20 The invention additionally includes nucleic acids or nucleic acid fragments, or 

complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 

25 used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 36% of the bases 
may be so changed. 

The novel protein of the invention includes the dual specificity phosphatase-like 
protein whose sequence is provided in Table 78B. The invention also includes a mutant or 
30 variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 78B while still encoding a protein that maintains its dual specificity phosphatase-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 59% of the amino acid residues may be so changed. 
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These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the u Anti-NOVX Antibodies" section 
5 below. 



NOV79 

The disclosed NOV79 (alternatively referred to herein as CG56862-01) includes the 
939 nucleotide sequence (SEQ ID NO:263) shown in Table 79A. A NOV79 ORF begins with 
10 a Kozak consensus ATG initiation codon at nucleotides 19-21 and ends with a stop codon at 
nucleotides 928-930. The disclosed NOV79 maps to human chromosome 20. 



Table 79A. NOV79 Nucleotide Sequence (SEQ ID NO:263) 

TAC^GCAGGGTCTCCCTATGTTTCCCAGGCTGG^ 

GGCCTCCCAAAGTTCTGGGATTACAGACCCTGCAG6CGTCGGGCCTGGGCCGTCAGGGCAGCT 

GATCGCTTCCCGGGCGGCGAGCTGGGGGTGCACCCGGACCGCCGCCGCCGGGATCATGGGCAATGGCATG 

ACCAAGGTACTTCCTGGACTCTACCTCGGAAACTTCATTG^ 

TCCTGTTTCTTTCAGATGCCAAAGACCTGGATCAGCTGGGCCGAAATAAGATCACACACATCATCTCTAT 
CCATGAGTCACCCCAGCCTCTGCTGCAGGATATCACCrACCTTCGCATCCCGGTCGCTGATACCCCTGAG 
GTACCCATGAAAAAGCACTTCAAAGAATGTATCAACTTCATCCACTC 

GCCTTCTGCACACCACGATTGTGACAGCGTATGTGATGACTGTGACGGGGCTAGGCTGGCGGGACGTG^ 

T GAAGC CATCAAGGC CACCAGGC CCATCGCCAAC CC CAACCCAGGC TTTAGG CAGCAG CTTGAAGAGTTT 

GGCTGGGCCAGTTCCCAGAAGGTACAGCTTCGCCGGCAGCTGGAGGAGCGCTTCGGCGAGAGCCCCTTCC 

G CGACGAGGAGGAGTTGCGCGCGCTGCTGCCGCTGTGCAAGCGCTGCCGGCAGGG CTCCGCGACCTCGGC 

CTCCTCCGCCGGGCCGCACTCAGCAGCCTCCGAGGGAACCGTGCAGCGCCTGGTGCCGCGCACGCCCCGG 

GAAGCCCACCGGCCGCTGCCGCTGCTGGCGCGCGTCAAGCAGACTTTCTCTTGCCTCCCCCGGTGT 

CCCGCAAGGGCGGCAAGTGAGGATGCAGT 



A NOV79 polypeptide (SEQ ID NO:264) encoded by SEQ H) NO:263 is 303 amino 
1 5 acids in length and is presented using the one-letter amino acid code in Table 79B. The Psort 
profile for NOV79 predicts that this sequence has a signal peptide and is likely to be localized 
to the cytoplasm with a certainty of 0.6500. In alternative embodiments, a NOV79 
polypeptide is located to lysosomes with a certainty of 0.2216. The Signal P predicts a likely 
cleavage site for a NOV79 peptide is between positions 24 and 25, i.e. 9 at the dash in the 
20 sequence VLG-LQ. 



Table 79B. NOV79 Polypeptide Sequence (SEQ ID 
NO:264) 

MFPRLVSNSWAHTVLLPWPPICVLGLQTLQASGLG 

GNGMTKVLPGLYIiGNFIGHPASQIGSSILFLSDAKDIJDQLGRNIQTHI IS IHESPQPIiLQ 
DITYLRIPVADTPEVPMKKHFKECINFIHCCRLN^ 

LEAIKATRPIANPNPGFRQQLEEFGWASSQKVQLKRQLEERFGESPFRDEEELRALLPLC 
KRCRQGSATSASSAGPHSAASEGTVQIUJVPRTPREAHRPLPLIARVKQTFSCLPRCLSRK 
GGK - 
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A BLAST analysis of NOV79 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV79 had high 
homology to other proteins as shown in Table 79C. 



Table 79C. BLASTX results from PatP database for NOV79 






Smallest 






Sum 


Sequences producing High- scoring Segment Pairs: 


High 


Probability 


Score 


P(N) 


patp;AAE04840 Human SGP008 phosphatase polypeptide 


1298 


3.5e-132 


patp:AAY68795 Amino acid sequence of a human protein 


433 


1.6e-40 


patp:AAB67l67 Human dual-specificity phosphatase DSP-3 


433 


1.6e-40 


patp:AAB6643l Human DSP-3 protein - Homo sapiens, 184 aa. 


433 


1.6e-40 


patp:AAB73216 Human phosphatase AA374753 h - Homo sapiens 


433 


1.6e-40 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 93 of 1 18 bases (78%) identical to a gb:GENBANK- 
lD:HUMFLNG6PD|acc:L44140.1 mRNA from Homo sapiens (chromosome X region from 
filamin (FLN) gene to glucose-6-phosphate dehydrogenase (G6PD). The full amino acid 
sequence of the protein of the invention was found to have 273 of 276 amino acid residues 
(98%) identical to, and 274 of 276 amino acid residues (99%) similar to, the 275 amino acid 
residue ptnr.TREMBLNE W-ACC :CAC 1 0008 protein from Homo sapiens (Human) 
(BA243J1 6.6 (NOVEL PROTEIN)). NOV79 also has homology to the other proteins shown 
in the BLASTP data in Table 79D. 





Table 79D. NOV79 BLASTP results 


Gene Index/ 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%> 


Expect 


gi| 18104 942 |r 
ef |NP_542178. 
l| <NM 080611) 


dual specificity 
phosphatase -like 15 
[Homo sapiens] 


243 


241/245 
(98) 


242/245 
(98) 


e-133 


gi | 17458347 |r 
ef |XP_05928B. 
l| <XM_059288) 


similar to DA243J16.6 
(novel protein with a 
dual specificity 
phosphatase, catalytic 
domain) (H. 

sapiens) [Homo sapiens] 


235 


226/252 
(89) 


227/252 
(89) 


e-121 


gi | 9910432 | re 
f|NP 064570.1 
| (NM 020185) 


mitogen- activated 
protein kinase 
phosphatase x; homolog 
of mouse dual 
specificity phosphatase 
LMW-BSP2; JNK- 
stimulating phosphatase 
1 [Homo sapiens] 


184 


88/186 
(47) 


119/186 
(63) 


3e-44 
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gi|l3183069|g 
b|AAK15038.l| 
AF237619_1 
(AF237619) 


dual specificity 
phosphatase TS-DSP2 [Mus 
musculus] 


184 


87/186 
(46) 


119/186 
(63) 


le-43 


gi| 14726046 |r 
ef |XP_046543. 
l| (XM_046543) 


mitogen- activated 
protein kinase 
phosphatase x [Homo 
sapiens] 


184 


86/186 
(46) 


112/186 
(59) 


2e-41 



This BLASTP data is displayed graphically in the ClustalW in Table 79E. A multiple 
sequence alignment is given, with the NOV79 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 79D. 



Table 79E. ClustalW Alignment of NOV79 



NOV79 


gi 


18104942) 


gi 


17458347) 


gi 


9910432) 


gi 


13183069) 


gi 


14726046) 



(SEQ ID NO: 264) 

(SEQ ID NO: 694) 

(SEQ ID NO: 695) 

(SEQ ID NO: 696) 

(SEQ ID NO: 697) 

(SEQ ID NO:698) 



10 20 30 40 50 

■.-.|..-.|....|....| I |....|....| |....| 

N0V7 9 MFPRLVSNSWAHTVLLPWPPKVLGLQTLQASGLGRQGS CDRIASRAASWG 



18104942) 

17458347) 

9910432) 

13183069) 

14726046 



NOV79 

gi 1 18104942 | 
gi 1 17458347 I 
gi | 9910432 | 
gi 1 13183069 | 
gi j 14726046 j 



NOV79 



18104942) 
17458347 j 
9910432) 
13183069) 
14726046) 



NOV79 

gi 
gi 
gi 
gi 
gi 



18104942) 
17458347) 
9910432 | 
13183069) 
14726046 | 



60 



70 



80 



90 



100 



CTRTAAPGI 



MGNGMMKffiLPGLYgGNF 
MGNGMjjKwLPGLYgGNF 

wgngmSkmlpglyRgnf 
wgSgm§sSlpglyognf 



IGHPASQIGSSILFLS 
IGHPASQIGSSILFLS 



110 



iKMLPGIiYMGNF 1 ' 



120 




130 



140 



150 




160 



170 



180 



190 



200 




210 



220 



230 



240 



250 
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NOV79 

gi | 18104942 | 
gi 1 17458347 | 
gi j 9910432 | 
gi j 13183069 | 
gi 1 14726046 j 



NOV79 



18104942] 
17458347) 
9910432 | 
13183069| 
14726046 



NOV79 

gi 1 18104942 | 
gi j 17458347 j 
gi j | 9910432 | 
gi|l3183069| 
gi j 14726046 j 





290 



300 



WASSQKVgLgR< 
WASSQ--KL§ ' 

IASSQ- 
IKHEVH- 



260 270 

CRQGSATSASSAGPHSAgSEgT^QRLVPRTPREAHRPLPLLARVKQTgsC 
CRQGSATSASSAGPHSA^gTj</^ 
CRQGSATSASSAGPHSA|sjE|^^ 

g^lfcK gjwA 

310 - 



280 
..|.. 




Table 79F lists the domain description from DOMAIN analysis results against NOV79. 
This indicates that the NOV79 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 79F. Domain Analysis of NOV79 



gnl | Smart | smart 00135 , DSPc, Dual specificity phosphatase, catalytic domain 
SEQ ID NO: 873 

CD- Length = 139 residues, 97.8% aligned 
Score - 108 bits (271), Expect = 3e-25 

N0V79: 63 GMTKVLPGLYLGNFIGHPASQIGSSILFLSDAKDLDQLGR NKITHIIS - IHESPQPLLQD 121 

G +++LP LYLG++ SDA +L L + ITH+I+ E P 



Sbjct: 1 GPSE ILPHLYLGSY - 



- SDASNLALLKKLGITHVINVTEEVPUSNKSG 4 5 



N0V79: 122 ITYLRIPVADTPEVPMKKHPKECINFIHCCRLNGGNCLVH TTIVTAYVMTVT 173 

YL IPV D E + + E + FI GG LVH T++ AY+M 

St>3 Ct : 46 FLYLGIPVDDNTETKISPYLPEAVEFIEDAEKKGGKVLVHCQAGVSRSATLIIAYLMKYR 105 

NOV79: 174 GLGWRDVLEAI KATRP IANPNPGFRQQLEE F 204 

+ D + +K RPI +PN GF +QL E+ 
Sbjct: 106 NMSLNDAYDFVKERRPI I SPNFGFLRQLIEY 136 



Mitogen-activated protein (MAP) kinase phosphatases constitute a growing family of 
dual specificity phosphatases thought to play a role in the dephosphoiylation and inactivation 
of MAP kinases and are therefore likely to be important in the regulation of diverse cellular 
processes such as proliferation, differentiation, and apoptosis. For this reason it has been 
suggested that MAP kinase phosphatases may be tumor suppressors. 
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NOV79 is predicted to be expressed in at least the following tissues: brain, kidney, 
pancreas, testis, whole organism. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
5 expression data for NOV79 is provided in Example 2. 

The NOV79 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
brain disorders including epilepsy, eating disorders, schizophrenia, ADD, and cancer; heart 
disease; blood disorders, kidney disorders, liver diseases, inflammation and autoimmune 

10 disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 

inflammatory skin disorders, allergies, blood disorders; psoriasis; colon-, ovarian-, testicular-, 
lymphatic-, brain-, and pancreatic cancers; leukemia AIDS; thalamus disorders; metabolic 
disorders including diabetes and obesity; lung diseases such as asthma, emphysema, cystic 
fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency; and prostate 

15 disorders including prostate cancer and other diseases, disorders and conditions of the like. 
The NOV79 nucleic acid encoding the phosphatase-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a dual specificity phosphatase-like 

20 protein includes the nucleic acid whose sequence is provided in Table 79 A, or a fragment 

thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 79A while still encoding a protein 
that maintains its dual specificity phosphatase-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 

25 are complementary to the sequence of Table 79 A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

30 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 22% of the bases 
may be so changed. 



646 



WO 02/068649 



PCI7US02/02785 



The novel protein of the invention includes the dual specificity phosphatase-like 
protein whose sequence is provided in Table 79B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 79B while still encoding a protein that maintains its dual specificity phosphatase-like 
5 activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 2% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
1 0 prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV80 

The disclosed NOV80 (alternatively referred to herein as CG56882-01) includes the 
15 2039 nucleotide sequence (SEQ ID NO:265) shown in Table 80A. A NOV80 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 100-102 and ends with a TGA 
codon at nucleotides 1947-1949. The disclosed NOV80 maps to human chromosome 10. 



Table 80A. NOV80 Nucleotide Sequence (SEQ ID NO:265) 

"CATGTGCTAGGTO 

TCACC CTC CCCAAGACCCATTGCCCCATCATGGCCGGGGACCGGCTCCCCAGGAAGGTGATGGACGCCAA 
GAAGCTGGCCAGCCTGCTGCGGGGCGGGCCTGGGGGGGGCCTGGTCATC 

TACAACAGCTGGCATGTGCTCAGCTCCGTCAACATCTGCTGCTCCAAGCTGGTGAAGTGGCGGTTGCAGA 
AGGGCAAGGTGACCATTGTGGAGTTCATCCAGCCGGCCGCACGCAGCCAGGTGGAGGCCACTGAGCCACA 
GGACGTGGTGGTCTATGACCAGAGCACGCGGGCCGCAGACAGCTTCCTCTCCATCCTGCTGAGCAAGCTG 
GATGGCTCCTTCCACAGCGTGGCCGGCTGCTT 

TCTCCTCCTGCTTCCCCGACCTCTGCGAGGGCGAGCCTGCTGCCCTGCTACCCATGAGCCTCTCCCAGTC 
CTGCCTGCTCGTGCCCAGCGTGGGCCTGACCCTCATCCTGCCTCACCTCTACCTGGGCTCGCAGGAAGAC 
GTCCTGAACAAGGATCTGATGACGCAGAATGGAATAAGCTAC 
AGCCTGACTTCATCTACCAGAGCCACTTCTTGCGGGTCCCC^ 

GCCCTGGCTGGACAAGT CCATCGAGTTCGTCGATAAAGCCAAGCTGTCCAGCTGCCAAGTCATCGTC CAC 

CGTCTGGCCGGCATCTCCTGCTGTGCCACTATCGCCATC 

CCGAAGACGCCTACAGGTTTGTGAAGGACCAGCGCCCGTCG^ 

GCTGCTGGAGGACCAGAGCAGCCCGAAGCTGCTGGCCGCCGTGCAGGGCGACGCGGGCACCCCCTCAGGA 
ATGCAGGAGCCTCCCCCCAGCCCTGCGGCCGGGGCCCCACTGCCATGGCTGCCACCACCTACCTCAGAGA 
CCGCTGCCACCAGGAGTGCGGCTGCCAGGGAGGGCGGCCCGAGCGCGGGCAGGAAGCCCCCGGCGCCCCC 
CACGGCCACCAGCACGCTGC^GC^GGGCCTGCGCAGCCTGCGCCTCTCCTCGGACCACCTGCAGGACACC 
AGCCGCCTCAAGCCCTCCTTCTCTCTCGACATCAAGTCGGCCT 

CGGGCCCAGCGACCCCGGCGAGGCCCCGAAGCTCTCTGAAAGCTGGACAGCC^GTCGGGGCCATGTTGOT 
CCTGCCCTCGCCCTGCCCGGACGCCGCGCCCGCGGCCCGGGCCCAGCGACCCCGGCGAGGCCCCGAAGCT 
AGCCAGTCGGGGCCATGTTGGGCCCCTGCCCGGACGCCGCGCCCAGGC^ 

CCCCGCGCGCGGCCTTAACTTCGGCTACGCGGCTGCCGGGCCCTGGCCAGCCX3GCCAGCCCCGGAGCCTG 

GACGCCACCGCTCGACTCCCTGAAGaSTCCTCGGTGCTTCAGCCCCGAGGGCGTGCAAGGGCCGGGCLAGG 

GTGCTGTTTGCGCCCTTCX^CCGGGCGGGCGCCCCGGAACCCAACGGCTGCAGCGACCTGCCACG^ 

AGGCAGCAAGGGCTGAGCCCGGGACGGGTCAGACGAGCTGGCCCGACGAGCTGGCCCCGGATTCGCTVCT^ 

CAAGTGCTGCAGCTGCCAGATGGAGTTCGAGGAGGGCATGGTGGAGGGGCGCGCGCGCGGCGAGGAGCTG 

GCCGCCCTGGGCAAGCAGGGGAGCTTCTrCGGGCAGCGT^ 

GCCCTCGGCTCCGCCGCCCGCAGCTGGGCAGTTATAAATATATATTATATATAATGCAAAGAAAGGCAAA 
TGGTTTTAC 
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A NOV80 polypeptide (SEQ ID NO:266) encoded by SEQ ID NO:265 is 616 amino 
acids in length and is presented using the one-letter amino acid code in Table 80B. The Psort 
profile for NOV80 predicts that this sequence is a Type lb membrane protein, has no signal 
peptide, and is likely to be localized at the plasma membrane with a certainty of 0.7000. In 
5 alternative embodiments, a NOV80 polypeptide is located to peroxisomal microbodies with 
a certainty of 0.3000, to the mitochondrial inner membrane with a certainty of 0.2143, or to 
the nucleus with a certainty of 0.3000. 



Table 80B. NOV80 Polypeptide Sequence (SEQ ID 
NO:266) 

MAGDRLPRKVMDAKKLASLLRGGPGGGLVIDSHS FLEYNS WHVLS SVNI C CSKLVKWRLQ 
KGKVTIVEFIQPAARSQVEATEPQDVVVYDQSTRAAD 

MAIITGGFATFSSCFPDLCEGEPAALLPMSLSQSCLLVPSVGLTLILPHLYLGSQEDVIiN 
KDLMTQNGISYVLYASNSCPKPDFIYQSHFLRW 

SCQVIVHRLAGISCCATIAIAYIMKTMGMSSEDAYRFVKDQRPSISPNFNFIiGQIiLEDQS 
SPKLLAAVQGDAGTPSGMQEPPPSPAAGAPLPWLPPPTSETAATRSAAAREGGPSAGRKP 
PAPPTATSTLQQGLRSLRLSSDHLQDTSRLKPSFSLDIKSAYAPSRRPGGPGPATPARPR 
SSLKAGQPVGAMLGLPSPCPDAAPAARAQRPRRGPEASQSGPCWAPARTPRPGTPTARRA 
TPRAAIiTSATRLPGPGQPASPGAWTPPLDSLKRPRCFSPEGVQGPGRVLFAPFGRAGAPE 
PNGCSDLPRREAARAEPGTGQTSWPDELAPDSHFKCCSCQMEFEEGMVEGRARGEELAAIi 
GKQGS FSGSVEVIEMS 

10 A BLAST analysis of NOV80 was run against the proprietary PatP GENESEQ Protein 

Patent database. It was found, for example, that the amino acid sequence of NOV80 had high 
homology to other proteins as shown in Table 80C. 



Table 80C BLASTX results from PatP database for NOV80 






Smallest 






Sum 


Sequences producing High-scoring Segment Pairs: 


High 


Probability 


Score 


P(N) 


patp:AAW29l50 Dual -specific murine thr-tyr phosphatase 


1873 


4.le-l93 


patp:AAE04834 Human SGP002 phosphatase polypeptide 


950 


6.6e-108 


patp:AAU090l6 Human dual specificity phosphatase 21117 


950 


6.6e-108 


patp:AAM25744 Human protein sequence 


955 


7.8e-96 


patp:AAB20325 Human protein phosphatase and kinase protein 


949 


3.4e-95 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1 587 of 1 930 bases (82%) identical to a gb:GENBANK- 
ID:HSU271 93|acc:U27193. 1 mRNA from Homo sapiens (Human protein-tyrosine 
phosphatase mRNA). The full amino acid sequence of the protein of the invention was found 
to have 489 of 625 amino acid residues (78%) identical to, and 514 of 625 amino acid residues 
(82%) similar to, the 625 amino acid residue ptnr:SWISSNEW-ACC:Q13202 protein from 
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Homo sapiens (Human) (DUAL SPECIFICITY PROTEIN PHOSPHATASE 8 (EC 3.13.48) 
(EC 3.1.3.16) (DUAL SPECIFICITY PROTEIN PHOSPHATASE HVH-5)). (NOV80 also 
has homology to the other proteins shown in the BLASTP data in Table 80D. 





Table 80D. NOV80 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi| 4758212 | re 
f |NP_004411.1 
| (NMJJ04420) 


dual specificity 
phosphatase 8 ; Hi 
phosphatase, vaccinia 
virus homolog; protein 
tyrosine phosphatase; 
serine/ threonine 
specific protein 
phosphatase 
[Homo sapiens] 


625 


480/632 
\ /a; 


506/632 


0.0 


gi| 6679156) re 
f|NP 032774.1 
| (NM_008748) 


neuronal 

tyrosine/ threonine 
phosphatase 1 
[Mus musculus] 


663 


450/682 
(65) 


476/682 
(68) 


0.0 


gi 1 17471343 |r 
ef |XP O61101. 

l| <XM_061101) ! 


similar to dual 
specificity phosphatase 
8; HI phosphatase, 
vaccinia virus homolog; 
protein 

tyrosinephosphatase ; 
serine/ threonine 
specific protein 
phosphatase (H. sapiens) 
[Homo sapiens] 


461 


324/422 
(76) 


333/422 
(78) 


e-140 


gi| 13639013 |r 
ef |XP_Q12007. 
2 | (XM 012007) 


dual specificity 
phosphatase 8 
[Homo sapiens] 


501 


343/473 
(72) 


363/473 
(76) 


e-131 


gi|l2697945|d 
bj |BAB21791.1 
| (AB051487) 


KIAA1700 protein 
[Homo sapiens] 


690 


270/668 
(40) 


373/668 
(55) 


e-103 



This BLASTP data is displayed graphically in the ClustalW in Table 80E. A multiple 
sequence alignment is given, with the NOV80 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 80D. 



Table 80E. ClustalW Alignment of NO V80 


NOV80 (SEQ ID NO: 266) 
gi 4758212 | (SEQ ID NO: 699) 
gi 6679156 j (SEQ ID NO:700) 
gi 17471343 (SEQ ID NO:70l) 
gi 13639013 (SEQ ID N0:702). 
gi 12697945 (SEQ ID NO: 703) 

10 20 


MAGDRLPRKVMDAKKLASLLRGGPG 


0 
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NOV80 


gi 


4758212 | 


gi 


6679X56 | 


gi 


17471343 | 


gi 


13639013 j 


gi 


12697945) 



gi 
gi 
gi 
gi 
gi 



4758212 

6679156 

17471343 

13639013 

12697945 



NOV80 



4758212 
6679156 
17471343 
13639013 



gi 1 12697945 



NOV80 


gi 


4758212) 


gi 


6679156) 


gi 


17471343 


gi 


13639013 


gi 


12697945 



NOV80 

gi 
gi 
gi 
gi 
gi 



4758212) 
6679156 | 
17471343) 
13639013) 
12697945) 



NOV80 

gi|4758212| 
gi) 6679156 j 
gi|l747l343 
gi j 13639013 
gi j 12697945 



NOV80 



4758212 

6679156 

17471343 

13639013 

12697945 



NOV80 


gi 


4758212) 


gi 


6679156) 


gi 


17471343 




gi 


13639013 




gi 


12697945 





MAGDRLPRKVMDAKKLASLLRGGPG 
MAGDRLPRKVMDAKKLASLLRGGPG 
MAGDRLPRKVMDAKKLASLLRGGPG. 



afehttitsswqikegggkrglivvJJJh^ 

100 





120 130 

r J»n,i*.e,-| 
svla^--sfHsillskS 

;SVLA|jD- -SP^SILLS 



140 



150 




i^dlDCS@K^H3^^S DVASL SlD- -CF2rVLLGK|EKS[|NggHgA 



160 



170 



L 



180 



190 



I 



200 



GGPATFSSCFPgLCEOaPAgLgPMSLSQgCLJJVPSVGIiTaiLPHLYLGSQ. 
GGFATFSSCFPgLCEG|pAgLSpMSLSQPCLPVPSVGLTRILPHLYLGSQ! 
GGFATFS SCFPgLCEGgPAy L§§MS LSQPCLPVPS VGLTRI LPHLYLGSQ 
GGFATFSSCFPBLCEGi3iPA™Ln^MSLSQ§CLi3vPSVGLT2lLPHLYLGSq 

MSLSQPCLPV P5V GLTRILPHLYLGSQ 
IIsQPCLPvJ^gHtrILPSlYLgBQ 
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NO V80 


gi 


4758212| 


gi 


6679156] 


gi 


17471343 | 


gi 


13639013 | 


gi 


12697945) 



410 



420 



430 



440 



450 



NOV80 

gi 4758212 | 
gi 6679156| 
gi 17471343 | 
gi 13639013 | 
gi 12697945) 



NOV80 



gi 
gi 
gi 
gi 
gi 



4758212) 
6679156 j 
17471343 
13639013 
12697945 



NOV80 

gi | 4758212 | 
gi | 6679156 j 
gi j 17471343 | 
gi j 13639013 j 
gi j 12697945 j 



NOV80 



gi 


4758212] 


gi 


6679156) 


gi 


17471343 


gi 


13639013 


gi 


12697945 


NOV80 


gi 


4758212) 


gi 


6679156) 


gi 


17471343 


gi 


13639013 


gi 


12697945 


NOV80 


gi 


4758212 | 


gi 


6679156 j 


gi 


17471343 


gi 


13639013 


gi 


12697945 



NOV80 

gi 
gi 

gi 



4758212) 
6679156) 
17471343 1 




SQS( 
PAG2 
PASS 
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gi 1 13639013 | PPPAA RPBMgY YI 
gi) 12697945 | H lffcSgl -- 



Table 80F lists the domain description from DOMAIN analysis results against NOV80. 
This indicates that the NOV80 sequence has properties similar to those of other proteins 
known to contain this domain. 

5 



Table 80F. Domain Analysis of NOV80 


gnl| Smart I smart 001 95, DSPc, Dual specificity phosphatase, catalytic domain 


SEQ ID 


N0:873 








CD-Length =139 residues, 97.1% aligned 




Score = 


154 bits (388), Expect = 2e-38 




NOV80: 


162 


GLTLILPHLYLGSQEDVIiNKDIOTQNGISyVLYASNSCPKPDFIYQSHFLRVPISroNYCT 


221 






G + ILPHLYLGS D N L+ + GI++V+ + P +L +P++DN 




Sbjct: 


1 


GPSEILPHLYLGSYSDASNliAIiLKKLGITHVINVTEE VPN - SNKSGFLYLGI PVDDNTET 


59 


NOV80: 


222 


KLLPWLDKSIEFVDKAKLSSCQVIVHRLAGISCCATIAIAY 


281 






K+ P+L +++EF++ A+ +V+VH AG+S AT-f IAY+MK MS DAY FVK++ 




Sbjct: 


60 


KI S PYLPEAVE FIEDAEKKGGKVLVHCQAGVS RS ATLI IAYLMKYRNMSUIDAYDFVKER 


119 


NOV80: 


282 


RPS I S PNFNFLGQLLE 297 








RP ISPNF FL QL+E 




Sb j ct : 


120 


RPIISPNFGFLRQLIE 135 





Mitogen-activated protein (MAP) kinase phosphatases constitute a growing family of 
dual specificity phosphatases thought to play a role in the dephosphorylation and inactivation 
of MAP kinases and are therefore likely to be important in the regulation of diverse cellular 
10 processes such as proliferation, differentiation, and apoptosis. For this reason it has been 
suggested that MAP kinase phosphatases may be tumor suppressors. 

NOV80 is predicted to be expressed in at least the following tissues: kidney. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 
15 sources, and/or RACE sources. Further expression data for NOV80 is provided in Example 2. 

The NOV80 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
brain disorders including epilepsy, eating disorders, schizophrenia, ADD, and cancer, heart 
disease; blood disorders, kidney disorders, liver diseases, inflammation and autoimmune 
20 disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 

inflammatory skin disorders, allergies, blood disorders; psoriasis; colon-, ovarian-, testicular-, 
lymphatic-, brain-, and pancreatic cancers; leukemia AIDS; thalamus disorders; metabolic 
disorders including diabetes and obesity; lung diseases such as asthma, emphysema, cystic 
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fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency; and prostate 
disorders including prostate cancer and other diseases, disorders and conditions of the like. 
The NOV80 nucleic acid encoding the phosphatase-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
5 the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a dual specificity phosphatase-like 
protein includes the nucleic acid whose sequence is provided in Table 80A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 80A while still encoding a protein 

1 0 that maintains its dual specificity phosphatase-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 80A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 

15 complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 

20 In the mutant or variant nucleic acids, and their complements, up to about 1 8% of the bases 
may be so changed. 

The novel protein of the invention includes the dual specificity phosphatase-like 
protein whose sequence is provided in Table 80B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 

25 in Table 80B while still encoding a protein that maintains its dual specificity phosphatase-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 22% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 

30 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV81 
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NOV81 includes two galactosyltransferase-like proteins, designated herein as NOV81a 
and NOV81b. 



NOV81a 

5 The disclosed NOV8 1 a (alternatively referred to herein as CG56283-0 1) includes the 

1247 nucleotide sequence (SEQ ID NO:267) shown in Table 81 A. A NOV81a ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 33-35 and ends with a stop codon 
at nucleotides 1224-1226. The disclosed NOV81a maps to human chromosome 19. 



Table 81 A. NOV81a Nucleotide Sequence (SEQ ID NO:267) 

\ CTCCCGCGGCCGCCCCTTCCCTGGGCCGGGTCATGCGCTGCCCCAAGTGCCTTCTCTGCCTGTCAGCACT 
GCTCACACTCCTGGGCCTC7VAAGTGTACATCGAATGGACATCCGAGTCCCGGCTCAGCMGGCCTACCCC 
AGCCCTCGGGGCACCCCGCCAAGCCCCACGCCAGCCAACCCTGAGCCCACCCTACCTGCCAACCTCTCCA 
CCCGCCTGGGCCAGACTATCCCGCTGCCCTTTGCTTACTGGAACCAGCAGCAGTGGCGGCTGGGGTCCCT 
GCCCAGTGGGGACAGCACTGAAACGGGGGGCTGCC^GGCTTGGGGGGCCGCCGCCGCCACCGAGATCCCT 
GACTTCGCCTCCTACCCCAAGGACCTCCGCCGCTTCTTGCTGTCAGCAGCCTGCCGGAGCTTCCCACAGT 
GGCTGCCTGGAGGTGGTGGCGGCCAAGTCTCCAGCTGCTCAGATACTGATGTCCCCTACCTGCTGTTGGC 
CGTGAAGTCAGAACCAGGGCGCTTTGCAGAACGACAGGCCGTG^ 

GGGATCCGGCTGCTCTTCCTGCTAGGGTCTCCGGTAGGTGAGGCGGGGCCTGACCTAGACTCACTAGTGG 

CATGGGAGAGCCGTCGCTACAGTGACCTGCTGCTCTGGGACTTCCTCGACGTCCCATTC^CCAGACGCT 

CAAAGACCTGCTGCTGCTGGCCTGGCrrGGGCCGCCACTGCCCCACCGTGAGTTTTGTCTTGCGAGCT 

GACGATGCCTTTGTACACACCCCTGCCCTGCTGGCTCACCTGCGGGCCCTGCGACCTGCCTCGGCCCGAA 

GCCTCT ACCTGGGTGAGGTCTTTACCCAGG CCATGC CTCTCCGGAAGCCAGGAGGACCCTTCTATGTGCC 

CGAGTCCTTCTTCGAAGGTGGCTACCCAGCCTATGCAAGCGGGGGTGGCTACGTGATTGCCGGGCGCCTG 

GCACCCTGGCTGCTGCGGGCGGCAGCCCGTGTGGCACCCTTCCCCTTTGAGGACGTCTACACTGGCC1TT 

GCATCCGAGCCCTGGGCCTGGTGCCCCAGGCCCACCCAGGCTTCCTCACAGCCTGGCCAGCAGACCGCA 

TGCGGACCACTGTGCTTTCCGCAACCTGCTGCTGGTACGGCCCCTGGGCCCCCAGGCCAGCATTCGGCTC 

TGGAAACAACTGCAAGACCCAAGGCTCCAGTGCTGACTCTCATTGGGGAGGGCGGAG 



A NOV81a polypeptide (SEQ ID NO:268) encoded by SEQ ID NO:267 is 397 amino 
acids in length and is presented using the one-letter amino acid code in Table 8 IB. The Psort 
profile for NOV81a predicts that this sequence has a signal peptide and is likely to be 
localized to lysosomes with a certainty of 0.8650, or to the outside of the cell with a certainty 
of 0.8191 . The Signal P predicts a likely cleavage site for a NOV81a peptide is between 
positions 34 and 35, i.e., at the dash in the sequence SKA-YP. 



Table 81B. NOV81a Polypeptide Sequence (SEQ ID 
NO:268) 

MRCPKCLLCLSALLTLLGLKVYIEWTSESRLSKAYPSPRGTPPSPTPANPEPTLPANLST 
RLGQTI PLPFAYWNQQQWRLGSLPSGDSTETGGCQAWGAAAATEI PDFAS YPKDLRRFLL 
S AACRS FPQ WLPGGGGGQVS S CSDTDVP YIJjLAVKSE PGRFAERQAVRET WGS PAPGI RL 
LFLLGSPVGEAGPDLDSLVAWESRRYSDLLLWDFI^ 

FVLRAQDDAFVHTPALLAHLRALPPASARSLYLGEVFTQAMPLRKPGGPFYVPESFFEGG 
YP AYAS GGG YVI AGRLAP WLLRAAARVAP FP FED VYT GL C I RALGLVP QAHPG FLTAWP A 
DRTADHCAFRNLLLVRPLGPQAS IRLWKQLQDPRLQC 

NOV81b 



10 



15 
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The disclosed NOV81b (alternatively referred to herein as CG56283-02) includes the 
1368 nucleotide sequence (SEQ ID NO:269) shown in Table 8IC A SEC2 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 4-6 and ends with a TGA codon at 
nucleotides 1 195-1 197. In Table 81C, putative untranslated regions are indicated by 
5 underlining, and the stop and start codons are indicated in bold. The disclosed NOV81b maps 
to human chromosomes 19. 



Table 81C. NOV81b Nucleotide Sequence (SEQ ID 
NO:269) 



GTCATGCGCTGCCCCAAGTGCCTTCTCTGCCTGTCAGCACTGCTCACACTCCTGGGCCTC 
AAAGTGTACATCGAGTGGACATCCGAGTCCCGGCTCAGCAAGGCCTACCCCAGCCCTCGG 
GGCACCCCGCCAAGCCCCACGCCAGCCAACCCTGAGCCCACCCTACCTGCCAACCTCTCC 
ACCCGCCTGGGCCAGACTATCCCGCTGCCCTTTGCTTACTGGAACCAGCAGCAGTGGCGG 
CTGGGGTCCCTGCCCAGTGGGGACAGCACTGAAACGGGGGGCTGCCAGGCTTGGGGGGCC 
GCCGCCGCC^CCGAGATCCCTGACTTCGCCTCCCACCCCAAGGACCTCCGCCGCTTCTTG 
CTGTCAGC^GCCTGCCGGAGCTTCCCACAGTGGCTGCCTGGAGGTGGTGGCAGCCAAGTC 
TCCAGCTGCTCAGATACTGATGTCCCCTACCTGCTGTTGGCCGTCAAGTCAGAACCAGGG 
CGCTTTGCAGAACGACAGGCCGTGAGAGAGACGTGGGGCAGTCCAGCTCCAGGGATCCGG 
CTGCTCTTCCTGCTAGGGTCTCCGGTGGGTGAGGCGGGGCCTGACCTAGACTCACTAGTG 
GCCTGGGAGAGCCGTCGCTACAGTGACCTGCTGCTCTGGGACTTCCTCGACGTCCCATTC 
AACCAGACGCTCAAAGACCTGCTGCTGCTGGCCTGGCTGGGCCGCCACTGCCCCACCGTG 
AGTTTTGTCTTGCGAGCTCAGGACGATGCCTTTGTACACACCCCTGCCCTGCTGGCTCAC 
CTGCGGGCCCTGCCACCTGCCTCGGCCCGAAGCCTCTACCTGGGTGAGGTCTTTACCCAG 
GCCATGCCTCTCCGGAAGCCAGGAGGACCCTTCTATGTGCCCGAGTCCTTCTTCGAAGGT 
GGCTACCCAGCCTATGCAAGCGGGGGTGGCTACGTCATTGCCGGGCGCCTGGCACCCTGG 
CTGCTGCGGGCGGCAGCCCGTGTGGCACCCTTCCCCTTTGAGGACGTCTACACrGGCCTT 
TGCATCCGAGCCCTGGGCCTGGTGCCCCAGGCCCACCCZkGGCTTCCTCACAGCCTGGCCA 
GCAGACCGCACTGCGGACCACTGTGCTTTCCGCAACCTGCTGCTGGTACGGCCCCTGGGC 
CCCCAGGCCAGCATTCGGCTCTGGAAACAACTGCAAGACCCAAGGCTCCAGTGCTGACTC 
TCATTGGGGAGGG CGGAGGTGCTGAC CTGGCCCCGGCCCTGGCCTGGG CCTCTGGGGCCG 
GCCCCTGGCTCAG CCCCTCCTTCCAGGTCTTGATGGGAGGGAGGAGGG CCCAG AAGCTGG 
ACAACTTAAGC CACTCCTTGGCCTCCCC CAGCCAGGTGAGTGAGCTAT 



A NOV81b polypeptide (SEQ ID NO:270) encoded by SEQ ID NO:269 is 397 amino 
10 acids in length and is presented using the one-letter amino acid code in Table 8 ID. The Psort 
profile for NOV81b predicts that this sequence has a signal peptide and is likely to be 
localized to lysosomes with a certainty of 0.8650, to the exterior of the cell with a certainty of 
0.8190. The Signal P predicts a likely cleavage site for a NOV81b peptide is between 
positions 34 and 35, i.e., at the dash in the sequence SAK-YP. 
15 

Table 81D. NOV81b Polypeptide Sequence (SEQ ID 
NO:270) 

MRCPKCIiLCLSALLTLLGLKVYIEOT 

i^GQTIPLPFAYWNQQQWIU^GSLPSGDSTETGGCQAWGAAAATEIPDFASHPKDLRRFLL 
SAACRSFPQWLPGGGGSQVSSCSDTDVPYLLLAVKSEPGRFAERQAVRETWGSPAPGIRL 
LFLLGSPVGEAGPDIJ3SLVAWESRRYSDLLLWDFLDVPFNQTI^LLLIAWLGRHCPTVS 
FVLRAQDDAFVHTPALLAHLRALPPASARSLYLGEVFT^^ 
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YPAYAS GGGYVI AGRLAPWLLRAAARVAPFP FEDVYTGLCIRALGL VPQAHPGFLTAWPA 
DRTADHCAFRNLLLVRPLGPQAS IRLWKQLQDPRLQC 



A BLAST analysis of NOV81 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV81 had high 
5 homology to other proteins as shown in Table 8 IE. 



Table 81E. BLASTX results from PatP database for NOV81 






Smallest 






Sum 


Sequences producing High-scoring Segment Pairs: 


High 


Probability 


Score 


P(N) 


patp:AAB03619 Human bet a -1,3 -galactosyl transferase Znssp2 


2130 


2.4e-220 


patp:AAB03620 Murine beta- 1, 3-galactosyl transferase Znssp 


1585 


1.4e-162 


patp:AAM41987 Human polypeptide 


1528 


1.5e-156 


patp:AAE05767 Human secreted protein (SECP) 


641 


i.5e-62 


patp:AAM40201 Human polypeptide 


641 


1.5e-62 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 647 of 1088 bases (59%) identical to a gb:GENBANK- 
ID:AP001754|acc:AP00 1754.1 mRNA from Homo sapiens (genomic DNA, chromosome 21q, 
section 98/105). The foil amino acid sequence of the protein of the invention was found to 
have 127 of 343 amino acid residues (37%) identical to, and 194 of 343 amino acid residues 
(56%) similar to, the 397 amino acid residue ptnr:TREMBLNEW-ACC:AAD09763 protein 
from Mus musculus (Mouse) (BETA- 1 ,3-N-ACETYLGLUCOSAMINYLTRANSFERASE 
(EC 2.4.1.149)). NOV81 also has homology to the other proteins shown in the BLASTP data 
in Table 81F. 



10 



15 



Table 81F. NOV81 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 
(%) 


Positive 
(%) 


Expect 


gi 1 14290592 |g 
b|AAH09075.1 | 
AAH09075 
(BC009075) 


beta-l,3-N- 

acety lglucosaminyl trans f 
erase l [Mus musculus] 


397 


126/350 
(36) 


192/350 
(54) 


4e-58 


gi| 16973463 |g 
b|AAL32299.l| 
AF321831 1 
(AF321831) 


beta- 3- 

galactosyltransf erase 
[Danio rerio] 


406 


128/346 
(36) 


186/346 
(52) 


le-57 
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gi| 9938024) re 
f |NP_058584.2 
| (NMJ>16888) 


UDP-GlcNAc:betaGal beta- 
1,3-N- 

acetylglucosaminyltransf 
erase l; beta-l,3-N- 
acetylglucosaminyl trans f 
erase; beta-l,3-N- 
acetylglucosaminyltransf 
erase 1; UDP- 
Gal:betaGlcNAc beta 1,3- 
galactosyltransf erase , 
polypeptide 6 
[Mus musculus] 


397 


125/350 
(35) 


141/350 
(53) 


2e-57 


gi|9845238|re 
f |NP_006568.2 
| (NM_006577) 


beta-l f 3-N- 

acetylglucosaminyltransf 
erase bGnT-1; beta3gal- 
T5 gene; beta-l,3-N- 
acety Iglucosaminyl trans f 
erase bGnT-2 
[Homo sapiens) 


397 


121/351 
(34) 


187/351 
(52) 


6e-56 


gi| 9664889 |gb 
|AAF97254.l|A 
F288209_l 
(AF288209) 


beta 

galactosyltransf erase 
bGalT7 ' [Homo sapiens] 


393 


121/351 
(34) 


187/351 
(52) 


6e-56 



This BLASTP data is displayed graphically in the ClustalW in Table 81G. A multiple 
sequence alignment is given, with the NOV81 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 8 IF. 



Table 81G. ClustalW Alignment of NOV81 



Novaia 

NOV81b 

gi 
gi 
gi 
gi 
gi 



14290592) 
16973463| 
9938024 
9845238 
9664889 



NOV81a 
NOV81b 



gi 
gi 
gi 
gi 
gi 



14290592| 

15973463) 

9938024 

9645238 

9664889 



NOV81a 

NOV81b 

gi | 14290592 | 

gi j 16973463 | 

gi I 9938024 | 

gij 9845238 j 

gi | 9664889 | 



NOV81a 



(SEQ ID NO; 

(SEQ ID NO: 

(SEQ ID NO: 

(SEQ ID NO: 

(SEQ ID NO: 

(SEQ ID NO: 

(SEQ ID NO: 



268) 
270) 
704) 
705) 
706) 
707) 
708) 




110 



120 



130 

AAAA^Elpg^agYi^ 



140 



150 
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NOV81b 



gi 
gi 
gi 
gi 



14290592 

16973463 

9938024 

9845238 

9664889 



NOV8la 


NOV81b 


gi 


14290592 


gi 


16973463 


gi 


9938024 


gi 


9845238 


gi 


9664889 



NOV8ia 
NOV81b 



14290592 | 
16973463 I 
9938024 
9845238 
9664889 



NOV81a 
NOV81b 



14290592 

16973463 

9938024 

9845238 

9664889 



NOV8la 


NOV81b 


gi 


14290592 


gi 


16973463 


gi 


9938024 


gi 


9845238 


gi 


966^889 



NOV81a 


NOV81b 


gi 


14290592 


gi 


16973463 


gi 


9938024 


gi 


9845238 


gi 


9664889 



NOV81a 
NOV81b 



14290592) 
16973463 I 
9938024 
9845238 
9664889 



aaaaRexp! 




160 



170 



180 



190 



200 




360 



370 



380 




410 




ITVSEQLLLIGFTD 



r Bi g E PVYTG u c ^ 

Sv B^ pISbwtgBcBB^ 

svRlyp iddvytgmclSk 
MSl yp iddvytgmclSk 
3vBlyp iddvytgmclBk 

?Vj?iLYP IDDVYTGMCLgK 
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Table 81H lists the domain description from DOMAIN analysis results against 
NOV81 . This indicates that the NOV8 1 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 81H. Domain Analysis of NOV81 

gnl|Pfam|pfam01762 , Galactosyl T. Galactoayltransfara^. Thia family i^in^ a 
the galactosyl transferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- 
galactosyltransferase and UDP -Gal : beta -GlcNAc beta 1, 3 -galactosyl tranf erase. 
Specific galactosyltransferases transfer galactose to GlcNAc terminal chains 
in the synthesis of the lacto-series oligosaccharides types 1 and 2. SEQ ID 
N0:874 

CD-Length = 195 residues, 98.5% aligned 
Score = 77.4 bits (189), Expect « le-15 



NOV81 : 162 AERQAVRETWGSP- -APGIRL- -LFLLGSPVGEAGPDLDSLVAWESRRYSDLLLWDFLDV 2 17 

A R A+R+TW + + G R+ LFL+G + L LV E+R Y D+++ D D 

Sb j c t : 1 ARRNAIRKTWMNQNNSRGGRIKSLFLVG- - LAALDGKLKKLVMEEARLYGDI IWDLEDS 5 8 

NOV8 1 : 218 P FNQTL KDLLLLAWLGRHCP TV S FVLRAQDD AFVHT PALLAHLRAL P PAS ARS L YLGEVF 277 

N TLK L +L ++ CP + + DD FV+ LL+ L + + G + 

Sbj ct : 59 YLNLTLKTLTILLYWSKCPNAKLIGKIDDDVFVNPDNLLSLLEREY IDPSPLSFYGYI I 118 

NOV8 1 : 278 TQAMPLRKPGGPFYVPES FF - E GGYPAYASGGGYVIAGPJjAPWLLRAAARVAPFPFEDVY 336 

P+R +YVP + + YP Y SG Y+++ AP +L+A+ EDV 

Sbjct: 119 KNGEPVRTK1KSKWYVPPTAYPCSNYPPYLSGPFYILSRDAAPLILKASKHRRFIKIEDV1> 178 

NOV81: 337 - TGLCIRALGLVPQ 349 

TG+ LG+ 
Sbjct; 179 ITGILALDLGISRI 192 



There are 2 known types of carbohydrate chains in the lacto series of oligosaccharides: 
type 1 chains, which contain the Gal(beta-l-3)GlcNAc linkage, and type 2 chains, which 
contain the topoisomer GaI(beta-l-4)GlcNAc. The biosynthesis of both types of chains is 
catalyzed by specific galactosyltransferases (GalTs), which transfer galactose (Gal) to N- 

10 acetylglucosamine (GlcNAc)-terminating chains. Beta-4-GalT enzymes (e.g., GGTB2; are the 
galactosyltransferases responsible for type 2 chain biosynthesis, while beta-3-GalTs are the 
type 1 elongating enzymes. 

Kolbinger et al. (1998) searched an expressed sequence tag (EST) database with the 
amino acid sequence of a human beta-3-GalT, which they called beta-3-GalTl, and identified 

1 5 human brain cDNAs encoding a novel beta-3-GalT, which they named beta-3-GalT2. The 

deduced 422-amino acid beta-3-GalT2 protein has a predicted type II transmembrane topology 
with 5 potential N-glycosylation sites, and a predicted molecular mass of 49,202 Da. Beta-3- 
GalT2 shares 46% amino acid identity with beta-3-GalTl, but has a 17-amino acid extension 
at the carboxy terminus and longer cytoplasmic and stem regions. Beta-3-GalT2 directed the 
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synthesis of type 1 chains in mammalian cells and transferred Gal to GlcNAc- and Gal- 
terminating acceptors in enzymatic assays. 

Northern blot analysis demonstrated strong expression of a 3.5-kb beta-3-GalT2 
transcript, and weaker expression of a 2.8-kb transcript, in heart and brain. Amado et al. 
5 (1998) stated that a human beta-3-galactosyltransferase gene, called beta-3-GalTl by them, 
was isolated from a melanoma cell line using a transfection-cloning strategy. Beta-3-GalTl is 
a predicted 326-amino acid protein. By carrying out a BLAST search of an EST database with 
the beta-3-GalTl coding sequence, Amado et al. (1998) identified cDNAs encoding 3 other 
beta-3-galactosyltransferases, beta-3-GalT2, beta-3-GalT3, and beta-3-GalT4. The sequences 
10 of the 4 predicted proteins share 29 to 42% identity and have several conserved short sequence 
motifs. All 4 appear to be evolutionary related, since their coding regions are contained in a 
single exon. 

Using an insect cell expression system, Amado et al. (1998) showed that beta-3-GaITl 
and beta-3-GalT2 are UDP-galactose:beta-N-acetyl-glucosamine beta-1,3 

1 5 galactosyltransferases with similar kinetic properties. Northern blot analysis revealed that 
beta-3-GalTl is expressed as a 6.5-kb mRNA exclusively in brain. Hennet et al (1998) 
identified a mouse beta-3-GalTl homolog, designated beta-3-GalTI, and found that the coding 
region was contained in a single exon. 

NOV81 is predicted to be expressed in at least the following tissues: colon, blood, and 

20 lymphocyte. This infonnation was derived by determining the tissue sources of the sequences 
that were included in the invention including but not limited to SeqCalling sources, public 
EST sources, literature sources, and/or RACE sources. Further expression data for NOV81 is 
provided in Example 2. 

The NOV81 nucleic acids and proteins of are useful in potential therapeutic 

25 applications implicated in various pathological disorders described further herein, for example, 
hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, autoimmume disease, 
allergies, immunodeficiencies, transplantation, graft versus host disease (GVHD), 
lymphaedema, anemia, ataxia-telangiectasia, autoimmume disease,immunodeficiencies, 
Hirschsprung's disease, Crohn's Disease, appendicitis as well as other diseases, disorders and 

30 conditions. NOV8 1 nucleic acids encoding the galactosyltransferase-like protein of the 

invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a beta-l,3-galactosyltransferase-like 
protein includes the nucleic acid whose sequence is provided in Table 81 A or 8 1C, or a 
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fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 81 A or 8 1C while still 
encoding a protein that maintains its beta-l,3-galactosyltransferase-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
5 nucleic acids whose sequences are complementary to the sequence of Table 81 A or 81 C, 
including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 

10 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 41% of the bases 

1 5 may be so changed. 

The novel protein of the invention includes the beta-l,3-galactosyltransferase-like 
protein whose sequence is provided in Table 8 IB or 8 ID. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 81B or 8 ID while still encoding a protein that maintains its beta-1,3- 

20 galactosyltransferase-like activities and physiological functions, or a functional fragment 

thereof. In the mutant or variant protein, up to about 63% of the amino acid residues may be so 
changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
25 methods* These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV82 

30 The disclosed NOV82 (alternatively referred to herein as CG56983-01) includes the 

348 nucleotide sequence (SEQ ID NO:271) shown in Table 82A. A NOV82 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 9-11 and ends with a stop codon at 
nucleotides 321-322, The disclosed NOV82 maps to human chromosome X. 

] 
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Table 82A. NOV82 Nucleotide Sequence (SEQ ID NO:271) 

CTATCCCTATGGTGTCGGTGTGCAGGCCGTGGCCTGCTGTGGCCATAGCACTTCTGGCTCTGCTGGTCTG 

CCTGGGGGCGCTGGTCGACACCTGCCCCATCAAACCCGAGGCTCCTGGCGAAGACGAGTCCCTGGAGGAG 

CTGAGCCACTATTATGCTTCCCTGTGCCACTACCTCAACGTGGTCACCAGACAGTTAATTTCAGAGAGAA 

ACCTACCAGACACCATTGTGTCCAAGGAAGTATTTTTCACAAGCACAAAGGAAAGACCTGTGA 

GAAGGAAGGTTGCCATCTGCAAGCCAAGGAGAGAAGCCTCTGAAAAAACCAAACCTGCTGGCACCTTG 



A NOV82 polypeptide (SEQ ID NO:272) encoded by SEQ ID NO:271 is 104 amino 
acids in length and is presented using the one-letter amino acid code in Table 82B. The Psort 
profile for NOV82 predicts that this sequence has a signal peptide and is likely to be localized 
5 at the exterior of the cell with a certainty of 0.8200. In alternative embodiments, a NOV82 
polypeptide is located to the endoplasmic reticulum (membrane) with a certainty of 0.1000. 
The Signal P predicts a likely cleavage site for a NOV82 peptide is between positions 28 and 
29, i.e. 9 at the dash in the sequence VDT-CP. 



Table 82B. NOV82 Polypeptide Sequence (SEQ ID 
NO:272) 

MVSVCRP WPAVAI ALLALLVCLGALVDTCP I KPEAPGEDESLEELSHYYASLCHYLNVVT 
RQLISERNLPDTIVSKEVFFTSTKERPVRTQKEGCHLQAKERSL 

10 ~ " ' ~ ~~ ' -.--^ ^^^ ^ 

A BLAST analysis of NOV82 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV82 had high 
homology to other proteins as shown in Table 82C. 



Table 82C. BLASTX results from PatP database for NOV82 






Smallest 






Sum 


Sequences producing High- scoring Segment Pairs: 


High 


Probability 


Score 


P(N) 


patp:AAE09439 Human sbghPYYa protein - Homo sapiens 


329 


l.7e-29 


patp:AAB08020 Amino acid sequence of a human peptide yY 


301 


l.6e-26 


patp:AAG75364 Human colon cancer antigen protein 


293 


l.le-25 


patp:AAYi4602 Amino acid sequence of the baboon PY 


221 


4.7e-18 


patp:AAY43334 Neuropeptide Y - Synthetic, 97 aa. 


188 


1.5e-14 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 217 of 288 bases (75%) identical to a gb:GENBANK- 
ID:HUMPYYP3|acc:Dl 3902.1 mRNA from Homo sapiens (Human mRNA for peptide YY). 
The full amino acid sequence of the protein of the invention was found to have 62 of 94 amino 
20 acid residues (65%) identical to, and 73 of 94 amino acid residues (77%) similar to, the 97 
amino acid residue ptnr:SWISSNEW-ACC:P10082 protein from Homo sapiens (Human) 
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(PEPTIDE YY PRECURSOR (PYY)). NOV82 also has homology to the other proteins 
shown in the BLASTP data in Table 82D. 



Table 82D. NOV82 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
{%) 


Expect 


gi | 1172796 |sp| PI 
0082|PYY_HUMAN 


PEPTIDE YY PRECURSOR 
(PYY) (PEPTIDE 
TYROSINE TYROSINE) 


97 


62/94 
(65) 


73/94 
(76) 


2e-23 


gi | 131753 |sp|P10 
63l|PYY_RAT 


PEPTIDE YY PRECURSOR 
(PYY) (PEPTIDE 
TYROSINE TYROSINE) 


97 


61/94 
(64) 


72/94 
(75) 


8e-23 


gi| 4 758982 |ref |N 

P_004151.l| 

(NM 004160) 


peptide YY 
[Homo sapiens} 


90 


60/91 
(65) 


71/91 
(77) 


4e-22 


gi| 422871 |pir| |S 
34569 


peptide YY precursor 
(clone L2) - human 
(fragment) 


90 


60/91 
(65) 


71/91 
(77) 


4e-22 


gi)422870|pir| |S 
34568 


peptide YY precursor 
(clone Ll) - human 
(fragment) 


90 


59/91 
(64) 


70/91 
(76) 


2e-21 



5 This BLASTP data is displayed graphically in the ClustalW in Table 82E. A multiple 

sequence alignment is given, with the NOV82 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in. 
Table 82D. 



Table 82E. ClustalW Alignment of NOV82 



NOV82 <SEQ ID NO: 272) 

gi 1172796] (SEQ ID N0:709) 

gi 131753 | (SEQ ID NO:710) 

gi 4758982 | (SEQ ID N0:711) 

gi 42287l| (SEQ ID NO:712) 

gi 422870) (SEQ ID NO: 713) 



N0V82 

gi 1 1172796 | 
gi | 131753 | 
gi|4758982| 
gi|42287l| 
gi | 422870 | 



N0V82 



1172796) 
131753) 
4758982) 
422871 | 
422870 




NOV82 ERSL 
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1172796 | 
131753 | 
4758982 | 
42287l| 
422870 



Table 82F lists the domain description from DOMAIN analysis results against NOV82. 
This indicates that the NOV82 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 82F. Domain Analysis of NOV82 



gnllPfam|pfam001S9. hormone3, Pancreatic hormone peptide SEQ ID NO: 875 

CD-Length =36 residues, 91.7% aligned 
Score = 43.1 bits (100), Expect = 8e-06 



10 



N0V82 : 30 PIKPEAPGEDESLEELSHYYASLCHYLNWTRQ 62 

P KPE PG+D S E+L+ Y +L Y+N++TR 
Sbjct; 2 PSKPEYPGDDASPEDLAQYLRALRQYINLITRP 34 



Pancreatic hormone (PP) is a peptide synthesized in pancreatic islets of Langherhans, 
which acts as a regulator of pancreatic and gastrointestinal functions. The hormone is 
produced as a larger propeptide, which is enzymatically cleaved to yield the mature active 
peptide, which is 36 amino acids in length and has an amidated C-terminus. The hormone has 
a globular structure with residues 2-8 forming a left-handed poly-proline-II-like helix, residues 
9-13 a beta turn, and 14-32 an alpha-helix,held close to the first helix by hydrophobic 
interactions. Unlike glucagon, another peptide hormone, the structure of pancreatic peptide is 
preserved in aqueous solution. Both N- and C-termini are required for activity: receptor 
15 binding and activation functions may reside in the N- and C-termini respectively. 

PYY is secreted from endocrine cells in the lower small intestine, colon, and pancreas. 
It acts on the gastrointestinal tract as an inhibitor of gastric acid secretion, gastric emptying, 
digestive enzyme secretion by the pancreas, and gut motility (Leiter et al., 1987). A related 
peptide, pancreatic polypeptide is secreted only by cells within the endocrine and exocrine 
20 pancreas and specifically inhibits the secretion of enzymes and bicarbonate from the exocrine 
pancreas. A third member of this gene family is neuropeptide Y. 

Each of these proteins are synthesized with a signal peptide sequence followed by a 
36-amino acid active peptide and a carboxyterminal peptide. During maturation, the signal and 
carboxyterminal peptides are cleaved and a common carboxyterminal tyrosine in the mature 
25 peptide is amidated. Hort et al. (1995) cloned the human PYY gene by screening a genomic 
library with a PCR product produced from the rat locus. The gene contains 4 exons spanning 
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about 1.2-kb of DNA. Exon 1 represents 5-prime untranslated sequence and is 75% identical 
to the comparable rat sequence. 

PYY and PPY are about 10-kb apart and are mapped them by fluorescence in situ 
hybridization to 17q21. Based on a comparison of the 3 gene sequences,it has been concluded 
5 that NPY and PYY are the result of a gene duplication event, and that a subsequent tandem 
duplication produced the PPY gene. Pancreatic polypeptide, peptide tyrosine-tyrosine (PYY), 
and neuropeptide tyrosine (NPY), three members of a family of structurally related peptides, 
are mainly expressed in the endocrine pancreas, in endocrine cells of the gut, and in the brain, 
respectively. Synthetic human PYY prepared using a solid-phase synthetic technique was 

1 0 found to be structurally identical to the natural peptide. 

NOV82 is predicted to be expressed in at least the following tissues: endothelium, 
esophageal carcinoma. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 

1 5 NOV82 is provided in Example 2. 

The novel nucleic acids of the invention encoding a PEPTIDE YY -like proteins 
includes the nucleic acid whose sequence is provided in Table 82A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 82A while still encoding a protein that maintains 

20 its PEPTIDE YY -like activities and physiological functions, or a fragment of such a nucleic 
acid. The invention further includes nucleic acids whose sequences are complementary to the 
sequence of Table 82A, including nucleic acid fragments that are complementary to any of the 
nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 

25 complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 

30 In the mutant or variant nucleic acids, and their complements, up to about 25% of the bases 
may be so changed. 

The novel protein of the invention includes the PEPTIDE YY -like protein whose 
sequence is provided in Table 82B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 82B while 
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still encoding a protein that maintains its PEPTIDE YY -like activities and physiological 
functions, or a functional fragment thereof In the mutant or variant protein, up to about 35% 
of the amino acid residues may be so changed. 

The NOV 82 nucleic acids and proteins are useful in potential therapeutic applications 
5 implicated in various pathological disorders described further herein, for example, esophageal 
carcinoma, inflammatory bowel disease, diverticular disease, as well as other diseases, 
disorders and conditions. NOV82 nucleic acids encoding the peptide YY-like protein of the 
invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. 
10 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

15 

NOV83 

The disclosed NOV83 (alternatively referred to herein as CG56890-01) includes the 
1 701 nucleotide sequence (SEQ ID NO:273) shown in Table 83 A. A NOV83 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 19-21 and ends with a stop codon 
20 at nucleotides 1 669- 1 67 1 . The disclosed NOV83 maps to human chromosome 1 7. 



Table 83A. NOV83 Nucleotide Sequence (SEQ ID NO:273) 



TGCGCCCCGTGCTCAGCCATGGTGGACATGGGGGCCCTGGACAACCTGATCGC 
AGGCCCGGAAGCCCTCGGACTGCGACAGCAAAGAGCTG 

GCTGCAGGGCTGCGCGGAGCTCCGCCAGAAGCTGTCCCTGAACTTCCACAGCCTGTGTGAGCAGCAGCCC 

ATCGGTCGCCGCCTCTTCCGTGACTTCCTAGCCAC^GTGCCCACGTTCCGCAAGGCGGCyUVCCTTCCTAG 

AGGACGTGCAGAACTGGGAGCTGGCCGAGGAGGGACCCACCTU^GACAGCGCGCTGCAGGGGCTGGTGGC 

CACTTGTGCGAGTGCCCCTGCCCCGGGGAACCCGCAACCCTTCCTCAGCCAGGCCGTGGCCACCAAGTGC 

CAAGCAGCCACCACTGAGGAAGAGCGAGTGGCTGCAGTGACGCTGGCCAAGGCTGAGGCCATGGCTTTCT 

TGCAAGAGCAGCCCTTTAAGGATTrCGTGACCAGCGCCTTCTACGACAAGTTTCT 

CGAGATGGAACCAGTGTCAGACAAGTACITCACTGAGTTCAGAGTGCTGGGGAAAGG 

GTAAAAAACACTGGGAAGATGTATGCCTGTAAGAAACTGGACAAGAAGCGGCTGAAGAAGAAAGGTGGCG 

AGAAGATGGCTCTCTTGGAAAAGGAAATCTTGGAGAAGGTCAGCAGC 

TGCCTTTGAGAGCAAGACCCATCTCTGCCTTGTCATGAGCCTGATGAATGGGGGAGACCTCAAGTTCCAC 
ATCTAC^UVCGTGGGCACGCGTGGCCTGGACATGAGCCGGGTGATCTTTTACTCGGCCa^GATAGCCTGTG 
G(^TGCTGCACCTCCATGAACTCGGC^TCGTCTATCGGGACATGAAGCCTGAGAATGTGCTTCTGGATGA 
CCTCGGC^CTGCAGGTTATCTGACCTGGGGCTGGCCGTGG 

AGGCAGGCTGGAACCAATGGTTACATGGCTCCTGAGATCCTAATGGAAAAGGTAAGTTATTCCTATCCT 
TGGACT GGT TTG C C ATGGGATG CAGCATTTATGAAATGGTTGCTGGACGAACAC CATTCAAAGAT TACAA 
GGAAAAGGTCAGTAAAGAGGATCTGAAGCAAAGAACTCTGCAAGACGAGGTCAAATT CCAG CATGATAAC 
TTCACAGAGGAAGCAAAAGATATTTGCAGGCTCTTCTTGGCTAAGAAACCAGAGCAACGCTTAGGAAG^ 
GGAGAGAAAAGTCTGATGATCCC^GGAAACATCATTTCTTTAAAACGATCAACTTTCCTCGCCTGGAAGC 
TGGCCTAATTGAACCCCCATTTGTGCCAGACCCTTCAGTGGTTTAT^ 

GATTTCTCTGAGGTTCGGGGGGTGGAATTTGATGACAAAGATAAGCAGTTCTTCAAAAACTTTGCGACAG 
GTGCTGTTCCTATAGCATGGCAGGAAGAAATTATAGAAACGGGACTGTTTGAGGAACTGAATGACCCCAA 
CAGACC TACGGG TTGTGAGGAGGGT AATTCAT C CAAGT CTGGCGTGTGTTTGTT ATTGTAAAT TG CT CT C 
TTTACCAGACAGGCAGCAGGA 
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A NOV83 polypeptide (SEQ ID NO:274) encoded by SEQ ID NO:273 is 550 amino 
acids in length and is presented using the one-letter amino acid code in Table 83B. The Psort 
profile for NOV83 predicts that this sequence is likely to be localized to the nucleus with a 
5 certainty of 0.9685. In alternative embodiments, a NOV83 polypeptide is located to 
microbodies with a certainty of 0. 1 3 1 7. 



Table 83B. NOV83 Polypeptide Sequence (SEQ ID 
NO:274) 

EQQPIGRRLFRDFIATVPTFRKAATFLEDVQ^ 

NPQPFLSQAVATKCQAATTEEERVAAVTIiAKAEAMAFLQEQPFKDFVTSAFYDKFLQWKL 
FEMQPVSDKYFTEFRVLGKGGFGEVia^KMYAC 

VSSPFIVSLAYAFESKTHLCLVMSIiMNGGDLKFHIYNVGTRGLDMSRVIFYSAQIACGML 
HLHELGIVYRDMKPENVLIJ)DLGNCRLSDLGIAVEMKGGKPITQRQAGTNGYMAPEILME 
KVSYSYPVDWFAMGCSIYEMVAGRTPFKDYKEKVSKEDLKQRTLQDEVKFQHDNFTEEAK 
DICRLFLAKKPEQRLGSRREKSDDPRKHHFFKTINFPRLEAGLIEPPFVPDPSVVYAKDI 
AEIDDFSEVRGVEFDDKDKQFFKNFATGAVPIAWQEEIIETGLFEELNDPNRPTGCEEGN 
SSKSGVCLLL 



A BLAST analysis of NOV83 was run against the proprietary PatP GENESEQ Protein 
10 Patent database. It was found, for example, that the amino acid sequence of NOV83 had high 
homology to other proteins as shown in Table 83C. 



Table 83C. BLASTX results from PatP database for NOV83 






Smallest 






Sum 




High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P(N) 


patp:AAU03502 Human protein kinase #2 - Homo sapiens 


2812 


1.3e-292 


patp:AAY57085 Human rhodopsin kinase amino acid sequence 


1255 


1.3e-127 


patp:AAY24423 GRK4 polymorphism GRK4-alpha protein 


1248 


7.0e-127 


patp:AAY24424 GRK4 polymorphism GRK4-beta protein 


1190 


1.7e-124 


patp:AAY24425 GRK4 polymorphism GRK4- gamma protein 


1215 


2.2e-123 



In a search of sequence databases, it was found, for example, that the nucleic acid 
1 5 sequence of this invention has 1 359 of 1 700 bases (79%) identical to a gb:GENBANK- 

ID: AF0630 1 6|acc: AF06301 6. 1 mRNA from Spermophilus tridecemlineatus (Spermophilus 
tridecemlineatus G protein-coupled receptor kinase GRK7 mRNA). The full amino acid 
sequence of the protein of the invention was found to have 463 of 549 amino acid residues 
(84%) identical to, and 502 of 549 amino acid residues (91%) similar to, the 548 amino acid 
20 residue ptnr:SPTREMBL-ACC:Q9Z2G7 protein from Spermophilus tridecemlineatus 
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(Thirteen-lined ground squirrel) (G PROTEIN-COUPLED RECEPTOR KINASE GRK7). 
NOV83 also has homology to the other proteins shown in the BLASTP data in Table 83D. 



Table 83D. NOV83 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


rti 1 "i *7 no ca i Q 1 n 
y x | j. / uzojie j y 

b|AAL33880.l| 

AF282269_1 

(AF282269) 


G protein- coupled 
receptor kinase 7 [Homo 
sapiens] 


553 


548/555 
(98) 


548/555 
(98) 


0.0 


gi|l7933490|g 

b|AAL06241. l| 
(AY049726) 


retina G protein- coupled 
receptor kinase 7 
[Bos taurusj 


552 


470/555 
(84) 


503/555 
(89) 


0.0 


gi [17026320 |g 
b|AAL33881.l| 
AF282270JL 
(AF282270) 


G protein- coupled 
receptor kinase 7 [Sus 
scrofa] 


553 


465/555 
(83) 


504/555 
(90) 


0.0 


gi)400i826|gb 

|AAC95001.l| 

(AF063016) 


G protein- coupled 
receptor kinase GRK7 
[Spermophilus 
trideceral ineatus ] 


548 


463/554 
(83) 


502/554 
(90) 


0.0 


gi|l744385l|r 
ef |XP_067593. 
l| (XM_067593) 


similar to G protein - 
coupled receptor kinase 
GRK7 (H. sapiens) 
[Homo sapiens) 


692 


326/350 
(93) 


328/350 
(93) 


e-172 



5 This BLASTP data is displayed graphically in the ClustalW in Table 83E. A multiple 

sequence alignment is given, with the NOV83 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 83D. 



Table 83E. ClustalW Alignment of NOV83 



N0V83 



17026318 
17933490 
17026320 
4001826 | 
17443851 | 



NOV83 


gi 


17026318) 


gi 


17933490| 


gi 


17026320) 


gi 


4001826 | 


gi 


1744385l| 


NOV83 


gi 


17026318 


gi 


17933490 


gi 


17026320 


gi 


4001826| 



SEQ ID NO: 274) 
SEQ ID NO: 714) 
SEQ ID NO: 715) 
SEQ ID NO: 716) 
SEQ ID NO; 717) 
SEQ ID NO: 718) 



10 



20 



30 



40 



50 



|....| 



|....|....| 



MPGEYKWAPETSARDTHASVQVEAYQGI^VHI^ISTLIiKK/lAALPRKSS 



60 

-I- 



70 



60 



90 



100 



|./..|....|. 



I 1 
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gi 


17026318 


gi 


17933490 


gi 


17026320 


gi 


4001826| 


gi 


17443851! 


NOV83 


gi 


17026318 


gi 


17933490 


gi 


17026320 


gi 


4001826 | 


gi 


17443851 



gi 1 17443 851 1 RETSFLRTAGSAEQSALHGCHKDTRNMRTSVYKGSVPCSDVGCDKFELQR 



NOV83 

gi j 17026318 
gi |l7933490 
gi 1 17026320 
gi|4001826| 
gi | 17443851 | 



NOV83 



NOV83 

gi | 17026318 
gi | 17933490 
gi j 17026320 
gi|4001826| 
gi (17443851 



NOV83 

gi 
gi 
gi 
gi 
gi 



17026318 
17933490 
17026320 
4001826 | 
1744385l| 



NOV83 



17026318 
17933490 
17026320 
4001826 | 
17443851 



NOV83 

gi | 17026318 | 
gi j 17933490 | 
gi | 17026320 | 
gi j 4001826 | 
gi j 17443851 | 



NOV83 

gill7Q26318| 



110 
.|....|.. 



120 



130 



140 



150 



GHRGFVSFSKSKPEPEPEFRPSLQPPPIAAVSRAPDATRAAGPGRRESES 



160 



170 



180 



190 



200 



EFGNEELESIFSGLENNQRLQGLSLRYCGLGPQSGLRLGSVISQSA1CEL 



210 



220 



230 



.|....|....| 



240 
..|.. 



250 



FLDGNyLECSGALALLRPIAGFAETQGEDQPAPGSPDTGNPPQRLQGHRT 



) 

260 



270 



280 



300 



HCKSLGRCLTREGKAASSPPALLCFPWECAPCS, 




310 



320 



330 



340 



350 




360 



370 



RRLFRDFLATVP 
RRLFRD FLAT VP 
RRLFRDFLATVP 
RRLFRDFLATVP 
RRLFRDFLATVP 
RRLFRDFLATVP 



410 




380 



390 



400 



fledvqnwelaeegp 
fledvqnwelaeegp 
fle^vq^welaeegp 
fle§vq§welaeegp 
fledvqnwelaeegp 
fledvqnwelaeegp 




420 



43 0 



440 



450 




460 



470 
..!.. 



480 



490 



500 
••I 



jSgFYDKFLOWKLFEMOPVSDKYFTEFRVLGKGGFG MB»^ KNTGKM 
3sgFYDK??r J QW,KLt^EMQPV'SDKYFTEFRVLGKGGFGEVCAVQVKNTGKM 
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[sgFYDKFLQWKgjFEMQPVSDKYFHEFRVLGKGGFGEVCAVQVKNTGKM 

is Sfydkflqwk2femqpvsdkyf§efrvlgkggfgevcavqvkntgkivi 
s jfydsflqwklfemqpvsdkyftefrvlgkggfgevcavq^ntgkm 
[sBfydkflqwklfemqpvsdkyftefrvlgkggfgevcavqvkntgkm 



gi|l7933490| 
gi j 17026320) 
gi|4001826| 
gi j 17443851 1 



NOV83 





17026318 


gi 


17933490 


gi 


17026320 


gi 


4001826) 


gi 


17443851| 


NOV83 


gi 


17026318 


gi 


17933490 


gi 


17026320 


gi 


4001826) 


gi 


1744385l| 


NOV83 


gi 


17026318 


gi 


17933490 


gi 


17026320 


gi 


4001826 | 


gi 


17443851 | 



NOV83 

gi | 17026318 | 
gi j 17933490 j 
gi 1 17026320 1 
gi j 4001826 | 
gi |17443851| 



NOV83 


gi 


17026318 


gi 


17933490] 


gi 


17026320 


gi 


4001826 | 


gi 


17443851| 


NOV83 


gi 


17026318 


gi 


17933490 


gi 


17026320 


gi 


4001826) 


gi 


17443851 | 



NOV83 



17026318 | 
17933490 | 
17026320) 
4001826) 
174438511 



Eft 
IiV 

m 

ST 



510 



520 
. . I . . 



530 



540 
..I.. 



550 
..I 



yackkldkkrlkkkggekmallekeilekvsspfivslayafeskthlcl 
yackkldkkrlkkkggekmallekeilekvsspfivslayafeskthlcl 
yackkldkkrlkkkSgekmallekeile3vsspfivslayafesk|hlcl 
yackkldkkrlkkkggekmal§ekeilekvsspf2vslayafesk§hlcl 
yackkldkkrlkkkggekmallekeilskvgjspfivslayafeskthlcl' 
yackkldkkrlkkkggekmallekeilekvsspfivslayafeskthlcl 



560 



570 



580 



590 



600 



VMSLMNGGDLKFHIYNVGTRGLDMSRVIFYSAQlgCGjjLHLHf>LGIVYRD 

i^slmnggdlkfhiynvgtrgldmsrvifysaqi8cgSi,hlhBlgivyrd 
vmslwggdlkfhiy^g^gldmsrvifysaqiBcgwlhlhBlgivyrd 
vmsl^ggdlkfhiy^g^gldmjSrvifysaq^cgSlhlhBlgivyrd 
vmslmnggdlkfhiynvgtrgl^srvify^qScgSlhlhSlgivyrd 
vmslmnggdlkfkiynvgtrgldmsrvifysaqiScgSlhlhBlgivyrd 



vmslmnggdlkfhi YNVGTRi 

VMSLMNGGDLKFHIYNVGTRi 



glSmsrvify|jaqScgS: 

GLDMSRVI F YS AQ iBcgS 



610 



I. 



620 



630 



640 



650 



KPENVLLDDLGNCRLSDLGLAVE 
KPEOTLLDDLGNCRLSDLGLAV J 
KPENVLLDDLGNCRLSDLGLA 
KPENVLLDDLGNCRLSDLGLA 
KPENVLLDDLGNCRLS DLGLAVE 
KPENVLLDDLGNCRLS DLGLAVEjil? 




660 



670 



I 



680 
..I.. 



690 



I. 



700 
..I 



sysypvdwfamgcsiyer^vagrtpfkdykekvskedlkqrtl^evgfq 
sysypvdwfamgcsiyemvagrtpfkdykekvskedlkqrtl^evgfq 
asysypvdwfamgcsiyeotagrtpr2°ykekvskei)lkqrtl^evSfq| 
asysypvdwfamgcs!yemvagrtpfkdykek0skedlkqrtlwev§fq 

^ SYSYP VD WFAMGCSIY EMVAG RTPFKDlgKEKVSKEDL KigRTS^EVgP 1 " 



710 



720 



730 



740 



750 




760 

* . I . • 



770 



780 



790 



800 
. . | 



GLMEPPFVPDPSWYAKDIgEIDDFSEVRGVEFDDKDKQFF^FATGAVP 
GLgEPPFVPDPSWYAKDIgEIDDFSEVRGVSFDDKDKQFFfflFATGAVP 
GL|jjEPPFVPDPSWYAKDlSEIDDFSEVRGVEFDDjS|DKQFF^FATGAVP 
GL^PPFVPDPSVVYAKD^ElgDFSEVRGVEFDDKDKQFFgSFATGAVP 
~ LflEPPFVP DP SWYAKD ^EIDDFSE V RGVSFDDKD KQF F^F^TG AVP 



810 
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Table 83F lists the domain description from DOMAIN analysis results against NOV83. 
This indicates that the NOV83 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 83F. Domain Analysis of NOV83 


enllSmart|smart00220, S TKc, Serine/Threonine protein kinases, catalytic domain; Phosphotransferases. 


Serine or threonine-specific kinase subfamily. SEQ ID NO: 876 








CD-Length = 256 residues, 100.0% aligned 




Score = 


208 bits (530), Expect = 6e-55 




NOV83 : 


191 


FTE FRVLGKGGFGE V KNTGKMYACKKLDiCKKIJC^ F • 


245 






+ VLGKG FG+V K TGK+ A K + K++LKKK E L E +IL+K+ P 




Sbjct: 


1 


YELLEVLGKGAFGKVYLARDKKTGKLVAI KVI KKEKLKKKKRE - RI LRE I KILKKLDHPN 


59 


N0V83 : 


246 


IVSLAYAFESKTHLCLVMSLMNGGDLKFHIYITVGT^ 


305 






XV L FE L LVM GGDL + G L FY+ QI + +LH 




Sb j Ct : 


60 


I VKLYDVFEDDDKLYLVMEYCEGGDLFDLLKKRG- - RLSEDEARFYARQILSALEYLHSQ 


117 


NOV83 : 


306 


GIVYRDMKPENVLLDDLGNOU^SDLGLAVEMKGGKPITQRQAGTNGYMAPEILMEKVSYS 


365 






GI++RD+KPEN+LLD G+ +L+D GLA ++ G + GT YMAPE+L+ K Y 




Sb j Ct : 


118 


GI IHRDLKPENILLDSDGHVIOjADFGIiAKQLDSGGT - GYG 


176 


N0V83 : 


366 


YPVDWFAMGCSIYEMVAGRTPFKDYKEKVSKEDLKQRTLQDE VKF- - QHDNFTEEAKDI C 


423 






VD +++G +YE++ G+ PF + I* ++ + F + EAKD+ 




Sb j Ct : 


177 


KAVDIWSLGVILYELLTGKPPFPGDDQ LLALFKKIGKPPPPFPPPEWKISPEAKDLI 


233 


NOV83 : 


424 


RLFLAKKPEQRLGSRREKSDDPRKHHFF 451 








+ L K PE+RL +++ +H FF 




Sbjct: 


234 


KKLLVKDPEKRXiT AEEALEHPFF 256 





5 

Serine/threonine protein kinases are an extensive family of enzymes that catalyzes the 
phosphorylation of serine or threonine residues on its target protein. Protein kinases share a 
conserved catalytic core common to both serine/ threonine and tyrosine protein kinases. This 
domain contains residues, which are specific to the distinct types of protein kinases 

10 The S6/H4 kinase purified from human placenta catalyzes phosphorylation of the S6 

ribosomal protein, histone H4, and myelin basic protein. In vitro activation of the p60 S6/H4 
kinase requires removal of an autoinhibitory domain by mild trypsin digestion and 
autophosphorylation of the catalytic domain (p40 S6/H4 kinase). The two 
autophosphorylation/autoactivation sites contain the sequences SSMVGTPY (site 1) and 

15 SVIDPVPAPVGDSHVDGAAK (site 2) (SEQ ID NO:1381). These sequences identify S6H4 
kinase as the rac-activated PAK65 (Martin, G. A., Bollag, G., McCormick, F. and Abo, A. 
(1995) EMBO J. 14, 1971-1978). Site 1 phosphorylation is most rapid, but activation does not 
occur until site 2 is autophosphorylated. The site 1 phosphorylation occurs by an 
intramolecular mechanism whereas site 2 autophosphorylation occurs by an intermolecular 

20 mechanism. A model is proposed in which phosphorylation of sites 1 and 2 occurs 

sequentially. The model proposes that trypsin treatment of the inactive holoenzyme removes 
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an inhibitory rac-binding domain which blocks MgATP access to the catalytic site. The 
pseudosubstrate domain at site 1 is autophosphorylated and subsequent bimolecular 
autophosphorylation at site 2 fully opens the catalytic site. Phosphorylation by a regulatory 
protein kinase may occur at site 2 in vivo. 
5 Rapid regulation of G-protein-coupled receptors (GPCRs) involves agonist- promoted 

receptor phosphorylation by GPCR kinases (GRKs) . This process is followed by arrestin 
binding and transient receptor internalisation. It has been shown that beta-adrenergic receptor 
kinase (beta ARK-1 or GRK2) follows a similar pattern of internalisation upon agonist 
activation of beta(2)-adrenergic receptors (beta(2)AR) and that beta ARK expression levels 

1 0 modulate receptor sequestration. 

Such studies indicate a functional relationship between receptor phosphorylation and 
sequestration, showing that beta ARK not only translocates from the cytoplasm to the plasma 
membrane in response to receptor occupancy, but also shares endocytic mechanisms with the 
beta(2)AR . These results suggest a role for beta ARK in the sequestration process, or 

1 5 involvement of receptor internalisation in the intracellular trafficking of the kinase. 

NOV83 is predicted to be expressed in at least the following tissues: retina, spleen. 
This information was derived by determining the tissue sources of the sequences that were 
included in the invention including but not limited to SeqCalling sources, public EST sources, 
literature sources, and/or RACE sources. Further expression data for NOV83 is provided in 

20 Example 2. 

The NOV83 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, hemophilia, 
hypercoagulation, idiopathic thrombocytopenic purpura, immunodeficiencies, graft vesus host 
disease, Von Hippel-Lindau (VHL) syndrome, diabetes, tuberous sclerosis, as well as other 

25 diseases, disorders and conditions. NOV83 nucleic acids encoding the GPCR-like protein of 
the invention, or fragments thereof, may further be useful in diagnostic applications, wherein 
the presence or amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a G protein-coupled receptor kinase 
GRK7-like protein includes the nucleic acid whose sequence is provided in Table 83 A, or a 

30 fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 83A while still encoding a 
protein that maintains its G protein-coupled receptor kinase GRK7-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
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nucleic acids whose sequences are complementary to the sequence of Table 83A, including 
nucleic acid fragments that are complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
5 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 21% of the bases 

10 may be so changed. 

The novel protein of the invention includes the G protein-coupled receptor kinase 
GRK7-like protein whose sequence is provided in Table 83B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 83B while still encoding a protein that maintains its G Protein- 

15 Coupled Receptor Kinase GRK7-like activities and physiological functions, or a functional 
fragment thereof. In the mutant or variant protein, up to about 16% of the amino acid residues 
may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 

20 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV84 

25 The disclosed NOV84 (alternatively referred to herein as CG56912-01) includes the 

2355 nucleotide sequence (SEQ ID NO:275) shown in Table 84A. A NOV84 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 1 1-13 and ends with a stop codon 
at nucleotides 2336-2338. The disclosed NOV84 maps to human chromosome 17. 



Table 84A. NOV84 Nucleotide Sequence (SEQ ID NO:275) 

AGGCTGGAGGATGGGGCAGCAGGTCCTAGCGGGGGAGGGCAGGGGAGCCTCTGGGCAGGTACGGCCTGAC 
GCCCCGGGTCCTCCCGCCCCGCCAGGCCTGACGGAGGACGAGGACGTGCGCGCCATGCTGCGGGGCTCCC 
GGCTCCGCAAGATCCGCTCGCX3CACGTGGC&CAAGGAGC 

CGTGTGGTTCCAGCGGCGCATCCCGCGTGCGCCATCGCAGCACATCGTCCTCCGCCCTGACCCGGCCCTC 

CTCTCAGTCTTCGTGCAGCACATCGAGGCGGTCCGCGAGG 

GGGGTGCCTTCGCGCCAGCGCGCTGCCTGACCM 

GGCGCCCACGGCTGAGGAAGCX3CAGCGCTGGGTGCGCGGTCTGACCAAGCTCCGCGCGCGCCTGGACGCC 
ATGAGCCAGCX5CGAGCGGCTAGACCAATATTGGATCCACTCCTATCTGCACCGGGCTGACTCCAACCAGG 
ACAGCAAGATGAGCTTCAAGGAGATO^GAGCCTGCTGAGAATGGTCAACGTGGACATGAACGACATGTA 
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CGCCTACCTCCTCTTCAAGCAGGAGTGTGAC(^CTCCAACAACGACCGTCTAGAGGGGGCTCAGATCGAG 

GAGTTCCTG CGGCGGCTGCTGAAGCGGCCGGAGCTGGAGGAG ATCTT CCATCAGT ACTCGGGCGAGGAC C 

GCGTGCTGAGTGCCCCTGAGCTGCTGGAGTTCCTGGAGGACCAGGGCGAGGAGGGCGCCACACTGGCCCG 

CGCCCAGCAGCTCATTCAGACCTATGAGCTCAACGAGACACCCTCTCCTGCCACCCCTATGACACTGGAT 

GGCTTCATGATGTACCTGTTGTCGCCGGAGGGGGCTGCCTTGGACAACACCCACACGTGTGTGTTCCAGG 

ACATGAACCAGCCCCTTGCCCACTACTTCATCTCTTCCTCCCACAACACCTATCTGACTGACT 

CGGGGGGCC CAGCAGCACCGAGGCCTATGTTAGGGC CTTTGC CCAGGGATGCCGCTG CGTGGAGCTGG AC 

TGCTGGGAGGGGCCAGGAGGGGAGCCCGTCATCTATCATGGCC^TACCCTCACCTCCAAGATTCTCTTCC 

GGGACGTGGTCCAAGCCGTGCGCGACCATGCCTTCACGCTGTCCCCTTACCCTGTCATCCTATCCCTGGA 

CAACC^CGACGGGCTGGAGCAGCAGGCTGCCATGGCCCGCCACCrrCTGCACCATCCTGGGGGACATGCT^ 

GTGACACAGGCGCTGGACTCCCCAAATCCCGAGGAGCTGCCATCCCCAGAGCAGCTGAAGGGCCGGGTCC 

TGGTGAAGGGAAAGAAGCTGCCCGCTGCTCGGAGCGAGGATGGCCGGGCTCTGTCGGATCGGGAGGAGGA 

GGAGGAGGATG ACGAGG AGG AAGAAGAGGAG GTGGAGGCTGCAG CGC AGAGGCGG CTGG C CAAGC AGAT C 

TCCCCGGAGCTGTCGGCCCTGGCTGTGTACTGCCACGCC^CCCGCCTGCGGACCCTGCACCCTGCCCCCA 

ACGCCCCACAACCCTGCCAGGTCAGCTCCCTCAGCGAGCGCAAAGCCAAGAAACTCATTCGGGAGGCAGG 

GAACAGCTTTGTCAGGCACAATGCCCGCCAGCTGACCCGCGTGTACCCGCTGGGGCTGCGGATGAACTCA 

GC CAACT ACAGT C CC CAGGAGATGTGGAACT CGGGCTGT CAG CTGGTGGC CT TGAACTTC CAG ACGC CAG 

GCTACGAGATGGACCTCAATGCCGGGCGCTTCCTAGTCAATGGGCAGTGTGGCTACGTCCTAAAACCTGC 

CTGCCTGCGGCAACCTGACTCGACCTTTGACCCCGAGTACCCAGGACCTCCCAGAACCACTCT 

CAGGTGCTGACTGCACAGCAGCTGCCCAAGCTGAATGCCGAGAAGCCACACTCCATTGTGGACCCCCTGG 

TGCGCATTGAGATCGATGGGGTGCCCGCAGACTGTGCCCGGCAGGAGACTGACTACGTGCTCAACAATGG 

CTTCAACCCCCGCTGGGGGCAGACCCTGCAGTTCCAGCTGCGGGCTCCGGAGCTGGCACTGGTCCGGTTT 

GTGGTGGAAGATTATGACGCCACCTCCCCCAATGACTTTGTGGGCCAGTTTACACTGCCTCTTAGCA 

TAAAGCAAGGGTACCGCCACATACACCTGCTTTCCAAGGACGGGGCCTCACTGTCACCAGCCACGCTCTT 

CATCCAAATCCGCATCCAGCGCTCCTGAGGGCCCACCTCACTCGC 



A NOV84 polypeptide (SEQ ID NO:276) encoded by SEQ ID NO:275 is 775 amino 
acids in length and is presented using the one-letter amino acid code in Table 84B. The Psort 
profile for NOV84 predicts that this sequence is likely to be localized to the cytoplasm with a 
5 certainty of 0.4500. In alternative embodiments, a NOV84 polypeptide is located to 
microbodies with a certainty of 0.3000. 



Table 84B. NOV84 Polypeptide Sequence (SEQ ID 
NO:276) 

MGQQVLAGEGRGASGQVRPDAPGPPAPPGLTEDEDVRAT^RGSRLRKIRSRTWHKERLYR 
LQEDGLSVWFQRRIPRAPSQHIVLRPDPALLSVFVQHIEAVREGHQSEGLRRPGGAFAPA 
RCLTIAFKGRRKNLDLAAPTAEEAQRWVRGLTK^ 

SNQDS KMS FKEIKSLIJ^MVNVDMNDMYAYLLFKQECDHSNNDRLEGAEIEEFLRRIiLK^ 
EL EE I FHQ YS GEDRVLS APELLE FLEDQGEEGATLARAQQL I QTYELNET PS PAT PMTLD 
GFMMYLLSPEGAAIJ)NTHTC^FQD™QPL7^FISSSHNTYLTDSQIGGPSSTEAYVRAF 
AQGCRCVELDCWEGPGGE P VI YHGHTLTS KI LFRD WQAVRDHAFTLS P YP VI LS LDNHD 
GLEQQAAMARHLCTILGDMLVTQALDSPNPEELPSPEQLKGRVLVKGKKLPAARSEDGRA 
LSDREEEEEDDEEEEEEVEAAAQRRLAKQ I S PELSALAVYCHATRLRTLHPAPNAPQPCQ 
VS SLSERKAKKL I REAGNS FVRHNARQLTRVYPLGLRMNS ANYS PQEM WNSGCQL VALNF 
QTPGYEMDLNAGRFLVNGQCGYVLKPACLRQPDSTFDPEYPGPPRTTLSIQVLTAQQLPK 
LNAEKPHSIVDPLVRIEIHGVPADCARQETDYVLNNGFNPRWGQTLQFQLRAP 
WED YDATS PNDFVGQFTLPLSS LKQG YRHI HLLSKDGAS LS PATLF I Q I RI QRS 



A BLAST analysis of NOV84 was run against the proprietary PatP GENESEQ Protein 
10 Patent database. It was found, for example, that the amino acid sequence of NOV84 had high 
homology to other proteins as shown in Table 84C. 



Table 84C BLASTX results from PatP database for NOV84 



Smallest 
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Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAG63220 Amino acid sequence of a human protein 


3735 


0.0 


patp.-AAB475l6 Human phospholipase C, 16835 - Homo sapiens 


3734 


0.0 


patp:AAY81394 Rat phospholipase C-delta-1 - Rattus sp. 


1882 


4.6e-194 


patp.-AAE10440 Novel human phospholipase protein #7 


1783 


1.4e-183 


patp:AAEH925 Human CG121 (or C592) lipase protein #1 


1182 


6.9e-120 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 954 of 1415 bases (67%) identical to a gb:GENBANK- 
ID:OCPLCMR|acc:Z49747.1 mRNA from Oryctolagus cuniculus (O.cuniculus mRNA for 
5 phospholipase C). The full amino acid sequence of the protein of the invention was found to 
have 381 of 745 amino acid residues (51%) identical to, and 524 of 745 amino acid residues 
(70%) similar to, the 745 amino acid residue ptnr:SPTREMBL-ACC:Q60450 protein from 
Cricetulus griseus (Chinese hamster) (PHOSPHOLIPASE C-DELTA1). NOV84 also has 
homology to the other proteins shown in the BLASTP data in Table 84D. 

10 



Table 84D. NOV84 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi|l4715017|g 
b|AAH10668.l| 
AAH10668 
(BC010668) 


Similar to phospholipase 
C, delta [Homo sapiens] 


613 


604/615 
(98) 


606/615 
(98) 


0.0 


gi 

ef 

2 | 


17481372 |r 
XP_053638. 
!XM 053638) 


hypothetical protein 
XP_05363 8 
[Homo sapiens] 


581 


472/500 
(94) 


475/500 
(94) 


0.0 


S. 


2137061 |pi 
PC4183 


l-phosphatidylinositol- 
4 , 5-bisphosphate 
phosphodiesterase (EC 
3.1.4.11) delta-1 - 
Chinese hamster 
(fragment) 


745 


378/758 
(49) 


522/758 
(67) 


0.0 


gi|839398l|re 
f |NP_058731.1 
| {NM 017035) 


phospholipase C-deltal 
[Rattus norvegtcus] 


756 


368/757 
(48) 


501/757 
(65) 


0.0 


gi| 9790167] re 
f |NPJ)62650.1 
| (NM_019676) 


phospholipase C, delta; 
PLC-delta 1; 

phospholipase C delta-1 
[ATus mus cuius] 


756 


374/757 
(49) 


503/757 
(66) 


0.0 



This BLASTP data is displayed graphically in the ClustalW in Table 84E. A multiple 
sequence alignment is given, with the NOV84 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
15 Table 84D. 



Table 84E. ClustalW Alignment of NOV84 
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NOV84 

gi 1 14715017 | 
gi j 17481372 I 
gi (2137061 | 
gi|839398l| 
gi | 9790167 | 



(SEQ ID NO:276) 

(SEQ ID NO: 719) 

(SEQ ID NO:720) 

(SEQ ID NO: 721) 

(SEQ ID NO: 722) 

(SEQ ID NO.-723) 



NOV84 


gi 


14715017 


gi 


17481372 


gi 


2137061 


gi 


8393981 


gi 


9790167 



NOV84 



14715017 | 

17481372) 

2137061 

8393981 

9790167 



NOV84 



14715017) 

17481372) 

2137061 

8393981 

9790167 



NOV84 

gi | 14715017 
gi|l7481372 
gi|213706l| 
gi j 8393981 1 
gi|9790167| 



NOV84 




gi 


14715017) 


gi 


17481372 


gi 


2137061 




gi 


8393981 




gi 


9790167 





NOV84 



14715017 

17481372 

2137061 

8393981 

9790167 



NOV84 



14715017| 
17481372 j 
2137061) 
8393981) 




MGQQVLAGEGRGASGQVRPDAPGPPAP] 



-mdsgrdflt: 

-MDSGRDFL 1 




60 



70 



80 



90 



100 



|....| 



|SQHS[VLRPDPALLS' 




110 



120 



130 



140 



150 




160 



170 



180 



190 



200 




260 



270 



280 



290 



300 
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gi | 9790167 | 



NOV84 



14715017| 
17481372 j 
2137061 
8393981 
9790167 



NOV84 



14715017 

17481372 

2137061 

8393981 

9790167 



NOV84 



14715017 | 
17481372) 
2137061 | 
8393981) 
9790167 



NOV84 

gi | 14715017 | 
gi | 17481372 j 
gi (2137061 
gi (8393981 
gi j 9790167 



NOV84 



14715017| 
17481372 j 
2137061 
8393981 
9790167 





510 



520 



540 



:PKgDKL|C£v 




560 



570 



530 

h^trlrtlhpapnSp@p 

HATRXiRTLHPAP] 

kdStrlrpcSIl 
ksvhfgg: 



580 



550 




HNP-ARS 



590 



600 




610 



620 



630 



640 



650 



NOV84 



14715017) 
17481372 
2137061 | 
8393981 
9790167 
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gi 1 174813*7 | 
gij 2137061 
gi (8393981 
gij 9790167 



NOV84 



gi 
gi 
gi 
gi 



14715017| 
17481372 j 
2137061 
8393981 
9790167 




PffiLALVRFfflVEDYDS^SgNDFBGC 1 
P& AL VRFj*VED YD&^|s BnD F||g C 



760 



770 



780 



I. 



|slkqgyrh|hllsk|g 
|slkqgyrhuhllsk§g 





Table 84F lists the domain description from DOMAIN analysis results against NOV84. 
This indicates that the NOV84 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 84F. Domain Analysis of NOV84 



gnllSmartlsmartOOl 48, PLCXc, Phospholipase C, catalytic domain (part); domain X; Phosphoinositide- 

specific phospholipases C. These enzymes contain 2 regions (X and Y) which together form a TIM barrel-like 

structure containing the active site residues. Phospholipase C enzymes (PI-PLC) act as signal transducers 

that generate two second messengers, inositoI-l,4,5-trisphosphate and diacylglycerol. The bacterial enzyme 

appears to be a homologue of the mammalian PLCs. SEQ ID NO: 877 

CD-Length = 145 residues, 93.1% aligned 
Score « 186 bits (471), Expect » 6e-48 



NOVB4: 323 QDMNQPLAHYFISSSHNTYLTDSQIGGPSSTEAYVRAFAQGCRCVELDCWEGPGGEPVIY 382 

QDM++PL+HYFI+SSHNTYLT Q+ G SS E Y++A GCRCVELDCW+GP GEPVIY 
Sbjct:' 1 QDMSKPLSHYFINSSHNTYLTGKQLWGESSVEGYIQ 60 

N0V84: 383 HGHTLTSKI LFRDWQAVRDHAFTLSP YPVI LSLDNHDGLEQQAAMARHLCTI LGDMLVT 442 

HGHT T I +V++A++ AF SPYPVILSL+NH +QQA MA+ I GD+L T 
Sbjct: 61 HGHTFTLPI KLSEVLEAI KKFAFVTSP YPVI LSLENHCSPDQQAKMAQMFKEI FGDLL YT 120 

NOV84: 443 QALDSPNPEELPSPEQ 458 
S + E LPSPEQ 

Sbjct: 121 PPTTS-SLEYLPSPEQ 135 



Phosphoinositide-specific phospholipase C acts as a signal transducer that generates 
two messengers, diacylglycerol and inositol 1,4,5-trisphosphate, by hydrolyzing inositol 
phospholipids. Molecules belonging to the PLC family are divided into subfamilies, PLC-beta, 
PLC-gamma, and PLC-delta, whose amino acid sequences are highly conserved in two distinct 
regions designated X and Y. PLC-delta- 1 is distinguished from PLC-gamma by lack of the 
SH2 and SH3 domains that are essential for activation of PLC-gamma by tyrosine protein 
kinases, and from PLC-beta by lack of the C-terminal region of PLC-beta that is responsible 
for binding and activation by G proteins. 
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Cheng et al. (1995) cloned cDNA for human PLC-delta-1 and localized the gene to 
chromosome 3 by means of a human/rodent somatic cell panel (Lyu et al., 1996). In the course 
of a large-scale sequencing analysis of genomic DNA in the vicinity of the homozygous 
deletion on chromosome 3p found in a lung cancer cell line, Ishikawa et al. (1997) found that 
5 the gene encoding phospholipase C, delta- 1 (PLCD1) is located just distal to the region 

removed by the deletion. They found that the gene consists of 15 exons and spans about 22 kb. 
By fluorescence in situ hybridization, they localized the PLCDI gene to 3p22-p21.3. 
Shimohama et al. (1998) examined the entire sequences corresponding to protein-coding 
exons 2-15 of the hamster PLC-delta-1 gene in genomic DNA derived from the leukocytes of 

10 13 unrelated patients with early-onset sporadic Alzheimer disease. In 1 of these patients whose 
clinical features and course did not differ from those of the other 12 cases, they found a 
change of codon CGC (arg) to C AC (his), located in the pleckstrin homology domain of the 
PLCDI gene. They stated that this was the first mutation found in the human PLC genes. 
Site-directed mutagenesis of the glutathione-S-transferase (GST/PLCD1) fusion 

1 5 protein changing argl05 to his resulted in a 4-fold decrease in the affinity of specific binding 
and a reduction in hydrolyzing activity to about 40% of that of the wildtype enzyme. This 
remarkable loss of function could be interpreted in terms of a conformational change in the 
pleckstrin homology domain. Shimohama et al. (1998) found that the argl05-to-his mutation 
was present ih heterozygous state in the patient with AD. The mutation was not found in DNA 

20 extracted from leukocytes of 23 unrelated patients with familial AD, 23 unrelated patients with 
early-onset sporadic AD, 46 unrelated patients with late-onset sporadic AD, and 456 
nondemented control subjects. Thus the change did not appear to be a common polymorphism. 

However, determination of the possible pathologic role required transgenic studies of 
the mutant gene to determine the role of the enzyme and the mutation and a search for other 

25 mutations in the pleckstrin homology domain of PLC genes in human subjects with genetic 
disorders. In vitro single point mutagenesis, inositol phospholipid hydrolysis, and substrate 
protection experiments were used to identify catalytic residues of human 
phosphatidylinositide-specific phospholipase C delta 1 (PLC delta 1) isolated from a human 
aorta cDNA library. Invariant amino acid residues containing a functional side chain in the 

30 highly conserved X region were changed by in vitro mutagenesis. Most of the mutant enzymes 
were still able to hydrolyze inositol phospholipid with activity ranging from 10 to 100% of 
levels in the wild type enzyme. Exceptions were mutants with the conversion of Arg338 to 
Leu (R338L), Glu341 to Gly (E341G), or His356 to Leu (H356L), which made the enzyme 
severely defective in hydrolyzing inositol phospholipid. Phospholipid vesicle binding 
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experiments showed that these three cleavage-defective mutant forms of PLC delta 1 could 
specifically bind to phosphatidylinositol 4,5-bisphosphate (PIP2) with an affinity similar to 
that of wild type enzyme. Western blotting analysis of trypsin-treated enzyme-PIP2 complexes 
revealed that a 67-kDa major protein fragment survived trypsin digestion if the wild type 
enzyme, E341G, or H356L mutant PLC delta 1 was preincubated with 7.5 microM PIP2, 
whereas if it was preincubated with 80 microM PIP2, the size of major protein surviving was 
comparable to that of intact enzyme. However, mutant enzyme R338L was not protected from 
trypsin degradation by PIP2 binding. These observations suggest that PLC delta 1 can 
recognize PIP2 through a high affinity and a low affinity binding site and that residues Glu341 
and His356 are not involved in either high affinity or low affinity PIP2 binding but rather are 
essential for the Ca(2+)-dependent cleavage activity of PLC. 

NOV84 is predicted to be expressed in at least the following tissues: aorta, brain, 
colon, foreskin, heart, muscle, placenta, stomach, uterus, whole embryo, brain, colon, eye, 
head and neck, lung, muscle, ovary, pancreas, placenta, skin, stomach, uterus. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 
sources, and/or RACE sources. Further expression data for NOV84 is provided in Example 2. 

The NOV84 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, 
cardiomyopathy, atherosclerosis, hypertension, congenital heart defects, aortic stenosis, atrial 
septal defect (ASD), atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis 
, subaortic stenosis, ventricular septal defect (VSD), valve diseases, tuberous sclerosis, 
scleroderma, obesity, transplantation as well as other diseases, disorders and conditions. 
NOV84 nucleic acids encoding the phospholipase-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a Phospholipase C delta 1 -like 
protein includes the nucleic acid whose sequence is provided in Table 84A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 84A while still encoding a protein 
that maintains its Phospholipase C delta 1 -like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 84A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 
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The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
5 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 33% of the bases 
may be so changed. 

The novel protein of the invention includes the Phospholipase C delta 1-like protein 
10 whose sequence is provided in Table 84B. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
84 while still encoding a protein that maintains its Phospholipase C delta 1 -like activities and 
physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
-to about 49% of the amino acid residues may be so changed. 
15 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

20 

NOV85 

The disclosed NOV85 (alternatively referred to herein as CG56955-01) includes the 
4091 nucleotide sequence (SEQ ID NO:277) shown in Table 85A. A NOV85 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 13-15 and ends with a stop codon 
25 at nucleotides 4078-4080. 



Table 85A. NOV85 Nucleotide Sequence (SEQ ID NO:277) 

ACT ACTTGTGGAATGTCACTG C CT CGGGGTAT TTCACAAGACAGGT CACCTCTTGT GAAAGT C CGAAGT A 
ATTCTCTGAAAGCTCCTTCCACGCATGTCACAAAACCATCATTTAGCCAGAA 

AGACCAAAGAC C AGTAAAT CACTTGCATCAGAACAGTCTGTTG AATCAG CAGACATGGGTAAGGAC TGAC 
AGTGCCCCCGATC^GCAAGTGGAGACTGGGAAATCCCCCTCTC 

CTCAGTCGAGTGAAAACGCTGGTACTTCAGATTTAGAACTACCTGTCAGTCAAAGGAATCAAGATTTAAG 
TTTACAAGAGGCTGAAACTGAGCAATCAGATACTTTAGATAATAAAGAAGCTGTCATCCTAAGGGAAAAA 
CCTCCATCTGGACGCCAGACACCGCAGCCTTTAAG^ 

AGACCGGGTCAGACACTACCTGCTGGCTGCCCAATGATGCACGTCGAGAGGTCCACATAAAAAGAATGGA 
GGAAAGAAAAGCCTCGAGTACCAGTCCGCCTGGCGATTCTTTGGCTTCCATCCCATTTATAGATGAACCA 
ACTAGCCCTAGCATTGATCATGATATTGCAC^TATCCCT 
AGGTCCCCTCCMAGC^CAGTTCCTCCrTGCCT 

ACATGACCACGAATCTGTTGGCC CTCCTAG CCTGGATGCTCAG CC C7VACTCAAAGACAG AAAGATCAAAA 
TCATATGATGAGGGTCTGGAT GATTACAGAGAAGATG CAAAATTGTC CTTTAAGCACGT ATCTAGTCTGA 
AGGGAAT CAAGAT CGC AGACAGCCAAAAGTCAT C^GAAGACTCTGGGTC CAGAAAAGATTCTTCCT C AGA 
GGTCTTCAGTGATGCTGCC^GGAAGGGTGGCTTCATTTCCGACCCCTTGTCACCGATAAGGGCAAGCGA 
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GTTGGTGGAAGTATTCGGCCATGGAAACAGATGTATGTTGTCCTTCGGGGTCATTCACTTTACCTGTACA 
AAGATAAAAGAGAGCAGACGACTCCGT CTGAGGAAGAG CAGCCCATCAGTGTTAATGCTTGCTTGATAGA 
CAT CTCTTACAGTGAGACCAAGAGGAAAAATGTGTTTCGACTCACCAC GTCCGACTGTGAATGCCTGTTT 
CAGGCTGAAGACAGAGATGATATGCTAGCTTGGATCAAGACGATCCAGGAGAGCAGCAACCTAAACGAAG 
AGGACACTGGAGTC^CTAACAGGGATCTAATTAGTCGAAGAATAAAAGAATACAACAATCTGATGAGC^A 
AGCAGAACAGTTGCCAAAAACACCTCGCCAGAGTCTCAGCATCAGGCAAACTTTGCTTGGTGCTAAATCA 
GAGCCAAAGACTCAAAGCCCACACTCTCCGAAGGAAGAGTCGGAAAGGAAACTTCTCAGTAAAGATGATA 
CCAGTCCCCCAAAAGACAAAGGCACATGGAGAAAAGGCATTCCAAGTATCATGAGAAAGACATTTGAGAA 
AAAGCCAACTGCTACAGGAACTTTCGGCGTCCGACTAGATGACTGCCCACCAGCTCATACTAATCGGTAT 
ATT CCAT T AAT AGTTGACAT ATGTTG CAAATTAGTTGAAG AAAGAGGT CTTGAATAT ACAGGT ATTT AT A 
GAGTTCCTGGAAATAATGCAGCCATCTCAAGTATGCAAGAAGAACTCAACAAGGGAATGGCTGATATTGA ' 
TATACAAGATGATAAATGGCGAGATTTGAATGTGATAAGCAGTTTACTAAAATCCTTCTTCAGAAAACT 
CC TGAGC CT CT C T T CACAAATGAT AAATATGC TG AT TTTATTG AAGCC AATCGTAAAGAAGATCCT CTAG 
AT CGTCTGAAAACAT T AAAAAGAC T AATTCAC GATTTGCCIXSAACATCATT ATGAAACACTT AAGTT CCT 
TT C AGCTCATCTGAAGACAG TGG C AGAAAATT CAGAGAAAAAAAATAAGATGG AAC CAAG AAAC CT AGCA 
AT AGTGT T TGGT CCCAC C CTTGTT CGAACATCAGAAGACAACATG ACC CACATGGT CACC CACATG CCTG 
AC CAGTAC AAG AT TGT AGAAACG CT CATC CAGCACCATGACTGGTTTT TCACAGAAGAAGGTGCTGAAG A 
GCCTCTTACAACAGTGCAGGAGGAAAGCACAGTAGACTCCCAGCCAGTGCCAAACATAGA 
ACCAACATTGGAAGGACAGGAGTCTCCCCAGGAGATGTATCAGATTGAGCTACTAGT^ 
CTAAGGGTTCTTGGGGATCTGGAAAGGATCAGTATAGCAGGteAACTGCTT 
TGCTAGTCGCAAGAGGAAGAAGCCGAAAGAAAAAGCAGAGCCTAGCAGCT^ 

GTATTTTTTAAGAAAGAAAATGTGGAAC^GTGTCA(^TGATACTAAAGAGGAGTCCAAAAAAGAAAGTG 
AGACACTGGGCAGAAAACAG AAGATC ATCATTGCCAAAGAAAACAG CAC^ CAG C ACGAC 

AAAAGATGAAAAGATATCACTAGGAAAAGAGAGCACGCCTTCTGAAGAACCCTCACCACC^ 
AAACACAACAAGTCACCAACTCTCAGCTGTCGCTTTGCCATCCT 

CACAGAAGTCCTCCCACCTTGAAGAGACAGGCTCTGACTCTGGCACTTTGCTCAGCACGTCTTCCC^ 
CT C CCTGGCAAGGTTTT CCATGAAGAAATCAACGAGTCCAGAAACGAAACATAG CGAGT TTTTGG CCAAC 
GTCAG CACCATCACCTCAGAT TATTCCACCACATCGTCTGCTACATAC TTGACTAG CCTGGACTCCAGTC 
GACTGAGC CCTGAGGTGCAATCCGTGGCAGAGAGCAAGGGGGACGAGG CAGATGACGAGAGAAGCGAACT 
CATCAGTGAAGGGCGGCCTGTGGAAACCGACAGCGAGAGCGAGTTTCCCGTGTTCCCCACAGCCTTGACT 
TCAGAGAGGCTTTTCCGAGGAAAACTGCAAGAAGTGACTAAGAGCAGCCGGAGAAATTCTGAAGGAAGTG 
AATTAAGTTGCACCGAGGGAAGTTTAACATCAAGTTTAGATAGCCGGAGACAGCT 

ACT CATCGAATGTGATACT CTTTCCAGGAAAAAATCAGCTAGATTCAAGTCAGATAGTGGAAGTCTAGGA 
GAT G C CAAG AATGAGAAAGAAG CACCTT CGTT AACTAAAGTGTTTGAT GTT ATGAAAAAAGGAAAGT CAA 
CTGGGAGTTTACTGACACCCACC^GAGGCGAATCCGAAAAACAGGAACCC^C^TGGAAAACGAAAATAGC 
AGATCGGTTAAAACTTGAGACCCAGAGCCCCTGCGGATGACATGTTTGGAGTAGGGAATCACAAAGTGAAT 
GC CGAGACTGCT AAAAGGAAAAGCAT CCGGCGCAGACATACACT AGGAGGGCACAG AGAT CT AC CGAAA 
.T CAG CGT TTTGAATTTT TGGAAAGTGCATGAGCAGAGCGGGG AG AGAG AAT CTGAACTTTCAGC TGTAAA 
CCGGTTAAAACCAAAATGCTCAGCCCAGGACCTTTCCATCTCAGACTGGCTGGCCAGGGAACGCCTACGC 
ACCAGT AC CT CTG ACCTTAGCAGAGGAGAAAT CGGAGAT C CCCAG ACAGAGAAC C CAAGCACACGAGAAA 
TAGCCACGACCGACACACCTTTGTCTCTTCATTGCAACACAGGCAGT^ 
AAACAGGCCCCTTCTTTCCATACCACCACAGTCACCTGACa^TAAACGGAGAAAGCTT^ 
AGCAAAAATGCT AGTT CTX5CAGCG AATGCC CAAC CTCAT AAACTGTCTGAAAC C CC AGG CACTAAAGCAG 
AGTTTCAT C CCTGTCTTTAAACTGGGGGTAT 

A NOV85 polypeptide (SEQ ID NO:278) encoded by SEQ ID NO:277 is 1355 amino 
acids in length and is presented using the one-letter amino acid code in Table 85B. The Psort 
profile for NOV85 predicts that this sequence is likely to be localized to the nucleus with a 
5 certainty of 0.7000. In alternative embodiments, a NOV85 polypeptide is located to the 
mitochondrial matrix space with a certainty of 0.3600, or to microbodies with a certainty of 
0.3000. 



Table 85B. NOV85 Polypeptide Sequence (SEQ ID 
NO:278) 

MSLPRGISQDR^ 

TWVRTDSAPDQQVETGKSPSLSGASAKPAPQSSENAGTSDLELPVSQRNQDLSLQEAETE 
QSDTIJ^NKEAVILREKPPSGRQTPQPIJraQSYILAV^ 

RMEERKASSTSPPGDSLASIPFIDEPTSPSIDHDIAHIPASAVISASTSQVPSIATVPPC 
LTTSAPLIRRQLSHDHESVGPPSLDAQPNSKTERSKSYDEGLDDYREDAKLSFKHVSSLK 
GIKIADSQKSSEDSGSRKDSSSEVFSDAAKSGWLHFRPLOTDKGKRVGGSIRPWKQMYVV 
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LRGHS LYL YKDKREQTTP SEE EQP I S VNACL IDI S YS ETKRKNVFRLTTS DCE CLFQAED 

RDDMIiAWIKTIQESSNLNEEDTGVTNRDLISRRIKEYNNLMSKAEQLPKTPRQSLSIRQT 

LLGAKSEPKTQSPHSPKEESERKLLSKDDTSPPKDKGTWRKGIPSIMRKTFEKKPTATGT 

FGVRLDDCPPAHTNRYIPLI VDI CCKLVEERGLEYTGI YRVPGNNAAI SSMQEELNKGMA 

DIDIQDDKWRDLNVISSLLKSFFRKLPEPLFTNDKYADFIEANRKEDPLDRLKTLKRLIH 

DLPEHHYETLKFLSAHLKTVAENSEKKNKNEPRKIAIVFGPTLVRTSEDNMTHM^ 

QYKIVETLIQHHDWFFTEEGAEEPLTTVQEESTVDSQPVPNIDHLLTNIGRTGVSPGDVS 

DSATSDSTKSKGSWGSGKDQYSRELLVSSIFAAASRKRKKPKEKAQPSSSEDELDNVFFK 

KENVEQCHNDTKEESKKESETLGRKQKIIIAKENSTRKDPSTTKDEKISLGKESTPSEEP 

SPPHNSKHNKSPTLSCRFAILKESPRSLLAQKSSHLEETGSDSGTIJ^STSSQASliARFSM 

KKSTSPETKHSEFLANVSTITSDYSTTSSATYLTSLDSSRLSPEVQSVAESKGDEADDER 

SELISEGRPVETDSESEFPVFPTALTSERLFRGKLQEVTKSSRRNSEGSELSCTEGSLTS 

SLDSRRQLFSSHKLIECDTLSRKKSARFKSDSGSLGDAKNEKEAPSLTKVFDVMKKGKST 

GSLLTPTRGESEKQEPTWKTKXADRLKLRPRAPADDMFGVGNHKVNAETAKRKSIRRRHT 

LGGHRDATEI SVLNFWKVHEQSGERESELSAVNRLKPKCSAQDLS I S DWLARERLRTS T S 

DLSRGEIGDPQTENPSTREIATTDTPLSLHCNTGSSSSTLASTNRPLLSIPPQSPDQING 

ESFQNVSKNASSAANAQPHKLSETPGTKAEFHPCL 



A BLAST analysis of NOV85 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV85 had high 
homology to other proteins as shown in Table 85C. 

5 



Table 85C. BLASTX results from PatP database for NOV85 






Smallest 






Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAB97911 Human G-protein activating protein 


6012 


0.0 


patp:AAB41660 Human OREX 0RF1424 polypeptide sequence 


4368 


0.0 


patp:AAM93705 Human polypeptide 


852 


3.9e-84 


patp:AAU17l01 Novel s'ignal transduction pathway protein 


618 


6.3e-59 


patp:AAB64387 Amino acid sequence of human intracellular 


426 


2.le-38 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 3871 of 3875 bases (99%) identical to a gbrGENBANK- 
ID:AB037845|acc:AB037845.1 mRNA from Homo sapiens (mRNA for KIAA1424 protein). 
10 The full amino acid sequence of the protein of the invention was found to have 1285 of 1287 
amino acid residues (99%) identical to, and 1286 of 1287 amino acid residues (99%) similar 
to, the 1286 amino acid residue ptnr:SPTREMBL-ACC:Q9P2C3 protein from Homo sapiens 
(Human) (KIAA1424 PROTEIN). NOV85 also has homology to the other proteins shown in 
the BLASTP data in Table 85D. 

15 



Table 85D. NOV85 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 
(%) 


Positive 
(%) 


Expect 


gi |7243229 |db 
j |BAA92662.X| 
(AB037845) 


KIAA1424 protein 
[Homo sapiens] 


1286 


1285/1287 
(99) 


1286/1287 
(99) 
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gi|l473638l|r 
ef |XP_038564. 
l| (XM 038564) 


hypothetical protein 
XP_038564 
[Homo saryiens'] 


1173 


1172/1174 
(99) 


1173/1174 
(99) 


0.0 


gi | 7959263 | db 
j (BAA96025.1) 
(AB040934) 


KIAA1501 protein 
[Homo sapiens] 


735 


331/685 
(48) 


433/685 
(62) 


e-148 


gi 1 14748447 |r 
ef |XP_041405. 
l| (XM 041405) 


hypothetical protein 
XP_041405 
[Homo sapiens] 


304 


277/284 
(97) 


279/284 
(97) 


e-135 


gi 12856667 |d 
bj BAB3 0742.1 
| (AK017433) 


KIAA1424 PROTEIN 
(FRAGMENT) -putative [Mus 
musculus] 


385 


301/384 
(78) 


328/384 
(83) 


e-127 



This BLASTP data is displayed graphically in the ClustalW in Table 85E. A multiple 
sequence alignment is given, with the NOV85 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 85D. 



Table 85E. ClustalW Alignment of NOV85 



NOV85 (SEQ ID N0:278) 

gi 7243229 | (SEQ ID NO: 724) 

gi 14736381 | (SEQ ID NO: 725) 

gi 7959263 | (SEQ ID N0:726) 

gi 14748447 | (SEQ ID NO: 727) 

gi 12856667 j (SEQ ID NO:728) 
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NOV85 


gi 


7243229J 


gi 


14736381 


gi 


7959263 | 


gi 


14748447 


gi 


12856667 



NOV85 

gi 1 7243229 | 
gi|l473638l| 
gi | 7959263 | 
gi 1 14748447 | 
gi | 12856667 j 



NOV85 


gi 


7243229) 


gi 


14736381 


gi 


7959263 | 


gi 


14748447 


gi 


12856667 



NOV85 

gi | 7243229 | 
gi j 14736381 1 
gi j 7959263 | 
gi 1 14748447 | 



30 

|....|....|. 



40 



50 



PDQQVETGKSPSLSGASAKPAPQSSENAGTSDLELPVSQRSQDLSLQEAE 



6.0 70 80 

■ |....|..~|....|....|....|. 



90 



100 



TEQSDTLDNKEAVILREKPPSQRQTPQPLRHQSYILAVNDQETGSDTTCW 



110 120 130 140 150 

LPNDARREVHIKRMEERKASSTSPPGDSIASIPFIDEPTSPSIDHDIAHI 



160 170 180 190 

|....|....|....|....|....|....|....|.. 



200 



PASAVISASTSQVPSIATVPPCLTTSAPLIRRQLSHDHESVGPPSLDAQP 
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gi 


| 12856667) 


N0V85 


gi 


7243229 | 


gi 


14736381) 


gi 


7959263 | 


gi 


14748447) 


gi 


12856667 j 


NOV85 


gi 


7243229 | 


gi 


14736381 | 


gi 


7959263 | 


gi 


14748447 | 


gi 


12856667 j 


NOV85 


gi 


7243229] 


gi 


14736381 | 


gi 


7959263) 


gi 


14748447 | 


gi 


12856667 | 


NOV85 


gi 


7243229 | 


gi 


14736381) 


gi 


7959263| 


gi 


14748447 | 


gi 


12856667 j 


NOV85' 


gi 


7243229| 


gi 


14736381] 


gi 


7959263 | 


gi 


14748447] 


gi 


12856667 j 


NOV85 


gi 


7243229] 


gi 


14736381] 


gi 


7959263] 


gi 


14748447) 


gi 


12856667 j 


NOV85 


gi 


7243229] 


gi 


14736381] 


gi 


7959263 | 


gi 


14748447] 


gi 


12856667] 



NOV85 

gi | 7243229 | 



210 220 230 240 250 

| | ]....] ]....] | | | | 

- -MSLPRGISQDRSP 

NSKTERSKSYDEGIiDDyREDAKLSFKHVSSLKGIKIADSQKSSEDSGSRK 



260 



270 



(....). 



280 



290 



300 



LVKVRSNSLKAPSTHVTKPSFSQKSFVSMRDQRPVNHLHQNSLLNQQTWV 
DS S SE VFSDAAKEGWLHFRPLVTDKGKRVGGS I RPWKQMYWLRGHSL YL 



310 



320 



350 



330 340 

| | | | ]..,.] | | ]...,] 

RTDSAPDQQVETGKSPSLSGASAKPAPQSSENAGTSDLELPVSQRNQDLS 
YKDKREQTTPSEEEQPI SVNACLIDI S YSETKRKNVFRLTTSDCECLFQA 



360 



370 



380 



390 400 

.... | .... | .... | .... | | .... | .... | .... | .... | .... | 

LQEAETEQSDTLDNKEAVILREKPPSGRQTPQPIiRHQSYILAVNDQETGS 
EDRDDMLAWIKTIQESSNLNEEDTGVTNRDLISRRIKEYNNLMSKAEQLP 



410 420 430 440 450 

-...|....|....|....|....|....|....|....|....|....| 

DTTCWLPNDARREVHIKRMEERKASSTSPPGDSLASIPFIDEPTSPSIDH 
KTPRQSLS IRQTLLGAKSEPKTQSPHS PKEESERKLLSKDDTSPPKDKGT 
MEERKASSTSPPGDSIASIPFIDEPTSPSIDH 



470 



480 



490 



500 



460 

| | | | | | .... | .... | | | 

DIAHIPASAVISASTSQVPSIATVPPCLTTSAPIiIRRQLSHDHESVGPPS 
WRKGI PSIMRKTFEKKPTATGTFGVRLDDCPPAHTNRYIPLI YDI CCKLV 



510 



520 
|....| 



530 



540 



550 



LDAQPNSKTERSKSYDEGLDDYREDAKLSFKHVSSLKGIK1ADSQKSSED 
EERGLEYTGIYRVPCajNAAISSMQEELMKGMADIBIQDDKWRDIiNVISSL 



560 



580 



590 



600 



570 

| | | | | | | | | | 

SGSRKDSSSEVFSDAAKEGWLHFRPLVTDKGKRVGGSIRPWKQMYVVLRG 
LKSFFRKLPEPLFTNDKYADFIEANRKEDPLDRLKTLKRIjIHDLPEHHYE 
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gi | 14736381 | 
gi (7959263 j 
gi j 14748447 | 
gi|l2856667| 



NOV85 


gi 


7243229 | 


gi 


14736381 | 


gi 


7959263 | 


gi 


14748447 | 


gi 


12856667 | 



NOV85 

gi j 7243229 | 
gi j 1473 6381 1 
gi j 7959263 | 
gi j 14748447 | 
gi j 12856667 j 



NOV85 


gi 


7243229 | 


gi 


14736381) 


gi 


7959263 | 


gi 


14748447 | 


gi 


12856667 j 



NOV85 


gi 


7243229 | 


gi 


14736381) 


gi 


7959263 | 


gi 


14748447 | 


gi 


12856667 j 



NOV85 


gi 


7243229 | 


gi 


14736381) 


gi 


7959263) 


gi 


14748447) 


gi 


12856667) 



NOV85 

gi | 7243229 | 
gi|l473638l| 
gi (7359263 | 
gi j 14748447 | 
gi|l2856667| 



SGSRKDSSSEVFSDAAKEGWLHFRPLVTDKGKRVGGSIRPWKQMYVVLRG 



610 620 630 640 650 

| |....|....| | | | | | | 

HSLYLYKDKREQTTPSEEEQPISVNACLIDISYSETKRKNVFRLTTSDCE 
TLKFLS AHLKTVAENS E KNKME PRNLAI VFGPTLVRTS EDNMTHMVTHMP 
HSLYIiYKDKREQTTPSEEEQPISVNACLIDISYSETKRKNVFRLTTSDCE 



670 



680 



690 



700. 



660 

I I — -I I |....|....| | | 

CLFQ AEDRDDMLAW IKTIQES SNLNEEDTGVTNRDL I S RRI KEYNNLMS K 
DQYKIVETLIQHHDWFTEEGAEEPLTTVQEESTVDSQPVPNIDHLIiTNI 
CLFQAEDRDDMLAW IKTIQES SNLNEEDTGVTNRDLISRRI KEYNNLMSK 



750 



710 720 730 740 

....|....|..-.|~..|....|....|....| 

AEQLPKTPRQSLSIRQTLLGAKSEPKTQSPHSPKEESERKLLSKDDTSPP 
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Table 85F lists the domain description from DOMAIN analysis results against NOV85. 
This indicates that the NOV85 sequence has properties similar to those of other proteins 
known to contain this domain. 



688 



WO 02/068649 



PCT/US02/02785 



Table 85F. Domain Analysis of NOV85 


enllSmartlsmart00324. RhoGAP. GTPase-activatnr protein for Rho-like GTPasm; GTPase flrtivntnr prntujns 


towards Rho/Rac/Cdc42-like small GTPases. SEQ ID NO: 87 8 








CD-Length = 175 residues, 100.0% aligned 






Score =* 


169 bits (428), Expect - 9e-43 




NOV85: 


555 


RY I PL I VDI CCKLVEERGLEYTGI YRVPGNNAAI S SMQEELNKGMAD I D I QDD KWRDLNV 


614 






+ IP+IV+ C + +E+RGL+ GIYR G+ + + ++E + G D D++ 




Sbjct: 


1 


KP I PI IVEKCIEYLEKRGLDTEGIYRKSGSASRVKELREAFDSGPDPD- - LDLSE YDVHD 


58 


NOV85: 


615 


ISSLLKSFFRKLPEPLFTNDKYADFIEANRKEDPLDRLKTLKRLIHDLPEHHYETLKFLS 


674 






++ LLK F R+LPEPL T + Y +FIEA + ED +RL+ L+ L+ LP + TL++L 
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VAGLLKL FLRELPEPL ITFEL YEEFI EAAKLEDEEERLRALRELLSLLPPANRATLRYLL 
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NOV85: 


675 


AHLKTVAENSEKKNKMEPRNIAIVFGPTLVRTSEDNMTHMVTHMPDQYKIVETLIQHHD 


733 






AHL VAE+SE +NKM RNLAIVFGPTL+R + + + Q K+VE LI++ D 




Sbj Ct: 


119 


AHLNRVAEHSE - ENKMTARNLAIVFGPTLLRPPDGESA- SLKDIRHQNKWEFLIENAD 


175 



Rho GTPases control a variety of cellular processes. There are three subtypes of Rho 
GTPases in the Ras superfamily of small G proteins: RHO, RAC, and CDC42. GTPase- 
activating proteins (GAPs) bind activated forms of Rho GTPases and stimulate GTP 
5 hydrolysis. Through this catalytic function, Rho GAPs negatively regulate Rho-mediated 

signals. GAPs may also serve as effector molecules and play a role in signaling downstream of 
Rho and other Ras-like GTPases. 

By screening a Jurkat cDNA library using a yeast 2-hybrid system with an activated 
form of RAC as bait, followed by screening a placenta cDNA library, Toure et al. (1998) 

10 isolated a cDNA encoding RACGAP1, which they called MGCRACGAP, The predicted 527- 
amino acid RACGAP1 protein has a large N-terminal region containing a protein kinase C- 
like cysteine-rich motif. RACGAP1 shares highest homology with the Drosophila RnRacGAP 
and the chimerins of rat and human. Functional analysis showed that the GAP domain of 
RACGAP1 exhibits strong GAP activity towards CDC42, RAC1, and RAC2. Northern blot 

15 analysis detected an approximately 3.2-kb RACGAP1 transcript that was most abundantly 

expressed in testis, with low expression in most other tissues. Western blot analysis detected a 
RACGAP1 protein of 58 kD in testis extracts. In situ hybridization showed that RACGAP1 
expression is restricted to germ cells in mature testis. Human breakpoint cluster region (bcr) 
gene product is a member of a group of GTPase-activating proteins that act exclusively on 

20 members of the Ras-related Rho subfamily. A complementary DNA was isolated from 
Caenorhabditis elegans that encoded a polypeptide of 1438 amino acid residues, CeGAP, 
which contains a domain with sequence similarity to the COOH-terminal segment (GTPase- 
activating protein region) of Bcr and other known GTPase-activating proteins of the Rho 
subfamily. It also contains a "pleckstrin homology" motif, present in many signaling proteins 
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including GTPase-activating proteins and nucleotide exchange factors. The Bcr-like domain of 
CeGAP exhibited activity not only on members of the C. elegans and human Rho subfamily 
but surprisingly also on C. elegans Ras protein (let-60), human Ras, and Rab3A. 

CeGAP is therefore the first GTPase-activating protein acting on Ras-related proteins 
5 across different subfamilies. Together with the presence of the pleckstrin homology motif, this 
suggests a central and integrative role for CeGAP in a signaling pathway common to Ras and 
related proteins. 

NOV85 is predicted to be expressed in at least the following tissues: pancreas, 
stomach, brain, bone. This information was derived by determining the tissue sources of the 
10 sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV85 is provided in Example 2. 

The NOV85 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Alzheimer's 
15 disease, stroke, tuberous sclerosis, hypercalcemia, Parkinson's disease, Huntington's disease, 
cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, 
leukodystrophies, behavioral disorders, addiction, anxiety, pain, neuroprotection, 
hypercalcemia, ulcers, diabetes, Von Hippel-Lindau (VHL) syndrome, pancreatitis, obesity as 
well as other diseases, disorders and conditions. NOV85 nucleic acids encoding the CeGAP- " 
20 like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be 
assessed. 

The novel nucleic acid of the invention encoding a GTPase activating protein-like 
protein includes the nucleic acid whose sequence is provided in Table 85A, or a fragment 

25 thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 85A while still encoding a protein 
that maintains its GTPase activating protein-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 85A, including nucleic acid fragments that are 

30 complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
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10 



15 



in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1% of the bases may 
be so changed. 

The novel protein of the invention includes the GTPase activating protein-like protein 
whose sequence is provided in Table 85B. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
85B while still encoding a protein that maintains its GTPase activating protein-like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 
up to about 1% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



20 



NOV86 

The disclosed NOV86 (alternatively referred to herein as CG56957-01) includes the 
3451 nucleotide sequence (SEQ ID NO:279) shown in Table 86A. A NOV86 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 35-37 and ends with a stop codon 
at nucleotides 3442-3445. 



Table 86A. NOV86 Nucleotide Sequence (SEQ ID NO:279) 



CCCGCCCCCTCGGGCTCCCGGCCGGGGCCCC^TCATGTTCTCCAGGAAGAAACGAGAGCTCATGAAAACC 
CCTTCCATCTCGAAAAAGAACCGCGCGGGAAGCCCCAGCCCGCAGCCCTCGGGGGAGCTGCCCAGGAAGG 
ATGGGGCTGACGCGGTGTTCCCCGGACCAAGCCTGGAGCCGCCCGCTGGGTCCTCCGGCGTCAAGGCCAC 
AGGGACCCTCAAGCGGCCCACCAGCCTGAGCCGCCACGCCAGCGCGGCTGGCTTCCCCCTGTCGGGTGCT 
GCCTCCTGGACACTGGGCCGGAGCCACCGGAGCCCACTGACAGCCGCCAGCCCGGGCGAGCTGCCCACCG 
AGGGTGCCGGCCCGGACGTCGTCGAGGACATCTCCCATCTGCTGGCGGACGTGGCCCGCTTCGCTGAGGG 
CCTTGAGAAACTTAAGGAGTGTGTGTTGCGTGACGACCTCCTTGAGGCCCGCCGCCCGCGGGCCCACGAG 
TGCCTGGGTGAGGCTCTGCCTGTCATGCATGAGATCM 

CGCTCACCGCAGCCGGCACCCTCATTGCCAAGGTCAAAGCCTTCCATTATGAGAGCAACAATGATCTGGA 
GAAACAGGAGTTCGAGAAGGCCCTGGAGACGATTGCTGTGGCCTTCAGTAGCACAGTGTCCGAGTTCCTC 
ATGGGTGAAGTGGACAGCAGC^CCCTCCTAGCAGTGCCTCCTGGGGACTCGAGCCAGTCCATGGAAAGCC 
TGTATGGACCGGGCAGTGAGGGCACGCCTCCCAGCCTGGATGACTGTGACGCCGGCTGCCTGCCCGCCGA 
GGAGGTGGACGTGCTGCTACAGCGCTGTGAGGGGGGCGTGGATGCCGCACTGCTGTATGCCAAGAACATG 
GCCAAGTAC^TGAAGGACCTCATCAGCTACCTGGAGAAGCGGACGACGCTGGAGATGGAGTTTGCCAAGG 
GCCTGC^GAAGATCGCT(^CAACTGC^GACAGAGCGTCATGCAGGAGCCCCACATGCCGCTCCTGTCCAT 
CTACTCGCTGGCCCTGGAGCAGGACCTGGAGTTCGGCC^CAGCM 

ACCCAGACCTTCATGCAGCCCCTGACCCTGCGGCGGCTTGAACACGAGAAGCGCAGGAAGGAGATCAAGG 
AGGCCTGGCACCGTGCCCAGAGGAAGCTGCAAGAGGCGGAGTCCAACCTGCGCAAGGCCAAGCAGGGTTA 
CGTGCAGCGCTGCGAGGACCACGACAAGGCTCGCTTCCTCGTGG 

AGCGCGCCGGGAGCAGGCGGCACGGCCACCAAGACCCTGGACAAGCGGCGGCGGCTGGAGGAGGAGGCCA 
AGAACAAGGCGGAGGAAG CTATGGCCACCTAC CG CACCTG CGTGGC CGACG CG AAGACGCAGAAG CAGGA 
GCTGGAGGATACCAAGGTGACGGCG CTGCGG CAGATC CAGGAGGTCATCCGGCAGAG CGAC CAAACCATC 
AAGTCGGCCACGATCTCCTACTACCAGATGATGCATATGCAGACGGCGCCGCTGCCCGTGCACTTCCAGA 
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TGCTGTGTGAGAGCAGCAAGCTGTATGACCCAGGCCAGCAGTACGCCTCCCACGTGCGCCAGCTGCAGCG 
GGACCAGGAGCCCGATGTGCACTACGACTTTGAGCCCGACG^ 

CGTGCCCGGAAGAGCAGCTTCAACGTGAGTGATGTGGCGCGGCCCGAGGCTGCCGGGAGCCCCCCAGAAG 
AAGGCGGGTGCACTGAGGGCACACCTGCCAAGGACCACAGGGCCGGGCGAGGACACCAGGTTCACAAGTC 
ATGGCCGCTCTCGATCTCAGACTCGGACAGTGGGCTGGACCCCGGCCCTGGCGCAGGGGACTTTAAGAAG 
TTCGAGCGGACGTCATCCAGTGGTACCATGTCGTCCACGGAGGAGCTGGTGGACCCAGACGGTGGAGCCG 
GGGCTTCAGCCTTTGAGCAGGCTGACCTCAACGGC^ 

ACCGTTCCGCCACGAGGGGCTGTCCAAGGCGGCCCGTACTCACCGGCTCCGGAAGCTCCGCACGCCCGCC 
AAGTGCCGCGAGTGCAACAGCTACGTCTACTTCCAGGGTGCTGAGTGTGAAGAGTGCTGCCTGGCCTGCC 
ACAAGAAATGTCTGGAGACGCTGG CCATACAGTGCGGG CACAAGAAGCTGCAGGGCCGCCTGCAGCTGTT 
CGGCCAGGACTTCAGCCACGCGGCCCGCAGCGCCCCCGACGGCGTGCCCTTCATCGTCAAGAAGTGCGTC 
TGCGAGATCGAGCGGCGGGCGCTGCX3CACCAAGGGCATCTACCGGGTCAATGGGGTAAAGACACGCGTGG 
AGAAGCTGTGCCAGGCCTTCGAGAACGGCAAGGAGCTGGTCGAGCTGTCGCAGGCCTCGCCCCACGACAT 
CAGCAACGTCCTCAAGCT CTACCTGCGTCAGCTTCCCGAGCCG CTCAT CTC CTTCCGCCTCTACCACGAG 
CTCGTAGGGCTGGCCAAGGACAGCCTGAAGGCAGAGGCCGAGGCCAAGGCGGCGTCCCGGGGCCGGCAGG 
ACGGCTCGGAGAGCGAGGCAGTGGCGGTGGCCCTGGCAGGTCGGCTGCGGGAGCTCCTGCGGGACCTGCC 
GCCTGAGAACCGGGCCTCGCTGCAGTACCTGCTGCGTCACCTACGCAGGATCGTGGAGGTGGAGCAGGAC 
AACAAGATGACCCCCGGGAACCTGGGCATCGTGTTCGGGCCCAOT 

CCACCGTGTCCCTCTCCTCCCTGGTGGATTATCCCCATCAGGCCCGCGTCATCGAGACTCTCATCGTCCA 
CTACGGCCTGGTCTTCGAGGAGGAGCCGGAGGAGACCCCCGG^ 

GCTGAGGTAGTCGTCCAGGTGCCGTACCTGGAGGCGGGCGAGGCGGTGGTCTACCCGCTGCAGGAGGCGG 
CGGCGGACGGGTGCAGAGAATCCCGAGTTGTGTCCAACGATTCGGACTCGGACCTAGAGGAGGCCTCCGA 
GCTGCTGTCCTCATCGGAGGCCAGTGCCCTGGGCCACCTCAGCTTCCTGGAGCAGCAGCAGAGCGAGGCC 
AGCCTAGAGGTGGCTTCTGGCAGCCACAGCGGCAGTGAGGAGCAGCTGGAGGCCACAGCCCGGGAGGACG 
GGGACGGGGACGAGGACGGCCCGGCCCAGCAGCTCTCAGGATTCAAC^^ 

GCAGGCCCCACTGCCCCCCATGAGGCTCCGTGGCGGGCGGATGACACTGGGCTCCTGCAGGGAAAGGCAG 
CCGGAATTCGTGTGAGCTGGG 



A NOV86 polypeptide (SEQ ID NO:280) encoded by SEQ ID NO:279 is 1 136 amino 
acids in length and is presented using the one-letter amino acid code in Table 86B. The Psort 
profile for NOV86 predicts that this sequence is likely to be localized to the nucelus with a 
5 certainty of 0.9800. 



Table 86B. NOV86 Polypeptide Sequence (SEQ ID 
NO:280) 

MF SRKKRELMKTPS I SKKNRAGSPS PQP SGELPRKDGADAVFPGPSLE PPAGS SGVKATG 
TLKRPTSIiSRHASAAGFPLSGAASOTLGRSHRSPLTAASPGELPTEGAGPDWEDISHLL 
ADVARFAEGLEKLKECVLRDDLLEARRPRAHECLGEALRV^ 
GTLIAKVKAFHYESNNDIJSKQEFEKALETIAVAF^ 

QSMESLYGPGSEGTPPSLDIXn^AGajPAEEVDVLLQRCEGGVDAALLYAKMMA 

SYLEKRTTIiEMEFAKGLQKIAHNCRQSVMQEPHMPLLSIYSI^EQDLEPGHSMVQAVGT 

LQTQTFMQPLTLRMiEHEKRRKEIKEAWHRAQRKLQEAESmiRKAKQGYVQRCEDHDKM 

FLVAKAEEEQAGSAPGAGGTATKTLDKRRRLEEEAKNKAEEAMATYRTCVADAKTQKQEL 

EDTKVTALRQIQEVIRQSDQTIKSATISYYQMMHMQTAPLPVHFQMLCESSKLYDPGQQY 

ASHVRQLQRDQEPDVHYDFEPHVS ANAWSPVMRARKS SFNVS DVARPEAAGS P PEEGGCT 

EGTPAKDHRAGRGHQVHKSWPLSISDSDSGLDPGPGAGDFKKFERTSSSGTMSSTEELVD 

PDGGAGASAFEQADLNGMTPELPVAVPSGP FRHEGLS KAARTHRLRKLRTPAKCRECNS Y 

VYFQGAE CEE CCLACHKKCLETLAI QCGHKKLQGRLQLFGQD FSHAARSAPDGVPFI VKK 

CVCE I ERRALRTKGI YRVNGVKTRVEKLCQAFENGKEI.VELS QAS PHDI SNVLKLYLRQL 

PEPLISFRLYHELVGLAKDSLKAEAEAKAASRGRQDGSESEAVAVALAGRI^LLRr>r»PP 

ENRASLQYLLRHLRRIVEVEQDNKMTPGm^IVFGPTIiLRPRPTEATVSLSSLVDYPHQA 

RVIETLIVHYGLVFEEEPEETPGGQDESSNQRAEVWQVPYLEAGEAWYPLQEAAADGC 

RESRWSNDSDSDLEEASELLSSSEASALGHLSFLEQQQSEASLEYASGSHSGSEEQLEA 

TAREDGDGDEDGPAQQLSGFNTNQSNNVLQAPLPPMRLRGGRMTLGSCRERQPEFV 



A BLAST analysis of NOV86 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV86 had high 
10 homology to other proteins as shown in Table 86C. 
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Table 86C BLASTX results from PatP database for NOV86 






Smallest 






Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAU173l3 Novel signal transduction pathway protein 


2855 


5.7e-299 


patp:AAW75995 GTPase activating protein (GAP) , PARG 


1497 


2.9e-153 


patp:AAY90268 Human GTP-ase activating polypeptide PARG 


1497 


2.9e-153 


patp:AAUl7459 Novel signal transduction pathway protein 


1265 


l.le-128 


patp:AAU17460 Novel signal transduction pathway protein 


1127 


4.7e-114 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 3447 of 3451 bases (99%) identical to a gb:GENBANK- 
5 ID:D86976|acc:D86976. 1 mKNA from Homo, sapiens (Human mRNA for KIAA0223 gene). 
The full amino acid sequence of the protein of the invention was found to have 1 134 of 1 136 
amino acid residues (99%) identical to, and 1 135 of 1 136 amino acid residues (99%) similar 
to, the 1 165 amino acid residue ptnr:SPTREMBL-ACC:Q92619 protein from Homo sapiens 
(Human) (MYELOBLAST KIAA0223). NOV86 also has homology to the other proteins 
1 0 shown in the BLASTP data in Table 86D. 



Table 86D, NOV86 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

<%> 


Positive 

(%> 


Expect 


gi 

ef 
1| 


14765644 |r 
XP_037574. 
!XM 037574) 


minor histocompatibility 
antigen HA-1 
[Homo sapiens] 


1136 


1134/1136 
(99) 


1135/1136 
(99) 


0.0 


gi (1504026 |db 
j |BAA13212.l| 
(D86976) 


similar to C.elegans 
protein (Z37093) [Homo 
sapiens] 


1165 


1134/1136 
(99) 


1135/1136 
(99) 


0.0 


gi| 2896796 |gb 

|AAC03237.l| 

(AC004151) 


D1013901 [Homo sapiens] 


996 


994/996 
(99) 


995/996 
(99) 


0.0 


gi|l2857707|d 
bj |BAB31085.1 
| (AK018130) 


MYELOBLAST KIAA0223 
( FRAGMENT ) -put at ive [Mus 
musculus] 


523 


423/542 
(78) 


460/542 
(84) 


0.0 


gi| 13635768 |r 
ef |XP 017232. 
l| {XM 017232) 


similar to PTPL1- 
associated RhoGAP .1 (H. 
sapiens) [Homo sapiens] 


1261 | 


345/881 
(39) 


509/881 
(57) 


e-160 



This BLASTP data is displayed graphically in the Clustal W in Table 86E. A multiple 
sequence alignment is given, with the NOV86 protein being shown on line 1 in a ClustalW 
1 5 analysis comparing the protein of the invention with the related protein sequences shown in 
Table 86D. 
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Table 86E. ClustalW Alignment of NOV86 



gi 


147G5644 


gi 


1504026 | 


gi 


2896796) 


gi 


12857707 


gi 


13635768 


NOV86 


gi 


14765644 


gi 


1504026) 


gi 


2896796) 


gi 


12857707 


gi 


13635768 



NOV86 



NOV86 



14765644 | 

1504026) 

2896796) 

12857707) 

13635768) 



NOV86 



14765644 | 

1504026) 

2896796) 

12857707| 

13635768 



NOV86 



14765644 | 
1504026 | 
2896796) 
12857707| 
13635768 



NOV86 



14765644) 
1504026 | 
2896796) 
12857707 | 
13635768) 



NOV86 


gi 


14765644 


gi 


1504026| 


gi 


2896796) 


gi 


12857707 


gi 


13635768 



NOV86 

gi | 14765644 | 



(SEQ ID NO;280) 
(SEQ ID NO:729) 
(SEQ ID NO: 730) 
(SEQ ID NO.-731) 
(SEQ ID NO:732) 
(SEQ ID NO: 733) 

10 



20 30 40 

— | — | — | — | — | — | — — ]_ 

. . . mfsrkkreEmktpsis 

mfsrkkreemktpsis 

qcgpeaasagscsaetpsppprapgrgpimfsrkkrejsmktpsis 




- m£ahkqkkS£JEk£S 



60 



gspspqp§gelp: 

G§PSPQP§iGELP; 
GSPS PQPgGELPRS 




70 

ScfrPGPSl 
>A\jjFPGPSI 
&VFPGPSI 



80 



90 



100 



PAGSSGVKATGTLKRPTSLSRH 
PAGSSGVKATGTLKRPTSLSRH 
PAGSSGVKATGTLKRPTSLSRH 



W^SGOLSgDITTSgMgLKSSSSNSIB^ 

110 120 130 



)....). 



140 

|....|..-.|....|....| 
ASAAGFPLSGAASWTLGRSHRSPLTAASPGELPTEGAGPDWSDbSE 
AS AAGFPLSGAASWTLGRSHRS PLTAAS PGELPTEGAGPDW&DHSB 
ASAAGFPLSGAASWTLGRSHRSPLTAASPGELPTEGAGPDVV^DbS£ 



150 



-iYgKEpN 



160 



170 




I. 



180 
. . I . . 



I. 



190 



200 
• • I 



jDLLEARRPRAHECLGEALRVMHQI I S KYPLL 
DLLEARRPRAHECLGEALRVMHQI I SKYPLL 
DLLEARRPRAHECLGEALRVMHQI I SKYPLL 1 
DLLEARRPRAHECLGEALRVMHQI I SKYPLLl 



210 



•s ; 



220 



230 
J" 



240 



250 



JT VE TL T AAGTL I AKVKAFH YE S NNDL E K 
NTVETLTAAGTLIAKVKAFHYESNNDLEK 
NTVETLTAAGTLIAKVKAFHYESNNDLEK 
NTVETLTAAGTLIAKVKAFHYESNNDLEK 



QEFEKALETIAVAFSST 
QEFEKALETIAVAFSST 
QEFEKALETIAVAFSST 
QEFEKALETIAVAFSST 



260 
I.... I.. 



J_ 



270 



280 



290 
..I.. 



300 



I 



vseflmgevdsstllavppgdssqsmeslygpgsegtppslgdcdagclp 
vseflmgevdsstllavppgdssqsmeslygpgsegtppslgdcdagclp 
vseflmgevdsstllavppgdssqsmeslygpgsegtppslSdcdagclp, 
vseflmgevdsstllavppgdssqsmsslygpgsegtppslspcdagclpi 



310 



SRE:' 



320 



ISSS- 



-|kSnfs 



330 



340 

..I.. 



350 
..I 



AEEVDVLLQRCEGGVDAALLYAKNMAKYMKDLISYtEKfcTTLEMEFAKGL 
AEEVDVLLQRCEGGVDAALLYAKNMAKYMKDLI S YLEKRTTLEMEFAKGL 
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CEVDVLLQRCEGGVDAALLYAKtmAKYMKJDLISYLEKRTTLEMEFAKGLj 
SEVDVLLQRCEGGVDAALLYAiaSIMAIOrMKDLISYLEKRTTLEMEFAKGLl 



12857707 
13635768 



NOV86 



gi 
gi 
gi 
gi 



14765644 

1504026 

2896796 

12857707 

13635768 



NOV86 


gi 


14765644) 


gi 


1504026) 


gi 


2896796| 


gi 


12857707-j 


gi 


13635768) 


NOV86 


gi 


14765644) 


gi 


1504026) 


gi 


2896796) 


gi 


12857707) 


gi 


13635768 | 


NOV86 


gi 


14765644) 


gi 


1504026) 


gi 


2896796) 


gi 


12857707) 


gi 


13635768) 



NOV86 

gi 1 14765644 
gi 1 1504026 | 
gi (2896796 | 
gi 1 12857707 
gi 1 13635768 



NOV86 

gi 1 14765644 
gi 1 1504026 | 
gi (2896796 j 
gi 1 12857707 
gi j 13635768 



NOV86 


gi 


14765644) 


gi 


1504026 | 


gi 


2896796| 


gi 


12857707 | 


gi 


13635768) 



360 370 380 390 400 

| | | | | | | j | ) 



QKIAHNCRQSVMQEPHMPLLSIYSLALEQDLEFGHSMVQAVGTLQTQTFM 
QKI AENCRQSVMQEPHMPLLS I YSLALEQDLEFGHS M VQAVGTLQTQTFM 
QKIAHNCRQSVMQEPHMPLLSIYSLALEQDLEFGHSMVQAVGTLQTQTFM 
QKIAHNCRQSVMQEPHMPLLSIYSLALEQDLEFGHSMVQAVGTLQTQTFM 



vJJi^eatQtotg i QEF^^jQ^itraN^LiN^i^s s^l&q^Scaa^ankQv 



410 420 
I. ...).. ..|....|.. 



430 440 



450 



QPLTLRRLEHEKRRKEIKEAWHRAQRKLQEAESNLRKAKQGYVQRCEDHD 
QPLTLRRLEHEKRRKEIKEAWHRAQRKLQEAESNLRKAKQGYVQRCEDHD 
3PLTLRRLEHEKRRKE I KEAWHRAQRKLQEAESNLRKAKQGYVQRCEDHD, 
QPLTLRRLEHEKRRKEIREAWHRAQRKLQEAESNLRKAKQGYVQRCEDHD 



EAEf|SLK<KAK» 



460 



470 
. . I . . ■ 

K 

I 



480 



1 



490 



i. 



500 
••I 



karflvakaeeeqagsapgaggtatktldkrrrleeeaknkaeeamatyr 
karflvakaeeeqagsapgagHtatktldkrrrleeeaknkaeeamatyr 
karflvakaeeeqagsapgagHtatktldkrrrleeeaknkaeeamatyr 
karflvakaeeeqagsapgag@tatktldkrrrleeeaknkaeeamatyr 



510 



520 



530 

..I. ...I, 



540 



550 

I.... I 



tcvadaktqkqeledtkvtalrqiqevirqsdqtiksatisyyqmmhmqt 
tcvadaktqkqeledtkvtalrqiqevirqsdqtiksatisyyqmmhmqt 
tcvadaktqkqeledtkvtalrqiqevirqsdqtiksatisyyqmmhimqt 
tcvadaktqkqeledtkvtalrq i qevi rqs dqtiks at i s yyqmmhmqt 



560 



570 



JL 



580 



590 



I. 



600 
. . | 



PLPVHFQMLCESSKLYDPGQQYASHVRQL.QRDQEPDVHYDFEPHVSANA 
APLPVHFQMLCESSKLYDPGQQYASHVRQLQRDQEPDVHYDFEPKVSANA 
APLPVHFQMLCESSKLYDPGQQYASHVRQLQRDQEPDVHYDFEPHVSANA 
APLPVKFQMLCESSKLYDPGQQYASHVRQLQRDQEPDVHYDFEPHVSANA 



SsJJadslSs 



LCgSSKLYDPGQlY 



610 



620 



63 0 



640 



Hbs 

650 
••I 



WS PVMRARKS S FNVSDVARPE AAGS PPEEGG CTEGT PAKDHRAGRGHQVH 
WS PVMRARKS S FNVSDVARPE AAGS PPEEGGCTEGT PAKDHRAGRGHQVH 

ws pvmrarks sfnvsdvarpe aags ppeegg ctegtpakdhragrghqvh 
ws pvmrarks s fnvs dvarpe aags ppeeggctegtpakdhragrghqvh 

eeggJHeSSpiJJkdhrHgrghqvh. 
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NOV86 


gi 


14765644 | 


gi 


1504026] 


gi 


2896796 j 


gi 


12857707 | 


gi 


13635768) 



NOV 8 6 

gi 
gi 
gi 
gi 
gi 



14765644] 
1504026) 
2896796 | 
12857707 | 
13635768] 



NOV86 

gi 1 14765644 | 
gi 1 1504026 | 
gi|2896796| 
gi (12857707 | 
gi j 13635768 j 



NOV86 



14765644 1 
1504026 | 
2896796 | 
12857707 | 
13635768 j 



NOV86 



14765644 | 
1504026) 
2896796| 
12857707) 
13635768 j 



NOV86 



14765644] 

1504026] 

2896796] 

12857707] 

13635768] 



NOV86 



14765644] 
1504026 | 
2896796 j 
12857707) 



gi 13635768) 



NOV86 



14765644] 
1504026] 
2896796] 
12857707| 



800 



850 




900 



950 



KAASRGRQDG 
KAASRGRQDG 
KAASRGRQDG 
KAASRGRQDG 
KAASRGRQgd, 



gTKKNS LjggKKWPNMCI E INR^CLLl 
960 970 




1000 



1010 



1020 



1030 




).... )....), 



1040 
..)... 



1050 



ISLQPQDVMCSIGWDQGCFPKPLLSPEERDIERSMKSLF 
1060 



1070 
..|. 



1080 
..]... 



I. 



1090 

.-I... 



I 



1100 

. I 



EE PEETPGGQDES S1TQRAE WV<3 VPYLEAG 

eepeetpggqdessnqraevwqvpyleag 
eepeetpggqdessnqraevwqvpyleag 
eepeetpg'gqdess nqraev^ qvpyleagI 

EEPEE5EG^Q^^sHHHSE0SSS LE r 
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gi [13635768 | 



NOV86 



14765644) 
1504026) 
2896796 j 
12857707) 
13635768) 



NOV86 



gi 
gi 
gi 
gi 
gi 



14765644) 
1504026 | 
2896796 | 
12857707) 
13635768) 



NOV86 

gi | 14765644 | 
gi j 1504026 | 
gi|2896796| 
gi | 12857707 | 
gi | 13635768) 



NOV86 

gi 
gi 
gi 
gi 
gi 



14765644 I 
1504026 | 
2896796) 
12857707) 
13635768) 



NOV86 



14765644) 

1504026) 

2896796) 

12857707) 

13635768 



FSSKEDIHTSESESKIFERATSF^§SERK^P^GKCD2C3DSDKAQIi3IiDQ 

1110 1120 1130 1140 1150 




1160 



1170 



iPRTKIRPVSLPVDRLLL 
1180 1190 1200 



|. ...)....)... .|. 



ASPPNEMTGRNMGNVNLDKFCKNPAFEGVNRKDAATTVCSKFNGFDQQTL 

1210 1220 1230 1240 1250 




1300 



1310 



1320 



1330 



1340 




1350 
••I 



TDHDPHGLWKSMPDPDKASACPGQATGQPKEDS 



1360 



1370 



1380 



NOV86 

gi 
gi 
gi 
gi 
gi 



14765644) - - 

1504026) - 

2896796) - 

12857707) - 

13635768 j EELGLPDVNPMCQRPRLKRMQQFEDLEGEIPQFV 



Table 86F lists the domain description from DOMAIN analysis results against NOV86. 
This indicates that the NOV86 sequence has properties similar to those of other proteins 
known to contain this domain. 
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Table 86F. Domain Analysis of NOV86 


enIISmart|smart00324. RhoGAP. GTPase-activatnr protein for Rho-like GTPases; fiTPaw artivafnr prntoint 


towards Rho/Rac/Cdc42-Hke small GTPases SBQ ID NO: 879 








CD-Length « 175 residues, 98.3% aligned 






Score = 


157 bits (396), Expect = 4e-39 




NOV86: 


774 


VPFIVKKCVCEIERRALRTKGIYRWGVKTRVEKLCQAFENGKELV- ELSQASPHDISNV 


832 






+P IV+KC+ +E+R L T+GIYR +G +RV++L +AF++G + +LS+ HD++ + 




Sbjct: 


3 


IPIIVEKCIEYLEKRGLDTEGIYRKSGSASRVKELREAFDSGPDPDLDLSEyDVHDVAGL 


62 


NOV86: 


833 


LKLYLRQLPEPLISFRLYHELVGXum>SLiCAJSAE^^ 


892 






LKL+LR+LPEPLI+F LYE+A +EE + 




Sbjct: 


63 




101 


NOV86: 


893 


i?LIJU3LPP£2VKASLQYLLRHLRRIVEVEQD 


952 






LL LPP NRA+L+YLL HL R+ E ++NKMT NL IVFGPTLLRP E +S 




Sbjct: 


102 


- LLSLLPPANRATLRYLLAHLNRVAEHSEENI^TARNLAI VFG^ SAS 


156 


N0V86: 


953 


LVDYPHQARVIETLIVHY 970 








L D HQ +V+E LI + 




Sbjct: 


157 


LKDIRHQNKWEFLIENA 174 





Rho GTPases control a variety of cellular processes. There are 3 subtypes of Rho 
GTPases in the Ras superfamily of small G proteins: RHO, RAC, and CDC42. GTPase- 
activating proteins (GAPs) bind activated forms of Rho GTPases and stimulate GTP 
5 hydrolysis. Through this catalytic function, Rho GAPs negatively regulate Rho-mediated 

signals. GAPs may also serve as effector molecules and play a role in signaling downstream of 
Rho and other Ras-like GTPases. 

By screening a Jurkat cDNA library using a yeast 2-hybrid system with an activated 
form of RAC as bait, followed by screening a placenta cDNA library, Toure et al. (1998) 

10 isolated a cDNA encoding RACGAPI , which they called MGCRACGAP. The predicted 527- 
amino acid RACGAPI protein has a large N-terminal region containing a protein kinase C- 
like cysteine-rich motif. RACGAPI shares highest homology with the Drosophila RnRacGAP 
and the chimerins of rat and human. Functional analysis showed that the GAP domain of 
RACGAPI exhibits strong GAP activity towards CDC42, RAC1, and RAC2. Northern blot 

15 analysis detected an approximately 3.2-kb RACGAPI transcript that was most abundantly 

expressed in testis, with low expression in most other tissues. Western blot analysis detected a 
RACGAPI protein of 58 kD in testis extracts. In situ hybridi2ation showed that RACGAPI 
expression is restricted to germ cells in mature testis Human breakpoint cluster region (bcr) 
gene product is a member of a group of GTPase-activating proteins that act exclusively on 

20 members of the Ras-related Rho subfamily. 

A complementary DNA was isolated from Caenorhabditis elegans that encoded a 
polypeptide of 1438 amino acid residues, CeGAP, which contains a domain with sequence 
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similarity to the COOH-terminal segment (GTPase-activating protein region) of Bcr and other 
known GTPase-activating proteins of the Rho subfamily. It also contains a "pleckstrin 
homology" motif, present in many signaling proteins including GTPase-activating proteins 
and nucleotide exchange factors. The Bcr-like domain of CeGAP exhibited activity not only 
5 on members of the C elegans and human Rho subfamily but surprisingly also on C. elegans 
Ras protein (let-60), human Ras, and Rab3A. 

CeGAP is therefore the first GTPase-activating protein acting on Ras-related proteins 
across different subfamilies, studies suggest a central and integrative role for CeGAP in a 
signaling pathway common to Ras and related proteins. 

10 NOV86 is predicted to be expressed in at least the following tissues: pancreas, 

stomach, brain, bone. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV86 is provided in Example 2. 

15 The NOV86 nucleic acids and proteins are useful in potential therapeutic applications 

implicated in various pathological disorders described further herein, for example, Alzheimer's 
disease, stroke, tuberous sclerosis, hypercalcemia, Parkinson's disease, Huntington's disease, 
cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, 
leukodystrophies, behavioral disorders, addiction, anxiety, pain, neuroprotection, 

20 hypercalcemia, ulcers, diabetes,Von Hippel-Lindau (VHL) syndrome, pancreatitis, obesity as 
well as other diseases, disorders and conditions. NOV86 nucleic acids encoding the CeGAP- 
like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be 
assessed. 

25 The novel nucleic acid of the invention encoding a GTPase activating protein-like 

protein includes the nucleic acid whose sequence is provided in Table 86A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 86A while still encoding a protein 
that maintains its GTPase activating protein-like activities and physiological functions, or a 

30 fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 86A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
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include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
5 In the mutant or variant nucleic acids, and their complements, up to about 1% of the bases may 
be so changed. 

The novel protein of the invention includes the GTPase activating protein-like protein 
whose sequence is provided in Table 86B. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
1 0 86B while still encoding a protein that maintains its GTPase activating protein-like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 
up to about 1% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
15 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV87 

NOV87 includes two GTPase activating-1 ike proteins, designated herein as NOV87a 
20 and NOV87b. 

NOV87a 

The disclosed NOV87a (alternatively referred to herein as CG56886-01) includes the 
994 nucleotide sequence (SEQ ID NO:281) shown in Table 87A. A NOV87a ORF begins 
25 with a Kozak consensus ATG initiation codon at nucleotides 1 6-1 8 and ends with a TGA 
codon at nucleotides 982-984. The disclosed NOV87a maps to human chromosome 17. 

Table 87A. NOV87a Nucleotide Sequence (SEQ ID NO:281) 

GGGGGCCACTCTCTCATGGCCCCO^VAGACAAATCCAGTAGGAAGAATGTGCTGGA 
TTGGGGAGAAGGAGGGAAGGGGCTTAGTGATGCCTGCCAGCTCT 
CTGGGGGTTTTTGACCCTTAATTCC^ 
TCGGAGGCCATCATCAGCACCTGGCATAAGGCCATTGCT 

AGGGCCTCAGCGACTTGAGCAAGGTCCGGCACAAGCTCCGCAAGTTCCTCCAGAGGCGGCCCACACTGCA 
GTCGCTGCGGGAGAAGGGCTAC&TCAAAGACCAGGTGTTOGGCT 

GAGAGGAGCCGGGTGCCACGCITCGTGCAGCAGTGCATCCGCGCCGTCGAGGCCCGCGGTCTGGACATCG 
ACGGGCTGTACCGCATCAGTGGAAACCTGGCCACCATCC^GAAGCTACGCTATAAGGTGGACCACGGTGA 
GGATGAGCGCCTTGACCTGGATGACGGGCGCTGGGAGGACGTCCACGTTATCACCGGAGCCCTGAAGCTC 1 
TTCTTTCGGGAGCTGCCCGAGCCCCTC^ 

AGCAGGACCAGGCCCGGCGCAGCCGCTGTGTGCGTGACTTGGTGCGCTCGCTGCCCGOTCCCAACCACGA- 
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CACTCTGCGGATGCTCTTCCAGCACCTCTGCCGGAGGGTGATCGAGCACGGCGAGCAGAACCGCATGTCG 
GTGCAGAGCGTGGCCATTGTGTTCGGGCCCACGCTGCTGCGGCCCGAGGTGGAAGAGACCAGCATGCCCA 
TGACCATGGTGTTCCAGAACCAGGTGGTGGAGCTCATCCTGCAGCAGTGCGCGGACATCTTCCCGCCGCA 
CTGACTGCTGGC CT ' 



A NOV87a polypeptide (SEQ ID NO:282) encoded by SEQ ID NO:281 is 322 amino 
acids in length and is presented using the one-letter amino acid code in Table 87B. The Psort 
profile for NOV87a predicts that this sequence is likely to be localized to the cytoplasm with a 
certainty of 0.6500. 



Table 87B. NOV87a Polypeptide Sequence (SEQ ID 
NO:282) 



MAPKDKS S RKNVLEVSGGVGEKEGRGLVMPASSIiTT I DLEMTWGFLTLNSQLRSRDGSE Y 
L IQHDSEAI I,STWHKAIAQGI QELVSRAQGLSDLSKYRHKLRKFLQRRPTLQSLREKGYI 
KDQVFGCALAALCERERSRVPI^QQCIRAVEARGLD^ 

GEDERIjDIjDDGRWEDVHVITGALKLFFI^LPEPLFPFSHFRQFIAAISEQDQARRSRCVR 
DLTOSLPAPNHDTLRMLFQHLCRRVIEHGEQNRMSVQSVAIVFGPTLLRPEVEETSMPMT 
MVFQNQWELILQQCADIFPPH ' 



NOV87b 

The disclosed NOV87b (alternatively referred to herein as CG56886-02) includes the 
10 985 nucleotide sequence (SEQ ID NO:283) shown in Table 87C. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 7-9 and ends with a TGA codon at 
nucleotides 973-975. The disclosed NOV87b maps to human chromosome 17. 



Table 87C. NOV87b Nucleotide Sequence (SEQ ID NO:283) 



GGGGGCCACTCTCTCATGGCCCCCAAAGACAAATCCAGTAGGAAGAATGTC 

TTGGGGAGAAGGAGGG AAGGGG CTTAGTGATGC CTG C CAG CTCT CTGACAACTATTGACCTGGAAATGAC 
CTGGG GGT TTTTG ACCCTTAATT C C CAGCTACGG AG C CGAGATGGCT CTGAGT ACCTGAT C CAGCACGAC 
TCGGAGGCCATCATCAGCACCTGGCATAAGGCCATTGCTCAGGGCATCCAGGAGCTGGTAAGCAGAGCCC 
AGGGCCTCAGCGACTTGAGCAAGGTCCGGCACAAGCTCCGCAAGTTCCTC 

GTCGCTGCGGGAGAAGGGCTACATCAAAGACCAGGTGTTCGGCTGCGCGCTGGCCGCGCTGTGTGAGCGC 

GAGAGGAGCCGGGTGCCACGCTTCGTGCAGCAGTGCATCCGCGCCGTCGAGGCCCGCGGTCTGGACATCG 

ACGGGCTGTACCGC^TCAGTGGAAACCTGGCCACCATCCAGAAGCTACGCTATAAGGTGGACCACGGTGA 

GGATGAGCGCCTTGACCTGGATGACGGGCGCTGGGAGGACGTCCACGTTATCACCGGAGCCCTGAAGCTC 

TTCTTTCGGGAGCTGCCCGAGCCCCTCTTCCCCTTCTCGCACTTCCGCCAGTTCATTGCGGCCATCAGTG 

AGCAGGACCAGGCCCGGCGCAGCCGCTGTGTGCGTGACTTGGTGCGCTCGCTGCCCGCTCCCAACCACGA 

CACTCTGCGGATGCTCTTCC^GC^CCTCTGCCGGAGGGTGATCGAGCACGGCGAGCAGAACCGCATGTCG 

GTGCAGAGCGTGGCCATTGTGTTCGGGCCCACGCTGCTGCGGCCCGAGGTGGAAGAGACCAGCATGCCCA 

TGACCATGGTGTTCCAGAACCAGGTGGTGGAGCTCATCCTGC^GC^ 

CTGACTGCTGGCCT 



1 5 The NOV87b polypeptide (SEQ ID NO:284) encoded by SEQ ID NO:283 is 322 

amino acids in length and is presented using the one-letter amino acid code in Table 87D. The 
Psort profile for NOV87b predicts that this sequence has no signal peptide and is likely to be 
localized to the cytoplasm with a certainty of 0.6500. 
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Table 87D. NOV87b Polypeptide Sequence (SEQ ID 
NO:284) 

MAPKDKSSRKNVLEVSGGVGEKEGRGLVMPASSLTTIDLEMTWGFLTLNSQIjRSRDGSEY 
LIQHDSEAIISTWHKAIAQGIQELVSRAQGLSDLSKVRHKLRKFLQRRPTLQSLREKGYI 
KDQ VFGCALAALCERERSRVPRFVQQCIRAVEARGLD IDGLYRI SGNLATIQKLRYKVDH 
GEDERLDLDDGRWEDVHVITGALKLFFRELPEPLFPFSHFRQFIAAISEQDQARRSRCVR 
DLVRSLPAPl^TLRI^FQHLCRRVIEHGEQNRMSVQSVAIVFGPTLLRPEVEETSMPMT 
MVFQNQWELILQQCADIFPPH 



A BLAST analysis of NOV87 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV87 had high 
5 homology to other proteins as shown in Table 87E. 



Table 87E. BLASTX results from PatP database for NOV87 






Smallest 






Sum 


Sequences producing High -scoring Segment Pairs: 


High 


Probability 


Score 


P(N) 


patp:AAUi7449 Novel signal transduction pathway protein 


827 


2.9e-82 


patp;AAB68548 Human GTP-binding associated protein 


779 


3.5e-77 


patp:AAG66505 GTP enzyme Rho family active site 90 


779 


3.5e-77 


patp:AAB64387 Amino acid sequence of human intracellular 


711 


5.6e-70 


patp:AAY94450 Human inflammation associated protein 


589 


3.0e-62 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 276 of 417 bases (66%) identical to a gb:GENBANK- 

10 ID:BC006107|acc:BC006107.1 mRNA from Homo sapiens {Homo sapiens, clone 

MGC: 12959, mRNA). The full amino acid sequence of the protein of the invention was found 
to have 173 of 319 amino acid residues (54%) identical to, and 229 of 319 amino acid residues 
(71%) similar to, the 316 amino acid residue ptnr:SPTREMBL-ACC:Q9NT76 protein from 
Homo sapiens (Human) (HYPOTHETICAL 36.4 KDA PROTEIN). NOV87 also has 

15 homology to the other proteins shown in the BLASTP data in Table 87F. 



Table 87K NOV87 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 
(%) 


Positive 
(%) 


Expect 


gi 

ir 


1136009l|p 
[T46471 


hypothetical protein 
DKFZp434L0130.1 - human 


316 


168/355 
(47) 


221/355 
(61) 


7e-80 


gi 

bj 
| 


13676443 |d 
BAB41146.1 
(AB060206) 


hypothetical protein 
[Macaca fascicularis] 


847 


164/356 
(46) 


220/356 
(61) 


6e-79 
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gi|l4245732|d 
bj |BAB56159.1 
] (AB051853) . 


rho-GTPase activating 
protein [Homo sapiens) 


731 


130/328 
(39) 


188/328 
(56) 


9e-57 


gi|l8l4683l|d 
bj |BAB83128.1 
] (AB030239) 


RGL1 [Homo sapiens] 


547 


130/328 
(39) 


188/328 
(56) 


2e-56 


gi|l508008l|g 
b|AAH11820.l| 
AAH11820 
(BC011820) 


unknown {protein for 

IMAGE : 3619501) 
[Homo sapiens} 


599 


130/328 
(39) 


188/328 
(56) 


2e-56 



This BLASTP data is displayed graphically in the ClustalW in Table 87G. A multiple 
sequence alignment is given, with the NOV87a and b proteins being shown on lines 1 and 2 in 
a ClustalW analysis comparing the protein of the invention with the related protein sequences 
shown in Table 87F. 



Table 87G. ClustalW Alignment of NOV87 



NOV87a 
NOV87b 



gi 
gi 
gi 
gi 



11360091 
13676443 
14245732 
18146831 
15080081 



(SEQ ID NO: 282) 
{SEQ ID NO: 284) 
(SEQ ID N0:734) 
(SEQ ID NO: 735) 
(SEQ ID NO: 736) 
(SEQ ID NO: 737) 
(SEQ ID NO:738) 
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NOV87a 
NOV87b 



gi 


11360091 


gi 


13676443 


gi 


14245732 


gi 


18146831 


gi 


15080081 


NOV87a 


NOV87b 


gi 


11360091 


gi 


13676443 


gi 


14245732 


gi 


18146831 


gi 


15080081 



NOV87a 


NOV87b 


gi 


11360091 


gi 


13676443 


gi 


14245732 


gi 


18146831 


gi 


15080081 



NOV87a 
NOV87b 



gi 



11360091| 
13676443 



20 



|....|....|....|, 



30 40 50 

-|....|....|....|..,.| 



MKMADRSGKI IPGQAYIEVT3YDYEYEAKDRKIVI KQGERYILVKKTNDDW 
- M 



60 70 80 90 100 

|....|....|....|....|....|....|....|....|..,.| 



WQVKPDENSKAFYVPAQYVKEVTRKALMPPVKQVAGLPNNSTKIMQSLHL 
LSSRWWPSSWGI LGLGPRS PPRGSQLCALYAFTYTGADGQQVSLAEGDRF 



110 120 130 140 150 

..|....|....|....|....|....|....|....|....| 



QRSTENVNKLPELSSFGKPSSSVQGTGLTRDANQNFGPSYNPGHTVNLSL 
LLLRKTNSDWWLARRLEAPSTSRPI FVPAAYMIEES IPSQS PTTVTPGQL 



160 170 180 190 200 

..|....|....|....|....|....|....|....|....| 



DLTHNNGKFNNDSHSPKVSSQNRTRLFGHFPGPEFLDVEKTSFSQBQSCD 
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NOV87a 


NOV87b 


gi 


11360091 


gi 


13676443 


gi 


14245732 


gi 


18146831 


gi 


15080081 



14245732 
18146831 
15080081 



NOV87a 

NOV87b 

gi 1 11360091 

gi 1 13676443 

gi j 14245732 

gi j 18146831 

gi (15080081 



NOV87a 
NOV87b 



11360091) 
13676443) 
14245732) 
18146831) 
15080081 



NOV87a 
NOV87b 



gi 


11360091 


gi 


13676443 


gi 


14245732 


gi 


18146831 


gi 


15080081 


NOV87a 


NOV87b 


gi 


11360091 


gi 


13676443 


gi 


14245732 


gi 


18146831 


gi 


15080081 



NOV87a 

NOV87b 

gi | 11360091 

gi j 13676443 

gi | 14245732 

gi j 18146831 

gi | 15080081 



NOV87a 
NOV87b 
gi | 11360091 | 



LWTPGPKLFHGSLEELSQALPSRAQASSEQPPPLPRKMCRSVSTDNLSPS 

PPLPRKMCRSVSTDNLSPS 

210 220 230 240 250 

:::; |.-..|....|....|....|....|. .).... |....|....| 

SAGEGSERIHQDSESGDELSSSSTEQIRATTPPNQGRPDSPVYANLQELK 
LLKPFQEGPSGRSLSQEDLPSEAS- - -ASTAGPQPLMSEPPVYCNLVDLR 

- - -MSEPPVYCNLVDLR 

LLKPFQEGPSGRSLSQEDLPSEAS- - - ASTAGPQPLMS EP PVYCNLVDLR 

260 270 280 290 300 

-...|....|.--.|....|....|....|....|....|....|....| 

I SQS ALPPLPGS PAIQINGEWETHKD- SSGRCYYYDRGTQERTWKPPRWT 
RCPRSPPPGPACPLLQRLDAWEQHLDPNSGRCFYINSLTGCKSWKPPRRS 
RCPRSPPPGPACPLLQRLDAWEQHLDPNSGRCFYINSLTGCKSWKPPRRS 
RCPRSPPPGPACPLLQRLDAWEQHLDPNSGRCFYINSLTGCKSWKPPRRS 



310 



320 330 
..|....|....|.. 



340 



350 
I-...I 



RDAS I S KGDFQS PGDQELLS SEENYYSTS YSQSDSQCGSPPRGWSEELDE 

R SE 

R SE 

R SE 



360 



370 380 
..)... .)....).. 



390 



400 



RGHTLYTSDYTNEKWLKHIDDQGRQYyYSADGSRSEWELPKYNASSQQQR 

TNPGSMEGTQTLKRNNDVLQPQA 

- TNPGSMEGTQTLKRNNDVLQPQA 

TNPGSMEGTQTLKRNNDVLQPQA 



410 420 
..|....|....|.. 



430 440 450 
..).... |.... I |....| 



E 1 1 KS RSLDRRLQE PI VLTKWRHST I VLDTNDKES PTAS KP CFPENESS P 

KGFRSDTGTPEPLDPQGSLSLSQRTSQLDPPALQAP- - -RP 

KGFRSDTGTPEPLDPQGSLSLSQRTSQLDPPALQAP RP 

KGFRSDTGTPEPLDPQGSLSLSQRTSQLDP PALQAP RP - 



460 



470 



480 
■|....|.. 



490 



500 



S SPKHQDTASSPKDQEKyGLLNVTKIAENGKKVRKNWLSSWAVLQGSSLL 

LPQLLDDPHEVEKSGLLmTKIAQGGRKLRKNWGPSWVVLTGNSLV 

LPQLLDDPHEVEKSGLIiNMTKI AQGGRKLRKNWGPSWWLTGNSLV 

LPQLLDDPHEVEKSGLLNMTKIAQGGRKLRKNWGPSWVVLTGNSLV 
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NOV87a 


NOV87b 


gi 


11360091 


gi 


13676443 


gi 


14245732 


gi 


18146831 


gi 


15080081 



13676443) 
14245732 | 
1814683l| 
15080081 



NOV87a 
NOV87b 



1136009l| 
13676443) 
14245732) 
18146831) 
1508008l| 



NOV87a 
NOV87b 



11360091 
13676443 
14245732 
18146831 
15080081 



NOV87a 
NOV87b 



11360091 
13676443 
14245732 
18146831 
15080081 



NOV87a 

NOV87b 

gi | 11360091 

gi | 13676443 

gi I 14245732 

gi (18146831 

gi j 15080081 



NOV87a 

NOV87b 

gi | 11360091 

gij 1 13676443 

gi) 14245732 

gij 18146831 

gi|l5080081 



NOVB7a 
NOV87b 



FTKT- -QGSSTSWFGSNQSKPEFTVDLKGATIE] 
FYRE P P PTAPS S GWGPAGSRPE S S VDLRG - - AAI^HG) 
FYREPPPTAPSSGWGPAGSRPESSVDLRG- -AAiBhgS 
FYRE PP PTAPS AGWGPAGSRPE S S VDLRG - -AAsJwGfe 




560 



600 



570 580 590 

|TT I DLEMTWGFLTffl - -NSQBRSRDGSEYLIQH 
~ 3Eg- - SAEgPPEESESSRVDF 

-nqavbtdegieeei - p 
-nqa^Stdegieeeilp 
drenpSIlrlsgsgpael 
drenpS|lrlsgsgpael 
l,sgsgpael 




EpCSMPMTM^ 
EISMPM' 
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11360091 




SgniJ 


gi 


13676443 






gi 


14245732 




SSDP 


gi 


18146831 






gi 


15080081 








Table 87H lists the domain description from DOMAIN analysis results against 
NOV87. This indicates that the NOV87 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 87H. Domain Analysis of NOV87 



gnIlSmartls mart00324 . RhoGAP, GTPase-activator protein for Rho- like GTPases; GTPase activator proteins 

towards Rho/Rac/Cdc42-Iike small GTPases SEQ ID NO: 879 

CD-Length = 175 residues, 99.4% aligned 
Score = 144 bits (364), Expect = 6e-36 



NOV87: 138 SRVPRFVQQCI^VEARGLDIDGLYRISGNIiATIQKLRYKVDHGEDERLDLDDGRWEDVH 197 

+P V++CI +E RGLD +G+YR SG+ + +++LR D G D LDL + DVH 
Sbjct: 1 KPIPIIVEKCIEYLEKRGLDTEGIYRKSGSASRVKELREAFDSGPDPDLDLSE YDVH 57 

o o 

NOV87: 198 VITGALlCLFraELPEPLFPFSHFRQFIAAISEQDQitflRSRCVRDLTO 257 

+ G LKLF RELPEPL F + +FI A +D+ R R +R+L+ LP N TLR L 
Sbjct: 58 DVAGLLKLFLRELPEPLITFELYEEFIEAAKLEDEEERLRALRELLSLLPPANRATLRYL 117 

o 

NOV87: 258 FQHLCRRVIEHGEQNRMSVQSVAIWGPTLLRPEVEETSMPMTMVFQNQVVELILQQC 315 

HL RV EH E+N+M+ +++AIVFGPTLLRP E++ + QN+WE +++ 
Sbjct: 118 LAHL-I^VAEHSEENKMTARNIiAJVFGPTLLRPPDGESASLia3IRHQNKVVEFLIE^ 174 



Rho GTPases control a variety of cellular processes. There are 3 subtypes of Rho 
GTPases in the Ras superfamily of small G proteins: RHO, RAC, and CDC42. GTPase- 
activating proteins (GAPs) bind activated forms of Rho GTPases and stimulate GTP 

1 0 hydrolysis. Through this catalytic function, Rho GAPs negatively regulate Rho-mediated 

signals. GAPs may also serve as effector molecules and play a role in signaling downstream of 
Rho and other Ras-Iike GTPases. 

By screening a Jurkat cDNA library using a yeast 2-hybrid system with an activated 
form of RAC as bait, followed by screening a placenta cDNA library, Toure et al. (1998) 

15 isolated a cDNA encoding RACGAP1, which they called MGCRACGAP. The predicted 527- 
amino acid RACGAP1 protein has a large N-terminal region containing a protein kinase C- 
like cysteine-rich motif. RACGAP1 shares highest homology with the Drosophila RnRacGAP 
and the chimerins of rat and human. Functional analysis showed that the GAP domain of 
RACGAP1 exhibits strong GAP activity towards CDC42, RAC1, and RAC2. Northern blot 

20 analysis detected an approximately 3.2-kb RACGAP1 transcript that was most abundantly 

expressed in testis, with low expression in most other tissues. Western blot analysis detected a 
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RACGAP1 protein of 58 kD in testis extracts. In situ hybridization showed that RACGAP1 
expression is restricted to germ cells in mature testis Human breakpoint cluster region (bcr) 
gene product is a member of a group of GTPase-activating proteins that act exclusively on 
members of the Ras-related Rho subfamily. 
5 A complementary DNA was isolated from Caenorhabditis elegans that encoded a 

polypeptide of 1438 amino acid residues, CeGAP, which contains a domain with sequence 
similarity to the COOH-terminal segment (GTPase-activating protein region) of Bcr and other 
known GTPase-activating proteins of the Rho subfamily. It also contains a "pleckstrin 
homology" motif, present in many signaling proteins including GTPase-activating proteins 

10 and nucleotide exchange factors. The Bcr-like domain of CeGAP exhibited activity not only 
on members of the C. elegans and human Rho subfamily but surprisingly also on C. elegans 
Ras protein (Iet-60), human Ras, and Rab3A. 

CeGAP is therefore the first GTPase-activating protein acting on Ras-related proteins 
across different subfamilies, studies suggest a central and integrative role for CeGAP in a 

1 5 signaling pathway common to Ras and related proteins. 

NOV87 is predicted to be expressed in at least the following tissues: pancreas, 
stomach, brain, bone. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 

20 NOV87 is provided in Example 2. 

The NOV87 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Alzheimer's 
disease, stroke, tuberous sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, 
cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, 

25 leukodystrophies, behavioral disorders, addiction, anxiety, pain, neuroprotection, 

hypercalceimia, ulcers, diabetes, Von Hippel-Lindau (VHL) syndrome, pancreatitis, obesity as 
well as other diseases, disorders and conditions. NOV87 nucleic acids encoding the CeGAP- 
like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be 

30 assessed. 

The novel nucleic acid of the invention encoding a GTPase activating protein-like 
protein includes the nucleic acid whose sequence is provided in Table 87A or 96C, or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 87A or 96C while still 
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encoding a protein that maintains its GTPase activating protein-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to the sequence of Table 87A or 96C, 
including nucleic acid fragments that are complementary to any of the nucleic acids just 
5 described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
1 0 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject 
In the mutant or variant nucleic acids, and their complements, up to about 1% of the bases may 
be so changed. 

The novel protein of the invention includes the GTPase activating protein-like protein 
15 whose sequence is provided in Table 87B or 96D. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 87B or 96D while still encoding a protein that maintains its GTPase activating 
protein-like activities and physiological functions, or a functional fragment thereof. In the 
mutant or variant protein, up to about 1% of the amino acid residues may be so changed. 
20 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

25 

NOV88 

The disclosed NOV88 (alternatively referred to herein as CG56394-01) includes the 
1092 nucleotide sequence (SEQ ID NO:285) shown in Table 88 A. A NOV88 ORF begins . 
with a Kozak consensus ATG initiation codon at nucleotides 25-27 and ends with a stop codon 
30 at nucleotides 1033-1035. The disclosed NOV88 maps to human chromosome 2. 



Table 88A. NOV88 Nucleotide Sequence (SEQ ID NO:285) 

G CAC CAGCCACATCCTGAGATAC CATGGTTAAGGTG AAGGCCAGAGT CAACAGATTTGG C CACATTGGG C 
ACCAGATCACCAGGGCTGCTTTTAACTCTGGTAAAGTTC 

CCTCAACTACATGGTCTACGTGTTCCAGTGTGGTTCTACCCATGGCAAATTCCATGGCAOT 
GAGAATGGGAAGCTTGTCATTAACGGAAATCTCATCACC 

AATGGGACAATGTTGACGCTGAGTACATTTGGGTGTCCACCGGTGTCTTCACCACCACAGAGAAGGCTGG 
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GGCT CACTTGCAGCA6GGAGCCAAAAGGGTCAT AAT CTCTACTCCCTCTGCTGACG C CCCCATGTTCATG 
ATGGGCGTGAACCATAAGAAAT ATG AAAACAGC CTC AAGATCATCAGCAATGCCT CCTGTACCACCAACT 
TCTTAGCCTCCCTGGCCAAGCTCATCCATGACAACTTTGGTATTGTGGAAGGACTCATGACCACGACCCA 
C AC CATCACTGCCACCCAGAAGACTGTAGATGGACCCTCCAGGAAACTGTGGTGT GATGGCCACGGGGCT 
CTCCAGATCATCATCCCTGC^TCrACTGGTGCrGCCy^GCTGTAGGCAAGGTCATCCCCGAGATGAATG 

GGAAAATCCTGCCAAATATGATGACA^ 

GGCATCCTGGACTAC^CTGAGCACCACGTTGTCTCCTCCAGCTTXAACAGTGACACCCACTCTTCCACCT 
TCAATGATGGGGCTGGTATTGCCCTCAATGACGATTTTGTCAAGCTCATTTCCTGTTATGACAATGCATT 
TGGCTACAACAACAGGGCAGTGGACCTCATGGCCCACATGGCCTCCAAGAAGTAAGACCCCCAGACCACC 
AGCCTCAGGCCCTCAGCTGCTAGGAATCCCCTATTGCACTAG 



A NOV88 polypeptide (SEQ ID NO:286) encoded by SEQ ID NO:285 is 336 amino 
acids in length and is presented using the one-letter amino acid code in Table 88B. The Psort 
profile for NOV88 predicts that this sequence has no signal peptide and is likely to be 
5 localized to microbodies with a certainty of 0.4804. In alternative embodiments, a NOV88 
polypeptide is located to the mitochondrial matrix space with a certainty of 0.3600. 



Table 88B. NOV88 Polypeptide Sequence (SEQ ID 
NO:286) 

MVKVKARVNRFGHIGHQITRAAFNSGKVDIVAISDPFTGLNYMVYVFQ 

KAENGKLVINGNLITIFQERDPTKIKWDNVDAEYIWSTGVFTTTEKAGAHLQQGAKRVI 

ISTPSADAPMFMMGVNHKKYENSLKIIS 

ITATQKTVDGPSRKLWCDGHGALQIIIPASTGAAKAVGKVIPE^GKITSMAFRVPTTNV 
SVMHLTCHLENPAKYDDIKKVVXQAS 

GI ALNDHFVKLI S CYDNAFGYNNRAVDLMAHMASKK 



A BLAST analysis of NOV88 was run against the proprietary PatP GENESEQ Protein 
10 Patent database. It was found, for example, that the amino acid sequence of NOV88 had high 
homology to other proteins as shown in Table 88C. 



Table 88C BLASTX results from PatP database for NOV88 






Smallest 






Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAY07036 Breast cancer associated antigen precursor 


1407 


9.9e-144 


patp:AAY05368 Human HCMV inducible gene protein 


1407 


9.9e-144 


patp:AAG64817 Human G3PDH fragment - Homo sapiens, 327 aa. 


1377 


l.5e-140 


patp:AAE04373 Mouse cancer associated antigen OY-MC-2 


1316 


4.4e-134 


patp:AARl2995 GAP-DH - Aspergillus oryzae (ATCC 42149) 


993 


7.4e-100 



In a search of sequence databases, it was found, for example, that the nucleic acid 
15 sequence of this invention has 928 of 1055 bases (87%) identical to a gbrGENBANK- 
ID:AF261085|acc:AF261085.1 mRNA from Homo sapiens (glyceraldehyde-3 -phosphate 
dehydrogenase (GADPH) mRNA). The full amino acid sequence of the protein of the 
invention was found to have 278 of 336 amino acid residues (82%) identical to, and 294 of 
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336 amino acid residues (87%) similar to, the 335 amino acid residue ptnnTREMBLNEW- 
ACC:AAG01996 protein from Homo sapiens (Human) (CLONE CDABP0047 MRNA 
SEQUENCE). NOV88 also has homology to the other proteins shown in the BLASTP data in 
Table 88D. 



Table 88D. NOV88 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi|7669492|ref | 
NP_002037.2| 
(NM 002046) 


glyceraldehyde- 3 - 
phosphate dehydrogenase 
[Homo sapiens] 


335 


278/336 
(82) 


294/336 
(86) 


e-152 


gi| 31645 erab|CA 
A25833.1 
(X01677) 


glyceraldehyde -3 - 
phosphate 
dehydrogenase 
[Homo* sapiens] 


335 


227/336 
(82) 


294/336 
(87) 


e-152 


gi|2407184|gb|A 
AB94053.l| 
(AF017079) 


glyceraldehyde 3- 
phosphate dehydrogenase 
(Sus scrofa] 


333 


272/333 
(81) 


290/333 
(86) 


e-149 


gi| 6983849 |dbj | 
BAA90818.l| 
(AB038241) 


glyceraldehyde - 3 - 
phosphate dehydrogenase 
[Felis catus] 


333 


267/333 
(80) 


285/333 
(85) 


e-145 


gi|250644l|sp|P 
00355 |G3P_PIG 


GLYCERALDEHYDE 3- 
PHOSPHATE DEHYDROGENASE 
(GAPDH) 


333 


265/333 
(79) 


284/333 
(84) 


e-145 



10 



This BLASTP data is displayed graphically in the ClustalW in Table 88E. A multiple 
sequence alignment is given, with the NOV88 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 88D. 



Table 88E. ClustalW Alignment of NOV88 



NOV88 

gi 
gi 
gi 
gi 
gi 



7669492 
31645| 
2407184 
6983849 
2506441 



NOV88 


gi 


76694921 


gi 


31645 | 


gi 


2407184 


gi 


6983849 


gi 


2506441 



N0V88 



7669492 | 
31645 | 
24071B4 1 



(SEQ ID NO: 286) 
(SEQ ID N0:739) 
(SEQ ID NO: 740) 
(SEQ ID NO: 741) 
(SEQ ID NO: 742) 
(SEQ ID NO: 743) 




60 



70 
.1.- 



80 



90 
.1.. 



100 



sthgkfhgtvkaengklvingnpitifqerdp|kikv?gdagaeyvvestg 
sthgkfhgtvkaengklvingnpitifqerdpgkikwgdagaeyvvestg 
sthgkfhgtvkaengklvingnpitifqerdpBkikwgdagaeywestg 
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STHGKFHGTVKAENGKLVINGgPITIFQERDP^jlKWGDAGAEYVVESTG 
STHGKTHGTVKAENGKIiVINGElTIFQERDP™IKWGDAGfiBYVVHSTG 



gi | 6963849 | 
gi | 2506441 j 



NOV88 


gi 


7669492 | 


gi 


31645 | 


gi 


2407184 | 


gi 


6983849 | 


gi 


2506441 | 


NOV88 


gi 


7669492 | 


gi 


31645 | 


gi 


2407184 


gi 


6983849 


gi 


2506441 


NOV88 


gi 


7669492] 


gi 


31645) 


gi 


2407184 


gi 


6983849 


gi 


2506441 


NOV88 


gi 


7669492 | 


gi 


31645 | 


gi 


2407184 


gi 


6983849 


gi 


2506441 



NOV88 


gi 


7669492 | 


gi 


31645 | 


gi 


2407184 | 


gi 


6983849 j 


gi 


2506441 | 




160 170 180 190 200 
,. | 1 1 | | | | | | 

i 



SCTTKSLAgLAKjJlHDNFGIVEGLMTTaHgiTATQKTVDGPSgKLlvgDGj 
SCTTNCLAPLAKVIHDNFGIVEGLMTTVHAITATQKTVDGPSGKLWRDGRj 
SCTTNCLAPLAKVIHDNFGIVEGLMTTVHAITATQKTVDGPSGKLWRDGR 
SCTTNCLAPLAKVIHDNFGIVEGLMTTVHAITATQKTVDGPSGKLWRDGR 
SCTTNCLAPLAXVIHD||FGIVEGLMTTVHAITATQKTVDGPSGKLWRDGR 
SCTTNCLAPLAKVIHDiJjFGIVEGLMTTVHAITATQKTVDGPSGKLWRDGR 



210 



220 



230 



240 



250 



■A 



GALQ0I I PASTGAAKAVGKVIPE-^GKgTgMAFRVPTgNVS V JgjLTCjJjLE 
GALQNII PASTGAAKAVGKVI PELNGKLTGMAFRVPtSnVSWDLTCRLE 
GALQNIIPASTGAAKAVGKVIPEL2GKLTGMAFRVPTgNVSWDLTCRLE 
GALQNIIPASTGAAKAVGKVIPELNGKLTGMAFRVPTgi^VSVVDLTCRLE 
GA^QNI I PASTGAAKAVGKVI PELNGKLTGMAFRVPtB^TVSWDLTCRLE 

gaSqniipastgaakavgkvipe^gkltgmafrvptSstvsvvdltcrle 



260 

..I.. 



270 



280 



290 



300 

-•I 



P AKYDDI KKWKQASEglp 
KP AKYDD I KKWKQ AS EG P 
KPAKYDDIKKWKQASEGP 
KP AKYDD I KKWKQASEG P 
K^AKYDDI KKWKQASEGP 
KPAKYDDIKKWKQASEGP 



LKG I LjjjYTEHijiVVS SgFNSDTHS STF|JJ£ga 
LKGILGYTEHQWSSDFNSDTHSSTFDAGA 
LKGILGYTEHQWSSDFNSDTHSSTFDAGA 
LKGILGYTEHQWSSDFNSDTHSSTFDAGA 
LKGILGYTEHQWSSDFNSDTHSSTFDAGA 

lkgilgyteSqwsSdfnsdths STFDAGA 



310 



320 



330 



GIALNDHFVKLISgYDI\lSFGYgiNR2VDLMAHMASiqj 
GIALNDHFVKLISWYDNEFGYSNRWDLMAHMASKE 
GIALNDHFVKLI S WYDNEFGYSNRWDLMAHMAS KE 

gialndhfvklis|ydnefgysnrwdlm2hmaske 
gi alndhfvkli s wydne fgysnrwdlm ahmas ke 
gialndkfvklis^dnefgysnrwdlmBhmaske 



Table 88F lists the domain description from DOMAIN analysis results against NOV88. 
This indicates that the NOV88 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 88F. Domain Analysis of NOV88 



gnllPfam|pfam02800. gpdh_C, Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a 

tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed 

alpha/antiparallel beta fold. SEQ ID NO: 880 

CD-Length = 163 residues, 100.0% aligned 
Score = 169 bits (427), Expect « 3e-43 



N0V88: 153 TTNFLAS LAKL I HDNFG I VSGLM TTTHTITATQKTVDG'PS RKLW CDGHG ALQ 1 1 1 PASTG 212 
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TTN LA LAK+++DNFGI +GLMTT H TA QK VDGP K G A IIP STG 




Sb j Ct : 


1 


TTNCLAPLAKVLNDNFGIEKGLMTTVHAYTAD^ IPTSTG 


60 


NOV88: 


213 


AAKAVGKVI PEMNGKITSMAFRVPTTNVSVMHLTCHLENP AKYDDI KKVVKQASEAPPLK 


272 






AAKAVG V+PE+NGK+T MAFRVPT NVSV+ LT LE P ++I +K+A+E P LK 




Sbjct: 


€1 


AAKAVGLVLPELNGKLTGl^RVPTPNVSVVDLTVELEKPVTVEEINAALKEAAEGP^K 


.120 


NOV88: 


273 


GlLDYTEWWSSSmSDTHSSTFM^GIIO^HFVKLISCY 3 15 








GIL YTE +VSS F D HSS F+ A I LND+FVXL++ Y 




Sbjct: 


121 


GILGYTEDPLVSSDFIGDPHSS I FDAKATI VLNDNFVKLVAWY 163 





NAD-dependent glycerol-3-phosphate dehydrogenase (EC 1 .1.1 .8) (GPD) catalyzes 
the reversible reduction of dihydroxyacetone phosphate to glycerol-3-phosphate. It is a 
cytoplasmic protein that is active as a homodimer, with each monomer containing an N- 
5 terminal NAD binding site. In insects, it acts in conjunction with a mitochondrial alpha- 
glycerophosphate oxidase in the alpha-glycerophosphate cycle, which is essential for the 
production of energy used in insect flight. 

Glyceraldehyde-3-phosphate dehydrogenase (EC 1.2.2.12) (GAPDH) mRNA levels, 
protein, and enzymatic activity increase in 3T3-F442A adipocytes after exposure to 

1 0 physiological concentrations of insulin (Alexander, M., Curtis, G., Avruch, J., and Goodman, 
H. (1985) J. Biol. Chem. 260, 1 1978-1 1985). In order to understand the mechanism of this 
regulation, researchers isolated and sequenced 5.4 kilobase pairs of a 12-kilobase pair human 
genomic clone encoding a functional GAPDH gene. The gene consists of 9 exons and 8 
introns with eukaryotic signals necessary for the transcription and translation of GAPDH 

1 5 mRNA. The exon sequence* confirms previously published cDNA sequences for human 
GAPDH in muscle, liver, and erythrocytes. The organization of the human and the unique 
chicken GAPDH genes is strikingly similar. Although chicken exons VIII-XI have been fused 
into human exon 8, introns which separate exons encoding the NAD binding, catalytic, and 
helical domains of the GAPDH protein have been retained. Stable transfection of rodent cells 

20 with the intact human GAPDH gene resulted in the expression of a correctly initiated human 
GAPDH mRNA and an enzymatically active human GAPDH polypeptide. Thus, the gene 
contains a functional promoter and intact coding sequences. Although many processed 
GAPDH pseudogenes and GAPDH-like sequences are present in the human genome, Southern 
blot analysis of human genomic DNA using a probe derived from the 3 -untranslated region of 

25 the GAPDH gene detected only two genes, a 1 0-copy processed pseudogene and a single copy 
of the isolated gene. In contrast, a probe derived from an intron segment of the isolated gene 
detected only a single copy of the GAPDH gene. Collectively, these findings strongly suggest 
that the human genome encodes a single functional GAPDH gene. 
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Hopkinson et al. (1974) presented evidence that glycerol-3-phosphate dehydrogenase 
(EC 1 .1 .1.8) is a dimer of dissimilar subunits. Electrophoretic variants at each of two loci, 
designated GPD1 and GPD2, were described. By the method of somatic cell hybridization, 
Kielty and Povey (1982) assigned the presumed structural gene for alpha-glycerophosphate 
5 dehydrogenase to chromosome 12. Since this is a liver-specific enzyme, a rat hepatoma cell 
line was used as one of the 'parents' in the hybridization. 

NOV88 is predicted to be expressed in at least the following tissues: liver. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 

10 sources, and/or RACE sources. Further expression data for NOV88 is provided in Example 2. 

The NOV88 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, cirrhosis, transplantation as well as other diseases, disorders 
and conditions. NOV88 nucleic acids encoding the Glycerol-3-Phosphate Dehydrogenase-like 

15 protein of the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a Glycerol-3-Phosphate 
Dehydrogenase-like protein includes the nucleic acid whose sequence is provided in Table 
88A, or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 

20 whose bases may be changed from the corresponding base shown in Table 88A while still 

encoding a protein that maintains its GlyceroI-3-Phosphate Dehydrogenase-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids.whose sequences are complementary to the sequence of Table 88A, including 
nucleic acid fragments that are complementary to any of the nucleic acids just described. 

25 The invention additionally includes nucleic acids or nucleic acid fragments,- or 

complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 

30 used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 13% of the bases 
may be so changed. 

The novel protein of the invention includes the Glycerol-3-Phosphate Dehydrogenase- 
like protein whose sequence is provided in Table 88B. The invention also includes a mutant or 
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variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 88B while still encoding a protein that maintains its Glycerol-3-Phosphate 
Dehydrogenase-like activities and physiological functions, or a functional fragment thereof. In 
the mutant or variant protein, up to about 18% of the amino acid residues may be so changed. 
5 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

10 

NOV89 

The disclosed NOV89 (alternatively referred to herein as CG56396-01) includes the 
1221 nucleotide sequence (SEQ ID NO:287) shown in Table 89A. A NOV89 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 25-27 and ends with a stop codon 
15 at nucleotides 1033-1035. The disclosed NOV89 maps to human chromosome 6. 



Table 89A. NOV89 Nucleotide Sequence (SEQ ID NO:287), 

CTGCATCTTCTCGTGCATTGCCAGCTGCAT^ 
GATTTGGTTGTATTGGCTGCCTGGTCACCAGGGCT 

CAATGACCCCTTCATTGAC CTCAACAACACT C 
CACGGCAC CGT CAAGG CTG AG AACGGG AAG CT TGT TAT CAATGGAAAT CT CAT CACTATTTTCCAGG GG C 
AAGATCTCACCAAAATCAAATGGGGCAATGCTGGCACTGAGTACA^ 
CACCATGGAGAAGGCTGGGGCTCACTTGGAGGGAGGAGCCAAAACGGT 

GATGC CCCCATGT T CGTGATGGGTGTG AAC CATGAGAAAT ATGACAACAGCTCAAGATT ACTCAAGATT A 
TGAGCAATGCCTCCTGCACCACCAGCTGCTTAACGCCCCTG 

CGTGGAAGGACTCATGACCATCGCTGCCACCCAGAAGACTATGGATGGCTCCTATGGGAAACTGTGGGGT 

GACGGCCATGGGGCTCTCCAGAACATCCTCTCTGCCTCTACTGGTGCTGCCAAGGCTGTGAGGAAGGTCA 

TCCCTGAGCTAAACGGGAAGCTCACTGGCATGGCCTTCCGTGTCCCCACTGCCAACATGTCAGT 

CCTGACCTGCCGTCTGGAAAAACCTACCAAATATGATGACACCAAGAAGGTGGTGAAGCAGGCGTCAGAG 

G AC CCC CT CAAAGG CAT C C TG GGCT ACTC TGAG CACCAGGTGGT CTCCT C CAACTT CAACTCAACAG AC A 

CCGACTCTTCCACCTTCGATGCTGGGGCTGGCATTGCCCTCA^ 

GTATGACAATGAATTTGGC TGCAGCAACAGGGTGG TGGACCTC TG CCCACAGTGTGGCTTCCAAGGAGTA 
AGACCCCCAGACCACCAGCCCCAGCGACAGCACGACGGGAAGAGAGCGGCCCTCACTGCTGGAGAGTCCC 
TGCCACACTCAGTCTCCCACCACACTGAGAATCTCCCCTCCTCM 
GGAGGAGCCGAGGGAGCCCCACCTTTTCATG 



A NOV89 polypeptide (SEQ ID NO:288) encoded by SEQ ID NO:287 is 374 amino 
acids in length and is presented using the one-letter amino acid code in Table 89B. The Psort 
20 profile for NOV89 predicts that this sequence is likely to be localized, to the endoplasmic 
reticulum (membrane) with a certainty of 0.5500, or to lysosomes with a certainty of 0.2630. 



Table 89B. NOV89 Polypeptide Sequence (SEQ ID 
NO:288) 

MVKVKAGVNRFGC I GCLVTRAALNSGLVD IVAINDPFIDL^INTVYMFQYNS AHGKFHGTV 
KAENGKLVTNGNL ITIFQGQDLTKI KWGNAGTEYIMEFTS I FTTMEKAGAHLEGGAKTVI 
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I SAP SADAPMFVMGVNHEKYDNS SRLLKI I SNASCTTS CI/TPLAKVI HDNFGT VEGLMTI 
AATQKTMDGSYGKLWGDGHGMjQNILSASTGAAKAVRKVIPEIJ^GKLTGMAFRVPTANMS 
VVDLTCRLEKPTKYDDTKKVVKQASEDPLKGILGYSEHQWS SNFNSTDTHS STFDAGAG 
IALNDHFVKLISWYDNEFGCSNRVVDLCPQCGFQGVRPPDHQPQRQHDGKRAALTAGESL 
PHSVSHHTENLPSS 



A BLAST analysis of NOV89 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV89 had high 
homology to other proteins as shown in Table 89C. 

5 



Table 89C. BLASTX results from PatP database for NOV89 






Smallest 






Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAY07036 Breast cancer associated antigen precursor 


1393 


3.0e-142 


patp:AAY05368 Human HCMV inducible gene protein 


13 93 


3.0e-142 


patp:AAG64817 Human G3PDH fragment - Homo sapiens, 327 aa. 


1377 


1.5e-140 


patp:AAE04373 Mouse cancer associated antigen OY-MC-2 


1315 


5.6e-134 


patp:AAR12995 GAP-DH - Aspergillus oryzae (ATCC 42149 


984 


6.6e-99 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1066 of 1225 bases (87%) identical to a gb:GENBANK- 
ID:AF261085|acc:AF261 085.1 mRNA from Homo sapiens (glyceraldehyde-3-phosphate 

10 dehydrogenase (GADPH) mRNA). The full amino acid sequence of the protein of the 

invention was found to have 279 of 327 amino acid residues (85%) identical to, and 293 of 
327 amino acid residues (89%) similar to, the 335 amino acid residue ptnr:TREMBLNEW- 
ACC:AAG01996 protein from Homo sapiens (Human) (CLONE CDABP0047 MRNA 
SEQUENCE). NOV89 also has homology to the other proteins shown in the BLASTP data in 

15 Table 89D. 



Table 89D. NOV89 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

(%> 


Expect 


gi | 7669492 |ref| 
NP_002037.2| 
(NM_002046) 


glyceraldehyde-3 - 
phosphate 
dehydrogenas e 
[Homo sapiens] 


335 


279/331 
(84) 


293/331 
(88) 


e-153 


gi| 31645 |emb|CA 
A25833.l| 
(X01677) 


glyceraldehyde-3 - 
phosphate 
dehydrogenas e 
[Homo sapiens] 


335 


278/331 
(83) 


293/331 
(87) 


e-152 


gi|2407184|gb|A 
AB94053.l| 
(AF017079) 


glyceraldehyde 3- 
phosphate 
dehydrogenase 
[Sus scrofa] 


333 


274/328 
(83) 


289/328 
(87) 


e-149 
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gi| 6983849 |dbj | 
BAA90818.l| 
(AB038241) 


glyceraldehyde-3 - 
phosphate 
dehydrogenas e 
[Felis catus] 


333 


268/328 
(81) 


283/328 
(85) 


e-146 


gi|250644l|sp|P 
00355 |G3P_PIG 


GLYCERALDEHYDE 3- 
PHOSPHATE 

DEHYDROGENASE (GAPDH) 


333 


267/328 
(81) 


283/328 
(85) 


e~146 



This BLASTP data is displayed graphically in the ClustalW in Table 89E. A multiple 
sequence alignment is given, with the NOV89 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
5 Table 89D. 



Table 89E. ClustalW Alignment of NOV89 



NOV89 

gi | 7669492 | 
gi j 31645 | 
gi | 2407184 | 
gi | 6963849 | 
gi | 2506441 j 



(SEQ ID NO:288) 

(SEQ ID NO:744) 

(SEQ ID NO: 745) 

(SEQ ID NO: 746) 

(SEQ ID NO: 747) 

(SEQ ID NO: 748) 



NOV89 



gi 
gi 
gi 
gi 
gi 



7669492 
31645 | 
2407184 
6983849 
2506441 



NOV89 

gi | 7669492 | 
gi j 31645 | 
gi [2407184 | 
gi j 6983849 j 
gi j 2506441 j 




60 



70 



80 



90 



100 



sShgkfhgtvkaengklvingi^itifq^5iSkikwg^g2ey^e5tJ 
sthgkfhgtvkaengklvingnpitifqerdpIkikwgdagaeywestg 
sthgkfhgtvkaektgklvingnpitifqerdpSkikwgdagaeywestg 
sthgkfhgtvkaengklvingnpitifqerdpBkikwgdagas.ywestg 

STHGKFHGTVKAENGKLVINGgpiTIFQERDPgjlKWGDAGAEYWESTG 

sthgkfhgtvkaengklvingS!itifqerdp™ikwgdaga5ywestg 



NOV89 

gi I 7669492 I 
gi | 31645 | 
gi (2407184 
gi | 6983849 
gi j 2506441 



NOV89 


gi 


7669492] 


gi 


31645) 


gi 


2407184 


gi 


6963849 


gi 


2506441 



I, 



110 

..I.. 



120 

..I.. 



130 



I . 



140 

..I.. 



150 



fttmekagah: 
vfttmekagah: 
vfttmekagah 
vfttmekagah: 
vfttmekagah 
vfttmekagah 



160 



LgGGAKgVI I SAPS ADAPMF VMG VNHEKYDNS 
T jffiGG AKRVI I S APS ADAPM FVMGVNHEKYDNS 
.SGGAKRVIISAPSADAPMFVMGVNHEKYDNS 
ijjGGAKRVI IS APS ADAPM FVMG'3}NHEKYDNS 
gGGAKRVI ISAPS ADAPMFVMGVNHEKYDNSi 
ilSSGGAKRVII SAPSADAPMFVMGVNHEKYDNS 




170 



180 



190 



200 




210 



220 



230 



240 



I — I — I 



250 

:JL_ 
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NOV89 


gi 


7669492| 


gi 


31645| 


gi 


2407184 




gi 


6983 849 




gi 


2506441 





NOV89 


gi 


7669492| 


gi 


31645| 


gi 


2407184 


gi 


6983849 


gi 


2506441 



NOV89 

gi | 7669492 

gi |31645| 

gi j 2407184 

gi | 6983849 

gi|2506441 



NOV89 


gi 


7669492| 


gi 


31645| 


gi 


2407184 


gi 


6983849 


gi 


2506441 



dgSgalqni^astgaakavSkvipelngkltgmafrvptanJsvvdltc 

DGRGALQNI I PASTGAAKAVGKVIPELNGKLTGMAFRVPTANVS WDLTC 
DGRGALQNI I PAS TGAAKAVGKVIPEI^GKLTGM AFRVPTANVS WDLTC 
DGRGALQNI I PASTGAAKAVGKVIPELNGKLTGMAFRVPTANVSVVDLTC 
DGRGALQNI I PAS TGAAKAVGKVI PELNGKLTGMAFRVPtIJsFVS WDLTC 
DGRGASQNI I PASTGAAKAVGKVIPELNGKLTGMAFRVPTraNVSWDLTC 



260 



270 
J.. 



280 



290 
..I.. 



300 



RLEKPyKYDDjjKKWKQASEjgPLKGILGYgEHQWSSjJjFNSj 
RLE KP AK YDD I KK WKQ AS EG PLKGI LG Y T E HQ WS S DFNS 
RLEKPAKYDDIKKWKQASEGPLKGILGYTEHQWSSDFNS 
RLE KP AK YDD I KK WKQ AS E GPL KGI LG YTE HQ WS S DFNS 
RLEK^AKYDDIKKWKQASEGPLKGILGYTEgQWSgDFNS 

rlekpakyddikkwkqa'segplkgilgyteSqwshdfns 



jDTHSSTFD 
DTHSSTFD 
DTHSSTFD 
DTHSSTFD 
DTHSSTFD 
DTHSSTFD 



310 



320 



330 340 350 

■|....|....|....|....| 
!PQCGF§GVRPPDHQPQRQ 



GAGIALNDHFVKLISWYDNEFGgSNRWDL 
AG AG I ALNDH FVKLI S W YDNE FG Y SNRWD L MgHMA S KE 
AG AGI ALNDHFVKLI S WYDNE FG YSNRWDLMgHMAS KE 
AGAGI ALNDHFVKLI S| YDNE FGYSNRWDLmRhMAS KE 
AGAGIALNDHFVKLISWYDNEFGYSNRWDLMgHMASKE 

agagialndhfvkliswydnefgysnrwdlmBhmaske 



360 370 
HDGKRAALTAGESLPHS VSHHTENLPS S 



Table 89F lists the domain description from DOMAIN analysis results against NOV89. 
This indicates that the NOV89 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 89F. Domain Analysis of NOV89 



gnllPfam|pfam02800, gpdh_C, Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a 

tetrameric NAD~binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed 

alp ha/an ti parallel beta fold. SEQ ID NO: 881 

CD-Length ■ 163 residues, 100.0% aligned 
Score = 209 bits (531), Expect = 3e-55 



NOV8 9 : 15 S TTSCLTPLAKVIHDNFGTVEGLMTI AATQKTMDGSYGKLWGDGHGALQNILSASTG 211 

TT+CL PLAKV+ +DNFG +GLMT A QK +DG + K G A NI+ STG 

Sb j ct : 1 TTNCLAPLAKVLNDNFGIEKGLMTTVHAY^ IPTSTG 6 0 

NOV89: 212 AAKAVRKVIPELNGKLTGMAFRVPTANMSVVDLTCRLEKPTKYDDT^ LK 270 

AAKAV V+PELNGKLTGMAFRVPT N+SWDLT LEKP ++ +K+A+E P LK 

Sb j c t : 61 AAKAVGLVLPELNGKLTGMAFRVPTPNVSVV^ 120 



NOV89 : 



271 



314 



GILGYSEHQWSSNFNSTDTHSSTFDAGAGIALNDHFVKLISWY 
GILGY+E +VSS+F D HSS FDA A I LND+FVKL++WY 
121 GILGYTEDPLVSSDFIG-DPHSSIFDAKATIVLNDNFVKLVAWY 163 
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NAD-dependent glyceroI-3-phosphate dehydrogenase (EC 1.1.1.8) (GPD) catalyzes 
the reversible reduction of dihydroxyacetone phosphate to glycerol-3-phosphate. It is a 
cytoplasmic protein that is active as a homodimer, each monomer containing an N-terminal 
NAD binding site. In insects, it acts in conjunction with a mitochondrial alpha- 
5 glycerophosphate oxidase in the alpha-glycerophosphate cycle, which is essential for the 
production of energy used in insect flight. 

GlyceraIdehyde-3-phosphate dehydrogenase (EC 1.2.2.12) (GAPDH) mRNA levels, 
protein, and enzymatic activity increase in 3T3-F442A adipocytes after exposure to 
physiological concentrations of insulin (Alexander, M., Curtis, G., Avruch, J., and Goodman, 

10 H. (1985) J. Biol. Chem. 260, 1 1978-1 1985). In order to understand the mechanism of this 
regulation, researchers isolated and sequenced 5.4 kilobase pairs of a 12-kilobase pair human 
genomic clone encoding a functional GAPDH gene. The gene consists of 9 exons and 8 
introns with eukaryotic signals necessary for the transcription and translation of GAPDH 
mRNA. The exon sequence confirms previously published cDNA sequences for human 

15 GAPDH in muscle, liver, and erythrocytes. The organization of the human and the unique 

chicken GAPDH genes is strikingly similar. Although chicken exons VIII-XI have been fused 
into human exon 8, introns which separate exons encoding the NAD binding, catalytic, and 
helical domains of the GAPDH protein have been retained. Stable transfection of rodent cells 
with the intact human GAPDH gene resulted in the expression of a correctly initiated human 

20 GAPDH mRNA and an enzymatically active human GAPDH polypeptide. Thus, the gene 
contains a functional promoter and intact coding sequences. Although many processed 
GAPDH pseudogenes and GAPDH-like sequences are present in the human genome, Southern 
blot analysis of human genomic DNA using a probe derived from the 3'-untranslated region of 
the GAPDH gene detected only two genes, a 10-copy processed pseudogene and a single copy 

25 of the isolated gene. In contrast, a probe derived from an intron segment of the isolated gene 
detected only a single copy of the GAPDH gene. Collectively, these findings strongly suggest 
that the human genome encodes a single functional GAPDH gene. 

Hopkinson et al. (1974) presented evidence that glycerol-3 -phosphate dehydrogenase 
(EC 1.1.1 .8) is a dimer of dissimilar subunits. Electrophoretic variants at each of two loci, 

30 designated GPD 1 and GPD2, were described. By the method of somatic cell hybridization, 
Kielty and Povey (1982) assigned the presumed structural gene for alpha-glycerophosphate 
dehydrogenase to chromosome 12. Since this is a liver-specific enzyme, a rat hepatoma cell 
line was used as one of the 'parents 1 in the hybridization. 
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NOV89 is predicted to be expressed in at least the following tissues: liver. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 
sources, and/or RACE sources. Further expression data for NOV89 is provided in Example 2. 
5 The NOV89 nucleic acids and proteins are useful in potential therapeutic applications 

implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, cirrhosis, transplantation as well as other diseases, disorders 
and conditions. NOV89 nucleic acids encoding the Glycerol-3-Phosphate Dehydrogenase-Iike 
protein of the invention, or fragments thereof, may further be useful in diagnostic applications, 

1 0 wherein the presence or amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a Glycerol-3-Phosphate 
Dehydrogenase-like protein includes the nucleic acid whose sequence is provided in Table 
89 A, or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 89A while still 

1 5 encoding a protein that maintains its Glycerol-3-Phosphate Dehydrogenase-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to the sequence of Table 89 A, including 
nucleic acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 

20 thereto, whose structures include chemical modifications. Such modifications include, by way 
of non-limiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or variant 

25 nucleic acids, and their complements, up to about 13% of the bases may be so changed. 

The novel protein of the invention includes the Glycerol-3-Phosphate Dehydrogenase- 
like protein whose sequence is provided in Table 89B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 89B while still encoding a protein that maintains its Glycerol-3-Phosphate 

30 Dehydrogenase-like activities and physiological functions, or a functional fragment thereof. In 
the mutant or variant protein, up to about 15% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
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prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV90 

5 The disclosed NOV90 (alternatively referred to herein as CG56888-01) includes the 

1686 nucleotide sequence (SEQ ID NO:289) shown in Table 90A. A NOV90 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 58-60 and ends with a stop codon 
at nucleotides 1630-1632. The disclosed NOV90 maps to human chromosome 3. 



Table 90A. NOV90 Nucleotide Sequence (SEQ ID NO:289) 

GTGCCTCACTGGGCTGGGGAACTTGCCTCA 

GAGAACTGGAAGACMGCCTCCAGCACCTCCCGTGCGAATGAGCAGGACCATCTTTAGCACTGGGGGCAA 
AGACCCTTTGTCMCCAATCACAGTTTGAAACCTTTGCCTT 

AAAATCATCTC CATATTCTCAGG CACAGAGAAAGGAAGTAAAAAGAAAG AAAAGGAACGG CCAGAAATTT 

CTCCTCCATCTGATTTTGAACACACCATCCATGTTGGCTTTGATGCTGTTACTGGAGAATTCACTGGCAT 

G CCAGAACAGTGGGCTCGATTACT ACAG ACC TC CAATATCACCAAACTACAG CAAAAGAAGAATC CTCAG 

GCTGTGCTGGATGTCTACGACTCCAAOV^GTGAAGCAGAAGTATCTGAGTTTTACTCCT 

ATGGCTTCCCTTCTGGAACACCAGCACTGAATGCCGAGGGAACAGAAGCACCTGCAGTAGTGACAGAGGA 

GGAGGACGATGATGAAGAGACTGCCCCTCCCATTATTGCCCCACCACCGGATCATATGAAATCAATTrAC 

ACACGGTCTGTAATTGACCCTGTTCCTGCACCAGTTGGTGATTCAAATGTTGATGGTGGTGCCAAGTCT 

TAGACAAACAGAAAAAGAAGACTAAGATGACAGATGAAGAGATTATGGAGAAACTAAGAACTATTGTGAG 

CATAGGTGACCCTAAGAAAAAAAGAAAAAAATATACAAGATATGAAAAAATTGGACAAGGGGCTTCTGGT 

ACAGTTT TC ACTG CT ACTGACGTTGCACTGGGACAG AAGGTTGC TAT CAAAC AAATTAATTTACAGAAAC 

AGCCAAAGAAGGAATTGATCATTAATGAGATTCTGGTAATGAAAGAATTAAAAAATCC 

CTTCTTGGACAGTTACCTGGTAGGAGATGAATTGTTTGTGGTCGTGGAATACCTTGCTAGGGGGTCACTC 

ACTG ATGTGGTAACAGAAACC TG CATGGATGAAG CACAG ATTG C CGCTGTATG CAGAGAGAGTTT ACAGG 

CATTGGAGTTTTTACATGCTAATCAAGTGATCCACAGAGACATCAAAAGTC 

GGAAGGATCGGTTAAGCrCACTGACTTTGGTTTCTGTGCCCAGATCACCCCTGAGCAGAG(^^CGGAGT 
ACCGTGGT CAGAACGC CAT ACTGGATGG C AC CAGAAGTGGTTACACGGAAGG CTT ATGGCCCTAAAGT CA 
ATGTATGGT CTCTGGGT ATCATGG CTACTG AGATGGTAGAAGGAGAG CCTCCATACCTCAATGAAAAT C C 
CTTGAGGGCCTTGTGCCTAATAGGAACTAATGGAATCCCAGAACT^ 

ATATTTCG GGATT TCTTAAATCGATGTTTGGAAACAGATGTGGAAAAAAGGGGTTCAGCC^ 
TACAGCATCTUTuCCTGAAACTAGCCAAACTGTTAT CTAGCTTGACACCACTGAT CATGGCAGCTAAAGA 

AGC^TGAAGAGTAACCGTTAACATCACTGCTGTGGCCTC^TATTCTTTTTTCCATTTTCTACAAGAAGC 
CTTTTA 



A NOV90 polypeptide (SEQ ID NO:290) encoded by SEQ ID NO:289 is 524 amino 
acids in length and is presented using the one-letter amino acid code in Table 90B. The Psort 
profile for NOV90 predicts that this sequence is likely to be localized to the nucleus with a 
certainty of 0.7000. 



Table 90B. NOV90 Polypeptide Sequence (SEQ ID 
NO:290) 

MCDNRELEDKPPAPPVRMSRTI FSTGGKDPLSANHSLKPLPS VPEEKKPRHKI IS I FSGT 

EKGS KKKEKERPE I S PPSDFEHT IHVGFD AVTGE FTGMPEQWARLLQTSNT TKLQQKKNP 

QAVLDVYDSNTVKQKYLSFTPPEKDGFPSGTPALNAEGTEAPAVVTEEEDDDEETAPPII 

APPPDHMKS I YTRSVIDPVPAPVGDSNVDGGAKSIJ>KQKKKTKMTDEEIMEICLRTI VS IG 

DPKKKRKKYTRYEKIGQGASGTVFTATDVALGQKVAI KQINLQKQPKKELI INEILVMKE 

LKNPNIVNFIJDSYLVGDELFVVVEYLARGSLTDVVTETCMDEAQIA^ 

ANQVIHRDI KSDS VLLGMEGS VKLTDFGFCAQ I TPEQSKRSTWRTP YWMAPEWTRKAY 

GPKVNWSLGIMATEMVEGEPPYLNENPI^^ 

LETDVEKRGSAKEIiLQHLFLKIiAKLLSSLTPLlMAAKEAMKSNR 
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A BLAST analysis of NOV90 was ran against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV90 had high 
homology to other proteins as shown in Table 90C. 

5 



Table 90C BLASTX results from PatP database for NOV90 






Smallest 






Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAB03969 p~21 activated protein kinase (PAK2) 


2494 


6.5e-259 


patp:AAW95521 Protease activated protein kinase I (PAK I) 


2459 


3.3e-255 


patp:AAR96296 Human p21-protein activated serine kinase 


2361 


8.0e-245 


patp:AAWl3379 Human p2l activated serine kinase p65 protein 


2361 


8.0e-245 


patp:AAW47H9 Human p2l-activated serine kinase p65 


2361 


8.0e-245 



In a search of public sequence databases, it was found, for example, that the amino 
acid sequence of the NOV90 protein of the present invention was found to have 503 of 524 
amino acid residues (95%) similar to, the 524 amino acid residue ptnnSWISSNEW- 
10 ACC:Q13177 protein from Homo sapiens (Human) (SERINE/THREONINE-PROTEIN 

KINASE PAK 2 (EC 2.7.1.-) (P21 -ACTIVATED KINASE 2) (PAK-2) (PAK65) (GAMMA- 
PAK) (S6/H4 KINASE)). NOV90 also has homology to the other proteins shown in the 
BLASTP data in Table 90D. 



Table 90D, NOV90 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%> 


Positive 
(%) 


Expect 


gi| 3041712 |sp 
|Q13177|PAK2_ 
HUMAN 


SERINE /THREONINE - PROTEIN 
KINASE PAK 2 (P21- 
ACTIVATED KINASE 2) (PAK- 
2) (PAK65) (GAMMA- PAK) 
(S6/H4 KINASE) 


524 


493/527 
(93) 


503/527 
(94) 


0.0 


gi|4505599|re 
f |NP_002568.l' 
| (NMJJ02577) 


p2i (CDKN1A) -activated 
kinase 2; novel serine 
kinase; hPAK65 
[Homo sapiens] 


525 


493/528 
(93) 


503/528 
(94) 


0.0 


gi | 16758002 |r 
ef |NP_445758. 
1| (NM 053306) 


p2 1 ( CDKN1A) - act ivated 
kinase 2 

[Rat t us norvegicus] 


524 


483/527 
(91) 


498/527 
(93) 


0.0 


gi|2499647|sp 
|Q29502|PAK2_ 
RABIT 


SERINE/ THREONINE - PROTEIN 
KINASE PAK 2 <P2l- 
ACTIVATED KINASE 2) 
(PAK-2) (GAMMA -PAK) 
( P2 1 -ACTIVATED PROTEIN 
KINASE I) (PAKI) 


524 


484/527 
(91) 


498/527 
(93) 


0.0 


gi|6288680|gb 
|AAF06695.l|U 
19967_1 
(U19967) 


PAK2 [Rattus norvegicus] 


524 


481/527 
(91) 


498/527 
(94) 


0.0 



721 



WO 02/068649 PCT/US02/02785 



This BLASTP data is displayed graphically in the Clustal W in Table 90E. A multiple 
sequence alignment is given, with the NOV90 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 90D. 



Table 90E. ClustalW Alignment of NOV90 



NOV90 



3041712 | 
4505599 | 
16758002 | 
2499647) 
6288680) 



NOV90 

gi 
gi 
gi 
gi 
gi 



3041712) 
4505599 I 
16758002) 
2499647) 
6288680) 



NOV90 


gi 


3041712) 


gi 


4505599| 


gi 


16758002) 


gi 


2499647) 


gi 


6288680 j 



NOV90 


gi 


3041712) 


gi 


4505599| 


gi 


16758002| 


gi 


2499647 | 


gi 


6288680) 



NOV90 


gi 


3041712) 


gi 


4505599) 


gi 


16758002 | 


gi 


2499647 | 


gi 


6288680) 



NOV90 



3041712) 
4505599) 
16758002 | 
2499647 | 
6288680 



(SEQ ID NO:290) 
(SEQ ID NO: 749) 
(SEQ ID NO; 750) 
(SEQ ID NO: 751) 
(SEQ ID NO: 752) 
(SEQ ID NO:753) 

10 



20 30 40 50 
. | | | | | 1 | 



MgDNgELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVPEEKKPR 
MSDNGELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVPEEKKPR 
MSDNGELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVPEEKKPR 
MSDNGELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVPEEKKPR 
MSDNGELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVPEEKKPR| 
MSDNGELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVPEEKKPR! 





60 


70 


80 


90 


100 




1 











jkiisifsgtekgskkkekerpeisppsdfehtihvgfdavtgeftgmpe 
gkiisifsgtskgskkkekerpeisppsdfehtihvgfdavtgeftgmpe 
™kiisifsgtekgskkkeker?eisppsdfehtihvgfdavtgeftgmpe 
kiisifs|tekgskkkeker?eisppsdfehtihvgfdavtgeftgmpe 
k 1 1 s i fsgtekg s kkkekerpe i s p jpsdfeht x hvg fdavtge ftgmp ej 
^kiisifsgtekgskkkekerpsisppsdfestikvgfdavtgeftgmpe 1 



110 



120 



130 



I 



I. 



140 

..L . 



I 



150 
■ • I 



QWARLLQTSNITKLgQKKNPQAVLDV^H YD SNTVKQKYLSFTPPEKDGF 

QWARLLQTSNITKLEQKKNPQAVLDVLKFYDSNTVKQKYLSFTPPEKDGr 

QWARLLQTSNITKLEQKKNPQAVLDVLKFYDSNTVKQKYLSFTPPEKDGi 

QWARLLQTSNITKLEQKKNPQAVLDVLKFYDSNTVKQKYLSFTPPEKDGF 

QWARLLQTSNITKLEQKKNPQAVLDVLKFYDSNTVKQKYLSFTPPEKDGF 

QWARLLQTSNITKLEQKKNPQAVLDVLKFYDSNTVKQKYL5FTFPEKDGF 



160 



170 



180 



190 



200 




250 



260 



270 



280 
• . 1 ; . 



290 



300 
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gytryekigqgasgtvftatdvalgqjsvaikqinlqkqpkkeliineilv 
□ytryekigqgasgtvftatdvalgqevaikqinlqkqpkkeliineilv 
Iytryekigqgasgtvftatdvalgqevaikqinlqkqpkkeliineilv 
Iytryekigqgasgtvftatdvalgqevaikqinlqkqpkkeliineilv 
Iytryekigqgasgtvftatdvalgqevaikqinlqkqpkkeliineilv 
aytryekigqgasgtvftatdvalgqevaikqinlqkqpkkeliineilv 



mkelknpniwfldsylvgdelfwgeylaggsltdwtetbcmdeaqia 
mkelknpnivnfldsylvgdelfwmeylaggsltdwtetocmdeaqia 
mkelknpnivnfldsylvgdelfwmsylaggsltdwtetgcmdeaqia 
^kelknpnivnfldsylvgdelfwmsylaggsltdwtetIcmdeaqia 
mkelknpnivnfldsylvgdelfwmeylaggsltdwtetIcmdeaqia 
mkelknpnivnfldsylvgdelfwmeylaggsltdwtetbcmdeaqia 



AVCRE^LQ ALE FLHANQ V I HRD I KS D§ VLLGMEG S VKLTD FGF CAQ I TP E 
AVCRECLQALEFLHANQVIHRDIKSDNVLLGMEGSVKLTDFGFCAQITPE 
AVCRECLQALEFLHANQVIHRDIKSDNVLLGMEGSVKLTDFGFCAQITPE 
AVCRECLQALEFLHAL^QVIHRDIKSDNVLLGMEGSVKLTDFGFCAQITPE 
AVCRECLQALEFLHANQVIKRDIKSDNVLLGMEGSVKLTDFGFCAQITPE 
AVCRECLQAIiEFLHANQVIHRDIKSDNVLLGMEGSVKLTDFGFCAQITPE 



NOV90 

gi (3041712 | 
gi|4505599| 
gi j 16758002 | 
gij 2499647 | 
gi j 6288680 j 



NOV90 


gi 


3041712) 


gi 


4505599) 


gi 


16758002 


gi 


2499647 | 


gi 


6288680 | 



NOV90 



gi 
gi 
gi 
gi 



3041712 j 
4505599 j 
16758002) 
2499647) 
6288680 



NOV90 

gi 1 3041712 | 
gi j 4505599 | 
gi j 16758002 | 
gi|2499647 | 
gi | 6288680 



NOV90 



gi 
gi 
gi 
gi 
gi 



3041712 | 

4505599) 

16758002) 

2499647) 

6288680) 



NOV90 


gi 


3041712) 


gi 


4505599) 


gi 


16758002) 


gi 


2499647) 


gi 


6288680) 



410 
. . I . . 



420 
..I.. 



430 



440 



450 
..I 



QSKRST2vjgTPYWMAPEVVTRKAYGPKV{JBwSLGIMA3SMVEGEP?YLNE 
QSKRSTMVGTPYWMAPEWTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNE 
QSKRSTMVGTPYWMAPEWTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNE 
QSKRSTMVGTPYWMAPEWTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNE 
QSKRSTMVGTPYWMAPEWTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNE 
QSKRSTMVGTPYWMAPEWTRlAYGPKVDIWSLGIMAIEMVEGEPPYLNE 



460 



470 



480 

1 ■ j 



490 



500 



PLRALgLIATNGUPELQNPEyLSPIFRDFLNRCLEyDVEKRGSAKELLQ 
NPLRALYLIATNGTPELQNPEKLSPIFRDFLNRCLEMDVEKRGSAKELLQ 
NPLRAL YL I ATNGT PELQNPEKLSPI FRDFLNR CLE MD VE KRGS AKELL Q 
NPLRALYLIATNGTPELQNPEKLSPIFRDFLNRCLEMDVEKRGSAKEIiQ 
NPLRALYLIATNGTPELQNPEKLSPIFRDFLNRCLEMDVEKRGSAKELLQ 
NPLRAL YLIATNGT PELQNPEKLSPI FRDFLNR CLEMDVEKRGS AKeLlQ 



510 



520 



HjjFLKLAKjjLSSLTPLIMAAKEAMKSNR 
HPFLKLAKPLSSLTPLIMAAKEAMKSNR 

hpflklakplssltplimaakeamksnr 
hpflklakplssltpli3aakeamksnr 
hpflklakplssltplimaakeamksnr 
hpflklakplssltplimaakeamksnr 



Table 90F lists the domain description from DOMAIN analysis results against NOV90. 
This indicates that the NOV90 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 90F. Domain Analysis of NOV90 
gnllSmartlsmart00220. S TKc, Serine/Threonine protein kinases, catalytic domain; Phosphotransferases. 
Serine or tore on in e-speciflc kinase subfamily. SSQ IB NO: 882 

CD-Length = 256 residues, 94.9% aligned 
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.... ____ 


Score = 


251 bits (640), Expect « le-67 




NOV90: 


255 


I GQGASGTVFTATDVALGQKVAIKQINLQKQPKK- - ELIINEI LVMKEL KNPNI VNFLDS 


312 






+G+GA G V+ A D G+ VAIK I +K KK E 1+ EI ++K+L +PNIV D 




Sb j Ct : 


7 


LGKGAFGKVYLARDKKTGKLVAI KVI KKEKLKKKKRER I LRE I K I LKKLDHPNI VKLYDV 


66 


NOV90 : 


313 


YLVGDELFVWEYIiARGSLTDWTET - CMDEAQI AAVCRESLQALEFLHANQVIHRDI KS 


371 






+ D+L++V+EY G L D++ + + E + R+ L ALE+LH+ +IHRD+K 




Sbjct: 


67 


FEDDDKLYLVMEYCEGGDLFDLLKKRGRLSEDEARFYARQ ILS ALEYLHSQGI IHKDLKP 


126 


NOV90: 


372 


DSVLLGMEGSVKLTDFGFCAQITPEQSKRSTVVRTPYWMAPE 


431 • 






TtTUL) tu VAij Ur \j y+ + +x V IP +MAPEV+- K YG V++WSLG+ 




Sbjct: 


127 


ENILLDSDGHVKLADFGLAKQLDSGGTLLTTFVGTPEYMAPEVIi 


186 


NOV90 : 


432 


MATEMVEGEPPYLNENPLRALCLIATNGI PELQNPE - 1XSPI FRDFLNRCLETDVEKRGS 


490 


Sbjct: 




+ E++ G+PP+ ++ L AL P PE +SP +D + + L D EKR + 


187 


ILYELLTGKPP FPGDDQLIJ^FKKIGKPPPPFPPPEWKI SPEAIOLI KKLLVKDPEKIUjT 


246 


NOV90 : 


491 


AKE 493 








A+E 




Sbjct: 


247 


AEE 249 





Serine/threonine kinases are an extensive family of enzymes that catalyzes the 
phosphorylation of serine or threonine residues on its target protein. Protein kinases share a 
conserved catalytic core common to both serine/ threonine and tyrosine protein kinases. This 
5 domain contains residues, which are specific to the distinct types of protein kinases 

The S6/H4 kinase purified from human placenta catalyzes phosphorylation of the S6 
ribosomal protein, histone H4, and myelin basic protein. In vitro activation of the p60 S6/H4 
kinase requires removal of an autoinhibitory domain by mild trypsin digestion and 
autophosphorylation of the catalytic domain (p40 S6/H4 kinase). The two 

1 0 autophosphoryiation/autoactivation sites contain the sequences SSMVGTPY (site 1 ) and 
SVIDPVPAPVGDSHVDGAAK (site 2). These sequences identify S6H4 kinase as the rac- 
activated PAK65 (Martin, G. A., Bollag, G., McCormick, F. and Abo, A. (1995) EMBO J. 14, 
1971-1978). Site 1 phosphorylation is most rapid, but activation does not occur until site 2 is 
autophosphorylated. The site 1 phosphorylation occurs by an intramolecular mechanism 

1 5 whereas site 2 autophosphorylation occurs by an intermolecular mechanism. A model is 

proposed in which phosphorylation of sites 1 and 2 occurs sequentially. The model proposes 
that trypsin treatment of the inactive holoenzyme removes an inhibitory rac-binding domain 
which blocks MgATP access to the catalytic site. The pseudosubstrate domain at site 1 is 
autophosphorylated and subsequent bimolecular autophosphorylation at site 2 fully opens the 

20 catalytic site. Phosphorylation by a regulatory protein kinase may occur at site 2 in vivo. 

NOV90 is predicted to be expressed in at least the following tissues: brain, cerebellum, 
skeletal muscle, ovary, thymus and spleen. This information was derived by determining the 
tissue sources of the sequences that were included in the invention including but not limited to 
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SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV90 is provided in Example 2. 

The NOV90 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
5 Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 

hypercalcemia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 

10 immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV90 nucleic acids encoding the PAK2-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a serine/threonine-protein kinase 

15 PAK 2-like protein includes the nucleic acid whose sequence is provided in Table 90A, or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 90A while still encoding a 
protein that maintains its serine/threonine-protein kinase PAK 2-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 

20 nucleic acids whose sequences are complementary to the sequence of Table 90A, including 
nucleic acid fragments that are complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

25 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 

The novel protein of the invention includes the serine/threonine-protein kinase PAK 2- 
like protein whose sequence is provided in Table 90B. The invention also includes a mutant or 

30 variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 90B while still encoding a protein that maintains its serine/threonine-protein kinase 
PAK 2-like activities and physiological functions, or a functional fragment thereof. In the 
mutant or variant protein, up to about 6% of the amino acid residues may be so changed. 
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These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
5 below. 

NOV91 

The disclosed NOV91 (alternatively referred to herein as CG56779-01) includes the 
404 nucleotide sequence (SEQ ID NO:291) shown in Table 91 A. A NOV91 ORF begins with 
10 a Kozak consensus ATG initiation codon at nucleotides 25-27 and ends with a stop codon at 
nucleotides 373-375. The disclosed NOV91n maps to human chromosome 3. 



20 



Table 91A. NOV91 Nucleotide Sequence (SEQ ED NO:291) 



AATGGGTCTCTTTTrCTACAGCCCATGCCCTTC 
CTGCAGGGTTGGAGAAATGGCTCTGCGCCACAGCCTC^ 

GATGGGTGTAGCGGGCCTGACCATGGTGCTGAGTAGGTCCACTGAGCCCTGGGCGCAGCTGTTCATCTCC 
TCCACCAGCATGATGGACACCACTGAGGAGAACCGCAGCCAC^ 

AGGAGCTGGCCCTGGGCCAGGACCAGATAATTTTCCACTTTTCCCCCCTGGAGCCCTGGCAG 
GAAGGGGATGGTCATAACTTTTTAGTGACTGGCCTCGAGGGATCCAGGGCATCT 



A NOV91 polypeptide (SEQ ID N0.292) encoded by SEQ ID NO:291 is 1 16 amino 
15 acids in length and is presented using the one-letter amino acid code in Table 91B. The Psort 
profile for NOV91 predicts that this sequence has no signal peptide and is likely to be 
localized to microbodies with a certainty of 0.6400. In alternative embodiments, a NOV91 
polypeptide is located to the cytoplasm with a certainty of 0.4500. 



Table 91B. NOV91 Polypeptide Sequence (SEQ ID 
NO:292) 



MPFLELDTNLPANQVPAGLEKWLCATASIIXSKPK^ 
ISSTSMMDTTEENRSHSTHFFEFLTEEI^^ 



A BLAST analysis of NOV91 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV91 had high 
homology to other proteins as shown in Table 91 C 



Table 91C BLASTX results from PatP database for NOV91 



Smallest 
Sum 

High Probabability 
Sequences producing High-scoring Segment Pairs: Score P(N) 
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patp:AAR83048 Human macrophage migration inhibitory factor 


399 


6 


.5e- 


-37 


patp:AAY44997 Human D-dopachrome tautoraerase (DDT) 


369 


9 


.8e- 


•34 


patp:AAB43 733 Human cancer associated protein sequence 


262 


2 


.le- 


-22 


patp:AAM22H0 Peptide #8544 encoded by probe 


183 


5 


.0e- 


-14 


patp:AAM38563 Peptide #12600 encoded by probe 


183 


5 


.0e- 


•14 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 283 of 289 bases (97%) identical to a gb:GENBANK- 
ID:AP000500|acc:AP000500.1 mRNA from Homo sapiens (genomic DNA, chromosome 
5 3p21.3, clone:603 to 320, anti-oncogene region, section 3/3). The full amino acid sequence of 
the protein of the invention was found to have 82 of 1 17 amino acid residues (70%) identical 
to, and 91 of 1 17 amino acid residues (77%) similar to, the 1 18 amino acid residue ptnnpir- 
id:JE0162 protein from human (dopachrome Delta-isomerase (EC 5.3.3.12)). NOV91 also has 
homology to the other proteins shown in the BLASTP data in Table 91D. 

10 



Table 91D. NOV91 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
<aa) 


Identity 

(%) 


Positive 
f%) 


Expect 


gi|450329l|re 
f |NP_001346.1 
| (NM_001355) 


D-dopachrome tautomerase 
[Homo sapiens] 


118 


82/117 
(70) 


91/117 
(77) 


le-38 


gi|4699610|pd ■ 
b|lDPT|A 


Chain A, D- Dopachrome 
Tautomerase 


117 


81/116 
(69) 


90/116 
(76) 


5e-38 


gi 


7512375 |pi 
G02438 


D-dopachrome. tautomerase 
- human 


118 


80/117 
(68) 


89/117 
(75) 


7e-38 


gi 

ef 
l| 


13162287 |r 
NPJ)77045. 

1 024131) 


D-dopachrome tautomerase 
[Rat t us norvegicus] 


118 


71/117 
(60) 


94/117 
(79) 


6e-34 


gi| 6753618 | re 
f |NP_034157.1 
| (NM 010027) 


D-dopachrome tautomerase 
[Mus musculus] 


118 


67/117 
(57) 


92/117 
(78) 


2e-32 



This BLASTP data is displayed graphically in the ClustalW in Table 9 IE. A multiple 
sequence alignment is given, with the NOV91 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
15 Table 91D. 



Table 91E. ClustalW Alignment of NOV91 


NOV91 


(SEQ ID NO:292) 


gi 


4503291 | 


(SEQ ID NO: 754) 


gi 


4699610) 


(SEQ ID NO: 755) 


gi 


7512375) 


(SEQ ID NO: 756) 


gi 


13162287 


| (SEQ ID NO: 757) 


gi 


6753618 | 


(SEQ ID NO: 758) 






10 20 30 40 50 

I | | | | | | | | | 
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Table 91F lists the domain description from DOMAIN analysis results against NOV91 . 
This indicates that the NOV91 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 91F. Domain Analysis of NOV91 



gn1jPfam|pfam01187. MIF, Macrophage migration inhibitory factor (MIF). SEQ ID NO: 883 
Score - 



CD-Length » 114 residues, 100.0% aligned 
126 bits (316), Expect = 8e-31 



NOV91: 2 PFLELDTNLPANQVPAGLEKWLCA-TASILGKPKDHVNMMGVAGLTMVLSRSTEPWAQLF 60 

P +DTNLPAN VPAG EK L A A LGKP+D + + G MV ST+P A 
Sb j Ct : 1 PMFTIDTNLPANSVPAGFEKRLTAALAKALGKPEDRIAVHIRPGQAMVFGGSTDPCAVCS 6 0 

NOV91: 61 ISSTSMMDTTEENRSHSTHFF^FLTEELALGQDQIIFHFSPLEPWQTGKKGMVIT 115 

I S ++ E+NRSHS F+FL +EL L +D++ F LE Q G G + 
Sbjct: 61 IKS I GW - GAEQNRS HSALLFKFIiAKELGLPKDRVYI RFFDLEAAQVGFNGTTMA 114 



D-Dopachrome tautomerase (DDT) shares a low homologous amino acid sequence 
(33% homology) with the macrophage migration inhibitory factor (MIF) yet possesses similar 
tautomerase activity. MIF is a cytokine involved in inflammatory reactions and immune 
10 responses. While MIF is a secreted protein, it is not processed from a larger precursor. 

Whereas recent studies have identified MIF as a pituitary hormone and immunoregulator, less 
is known about the structural basis of these physiological functions and the real significance, of 
tautomerase activity. D-dopachrome tautomerase, which is related to MIF, is a mammalian 
cytoplasmic enzyme involved in melanin biosynthesis that tautomerizes 2-carboxy-2,3- 
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dihydroindoIe-5, 6-quinone (D-dopachrome) with concomitant decarboxylation to give 5,6- 
dihydroxyindole (DHI). It is a protein of 1 17 residues, and acts as a homotrimer. 

NOV91 is predicted to be expressed in at least the following tissues: largely in the 
liver, and to lesser extent in other organs, including the heart, lung, pancreas; and placenta. 
5 This information was derived by determining the tissue sources of the sequences that were 
included in the invention including but not limited to SeqCalling sources, public EST sources, 
literature sources, and/or RACE sources. Further expression data for NOV91 is provided in 
Example 2. 

The NOV91 nucleic acids and proteins are useful in potential therapeutic applications 

10 implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalcemia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 

1 5 thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 

immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV91 nucleic acids encoding the D-Dopachrome tautomerase-like protein of the 
invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. 

20 The novel nucleic acid of the invention encoding a D-dopachrome tautomerase-like 

protein includes the nucleic acid whose sequence is provided in Table 91 A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 91 A while still encoding a protein 
that maintains its D-dopachrome tautomerase-like activities and physiological functions, or a 

25 fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 

30 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
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In the mutant or vaimut muaeic acids, and their complements, up to about 3% of the residues 
may be so changed. 

The novel protein of the invention includes the D-dopachrome tautomerase-like protein 
whose sequence is provided in Table 91B. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
9 IB while still encoding a protein that maintains its D-dopachrome tautomerase-like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 
up to about 30% of the bases may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV92 

The disclosed NOV92 (alternatively referred to herein as CG56904-01) includes the 
1311 nucleotide sequence (SEQ ID NO:293) shown in Table 92A. A NOV92 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 19-21 and ends with a stop codon 
at nucleotides 1282-1284. 



Table 92A. NOV92 Nucleotide Sequence (SEQ ID NO:293) 

GGAGCTCCCACACTTTCAM 
GGCCCCCGCCCGTCGCCOT 

CCGGGCCTGTCCCCTGCGCTGCTCCTGCCCCCGAGTCGACACTGTGGACT 

GTGTTCCCGGACAACATCACCAGAGCCGCTCAGCAC 

CCTACAATGAGCTGTCCCGCCTCAGT0H3CCT 

AGGCCTGCCTGACGAGGCCTTCGAGTCCCTCACCCAGCT 

AATCTCATCTCCAAGGTGCCCCGAGGAGCC^ 

ACAACCAGCTGACAGACAGTGGCCTGGATGCCACCACCTTCAGCAAG 

TCTCTCCCACAACCAGCTGACCACAGTGCCC^ 

CGCAACCGCATCCGGCAGGTGGAGGCGGCTCGGCTGCACGGGGCGCGTGGTCTGTO 

AGCACAACCAGCTGGGGAGCTCAGGGCTGCCCGCCGGGGCTCT 

GCACCTCGATGGCAATGGGCTGGACCGCGTGCCT 

CCCCACAACCACGTGGCCGCGCTGGGTGCCCGTGA 

TGGCCTATAACCGCCTGGCCAGCGCCCGTGTGCACCACC^ 

CCTCGACCTGGCAGGGAATCAGCTAACCCGGCTGCCCATGGGCCTGCCCACTGGCCT 

CTGCAACGCAACCAGCTGCGGATGCTCGAGCCCGAGCCTCT 

GCCTGGCGCACAACCGGCTCCGGGTCGGCGACATCGGGCCAGGCACCT 

GGTCAGGCACAGGCTGGTTAGCCACACTX^CCCCAGGGCCCCTCCATCCCC 

CCAAACATTCTAGTTAGCTGGTAAAGCAATCAGAACAAGAAAATGATAAGA 



A NOV92 polypeptide (SEQ ID NO:294) encoded by SEQ ID NO:293 is 421 amino 
acids in length and is presented using the one-letter amino acid code in Table 92B. The Psort 
profile for NOV92 predicts that this sequence has a signal peptide and is likely to be secreted 
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with a certainty of 0.441 9. The Signal P predicts a likely cleavage site for a NOV92 peptide is 
between positions 24 and 25, i.e., at the dash in the sequence VAG-LE. 



Table 92B. NOV92 Polypeptide Sequence (SEQ ID 
NO:294) 



MGRPTQWPSLLIil^I^GPPPVAGLEDAAFPHl^ESLQPLPRACPLRCSCPRVDTVDCDGL 

DLRVFPDNITI^QHLSLQl^QLQELPYiraLSiajSGLRTLOTiHNNLISSEGLPDBAFESLT 

QLQHLCVAHNKNNLISKVPRGALSRQTQLRELYLQHNQLTDSGLDATTFSKLHSLEYLDLS 

HNQLTTVPAGLPRTLAILHLGR1TOIRQVEAARLHG 

LRGLHTLHLDGNGLDRVPPALPRRLRALVLPHNHV 

ARVHHRAFRRLRALRSLDIAGNQLTRLPMGLPTGLRTLQLQRNQLRMLEPEPI^GIJ)QLRE 
LSLAHNRLRVGDIGPGTWHELQALQVRHRLVSHTVPRAPPSPCLPCHVPNIIjVSW 



A BLAST analysis of NOV92 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV92 had high 
homology to other proteins as shown in Table 92C. 



Table 92C. BLASTX results from PatP database for NOV92 






Smallest 






Sum 




High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P(N) 


patp:AAY13396 Amino acid sequence of protein PR0332 


690 


4.9e-70 


patp:AAB33425 Human PR0332 protein 


690 


4.9e-70 


patp:AAB80264 Human PR0332 protein - Homo sapiens , 642 aa. 


690 


4.9e-70 


patp:AADl2356 Human PR0332 polypeptide sequence 


690 


4.9e-7Q 


patp:AAM41258 Human polypeptide SEQ ID NO 6189 


334 


5.0e-30 



10 In a search of sequence databases, it was found, for example, that the nucleic acid 

sequence of this invention has 1310 of 1312 bases (99%) identical to a gbrGENBANK- 
ID:AK027100|acc:AK027100.1 mRNA from Homo sapiens (cDNA: FLJ23447 fis, clone 
HSI03346). The full amino acid sequence of the protein of the invention was found to have 
290 of 291 amino acid residues (99%) identical to, and 290 of 291 amino acid residues (99%) 

1 5 similar to, the 363 amino acid residue ptnr.TREMBLNE W-ACC:B AB 1 5657 protein from 
Homo sapiens (Human) (CDNA: FU23447 FIS, CLONE HSI03346). NOV92 also has 
homology to the other proteins shown in the BLASTP data in Table 92D. 



Table 92D. NOV92 BLASTP results 



Gene Index / 


Protein / Organism 


Length 


Identity 

(%> 


Positive 


Expect 


Identifier 




(aa) 


(%) 





731 



WO 02/068649 



PCT/US02/02785 



gi|l3376224|r 
ef |NP_079101. 
l| (NnTo24825) 


hypothetical protein 
FLJ23447 [Homo sapiens] 


363 


290/291 
(99) 


290/291 
(99) 


e-136 


gi| 11761721 |g 
b|AAG40157.l| 
AF247 822_1 
(AF247822) 


decor in 

[Oreochromis niloticus] 


359 


87/281 
(30) 


137/281 
(47) 


4e-27 


gi | 6759315 |db 
j |BAA90246.l| 
(AB037269) 


biglycan [Xenopus 
laevis] 


368 


90/289 
(31) 


134/289 
(46) 


le-26 


>gi| 8134605 |s 
p|Q9XSD9|PGS2 
PIG 


BONE PROTEOGLYCAN II 
PRECURSOR (PG-S2) 
(DECORIN) 


360 


94/298 
(31) 


144/298 
(47) 


4e-25 


gi| 129949 |sp| 
P21793|PGS2_B 
OVIN 


BONE PROTEOGLYCAN II 
PRECURSOR (PG-S2) 


360 


88/292 

(30) f 


140/292 
(47) 


6e-25 



This BLASTP data is displayed graphically in the ClustalW in Table 92E. A multiple 
sequence alignment is given, with the NOV92 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
5 Table 92D. 



Table 92E. ClustalW Alignment of NOV92 



NOV92 (SEQ ID NO: 294) 



gi 
gi 
gi 
gi 
gi 



13376224) (SEQ ID NO: 759) 

11761721) (SEQ ID NO: 760) 

6759315) (SEQ ID NO: 761) 

8134605) (SEQ ID NO:762) 

129949 | (SEQ ID NO:763) 
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NOV92 



13376224) 
11761721 | 
6759315 | 
8134605 I 
129949| 



GRPTQWPST.LI.gLL§PGP] 

.WTFECS RTTS - P! 

!Ad3SLL--r|VTA( 

" »cs---cil^i: 
tikfll---Baq^s 

-liFLL---gA^S 
60 




30 



40 



|....|....|. 



50 



S---TS--PCRTTSSRNSP T-MSC 

SflST 




70 



80 



90 



100 



NOV92 


gi 


13376224 


gi 


11761721 


gi 


6759315) 


gi 


8134605 j 


gi 


129949) 



prvdtvdcdgldlrv: 

P--ASVACE 



-~m- 




110 



--TSTTTSS--P 

SPLPPVfG GPK 

IIPTESIPDVGLPP- - -MDL 

IDRFPEVPEfePL GPW 

EHFPEVPHTEPM GPV 

120 



srHsgHt 

KAjLTRPS — 




NOV92 L 


SjJHNN 


LBS 


S 


gi 


13376224) 




-SPS 


gi 


11761721) 


SDLGL 


KAS? 


P 


gi 


6759315) 


SDLGL 


Tsi 


P 


gi 


8134605) 


SDLGL 




P 


gi 


129949 | 


SDLGL 


E 4? 


P 




130 

|....|....|_.| 
EAFESLTQLQHLCT" 
S--CSTSAW 
E 



150 



DTfjLLDLQNNKITEIKggDFK 
2DT«LLDLQNNKITEIKfflDFKgL 

dtSlldlqnnkiteikSEdfkSlc? 



140 

KVjPRmALSRQTQ 
r $j5?P R @ALSR QTQ 
N0KG 
TN 



dtSlldlqnnkiteikSgdfkSlS 



160 



170 



180 



190 



200 
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NOV92 

gi | 13376224 | 
gi) 11761721 j 
gij 6759315 | 
gi j 8134605 j 
gij 129949 | 




•TOSGLDAT 
ikiXDSGIiDAT' 

.ti--Hhp: 

|XSK--i 
•S^K — MS 
K--RS 




NOV92 B 




gi 


13376224) 




gi 


11761721 j 


LRyHEN 


gi 
gi 


6759315| 




8134605] 


lrSken 


gi 


129949| 


LRgHEN 



NOV92 



13376224 
11761721 
6759315 | 
8134605) 
1299491 



NOV92 

gi | 13376224 J 
gij 11761721 
gi (6759315 
gij 8134605 
gij 129949 




.1- 



360 



NOV92 

gi 
gi 
gi 
gi 
gi 



370 



13376224} 
11761721 j 
6759315j 
8134605] 
129949 | 



380 





410 



420 



NOV92 



gi 
gi 
gi 
gi 



13376224) 
11761721 j 
6759315) 
8134605) 
129949) 



Hl^PRAPPSPCLPCHVPNILVSW 
HT^RAPPSPCLPCHVPNILVSW 




Leucine-rich repeats (LRRs) are relatively short motifs (22-28 residues in length) 
found in a variety of cytoplasmic, membrane and extracellular proteins. Although these 
proteins are associated with widely different functions, a common property involves protein- 
5 protein interaction. Little is known about the 3D structure of LRRs, although it is believed that 
they can form amphipathic structures with hydrophobic surfaces capable of interacting with 
membranes. In vitro studies of a synthetic LRR from Drosophila Toll protein have indicated 
that the peptides form gels by adopting beta-sheet structures that form extended filaments. 
These results are consistent with the idea that LRRs mediate protein-protein interactions and 
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cellular adhesion. Other functions of LRR-containing proteins include, for example, binding to 
enzymes and vascular repair. The 3-D structure of ribonuclease inhibitor, a protein containing 
15 LRRs, has been determined, revealing LRRs to be a new class of alplu^eta fold. LRRs 
form elongated non-globular structures and are often flanked by cysteine rich domains. 
5 NOV92 is predicted to be expressed in at least the following tissues: colon, brain. This 

information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 
sources, and/or RACE sources. Further expression data for NOV92 is provided in Example 2. 
The NOV92 nucleic acids and proteins are useful in potential therapeutic applications 

1 0 implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalcemia, Parkinsons disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 

15 thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 

immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions, NOV92 nucleic acids encoding the LRR-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 

20 The novel nucleic acid of the invention encoding a secreted leucine-rich repeat (LRR) 

protein-like protein includes the nucleic acid whose sequence is provided in Table 92A, or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 92A while still encoding a 
protein that maintains its secreted leucine-rich repeat (LRR) protein-like activities and 

25 physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to the sequence of Table 92A, including 
nucleic acid fragments that are complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 

30 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
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In the mutant or variant nucleic acids, and their complements, up to about 1% of the bases may 
be so changed. 

The novel protein of the invention includes the secreted leucine-rich repeat (LRR) 
protein-like protein whose sequence is provided in Table 92B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 92B while still encoding a protein that maintains its Secreted leucine- 
rich repeat (LRR) protein-like activities and physiological functions, or a functional fragment 
thereof. In the mutant or variant protein, up to about 1% of the amino acid residues may be so 
changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV93 

The disclosed NOV93 (alternatively referred to herein as CG56277-01) includes the 
1518 nucleotide sequence (SEQ ID NO:295) shown in Table 93 A. A NOV93 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 2-4 and ends with a stop codon at 
nucleotides 1556-1558. The disclosed NOV93 maps to human chromosome 2. 



Table 93A. NOV93 Nucleotide Sequence (SEQ ID NO:295) 

CATGAGGGGACTACCTATCAGCAGCAGCACCGGCTACGTGGTCGAGGACGGGTTCACTGCGACCGTGCAG 
CAGCrCTTCGCCAGTGACCAGGGACTCACCTAC^ 

TAGCTGATGAGGTGGACCTGACCTCAGCCCTGACCCAACCGGTCACTCTGAAGACGCCGCTGATCTCCTC 
CC CC ATG GACACTGTGAC AGAGGC CGAC CTGGC CAT CGTGATGG CT C TGATGGGAGGTAC TGGTTT CATT 
C ACCACAACTGCAC C C C AG AGTTCCAGG CCAGTGAGGTGCAGAAGGT CAAGAAGTTTGAAC CGGGCTTT A 
TCACACACCCCGTGGTGCTGAGCCCCTTGCACACTGTGGCT 

CTTCTCTGGCATCCC(^TCACTGAGACGGGTACCATGGGCAGCAAGCTGGTGGGCATCGTCACCTCCCGA 

GACATCGACTTTCTTGCTGAGAAGGACCACACCACCCTCCTCM 

TAGTGGTGGCTCC^GCAAGCAGTGTGAGGTTGAAAGAGGCAAATGAGATCCTGCM 

AAAGCTGCCTATCGTCAATGATCGCGATGAGCTGGTGGCCATTATCACCTGCACCGCGCTGAAGAACCGA 

GACTACCCTGTGGCCTCCAAGGATTCCCATGAGCAGCTGCTGGGCGGGGCAGCTGTGGGTACCCATGAGG 

ATGACAAATACCACCTGGACCTGCTCACCCAGGTAGGCGTCAATGTC^TAGGCTTGGACTCGTCCCAAGG 

GAACTCGGTGTATCAGATCGCCATGGTGCATTACATCAAACAAAAGTACCCCCACCTCCAGGTGATTGGG 

GGGAACGTGGTGAGAGCAGCCCAGGCCAACAACCTGATTGATOCTGGTGT 

TGGACTGCGCGGCTGGCTCCATCTACATCAACCAGGAAGTGATAGCCTGCAGTCAGCCCCAGGGCACTGC 

TGTGTACAAGGTGGCCAAGCATACCCAGAACTTTGGTGTGCCCATCATAGCCGATGGTGGCATCCAGACC 

ATGGGGCATGTGGTCAAGGCCCTGGCCCTAGGAGCCTCCACAGTGATGATGGGCTCCCTGCTGGCCGCC^ 

CCATGGAGGCCCCCGGCGAGTGCTTCTTCTCAGACGGAATGCAGCTCAAGAAGTACC^ 

ACTGGATGCCATGGAGAAGAGCAG C AGC AG C CAGAAACAATACTTCAACGACGGGGAT AAGG CGAAGAT C 

ACGCAGGATGTCTTGGGCTCCATCCAGGACAAAGGGTCCATTCAGAAGTTCGTGCCCT 

G CAT CCAG CATGGCTGCCAGGAT ATCGGGGCCCACAGCCTGT CTGT C CT T CGGTCCATGATGT ACT CAGG 

GGAGCTCAAGTTTGAGAAGCAGACCATGTCAGCCCAGATCGACGGTGGCATCCATOT 

GAGAAGTGGCTGTACTGAGGACAGCGGTGCAGGGCGAGATG 
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A NOV93 polypeptide (SEQ ID NO:296) encoded by SEQ ID NO:295 is 518 amino 
acids in length and is presented using the one-letter amino acid code in Table 93B. The Psort 
profile for NOV93 predicts that this sequence is likely to be localized to the cytoplasm with a 
certainty of 0.4500. In alternative embodiments, a NOV93 polypeptide is located to 
5 ly sosomes with a certainty of 0. 1 92 1 , or, to microbodies with a certainty of 0.3346. 



Table 93B. NOV93 Polypeptide Sequence (SEQ ID 
NO:296) 

MRGLPISSSTGYVVEIX5FrATVQQLFASDQGLTYNDFLILPGFIDFIAI>EVDLTSALTQP 

OTLKTPLISSPMDTVTEADIAIVMALMG^ 

VVLSPLHTVGDVLEAKMRHGFSGIPITETGTMG 

MMPRI KLWAPAS S VRLJCEANE I LQLSKKGKLP I VNDRDELVAI I TCTAL KNRDYP VAS K 
DSHEQLLGGAAVGTHEDDKYHLDLLTQVGVNVIGLDSSQGNSVYQIA>WHYIKQKyPHLQ 
VIGGNVVTAAQAN1^IDAGVIX5LGRGMDCAAGSIYINQEVIACSQPQGTAVYKVAKHTQN 
FGVPI IAIXMIQTMGHVVKAIiALGASTVMMGSLLAATMEAPGECFFSI^ 
LDAMEKSSSSQKQYFNDGDKAKITQDVLGSIQDKGSIQKFVPYLIVGIQHGCQDIGAHSL 
SVLRSMMYSGELKFEKQTMSAQIDGGIHGLHSYEKWLY 



A BLAST analysis of NOV93 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV93 had high 
1 0 homology to other proteins as shown in Table 93C. 



Table 93C BLASTX results from PatP database for NOV93 






Smallest 






Sum 


Sequences producing. High- scoring Segment Pairs: 


High 


Probability 


Score 


P(N) 


patp:AAR05431 Chinese hamster IMPDH - Cricetulus sp, 514 aa. 


1874 


3.2e-193 


patp:AAR05432 Human IMPDH - Homo sapiens, 514 aa. 


1872 


5.3e-193 


patp:AAY08965 A. gossypii inosine- monophosphate dehydrogenase 


980 


1.8e-98 


patp:AAG30888 Arabidopsis thaliana protein fragment 


973 


9.7e-98 


patp:AAG43l08 Arabidopsis thaliana protein fragment 


949 


3.4e-95 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1402 of 1567 bases (89%) identical to a gb:GENBANK- 

15 ID:HUMIMPH|acc:J05272.1 mRNA from Homo sapiens (Human IMP dehydrogenase type 1 
mRNA). The full amino acid sequence of the protein of the invention was found to have 438 
of 513 amino acid residues (85%) identical to, and 462 of 513 amino acid residues (90%) 
similar to, the 514 amino acid residue ptnr:SWISSNEW-ACC:P20839 protein from Homo 
sapiens (Human) (INOSINE-S'-MONOPHOSPHATE DEHYDROGENASE 1 (EC 1.1.1.205) 

20 (IMP DEHYDROGENASE 1) (IMPDH-I) (IMPD 1)). NOV93 also has homology to the other 
proteins shown in the BLASTP data in Table 93D. 

736 



WO 02/068649 



PCT/US02/02785 



Table 93D. NOV93 BLASTP results 






Gene Index / 

THpnfifier 

lllVll l IX A 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi|l745330l|r 
ef jXPJ>04 627. 


IMP (inosine 
monophosphate ) 
dehydrogenase 1 
[Homo sapiens] 


514 


440/514 
(85) 


464/514 
(89) 


0.0 


gi | 124417 |sp| 
P20839|IMD1_H 
UMAN 


INOSINE- 5 1 -MONOPHOSPHATE 
DEHYDROGENASE 1 (IMP 
DEHYDROGENASE 1) (IMPDH- 
I) (IMPD 1) 


514 


438/514 
(85) 


462/514 
(89) 


0.0 


gi|4504687|re 
f|NP 000874.1 
| (NMJ)00B83) 


IMP (inosine 
monophosphate) 
dehydrogenase 1; 
SWSS2608 [Homo sapiens] 


514 


434/514 
(84) 


458/514 
(88) 


0.0 


gi | 6754344 | re 
f|NP 035959.1 
| (NM P11829) 


inosine 5' -phosphate 
dehydrogenase 1 
[Mus musculus] 


514 


430/514 
(83) 


462/514 
(89) 


0.0 


gi | 16549223 |d 
bj |BAB70780.1 
) (AK054640) 


unnamed protein product 
[Homo sapiens] 


489 


418/514 
(Bl) 


440/514 
(85) 


0.0 



This BLASTP data is displayed graphically in the ClustalW in Table 93E. A multiple 
sequence alignment is given, with the NOV93 protein being shown on line 1 in a ClustalW 
5 analysis comparing the protein of the invention with the related protein sequences shown in 
Table 93D. 



Table 93E. ClustalW Alignment of NOV93 



NOV93 



(SEQ ID NO: 296) 



gi 
gi 
gi 
gi 
gi 



17453301| (SEQ ID NO: 764) 

124417 | (SEQ ID NO:765) 

4504687) (SEQ ID NO: 766) 

6754344 | (SEQ ID NO:767) 

165492231 (SEQ ID NO:768) 



NOV93 



gi 
gi 
gi 
gi 
gi 



17453301) 
124417 | 
4504687 
6754344 
16549223 



NOV93 



gi 
gi 
gi 
gi 
gi 



17453301| 
124417 | 
4504687 | 
6754344) 
16549223] 



NOV93 



20 



30 



40 



50 




60 



70 



I. 



80 
-I-. 



90 



100 

-I 



VDLTSALTa|BTLKTPLISSPMDTVTEADgAlEMALMGG(jGFIHHNCTPE! 
DLTSALTRKITLKTPLISSPMDTVTEADMAIAMALMGGIGFIHHNCTPE 
VDLTS ALTRKITLKXPL I S S PMDTVTE ADM AIAMALMGGI GF I HHNCT PE 
VDLT S ALTRXITLKT PL IS S PMDTVTEADM AIAMALMGG I GFIHHNCTPE 
\mr.TSALTRKIlTLKTPLIS-5PMDTVTEADMAIAMAL MGGIGFIHHNCTPE 
DLT S ALTRKITLKTPL I S S PMDTVTE ADM AI 



120 
. . I . . 



130 



140 



150 



[tPWLSPr*HTVGDVLEAKMRHGFSGIPITETG 
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fqane vrkvkkfeqgfi tdp wls p shtvgdvle akmrhgfsgi pi tetg 
fqanevrkvkSfeqgfitdpvvlspshtvgdvleakmrhgfsgipitetg 
fqanevrkvk"feqgfitdpwlspshtvgdvleakmrhgfsgipitetg 
fqanevrkvkkfeqgfi tdp wls psht vgdvleak^hgfsgipi t^tg 
ssbbbbbbh kfe q gfi tdp wls p shtvgdvle akmrhgf sgi pitetg 





17453301 


gi 


124417 | 


gi 


4504687) 


gi 


6754344) 


gi 


16549223 


NOV93 


gi 


17453301 


gi 


124417 | 


gi 


4504687] 


gi 


6754344 j 


gi 


16549223 



NOV93 



NOV93 



17453301) 

124417) 

4504687) 

6754344) 

16549223| 



NOV93 

gi 1 17453301 1 
gi 1 124417 | 
gi [4504687 j 
gi | 6754344 j 
gi j 16549223 | 



NOV93 



17453301) 

124417) 

4504687) 

6754344) 

16549223) 



160 170 180 190 
| ....[. . ..)....].. . . | | | | . . 

Ss 



TMGSKLVGIVTSRDIDFLAEKDHTTLLSEVMjJiPRlgLWAP 
"TMGSKLVGI VTS RD IDFLAEKDHTTLLSE VMTPRIEL WAPAG 
TKGSKLVGIVTSRDIDFLAEKDHTTLLSEVMTPRIELWAPAG 
TMGSKLVGXVTSRDIDFLAEKDHTTLLSEVMTPRIELWAPAG 
TMGSKLVGI VTS RDIDFLAEKDHTTLLSE VMTPRIEL WAP AG 
TMGSKLVGIVTSRDIDFLAEKDHTTLLSEVMTPRIELWAPAG 




250 



17453301) 
124417) 
4504687) 
6754344 j 
16549223) 



NOV93 



17453301) 

124417) 

4504687) 

6754344) 

16549223) 



NOV93 


gi 


17453301 


gi 


124417) 


gi 


4504687) 


gi 


6754344 j 


gi 


16549223 




260 



270 



280 



290 



300 



VGTigEDDKYglLDLLTQQGViJjVlgLDSSQGNSVyQIAMVHYIKQKYPHL 
AAVGTREDDKYRLDLLTQAGVDVIVLDSSQGNSVYQIAMVHYIKQKYPHL 
AAVGTREDDKYRLDLLTQAGVDVIVLDSSQGNSVYQIAMVHYIKQKYPHL 



iAVGTREDDKYRLDLLTQAGVDVIVLDSSQGMSVYQIAMVHYIKQKYPHL 



310 

■ J.. 



320 



330 



340 



350 



QVIGGNWTAAQAjJjNLIDAGVDGLggGMjgCj 
QV I G GNWTAAQ AKNL I D AG VDG L R VGMG C 
QVIGGNWTAAQAKNLIDAGVDGLRVGMGC 
QVI GGNWTAAQAKNL IDAG VDGLRVGMGC 
Q VI GGNWT AAQAKNLI DAGVDGLRVGMGC 
QVIGGNWTAAQAKNLI DAGVDGLR VGMGC 



GS I JjliJjQEVQACggPQGT 
GSICITQEVMACGRPQGT 
GS ICITQE VMACGRPQGT 
GSICITQEVMACGRPQGT 
GSICITQEVMACGRPQGT 
GSICITQEVMACGRPQGT 



360 

■I--.-) 



370 



380 



390 

J. 



400 



VYKVA i^g«i5 FGVPIIADGGIQTigGHWKALALGASTVMMGSLT,AATJF. 
AVY K VAE Y ARR FGV P 1 I ADGG I QT VGHWKAL ALGASTVMMGS LL AATTE 

lVYK VAEYARRFG V P 1 1 ADGG IQT VGHWKALALGAST VMMGSLLAATTE 
AVYKVAEYARRFGVPIIADGGIQTVGHWKALALGASTVMMGSLLAATTE 
fWYKVAEYARRFGVPglADGGIQTVGHWKALALGASTVMMGSLLAATTE 

.VYKVAEYARRFGVPIIADGGIQTVGHWKALALGASTVMMGSLLAATTE 1 



410 



420 



430 
■ ■)■. 



440 



450 



PGEgFFSDG^LKKYgGMGSLDAMEKSSSSQKSYF^GDKgKlBQigVjfG 

pgeyffsdgvrlkkyrgmgsldamekssssqkryfsegdkvkiaqgvsg 
pgeyffslgvrlkkyrgmgsldamekssssqkryfsegdkvkiaqgvsg 
pgeyffsdgvrlkkyrgmgsldgmekssssqkryfsegdkvkiaqgvsg 1 
apgeyffsdgvrlkkyrgQgsldamekssssqkryfsegdkvkiaqgvsg 
apgeyffsdgvrlkkyrgmgsldamekssssqkryfsegdkvkiaqgvsg 



500 



IIQDKGSIQKFVPYLIflGIQHGCQDIGA^SLSVLRSMMYSGELKFEKgTMl 
S I QDKG S I QKF V P YL I AGI QHG C QD I G ARSL S VLRS MM Y S GELKFE KRTM 
SIQDKGSIQKFVPYLIAGIQHGCQDIGARSLSVLRSMMYSGELKFEKRTH 
SIQDKGSIQKFVPYLIAGIQHGCQDXGARSLSVLRSMMYSGELKFEKRTM 
S I QDKGS I QKFVPYLI AG I QHG CQDIGA^SLS VLRS MMYSGELKFE KRTM 
S I QD KG S I QKFV PY L I AG I QHG C QD I G ARSL S VLR S MM Y 5 GELK FEKRTM 
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510 

^ I > I 



NOV93 


gi 


17453301| 


gi 


124417) 


gi 


4504687| 


gi 


6754344) 


gi 


16549223 | 



SAQlgGGSHGLHSYEKgLY 
SAQIEGGVHGLHSYEKRLY 
SAQIEGGVHGLHSYEKRLY 
S^QIEGGVHGLHSYEKRLY 1 
SAQIEGGVHGLHSYEKRLY 
SAQIEGGVHGLHSYEKRLY 



fable 93F lists the domain description from DOMAIN analysis results against NOV93. 
This indicates that th&NOV93 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 93F. Domain Analysis of NOV93 
pn1tPfamlnfam00478, IMPDH_C, IMP dehydrogenase / GMP reductase C terminus. This family is involved 
in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. 
The alignment does not contain the whole TIM barrel domain. The alignment is truncated after the insert 
domain (2 CBS domains pfam00571) found in the inosine-5'-monophosphate dehydrogenase structure. This 
family should always be associated with pfam01574. This family is a member of the common phosphate 
binding site TIM barrel family. SEQ ID NO: 884 

CD-Length = 222 residues, 100.0% aligned 

Score - 261 bits (666), Expect = 9e-71 

NOV93 : 264 LLTQVGVNVI GLDS SQGNS VYQ I AMVHY IKQKYPHLQVI GGNVVTAAQANNLIDAGVDGL 323 ~~ 

L + GV+VI LDSS G S QI + +I++KYP +QVI GNWT A LIDAG D + 
Sbjct: 1 ALVEAGVDVICIJ)SSNGYSEVQIDFIRWIREKyPTVQVIAGNVVTGEMAEELIDAGADAV 60 

NOV93 : 324 GRGMDCAAGS IYINQEVI ACSQPQGTAVYKVAKHTQNFGVPI I ADGGIQTMGHVVKALAL 383 

G+ GSI I +EV +PQ TAV +VA + +P+I+DGGI GH+ KALA 
Sbjct: 61 KVGI - - GPGS I C I TREVAG IGRPQATAVLEVADASHGLNT PVI SDGGI TNPGHMAKALAG 118 

NOV93: 3 84 GASTVMMGSLLAATMEAPGECFFSDGMQLKKYQG^ 443 

GA VM+GSLLA T EAPGE F DG + K Y+GMGSL AM+K S +YF K + 
Sbjct: 119 GADFVMI GSLLAGTEEAPGEWFKDGKKYKLYRGMGS LTAMKKYQG S VARYFASKQKLSV 178 

NOV93: 444 TQDVLGSIQDKGSIQKFVPYLIVGIQHGCQDIGAHSLSVLRSMM 487 

+ VG + KG + + V L+ G++ C IGA L LR 
Sbjct: 179 EEGVTG YVP Y KGDVS RTVHDLLGGLRSS CTYI GAT KL KQLRKRA 222 . 



Inosine-5-prime-monophosphate dehydrogenase (EC 1.1.1.205) catalyzes the 
formation of xanthine monophosphate (XMP) from IMP. In the purine de novo synthetic 
pathway, IMP dehydrogenase is positioned at the branch point in the synthesis of adenine and 

10 guanine nucleotides and is thus the rate-limiting en2yme in the de novo synthesis of guanine 
nucleotides. Inhibition of cellular IMP dehydrogenase activity results in an abrupt cessation of 
DNA synthesis and a cell-cycle block at the Gl-S interface. Collart and Huberman (1988) 
used a polyclonal antibody directed against the purified protein to isolate human and Chinese 
hamster IMP dehydrogenase cDNA clones. The sequence of these clones demonstrated an 

1 5 open reading frame for a protein containing 5 1 4 amino acids. The molecular mass of the 
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produced protein was 56 kD, which is the observed molecular mass of the purified protein and 
of the immunpprecipitated in vitro translation product. 

A high order of conservation of the IMP dehydrogenase protein was indicated by the 
finding that human and Chinese hamster cDNA clones differed by only 8 amino acids. 
5 Natsumeda et al. (1990) isolated two distinct cDNAs (types I and II) encoding IMP 
dehydrogenase from a human spleen cDNA library. Both clones encode proteins of 514 
residues showing 84% sequence identity. Type I mRNA was found to be the main species in 
normal leukocytes, and type II (146691) predominated in human ovarian tumors. Using PCR 
primers specific for type II IMPDH, Glesne et al. (1993) screened a panel of human/Chinese 

10 hamster cell somatic hybrids and a separate deletion panel of chromosome 3 hybrids and 
localized the gene to 3p24.2-p2 1 .2. 

The gene was also localized on a map of two overlapping YACs and found to span no 
more than 12.5 kb of genomic DNA. From cloning and sequencing IMPD2, Glesne and 
Huberman (1994) determined that the gene spans approximately 5 kb and is interrupted by 12 

15 introns. The transcriptional start sites were determined by SI nuclease mapping to be 

somewhat heterogeneous but the predominant mRNA species showed a 5-prime end at 102 
and 85 nucleotides from the translational initiation codon. Zimmermann et al. (1995) also 
cloned the human gene and noted that it has 14 exons spanning approximately 5.8 kb. They 
also characterized regulatory elements in the 5-prime flanking region of the gene. 

20 NOV93 is predicted to be expressed in at least the following tissues: brain, 

prosencephalon/forebrain, diencephalon, pituitary gland. This information was derived by 
determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, public EST sources, literature sources, and/or RACE 
sources. Further expression data for NOV93 is provided in Example 2. 

25 The NOV93 nucleic acids and proteins are useful in potential therapeutic applications 

implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 

30 disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 
immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV93 nucleic acids encoding the inosine-5-prime-monophosphate 
dehydrogenase-like protein of the invention, or fragments thereof, may further be useful in 



740 



WO 02/068649 



PCT/US02/02785 



diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. 

The novel nucleic acid of the invention encoding a inosine-5-monophosphate 
dehydrogenase-like protein includes the nucleic acid whose sequence is provided in Table 

5 93 A, or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 93 A while still 
encoding a protein that maintains its inosine-5-monophosphate dehydrogenase-like activities 
and physiological functions, or a fragment of such a nucleic acid. The invention further 
includes nucleic acids whose sequences are complementary to the sequence of Table 93 A, 

10 including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
1 5 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1 1% of the bases 
may be so changed. 

20 The novel protein of the invention includes the inosine-5-monophosphate 

dehydrogenase-like protein whose sequence is provided in Table 93B. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 93B while still encoding a protein that maintains its 
inosine-5-monophosphate dehydrogenase-like activities and physiological functions, or a 

25 functional fragment thereof. In the mutant or variant protein, up to about 1 5% of the amino 
acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 

30 prediction from hydrophobicity charts; as described in the "Anti-NOVX Antibodies" section 
below. 

NOV94 

The disclosed NOV94 (alternatively referred to herein as CG5628 1-01) includes the 
35 1 573 nucleotide sequence (SEQ ID NO:297) shown in Table 94A. A NOV94 ORF begins 
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with a Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a stop codon at 
nucleotides 1 564-1 566. The disclosed NOV94 maps to human chromosome 3. 



Table 94A. NOV94 Nucleotide Sequence (SEQ ID NO:297) 

ATGGCGCGGAAGCAGGACCCGAAGCCTAAATTCCAGGAGGGTGAGCGAGTGCTGTGCTTTCATGGGCCTC 
TTCTTTATGAAGCAAAGTGTGTAAAGGTTGCCATAAAG 

TGGTTGGAATAAAAATTGGGATGAGTGGGTTCCGGAGAGCAGAGTACTCAAATACGTGGACACCAATGAA 

AATCGTAGATTGGCCAGGGAAATTCGTAGATTACAGCATAAATTGGCAAGAAATGCTGTAGCTCACCTGA 

GGAGCAAGAGAGAAAGAAGCAGCCGCTCCAGGTTGCTTGGTGCTGACTCTGTCTTAAAAGGCCT 

CG AAGAAAAAAATGAAAATGATGAAAACT CATTAAGC AGTT CC T CTGACAGTAGTGAAGACAAGGATGAA 

AAAATAAGTGAAGAATGTGATATTGAAGAAAAGACTGAAGTGAAAGAAGAACCGGAGCTTCAAACAAAAA 

GGGAAATGGAAGAAAG AACAGTAACT CTAGAAATC C CTGAAGTT CTGAAGAGGCAG CTGGAGGATGATTG 

TTACT ACATT AAT CGGAGG AAACGGTT AGTGC AACTT C CATGC CACAC CAACAT CAT AACG ATTTTGGAA 

TCC TATGTGAAG CATTTTGCTATCAGTGCAGCCTTTTCAGCCAATGAGAGG CCTCGTCAC CATCACGCTA 

TGCCACATGCCAGC^TGAACGTGCCTTATATCCCAGCAGAAAAGAATATTG 

GGATGGATTAAGAATAACCTTTGATTACACTCTCCCGTTGGTTTTACT 

T AT AAAAAGGTG ACTG CATCT AAGGTT TTTCTTG CAATT AAGG AAAGTGC CACAAATACT AATAGGAGC C 

AGGAGAAGCTCTCTCCCAGCTTACGTTTGTTGAATCCATCCAGGCCGCAGTCTACAGAGAGTCAGTCGAC 

CAGCGGTGAACCAGCCACCCCTAAAAGGCGCAAAGCCGAGCCGCAAGCA 

TCGCCCCACACCGCCAACTGTGACAGGCTTTCTA^ 

AGGACATGTCCACCAGTGTGCCCAAGCTGTTCCTGCACCTOT 

ATCTTCACCTACTCTGACTCCTAGCCAGGAAGGGAGTCCTGTGTTTGCT^ 

AATGAAATAAATGAGGTCCTCTCCTGGAAGCTCGTACCTGACAATTACCCACCAGGTGACCAGCCACCTC 
CACCCTCTTACATTTACGGGGCGCAACATTTGCTGC^ 

AATGTCCTTTACTGAGAAGAATCTGAAGGCTTTATTGAAGCACTTTGATCTCTTTGTGAGGT^ 

GAATACCACGATGACTTCTTCCCAGAGTCAGCTTACGTCGCTGCC 

ACCCCCAGGCAGTCAATAAGTGTTGATGGTTCT 



A NOV94 polypeptide (SEQ ID NO:298) encoded by SEQ ID NO:297 is 521 amino 
acids in length and is presented using the one-letter amino acid code in Table 94B. The Psort 
profile for NOV94 predicts that this sequence is likely to be localized to the nucleus with a 
certainty of 0.9700. In alternative embodiments, a NOV94 polypeptide is localized to 
microbodies with a certainty of 0.3000. 



Table 94B. NOV94 Polypeptide Sequence (SEQ ID 
NO:298) 

MARKQDPKPKFQEGERVLCFHGPLLYEAKCVKVAIK^ 

RVLKYVDTNENRRLAI^I RRLQHKL IEEK 
NEM)ENSLSSSSDSSEDKDEKISEECDIEEKTEVKEEPELQTKREMEERTOTLEIPEVLK 
RQLEDDCYYINRRKRLVQLPCHTNI ITILES YV^ 
VPYIPAEKNTDLCKE1WDGLRITFDYTLPL 

NRSQEKLSPSLRLLNPSRPQSTESQSTSGEPATPKRRjCAEPQAVQSLRRSSPHTANO)RL 
SKSSTSPQPKRWQQDMSTSVPKLFLHLEKKTPVHSRSSSPTLTPSQEGSPVFAGFEGRRT 
NEINEVLSWKLVPDNYPPGDQPPPPSYIYGAQHLLRLF\nGj^ 

HFDLFVRFLAEYHDDFFPESAYVAASEVHYSTRNPQAVNKC 



A BLAST analysis of NOV94 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV94 had high 
homology to other proteins as shown in Table 94C. 
15 



5 
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Table 94C. BLASTX results from PatP database for NOV94 






Smallest 






Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAW96184 Senescence protein derived from human 
patp:AAB60 085 Human transport protein TPPT-5 
patp:AAU32295 Novel human secreted protein #2786 
patp:AAM64 801 Human brain expressed single exon probe 
patp:AAM77S58 Human bone marrow expressed probe 


465 
465 
430 
239 
239 


2.9e-51 
2.9e-51 
3.4e-40 
3.5e-19 
3.5e-19 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1381 of 1581 bases (87%) identical to a gb:GENBANK- 
ID:AF117065|acc:AFl 17065.1 mRNA from Homo sapiens (male-specific lethal-3 homolog 1 
5 (MSL3L1) mRNA). The full amino acid sequence of the protein of the invention was found to 
have 414 of 520 amino acid residues (79%) identical to, and 457 of 520 amino acid residues 
(87%) similar to, the 521 amino acid residue ptnr:SPTREMBL-ACC:Q9Y5Z8 protein from 
Homo sapiens (Human) (MALE-SPECIFIC LETHAL-3 HOMOLOG 1). NOV94 also has 
homology to the other proteins shown in the BLASTP data in Table 94D. 

10 



Table 94D. NOV94 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi| 17975757 | re 
f |NP_523353.l| 
(NM_078629) 


male- specific lethal 3- 
like 1 isoform a; 
drosophila MSL3-like 1 
[Homo sapiens] 


521 


413/529 
(78) 


456/529 
(86) 


0.0 


gi| 17975759 | re 
f |NP_5233 54.l| 
(NMJ)7863 0) 


male-specific lethal 3- 
like i isoform b; 
drosophila 
MSL3-like 1 
[Hojho sapiens] 


462 


389/468 
(83) ! 


420/468 
(89) 


0.0 


gi| 14764458 | re 
f |XP_045715.l| 
<XM 045715) 


male- specif ic lethal-3 
(Drosophila) -like 1 
[Homo sapiens] 


496 


387/498 
(77) 


427/498 
(85) 


0.0 


gi| 11545735 | re 
f |NP_034962.2| 
" (NM 010832) 


male-specific lethal-3 
homolog 1 (Drosophila) 
[Mus musculus] 


466 


346/472 
(73) 


396/472 
(83) 


e-168 


gi| 17975761) re 
f |NPJ)06791.2| 
(NM_006800) 


male- specif ic lethal 3- 
like 1 isoform c; 
drosophila MSL3-like 1 
[Homo sapiens] 


355 


312/354 
(88) 


335/354 
(94) 


e-162 



This BLASTP data is displayed graphically in the ClustalW in Table 94E. A multiple 
sequence alignment is given, with the NOV94 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
15 Table 94D. 
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Table 94E. ClustalW Alignment of NOV94 



NOV94 



gi 


17975757 


gi 


17975759 


gi 


14764458 


gi 


11545735 


gi 


17975761 


NOV94 


gi 


17975757 


gi 


17975759 


gi 


14764458 


gi 


11545735 


gi 


17975761 


NOV94 


gi 


17975757 


gi 


17975759 


gi 


14764458 


gi 


11545735 


gi 


17975761 


NOV94 


gi 


17975757 


gi 


17975759 


gi 


14764458 


gi 


11545735 


gi 


17975761 



N0V94 



N0V94 


gi 


17975757 


gi 


17975759 


gi 


14764458 


gi 


11545735 


gi 


17975761 



NOV94 



17975757 
17975759 
14764458 
11545735 
17975761 



NOV94 

gi | 17975757 | 
gi 1 179757591 



(SEQ ID NO: 298) 
(SEQ ID NO:769) 
(SEQ ID NO:770) 
(SEQ ID NO: 771) 
(SEQ ID NO: 772) 
(SEQ ID NO: 773) 



10 20 30 40 50 

....|...-|....|....|....|....|....|....|....|....| 

MARKQDPKPKFQEGERVLCFHG PLLYEAKCVKVAIK-D KQV 

MSASEGMKFKFHSGEKVLCFEPDPTKARVLYDAKIVWIVGKDEKGRKIP 

MSASEGMKFKFHSGEKVLCFEPDPTKARVLYDAKIVDVIVGKDEKGRKIP 



100 




KYFIHYSG1 

eyli: 



110 



120 




200 



17975757 | 
17975759 | 
14764458 | 
11545735 j 
17975761] 




210 



220 



230 240 
..[..„! |.. 



I 



250 



cyyinrrkrlvglpc[gtniitilesyvkhfaigaafsanerprhhh2mpk 
cyyinrrkrlvklpcqtniitilesyvkhfainaafsanerprhhhvmph 
cyyinrrkrlvklpcqtniitilesyvkhfainaafsanerprhhhvi^iph 
cyyii-jrrkrlvklpcqtniitilesyvkhfainaafsanerprhhhvmph 
cyyinrrkrlvklpcqtniitilesyvkhfainaafsanerprhhhMh 
cyyinrrkrlvklpcqtniitilesyvkhfainaafsanerprhhhvmph 



260 


270 


280 


290 


300 













a^mnvgyipaeknsdlckemvdglritfdytlplvllypyeqaqykkvtj 
m^invhyipaeknvd'lckemvdglritfdytlplvllypyeqaqykkvts 
.winvhyipaeknvdlckemvdglritfdytlplvllypyeqaqykkvts 
anmnvhy i p ae knvdlc kem vdglr it fd ytlpl vll y p yeq aq yk kvt s 
^invhyEpaeknvdlckemvdglritfdytlplvllypyeqSqykSvts 
\n mnvh y i p ae knvdl c kem vdglr i t fd ytl pl vll yp yeqaoy k kvt s 



310 



320 
■ •I.. 



I. 



330 



340 
..I.. 



I. 



350 
.1 



SK^FLaiKESAT^TNRSQE2LSPSjS L ^PS|2PQSTESQgTgGEPATPKR^ 
"SKFFLPiKESATSTNRSQEELSPSPPLLNPSTPQSTESQPTTGEPATPKR 

skfflpikesats'tnrsqeelspsppllnpstpqstesqpttgepatpkr 
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SKFELPIKESATSTNRSQEELSPSPPLLNPSTPQSTESQPTTGEPATPKR 

SKFFLPIKES^2 TTOS Q EELSPSPPLLNPSTPQSTESQP i^ GEPATPKR 
SKFFLPIKE5ATSTNRSQEELSPSPPLLNPSTPQSTE5QPTTGEPATPKR 





14764458 


gi 


11545735 


gi 


17975761 


NOV94 


gi 


17975757 


gi 


17975759 


gi 


14764458 


gi 


11545735 


gi 


17975761 


NOV94 


gi 


17975757 


gi 


17975759 


gi 


14764458 


gi 


11545735 


gi 


17975761 


NOV94 


gi 


17975757 


gi 


17975759 


gi 


14764458 


gi 


11545735 


gi 


17975761 



NOV94 

gi | 17975757 
gi 1 17975759 
gi 1 14764458 
gi 1 11545735 
gi 1 17975761 



360 



370 



380 



390 



400 




410 



420 



430 



I. 



440 

..]_-• 



450 



LEKXTPVHSRSSS PgflLT PSflE GS&VFAG FEGRRTNE INE VL S WKLVPD 
~LEKKTPVHSRSSSPIPLTPSKEGSAVFAGFEGRRTNEINEVLSWKLVPDN 
LEKKTPVHSRSSSPIPLTPSKEGSAVFAGFEGRRTNEINEVLSWKLVPDN 

lekktpvhsrssspipltpskegsavfagfegrrtneinevlswklvpdn 
lekktpvhsrssspipltpsk<2gsavfagfegrr2neinevlswklvpdn 
lekktpvhsrssspipltpskegsavfagfegrrtneinevlswklvpdn 



460 



I 



470 



480 



I 



490 



500 



YPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSF^EKNLKALLKHFDLF] 
YPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSFSEKNLKALLKHFDLF 
YPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSFSEKNLKALLKHFDLF 
YPPGD'QPPPPSYIYGAQHLLRLFVKLPEILGKMSFSEKNLKALLKHFDLF 
YPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSFSEKNLKALLKHFDLF 
YPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSFSEKNLKALLKHFDLF 




Table 94F lists the domain description from DOMAIN analysis results against NOV94. 
This indicates that the NOV94 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 94F. Domain Analysis of NOV94 



gnUSmartlsmart00298, CHROMO, Chromatin organization modifier domain SEQ ID NO: 885 

CD-Length = 55 residues, 
Score = 42.0 bits (97), Expect = 9e-05 



N0V94: 36 KDKQVKYFIHYSGWNKNWDEWVPESRVLK 64 

K +++Y + + G++ D W PE +L 
Sbjct; 14 KKGELEYLVKWKGYSYREDTWEPEENLLN 42 



10 



The Drosophila male-specific lethal (msl) genes regulate transcription from the male X 

chromosome in a dosage compensation pathway that equalizes X-linked gene expression in 

males and females. The members of this gene family, including msll, msI2, msl3, mle, and 

mof, encode proteins with no sequence similarity to known proteins. However, mutations in 

745 



WO 02/068649 PCT/US02/02785 

each of these genes produce a similar phenotype: sex-specific lethality of male embryos 
caused by the failure of mutants to increase transcription from the single male X chromosome. 

The MSL gene products assemble into a multiprotein transcriptional activation 
complex at hundreds of sites along the chromatin of the X chromosome. By searching 
5 sequence databases with the sequence of a BAC clone that maps to Xp22.3, Prakash et al. 
(1999) identified a human homolog of Drosophila msl3, MSL3-like-l (MSL3L1). They 
isolated a cDNA containing a complete MSL3L1 coding sequence. The deduced 521-amino 
acid MSL3L1 protein shares 30% overall sequence identity with Drosophila MSL3 and 86% 
identity with mouse Msl311 . Three segments of the Drosophila MSL3 protein are highly 

10 conserved in MSL3L1, including two putative chromodomains, one at the N terminus and the 
other at the C terminus. Chromodomains, which form a characteristic tertiary structure and can 
interact with components of chromatin, have been implicated to play roles in chromatin 
organization and transcriptional regulation. MSL3L1 also contains a putative nuclear 
localization signal, a putative leucine zipper motif within the second chromodomain, and two 

1 5 potential tyrosine kinase phosphorylation sites. 

Prakash et al. (1999) identified human fetal kidney cDNAs representing an 
alternatively spliced MSL3L1 transcript that lacks exon 2. The predicted protein, which is 
referred to as isoform 2, is identical to the first isoform from amino acid 62 to the C terminus 
but does not contain the first 26 amino acids of the N-terminal chromodomain. Northern blot 

20 analysis detected a major 2.4-kb MSL3L1 transcript in all tissues examined, namely liver, 
pancreas, heart, lung, kidney, skeletal muscle, brain, and placenta, with highest expression in 
skeletal muscle and heart. A 2.6-kb transcript unique to skeletal muscle was also found. 
Northern blot analysis of E7, El 1, E15, and E17 mouse embryos detected approximately equal 
levels of Msl311 expression in all embryos. The MSL3L1 gene spans 17 kb and contains 13 

25 exons. It. is transcribed from telomere to centromere. Prakash et al. (1999) showed that the 
MSL3L1 gene undergoes X inactivation. 

NOV94 is predicted to be expressed in at least the following tissues: lung, testis, B- 
cell. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, public EST 

30 sources, literature sources, and/or RACE sources. Further expression data for NOV94 is 
provided in Example 2. 

The NOV94 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
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hypercalcemia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 

5 immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 

conditions. NOV94 nucleic acids encoding the MSL3Ll-like protein of the invention, or 
fragments thereof, may further be usefiil in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a male-specific lethal 3-like l-like 

10 protein includes the nucleic acid whose sequence is provided in Table 94A, or a fragment 

thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 94A while still encoding a protein 
that maintains its male-specific lethal 3-like l-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 

1 5 are complementary to the sequence of Table 94A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

20 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 13% of the bases 
may be so changed. 

25 The novel protein of the invention includes the male-specific lethal 3-like l-like 

protein whose sequence is provided in Table 94B, The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 94B while still encoding a protein that maintains its male-specific lethal 3-like l-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 

30 variant protein, up to about 21 % of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
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prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV95 

The disclosed NOV95 (alternatively referred to herein as CG56975-01) includes the 
1323 nucleotide sequence (SEQ ID NO:299) shown in Table 95 A. A NOV95 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 35-37 and ends with a stop codon 
at nucleotides 1301-1303. The disclosed NOV95 maps to human chromosome 1. 



10 



Table 95A. NOV95 Nucleotide Sequence (SEQ ID NO:299) 



GCAATTCCTTTTCAATATTTATATATTTCAGAAAATGTCACTC 
GGACTTGATAGTAATGTGTGGATTGAATTTACCAAATTGGCTG^ 

aaggctttccagatatatcccctcctacatatgtaaaagaag 

cctgaatcagtatacacgaggctttggccatccatcacttgtgaaagctctgtcctatctgtatgaaaag 

ctttat caaaag caaattg attcaaataaagaaatccttg tgacagtaggagcatatggat ctcttttta 

acaccattcaagcattaattgatgagggacaggtcatactaat^^ 

catggtgagaatggctggagcaacacctgtttttattcccctga 

tctagttctgactggacattagatcctcaagaactggaaagtaaat^ 

tactaaatactccagataacccacttggcaaggt^^ 

ttgcatcaaatatgacacactctgcatcagcx3atgaggtttatgaatggcttgtatatt 
cacttaaaaatagctacttttccaggtatgtgggagagaaciaataacaataggaact 
tcagtgtaactggctggaaggtaggctggtccattggtccaaatcatttgataaaaca^ 
tcaacaaaacacx5atttatacttgtgcaactccrrtta 

atcaagcgcatggatgacc cagaatgt tactt taatt ctttgccaaaagagtt agaagtaaaaagag at c 

GGATGGTACGTTT ACTTG AAAGTGTTGGCCT AAAAC CCAT AGTT CCTG ATGGAGGATACT T CAT CAT CGC 
TGATGTGTCTATTTTCATTGTGGTTTTAGATCCAGAC CT CT CTGATATGAAGAATAATGAG C CTTATG AC 
TAT AAGTTTGTGAAATGGATGACT AAAC AT CAGAAACTAT CAG CCATCCCCGTTTC AGCATT CTGTAACT 
GAG AGACTAAATC ACAGTT TGAGAAGTTTGTG CGTTTTTGCTT CATTAAAGTAAGT TCCCTGCTCGATGC 
TGCTGAAGAAATCATCAAGGCATGGAGTGTACAGAAGTCTTGATTTGTGCAGAATGGATTAAT 



15 



A NOV95 polypeptide (SEQ ID NO:300) encoded by SEQ ID NO:299 is 422 amino 
acids in length and is presented using the one-letter amino acid code in Table 95B. The Psort 
profile for NOV95 predicts that this sequence is a Type II membrane protein, and is likely to 
be localized at the plasma membrane with a certainty of 0.4400. In alternative embodiments, a 
NOV95 polypeptide is located to microbodies with a certainty of 0.3691. 



Table 95B. NOV95 Polypeptide Sequence (SEQ ID 
NO:300) 



MSLKFTNAKRIEGLDSNVWIEFTKLAADPSVVNLGQGFPDISPPTYVKEELSKIAAIDSL 
NQYTRGFGHPSLVKALSYLYEKLYQKQIDSNKEILVTVGAyGSLFNTIQALIDEGQVILI 
VPFYDCYEPMVRMAGATPVFI PLRSVSLGKRWSS SDWTLDPQELESKFNSKTKAI IIOTP 
HNPLGKVYNREELQVIADLCI KYOTLCI SDEVYE WLVYSGNKHLKIATFPGMWERTITIG 
SAGCTFSOTGWKVGWSIGPNHLIKHLQTOQQOTIYTC^ 

CYFNSLPKELEVKRDRMVRLIiESVGIiKPIVPDGGYFI IADVSIFI WLDPDLSDMKNNEP 
YDYKFVKWMTKHQKLSAI PVSAFCNSETKSQFEKFVRFCFIKVSSLLDAAEEI IKAWSVQ 
KS 
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A BLAST analysis of NOV95 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV95 had high 
homology to other proteins as shown in Table 95C. 



Table 95C BLASTX results from PatP database for NOV95 


Sequences producing High-scoring Segment Pairs: 


High 
Score 


Smallest 

Sum 
Probability 
P(N) 


patp:AAY54591 Amino acid sequence of a human transferase 
patp:AAR89906 Human kynurenine aminotransferase (KAT) 
patp:AAR89896 Rat kynurenine aminotransferase (KAT) 
patp:AAR89897 Rat . kynurenine aminotransferase (KAT) 
patp:AAR89898 Rat kynurenine aminotransferase (KAT) 


2121 
1171 
1130 
1130 
" 1130 


2.2e-219 
1.0e-118 
2.2e-114 
2.2e-114 
2.2e-114 



5 

In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 806 of 915 bases (88%) identical to a gb:GENBANK- 
ID:AF091090|acc:AF091090.1 mRNA from Homo sapiens (clone 669 unknown mRNA, 
complete sequence). The full amino acid sequence of the protein of the invention was found to 
10 have 229 of 415 amino acid residues (55%) identical to, and 300 of 41 5 amino acid residues 
(72%) similar to, the 419 amino acid residue ptnr:SPTREMBL-ACC:Q9W6U2 protein from 
Fugu rubripes (Japanese pufferfish) (Takifugu rubripes) (CYSTEINE CONJUGATE BETA- 
LYASE). NOV95 also has homology to the other proteins shown in the BIASTP data in 
Table 95D. 

15 



Table 95D. NOV95 BLASTP results 




Gene Index / 
Identifier 


Protein / Organism 


Lengt 
h(aa) 


Identity 
(%) 


Positive 

(%) 


Expect 


gi | 12654031 |g 
b|AAH00819.l| 
AAH00819 
(BC000819) 


Similar to CG6950 gene 
product [Homo sapiens] 


290 


280/294 
(95) 


284/294 
(96) 


e-162 


gi 1 5002565 1 em 
b|CAB44334.l| 
(Y17462) 


cysteine conjugate beta- 
lyase [Takifugu rubripes] 


419 


230/418 
(55) 


301/418 
(71) 


e-134 


gi (4757928 | re 
f|NP 004050.1 
| <NMJ)04059) 


cytoplasmic cysteine 
conjugate-beta lyase; 
glutamine-phenylpyruvate 
aminotransferase 
[Homo sapiens) 


422 


215/421 
(51) 


299/421 
(70) 


e-128 


gi | 15425868 |g 
b|AAK97625.l| 
AF395204_1 
(AF395204) 


kynurenine 

aminotransferase [Aedes 
aegypti] \ 


477 


224/420 
(53) 


284/420 
(67) 


e-126 


gi|7299520|gb 

|AAF54707.l| 

(AE003693) 


' CG6950 gene product [alt 
3] 

[Drosophila melanogaster] 


417 


220/417 
(52) 


289/417 
(68) 


e-125 
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This BLASTP data is displayed graphically in the ClustalW in Table 95E. A multiple 
sequence alignment is given, with the NOV95 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 95D. 



Table 95E. ClustalW Alignment of NOV95 



N0V95 (SEQ ID NO: 3 00) . 

gi 1 12654031 1 (SEQ ID NO: 774) 
(SEQ ID NO: 775) 
(SEQ ID NO: 776) 
(SEQ ID NO: 777) 
(SEQ ID NO:778) 
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gi | 5002565 | 
gi j 4757928 | 
gi | 15425868 | 
gi | 7299520 | 



NOV95 


gi 


12654031) 


gi 


5002565) 


gi 


4757928) 


gi 


15425868) 


gi 


7299520) 



NOV95 


gi 


12654031 


gi 


5002565) 


gi 


4757928 | 


gi 


15425868 


gi 


7299520) 



NOV95 

gi | 12654031 | 
gi | 5002565 | 
gi (4757928 | 
gi j 15425868 | 
gi|7299520| 



NOV95 



12654031| 
5002565) 
4757928 j 
15425868) 
7299520 | 



20 



30 



40 



50 



MMFLRNHNSVGGAIRTAWLQDLQFIVSNKSSALTGAVSSVHRQQIRTMS 



100 



NOV95 


gi 


12654031) 


gi 


5002565) 


gi 


4757928 | 


gi 


15425868) 


gi 


7299520) 




TgVKEELSKIAAIDS - - Sj^Ji^ ' 



150 



^vqeSfchahoegp-! 

DjgAVEjRFQHAVSGDF-! 
KKALlfeiAAAANSPDK 
EgSVTHSLADI, 





180 



190 



200 
1 — 1 



•iVlMglQPFQDCYSPM VgtviAG^PVFI 
M vHmAgJW? VF I 
EGDEVIIgjEPFFDCYgPMVgMAGGHpvJSl 

degdeviiiepffdcyepmJSmaggBpvf^ 
degdeviiiepffdcyepmv^agghpgfi 

dEgDSVIIIEPFFDCYEPIwSmAGgSpSfE 



210 



220 




270 



280 



290 



300 
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NOV95 



12654031 | 
5002565 | 
4757928 j 
15425868 
7299520] 



NOV95 


gi 


12654 031 


gi 


5002565 | 


gi 


4757928 | 


gi 


15425868 


gi 


7299520 | 



NOV95 



12654031 
5002565 | 
4757928) 
15425868 
72995201 



NOV95 



12654031] 
5002565] 
4757928 | 
15425868) 
72995201 



NOV95 



gi 
gi 
gi 
gi 
gi 



12654031 
5002565] 
4757928] 
15425868] 
7299520] 




Table 95F lists the domain description from DOMAIN analysis results against NOV95. 
This indicates that the NOV95 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 95F. Domain Analysis of NOV95 

gnllPfamlpfamOOlSS , aminotran_l_2 , Aminotransferase class I and II SEQ ID NO: 886 



CD-Length = 316 residues, 94.3% aligned 
Score = 117 bits (292), Expect - 2e-27 



NOV95: 93 EILVTVGAYGSLFNTIQALIDEGQVILI-VPFYDCOTPMVRMAGATPVFIPLRSVSLGKR 151 

++L GA I G +L+ .P Y Y ++ AG +LVL 

Sbjct: 17 QVLAG TGAKEVAAL F I S C FAAPGDAVIi VPD P TYP I YS DVLNHAGG I - - - VRLYPVPLRSS 73 

NOV95: 152 WS S SDWTLDPQELE S KFNS KTKAI ILNTPHNPLGKVYNRSELQVI ADLC I KYDTLC I SDE 211 

+ D+ + LE +K +++ PHNP G +L+ + DL +++ I* + DE 

Sb j Ct : 74 NHN- DFKALEEALEEA- PEGS KVVLVANPHNPTGMDGTLADLEKLLDIiAKEHNI IiLLVDE 131 
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NOV95: 212 VYE WL VYSGNKHL KI ATFPGMWERTI T I GS AG KTFS VTGWKVGWS I G - PNHL 262 

Y V+G IA +++ + SKF + G ++G + G + + 

Sbjct: 132 AYAGGVFGGLDG AS I AELIDE YDNLL WQSLSKNFGLAG KELGG AAGG I V AGS AAS FDRV 191 

NOV95: 263 IKHLQTVQQOTIYTCATPLQEAIAQAFWIDIK^ 322 

+ + AT A++D E +L + 

Sbjct: 192 SSQSRALLF - ATS S APPAVGAAI VALILQD KERLERWLKELKK 233 

NOV95 : 323 SVGLKPI VPDGGYFI IADVS I FI VVLDPDLSDMKNNEPYDYKFVKWMTKHQKLSAI PVSA 382 

+GL+ ++ G+ + D S 1+ L + + +P S 

Sbjct: 234 MLGLRVLLSRAGFVLWLDPS -GILPLWTFEDQA GLFSALLLEEHGWVPGSE 264 

NOV95: 383 FCNSETKSQFEKFVRFCFIKVSSL- LDAAEEI IKA 416 

F R ++ LD E I+A 
Sbjct: 285 FPTVPPGW GRISLAGLTDETLDELLEAIRA 314 

Aminotransferases share certain mechanistic features with other pyridoxal-phosphate 
dependent enzymes, such as the covalent binding of the pyridoxal-phosphate group to a lysine 
residue. On the basis of sequence similarity, these various enzymes can be grouped into 
5 subfamilies, one of which is the class I. Class I aminotransferases include cysteine conjugate 
beta-lyase. Living organisms employ a variety of metabolic pathways when detoxifying 
xenobiotic compounds, including the formation of cysteine S-conjugates via glutathione 
conjugation. Kidney cysteine conjugate beta-lyase (glutamine transaminase K, kyneurenine 
aminotransferase, EC 2.6.1.64) metabolises the cysteine conjugates of certain halogenated 

10 alkenes and alkanes to form reactive metabolites which can produce nephrotoxicity and 
neurotoxicity in experimental animals and man. 

NOV95 is predicted to be expressed in at least the following tissues: kidney, liver, 
colon, gall bladder. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 

15 public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV95 is provided in Example 2. 

The NOV95 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 

20 hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 
immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 

25 conditions. NOV95 nucleic acids encoding the aminotransferase-Iike protein of the invention, 
or fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 
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The novel nucleic acid of the invention encoding a cysteine conjugate beta-lyase-like 
protein includes the nucleic acid whose sequence is provided in Table 95 A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 95A while still encoding a protein 

5 that maintains its cysteine conjugate beta-lyase-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 95 A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 

10 complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 

1 5 In the mutant or variant nucleic acids, and their complements, up to about 12% of the bases 
may be so changed. 

The novel protein of the invention includes the cysteine conjugate beta-lyase-like 
protein whose sequence is provided in Table 95B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
20 in Table 95B while still encoding a protein that maintains its cysteine conjugate beta-lyase-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 45% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
25 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV96 

NOV96 includes three monocarboxylate transporter-like proteins, designated herein as 
30 NOV96a, NOV96b and NOV96c. 

NOV96a 
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The disclosed NOV96a (alternatively referred to herein as CG5691 8-01) includes the 
1302 nucleotide sequence (SEQ ID NO:301) shown in Table 96A. A NOV96a ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 9-1 1 and ends with a stop codon 
at nucleotides 1287-1289. The disclosed NOV96a maps to human chromosome 17. 

5 



Table 96A. NOV96a Nucleotide Sequence (SEQ ID 
NO:301) 

CCGCTTAGATGGCGCGCAGGACAGAGCCCCCCGACGGGGGCTGGGGATGGGTGGTGGTGC 
TCTCAGCGTTCTTCCAGTCGGCGCTTGTGTTTGGGGTGCTCCGCTCCTTTGGGGTCTTCT 
TCGTGGAGTTTGTGGCGGCGTTTGAGGAGCAGGCAGCGCGCGTCTCCTGGATCGCCTCCA 
TAGGAATCGCGGTGCAGCAGTTTGGGAGCCCGGTAGGCAGTGCCCTGAGCACGAAGTTCG 
GGCCCAGGCCCGTGGTGATGACTGGAGGCATCTTGGCTGCGCTGGGGATGCTGCTCGCCT 
CTTTTGCTACTTCCTTGACCCACCTATACCTGAGTATTGGGTTGCTGTCAGGCTCTGGCT 
GGGCTTTGACCTTCGCTCCGACCCTGGCCTGCCTGTCCTGTTATTTTTCTCGCCGACGAT 
CCCTGGCCACCGGGCTGGCACTGACAGGCGTGGGCCTCTCCTCCTTCACATTTGCCCCCT 
TTTTCCAGTGGCTGCTCAGCCACTACGCCTGGAGGGGGTCCCTGCTGCTGGTGTCTGCCC 
TCTCCCTCCACCTAGTGGCCTGTGGTGCTCTCCTCCGCCCACCCTCCCTGGCTGAGGACC 
CTGCTGTGGGTGGTCCCAGGGCCCAACTCACCTCTCTCCTCCATCATGGCCCCTTCCTCC 
GTTACACTGTTGCCCTCACCCTGATCAACACTGGCTACTTCATTCCCTACCTCCACCTGG 
TGGCCCATCTCCAGGACCTGGATTGGGACCCACTACCTGCTGCCTTCCTACTCTCAGTTG 
TTGCTATTTCTGACCTCGTGGGGCGTGTGGTCTCCGGATGGCTGGGAGATGCAGTCCCAG 
GGCCTGTGACACGACTCCTGATGCTCTGGACCACCTTGACT 

TCCCTGTAGCTCAGGCTCCCACAGCCCTGGTGGCTCTGGCTGTGGCCTACGGCTTCACAT 
CAGGGGCTCTGGCCCCACTGGCCTTCTCCGTGCTGCCTGAACTAATAGGGACTAGAAGGA 
TTTACTGTGGCCTGGGACTGTTGCAGATGGTAGAGAGCATCGGGGGGCTGCTGGGGCCTC 
CTCTC TCAGG CTACCT CCGGGATGTGACAGG CAACTACACGGCTT CTTTTGTGGTG GCTG 
GGGCCTTCCTTCTTTCAGGGAGTGGCATTCTCCTCACCCTGCCCCACTTCTTCTGCTCCT 
CAACT ACTAC CTCCGGGCC C CAGGACCTTGTAACAGAAG CACTAG AT ACTAAAGTT CCCC 
TAC CC AAGGAG GGG CTGGAAG AGGACTGAACT C CACAG AGTC 



A NOV96a polypeptide (SEQ ID NO:302) encoded by SEQ ID NO:301 is 426 amino 
acids in length and is presented using the one-letter amino acid code in Table 96B. The Psort 
profile for NOV96a predicts that this sequence is a Type Ilia membrane protein, has a signal 
1 0 peptide, and is likely to be localized to the plasma membrane with a certainty of 0.6400. In 
alternative embodiments, a NOV96a polypeptide is located to the Golgi with a certainty of 
0.4600, or to the endoplasmic reticulum (membrane) with a certainty of 0.3700. The Signal P 
predicts a likely cleavage site for a NOV96a peptide is between positions 34 and 35, j.e., at the 
dash in the sequence VAA-FE. 

15 ; 

Table 96B. NOV96a Polypeptide Sequence (SEQ ID 
NO:302) 

MARRTEPPDGGWGWVVVLSAFFQST^LVFGVLRSFGVFFVEFVAAFEEQAARVSWIASIGI 
AVQQFGS PVGSALSTKFGPRPVVMTGGILAALGMLLAS FATSLTHLYLS IGLLSGSGWAL 
TFAPTLACLSCYFSRRRSLATGLALTGVGLSSFTFAPFFQ 
HLVACKSALLRPPSIAEDPAVGGPRAQLTSIiLHH 

LQDLDWDPLPAAFLLSWAISDLVGRWSGWLGDAVPGPVTRLLMLWTTLTGVSLALFPV 
AQAPTALVAIAVAYGFTSGAIAPLAFSVLPELIGTRRIYCGLGLLQMVESIGGLLGPPLS 
GYLRDVTGNYTAS FWAGAFLLSGSGILLTLPHFFCS STTTSGPQDLVTEALDTKVPLPK 
EGLEED 
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NOV96b 

The disclosed NOV96b (alternatively referred to herein as CG5691 8-02) includes the 
1294' nucleotide sequence (SEQ ID NO:303) shown in Table 96C. A SEC2 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a TGA codon at 
5 nucleotides 1279-128L The disclosed NOV96b maps to human chromosome 1 7. 



Table 96C. NOV96b Nucleotide Sequence (SEQ ID 
NO:303) 

ATGGCGCGCAGGACAGAGCCCCCCGACGGGGGCTGGGGATGGGTGGTGGTGCTCTCAGCG 
TTCTTCCAGTCGGCGCTTGTGTTTGGGGTGCTCCGCTCCTTTGGGGTCTTCTTCGTGGAG 
TTTGTGGCGGCGTTTGAGGAGCAGGCAGCGCGCGTCTCCTGGATCGCCTCCATAGGAATC 
GCGGTGCAGCAGTTTGGGAGCCCGGTAGGCAGTGCCCTGAGCACGAAGTTCGGGCCCAGG 
CCCGTGGTGATGACTGGAGGCATCTTGGCTGCGCTGGGGATGCTGCTCGCCTCTTTTGCT 
ACTTCCTTGACCCACCTATACCTGAGTATTGGGTTGCTGTCAGGCTCTGGCTGGGCTTTG 
ACCTTCGCTCCGACCCTGGCCTGCCTGTCCTGTTATTTTTCTCGCCGACGATCCCTGGCC 
ACCGGGCTGGCACTGACAGGCGTGGGCCTCTCCTCCTTCACATTTGCCCCCrrTTTTCCAG 
TGGCTGCTCAGCCACTACGCCTGGAGGGGGTCCCTGCTGCTGGTGTCTGCCCTCTCCCTC 
CACCTAGTGGCCTGTGGTGCTCTCCTCCGCCCACCCTCCCTGGCTGAGGACCCTGCTGTG 
GGTGGTCCCAGGGCCCAACTCACCTCTCTCCTCCATCATGGCGCCTTCCTCCGTTACACT 
GTTGCCCTCACCCTGATCAACACTGGCTACTTCATO 

CT CCAGGAC CTGG ATTGGGACC CACTACCTGCTGC CTTTCCTACTCTCAGTTGTTGC TATT 

TCTGACCTCGTGGGGCGTGTGGTCTCCGGATGGCTGGGAGATGCAGTCCCAGGGCCTGTG 

ACACGACTCCTGATGCTCTGGACCACCTTGACTGGGGTGTCACTAGCCCTGTTCCCTGTA 

GCTCAGGCTCCCACAG CCCTGGTGGCTCTGGGTGTGGCCTACGGCTTCACAT CAGGGGCT 

CTGGC CC CACTGGCCTTCTC C GTGCTGC CTGAACTAATAGGGACTAGAAGG ATTT ACTGT 

GGCCTGGGACTGTTGCAGATGGTAGAGAGCATCGGGGGGCTGCTGGGGCCTCCTCTCTCA 

GGCTACCTCCGGGATGTGACAGGCAACTACACGGCTTCTTTTGTGGTGGCTGGGGCCTTC 

CTTCTTTCAGGGAGTGGCATTCTCCTCACCCTGCCCCACTTCTTCTGCTCCTCAAC 

ACCTCCGGGCCCCAGGACCTTGTAACAGAAGCACTAGATACTAAAGTTCCCCTACCCAAG 

GAGGGACTGGAAGAGGACTGAACTCCACAGAGTC 



A NOV96b polypeptide (SEQ ID NO:304) encoded by SEQ ID NO:303 is 426 amino 
acids in length and is presented using the one-letter amino acid code in Table 96D. The Psort 
profile for NOV96b predicts that this sequence is likely to be a Type Ilia membrane protein, 
has a signal peptide, and is likely to be localized to the plasma membrane with a certainty of 
0.6400. In alternative embodiments, a NOV96b polypeptide is located to the Golgi with a 
certainty of 0.4600, or to the endoplasmic reticulum (membrane) with a certainty of 0.3700. 
The Signal P predicts a likely cleavage site for a NOV96b peptide is between positions 44 and 
45, /.e., at the dash in the sequence VAA-FE. 



Table 96D. NOV96b Polypeptide Sequence (SEQ ID 
NO:304) 

MARRTEP PDGGWGWVVVLS AFFQS ALVFGVLRS FGVFFVEFVAAFEEQAARVS WI AS IGI 
AVQQFGSPVGSALSTKFGPRPVVMTGGIIAALC^lIiIiASFATSLTHLYLSIGIiliSGSGWAL 
TFAPTIiACLSCYFSRRRSIATGLALTGVGLSSFTFAPFFQWLLSHYAWRGSLLIiVSALSL 
HLVACGALLRP P S LAED P AVGGP RAQLT SU^HHGP FLR YTVALTL INTGY F I P YLHLVAH 
LQDLDWDPLPAAFLLSWAISDLVGRWSGWLGDAVPGPVTRIiMLWTTLTGVSLALFPV 
AQAPTAL VALAVAYGFTSGALAPIiAFSVLPELI GTRRI Y CGLGLLQMVES I GGLLGPPLS 
GYLRDVTGNYTAS FWAGAFLLSGSGILLTLPHFFCS STTTS GFQDLVTEALDTKVPLPK 



10 
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| EGLEED | 

NOV96c 

The disclosed NOV96c (alternatively referred to herein as CG569 18-03) includes the 
1445 nucleotide sequence (SEQ ID NO:305) shown in Table 96E. A SEC2 ORF begins with a 
5 Kozak consensus ATG initiation codon at nucleotides 9-1 1 and ends with a stop codon at 
nucleotides 1287-1289. The disclosed NOV96c maps to human chromosome 17. 



Table 96E. NOV96c Nucleotide Sequence (SEQ ID 
NO:305) 

ccgcttagatggcgcgcaggacagagccccccgacgggggctggggatgggtggtggtgc 

tcgtggagtttgtggcggcgtttgaggagCaggcagcgcgcgtctcctggatcgcctcca 
taggaatcgcggtgcagcagtttgggagcccggtaggcagtgccctgagcacgaagttcg 
ggcccaggcccgtggtgatgactggaggcatcttggctgcgctggggatgctgctcgcct 

CTTTTGCTACTTCCTTGACCCACCTATACCTGAGTATTGGGTTGCTGTCAGGCTCTGGCT 

GGG CTTTGACCTTCGCTCCGAGCCnX3GCCTGCCTGTCCTGTTATTTCTCTCGCCGACGAT 

CCCTGGCCACCGGGCTGGCACTGACAGGCGTGGGCCTCTCCTCCTTCACATTTGCCCCCT 

TTTTCCAGTGGCTGCTCAGCCACTACGCCTGGAGGGGGTCCCTGCTGCTGGTGTCTGCCC 

TCTCCCTCCACCTAGTGGCCTGTGGTGCTCTCCTCCGCCCACCCTCCCTGGCTGAGGACC 

CTGCTGTGGGTGGTCCCAGGGCCCAACTCACCTCTCTCCTCCATCATGGCCCCTTCCTCC . 

GTTACACTGTTGCCCTCACCCTGATCAACACTGGCTACTTCATTCCCTACCTCCACCTGG 

TGGCCCATCTCCAGGACCTGGATTGGGACCCACTACCTGCTGCCTTCCTACTCTCAGTTG 

TTGCTATTTCTGACCTCGTGGGGCGTGTGGTCTCCGGATGGCTGGGAGATGCAGTCCCAG 

GGCCTGTGACACGACTCCTGATGCTCTGGACCACCTTGACTGGGGTGTCACTAGCCCTGT 

TCCCTGTAGCTCAGGCTCCCACAGCCCTGGTGGCTCTGGCTGTGGCCTACGGCTTCACAT 

CAGGGGCTCTGGCCCCACTGGCCTTCTCCGTGCTGCCTGAACTAATAGGGACTAGAAGGA 

TTTACTGTGGCCTGGGACTGTTGCAGATGATAGAGAGCATCGGGGGGCTGCTGGGGCCTC 

CTCTCTCAGGCTACCTCCGGGATGTGTCAGGCAACTACACGGCTTCTTTTGTGGTGGCTG 

GGGCCTTCCTTCTTTCAGGGAGTGGCATTCTCCTGACC 

CAACTACTACCTCCGGGCCTCAGGACCTTGTAACAGAAGCACTAGATACTAAAGTTCCCC 
TACCCAAGGAGGGGCTGGAAGAGGACTGAACTCCACAGAGTCAGGCCCAGAAAGCCAAAG 
CTTGACAGCTCCAGGTCTTCTCTTGCCACGTCTTGGTCTC 

AGA1TCTTGATCTGCCTCCCCCTAGAGCAGGCCTGGGGCTCCTGCAATGTGTGTGCCAAC 
CCxxr 



The NOV96c polypeptide (SEQ ID NO:306) encoded by SEQ ID NO:305 is 426 
10 amino acids in length and is presented using the one-letter amino acid code in Table 96F. The 
Psort profile for NOV96c predicts that this sequence is a Type III a membrane protein, has a 
signal peptide, and is likely to be localized to the plasma membrane with a certainty of 0.6400. 
In alternative embodiments, a NOV96c polypeptide is located to the Golgi with a certainty of 
0,4600, or to the endoplasmic reticulum (membrane) with a certainty of 0.3700. The Signal P 
15 predicts a likely cleavage site for a NOV96c peptide is between positions 34 and 35, ie., at the 
dash in the sequence VAA-FE. 
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Table 96F. NOV96c Polypeptide Sequence (SEQ ID 
NO:306) 

MARRTEP PDGGWGWWVL S AFFQSALVFGVLRS FGVFFVEFVAAFEEQAARVS WI AS I GI 
AVQQFGSPVGSALSTKFGPRPVVMTGGILAAI1GMLI1ASFATSLTHLYLSIGLLSGSGWAL 
TFAPSLACLSCYFSRRRSLATGLALTGVGLSSFTFAPFFQWLLSHYAWRGSLLLVSALSL 
HLVACGALLRPPSIiAEDPAVGGPRAQLTSLLHHGPFLRYTVALTLINTGYFIPYLHLVAH 
LQDLDWDPLPAAFI^SWAISDLVGRWSGWLGDAVPGPVTRLLMLWTTLTGVSIiALFPV 
AQAPTALVAIiAVAYGFTS GALAPLAFS VLPEL IGTRRI YCGLGLLQMIES I GGLLGP PLS 
GYLRDVSGNYTASFWAGAFLLSGSGILLTLPHFFCFSTTTSGPQDLVTEALDTKVPIiPK 

EGLEED ____ — 



A BLAST analysis of NOV96 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV96a had high 
homology to other proteins as shown in Table 96G. 

5 



Table 96G. BLASTX results from PatP database for NOV96a 






Smallest 






Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAU01618 Human secreted protein 


940 


3.0e-94 


patp:AAM93737 Human polypeptide 


940 


3.0e-94 


patp:AAB88570 Human hydrophobic domain containing protein 


620 


2.5e-60 


patp:AAY31642 Human transport-associated protein-4 


602 


2.0e-58 


patp:AAU0l586 Human secreted protein related to gene #26 


357 


1.8e-32 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 420 of 702 bases (59%) identical to a gbrGENBANK- 
ID:RNU87627|acc:U87627.1 mRNA from Rattus norvegicus (Rattus norvegicus putative 

10 monocarboxylate transporter (MCT3) mRNA). The full amino acid sequence of the protein of 
the invention was found to have 89 of 191 amino acid residues (46%) identical to, and 1 19 of 
191 amino acid residues (62%) similar to, the 504 amino acid residue ptnnSPTREMBL- 
ACC:O95907 protein from Homo sapiens (Human) (DJ1039K5.2 (SIMILAR TO 
MONOCARBOXYLATE TRANSPORTER (MCT3)). NOV96 also has homology to the 

1 5 other proteins shown in the BLASTP data in Table 96H. 



Table 96H. NOV96 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi| 17491104 | re 
f |XP_06436B.l| 
(XM_064368) 


similar to solute 
carrier family 16 
(monocarboxylic acid 
transporters) , member 8 
(H. sapiens) 
[Homo sapiens] 


427 


424/427 
(99) 


425/427 
(99) 


0.0 
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gi| 14754801 | re 
f |XP_044738.l| 
(XMJJ4473B) 


solute carrier family 
16 (monocarboxylic acid 
transporters) , member 7 
[Homo sapiens] 


478 


141/410 
(34) 


213/410 
(51) 


2e-57 


gi) 4759120) ref 
|NP_004722.l| 
(NM_Q 04731) 


solute carrier family 
16 (monocarboxylic acid 
transporters) , member 
7; monocarboxylate 
transporter 2 
[Homo sapiens] 


478 


141/410 
(34) 


213/410 
(51) 


2e-57 


gi|2497855|sp| 
Q63344|MOT2_RA 
T 


MONOCARBOXYLATE 
TRANSPORTER 2 (MCT 2) 


489 


140/410 
(34) 


215/410 
(52) 


7e-57 


gi|l432167|gb| 
AAB04023.l| 
(U62316) 


monocarboxylate 
transporter 2 
TRattus norvegicus] 


489 


141/410 
(34) 


215/410 
(52) 


8e-57 



This BLASTP data is displayed graphically in the Clustal W in Table 961. A multiple 
sequence alignment is given, with the NOV96a, b, and c proteins being shown on lines 1, 2, 
and 3 in a ClustalW analysis comparing the protein of the invention with the related protein 
sequences shown in Table 96H. 



Table 96L ClustalW Alignment of NOV96 



NOV96a 

NOV96b 

N0V96C 

gi 1 17491104 | 

gi | 14754801 | 

gi|4759120| 

gi j 2497855 | 

gi j 1432167 | 



NOV96a 


NOV96b 


NOV96C 


gi 


17491104 | 


gi 


14754801 j 


gi 


4759120 | 


gi 


2497855 | 


gi 


1432167) 



NOV96a 
NOV96b 
N0V96c 



17491104 | 
14754801 | 
4759120 
2497855 
1432167 



N0V96a 
N0V96b 
N0V96C 
gi | 17491104 | 



(SEQ ID NO:302) 
(SEQ ID NO:304) 
(SEQ ID NO:306) 
(SEQ ID NO: 779) 
(SEQ ID NO: 780) 
(SEQ ID NO: 781) 
(SEQ ID NO: 782) 
(SEQ ID NO: 783) 

10 

....|....|....|... 

- — MARRTE 

- -MARRTE 

MARRTE 

MARRTE 

MPP MPSAPPVHP 

MPP MPSAPPVHP 

MPSESSVKATAAPPPFPL 
MPSESSVKATAAPPPFPL 



20 




30 


PPDGGWGW 




i/W 


LB 


PPDGGWGW 




\7W 




PPDGGWGW 




VW 


iis 


PPDGGWGW 




7W 


lis 


PPDGGWGW 




si 


PPDGGWGW 




Jw 




PPDGGWGW 








PPDGGWGW 




7W 





40 

|fqsalvegvl| ^ 
|fqsal^gvl&|f( 

|FQSALV£GVLfe£j: 
|FQSALVfeGVLi£sj: 
jlSIGFSYAFPKft 1 
JlSIGFS^PPr 
ilSIGFSXAFP^ 

rsiGFSsiAFP: 
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gi 
gi 
gi 



14754801 
4759120 
2497855 
1432167 



NOV96a 
NOV96b 
NOV96C 



gi 
gi 
gi 
gi 
gi 



17491104 
14754801 
4759120 | 
2497855) 
1432167 | 



NOV96a 
NOV96b 
NOV96C 



gi 
gi 
gi 
gi 
gi 



17491104 | 
14754801) 
4759120' 
2497855 
1432167 



NOV96a 
NOV96b 
NOV96C 
gi | 17491104 | 
gi|l475480l| 
gi|4759120| 
gi|2497855| 
gi 1 1432167 I 



NOV96a 
NOV96b 
NOV96C 



gi 


17491104 


gi 


14754801 


gi 


4759120 


gi 


2497855 


gi 


1432167 


KOV96a 


NOV96b 


NOV96C 


gi 


17491104 


gi 


14754801 


gi 


4759120 


gi 


2497855 


gi 


1432167 



NOV96a 
NOV96b 
NOV96C 



gi 
gi 
gi 



17491104 
14754801 
4759120 
2497855 
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gi (1432167 | S K I^E?iEQ0^ L S A £S IV 5fiE SGI ^E 1 CNAINYRLLEKERKIgSKARR 



NOV96a 
NOV96b 
NOV96C 



gi 
gi 
gi 
gi 



17491104 | 
14754801 j 
4759120 
2497855 
1432167 



A 

A 

A 

A 

K 

K 

KKSASQAS! 
KKSASQAS: 




480 



490 



|....|. 



'SNAQSVTSER- -ETNI 
r SNAQSVTSER- -ETNI 
'SNTHNPPSDRDKESSI 
'SNTHNPPSDRDKESSI 



Monocarboxylates such as lactate and pyruvate play a central role in cellular 
metabolism and metabolic communication between tissues. Essential to these roles is their 
rapid transport across the plasma membrane, which is catalysed by a recently identified family 
5 of proton-linked monocarboxylate transporters (MCTs). Nine MCT-related sequences have so 
far been identified in mammals, each having a different tissue distribution, whereas six related 
proteins can be recognized in Caenorhabditis elegans and four in Saccharomyces cerevisiae. 

Direct demonstration of proton-linked lactate and pyruvate transport has been 
demonstrated for mammalian MCT1-MCT4, but only for MCT1 and MCT2 have detailed 

10 analyses of substrate and inhibitor kinetics been described following heterologous expression 
in Xenopus oocytes. MCT1 is ubiquitously expressed, but is especially prominent in heart and 
red muscle, where it is up-regulated in response to increased work, suggesting a special role in 
lactic acid oxidation. By contrast, MCT4 is most evident in white muscle and other cells with 
a high glycolytic rate, such as tumour cells and white blood cells, suggesting it is expressed 

15 where lactic acid efflux predominates. MCT2 has a ten-fold higher affinity for substrates than 
MCT1 and MCT4 and is found in cells where rapid uptake at low substrate concentrations 
may be required, including the proximal kidney tubules, neurons and sperm tails. MCT3 is 
uniquely expressed in the retinal pigment epithelium. MCT1 and MCT4 have been shown to 
interact specifically with OX-47 (CD147), a member of the immunoglobulin superfamily with 

20 a single transmembrane helix. This interaction appears to assist MCT expression at the cell 
surface (Halestrap and Price, 1999, Biochem. J. vol.343: 281-99). 

NOV96 is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, foreskin, 

25 heart, kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
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public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV96 is provided in Example 2. 

The NOV96 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
5 Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 

hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 
1 0 immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 

conditions. NOV96 nucleic acids encoding the monocarboxylate transporter-like protein of 
the invention, or fragments thereof, may further be useful in diagnostic applications, wherein 
the presence or amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a monocarboxylate transporter-like 
1 5 protein includes the nucleic acid whose sequence is provided in Table 96A, 1 05C, or 1 05E, or 
a fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 96A, 105C, or 105E while 
still encoding a protein that maintains its monocarboxylate transporter-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
20 nucleic acids whose sequences are complementary to the sequence of Table 96A, 1 05C, or 

105E, including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
25 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 41% of the bases 
may be so changed. 

30 The novel protein of the invention includes the monocarboxylate transporter-like 

protein whose sequence is provided in Table 96B, 105D, or 105F. The invention also includes 
a mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 96B, 105D, or 105F while still encoding a protein that maintains its 
monocarboxylate transporter-like activities and physiological functions, or a functional 
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fragment thereof. In the mutant or variant protein, up to about 54% of the amino acid residues 
may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



10 



NOV97 

NOV97 includes six carboxypeptidase-like proteins, designated herein as NOV97a, 
NOV97b, NOV97c, NOV97d, NOV97e, and NOV97f. 



15 



NOV97a 

The disclosed NOV97a (alternatively referred to herein as CG57070-01) includes the 
1279 nucleotide sequence (SEQ ID NO:307) shown in Table 97A. A NOV97a ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a stop codon at 
nucleotides 1258-1260. The disclosed NOV97a maps to human chromosome 7q 31 . 



Table 97 A. NOV97a Nucleotide Sequence (SEQ ID 
NO:307) 



ATGCGGGGGTTGCTGGTGTTGAGTGTCCTGTTGGGGGCTGTCTTTGGCAAGGAGGACTTT 
GTGGGGCATCAGGTGCTCCGAATCT CTGT AGC CGATG AGG C CC AGGTACAG AAGGTGAAG 
GAGCTGGAGGACCTGGAGCACCTGCAGCTGGACTTCTGGCGGGGGCCTGCCCACCCTGGC 
TCCCCCATCGACGTCCGAGTGCCCTTCCCCAGCATCCAGGCGGTCAAGATCTTTCTGGAG 
TCCCACGGCATCAGCTATGAGACCATGATCGAGGACGTGCAGTCGCTGCTGGACGAGGAG 
CAGGAGCAGATGTTCGCCTTCCGGTCCCGGGCGCGCTCCACCGACACTTTTAACTACGCC 
ACCTACCACACCCTGGAGGAGGTGTATAGCTGGATTGACAACTTTGTAATGGAGCATTCC 
GATATTGTCTCAAAAATTCAGATTGGCAACAGCTTTGAAAACCAGTCCATTCTTGTCCTG 
AAGTTCAGCACTGGAGGTTCTCGGCACCCAGCCATCTGGATCGACACTGGAATTCACTCC 
CGGGAGTGGATCACCCATGCCACCGGCATCTGGACTGCCAATAAGATTGTCAGTGATTAT 
GGCAAAGACCGTGTCCTGACAGACATACTGAATG C CATGG ACATCTTCATAGAGCTCGTC 
ACAAACCCTGATGGGTTTGCTTTTACCCACAGC^TGAACCGCTTATGGCGGAAGAACAAG 
TCCATCAGACCTGGAATCTTCTGCATCGGCGTGGATCTCAAC^GGAACTGGAAGTCGGGT 
TTTGGAGGAAATGGTTCTAACAGCAACCCCTGCTCAGAAACTTATCACGGGCCCTCCCCT 
CAGTCGGAGCCGGAGGTGGCTGCCATAGTGAACTTCATCACAGCCCATGGCAACTTCAAG 
GCTCTGATCTCCATCCAC^GCTACTCTCAGATGCTTATGTACCCTTACGGCCGATTGCTG 
GAGCCCGTTTCAAATCAGAGGGAGTTGTACGATCTTGCCAAGGATGCGGTGGAGGCCTTG 
TATAAGGTCCATGGGATCGAGTACATTTTTGGCAGCATCAGCACCACCCTCTATGTGGCC 
AGTGGGATCACCGTCGACTGGGCCTATGACMTGGCATCAAGTACGCCTTCAGCTTTGAG 
CTCCGGGACACTGGGCAGTATGGCTTCCTGCTGCCGGCCACACAGATCATCCCCACGGCC 
CAGGAGACGTGGATGGCGCTTCGGACCATCATGGAGCACACCCTGAATCACCCCTACTAG 
CAGCACGACTGAGGGCAGG 



A NOV97a polypeptide (SEQ ID NO:308) encoded by SEQ ID NO:307 is 41 9 amino 
20 acids in length and is presented using the one-letter amino acid code in Table 97B. The Psort 
profile for NOV97a predicts that this sequence has a signal peptide and is likely to be 
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localized outside the cell with a certainty of 0.3703. In alternative embodiments, a NOV97a 
polypeptide is located to lysosomes with a certainty of 0.46200. The Signal P predicts a likely 
cleavage site for a NOV97a peptide is between positions 16 and 17, Le., at the dash in the 
sequence VFG-KE. 

5 _ . 

Table 97B. NOV97a Polypeptide Sequence (SEQ ID 

NO:308) 

^GLLVLSVLLGAVFGKEDFVGHQVLRISVADEAQ 

SPIDVRVPFPSIQAVKIFIiESHGISYETMIEDVQSLLDEEQEQMFAFRSRARSTDTFNYA 
TYHTLEEVYSWIDNFVMEHSDIVSKIQIGNSFENQSILVLKFSTGGSRHPAIWIDTGIHS 
RE WITHATGI WTANKI VSDYGKDRVLTDI LNAMDIFI ELVTNPIX3FAFTHSMNRLWRKNK 
S IRPGI FCIGVDLNRNWKSGFGGNGSNSNPCSETYHGPS PQSEPEVAAIVNFITAHGNFK 
ALI S I HSY SQMLMYPYGRLLEPVSKQRELYDLAKDAVEALYKVHGI EY IFGS I STTLYVA 
SGI TVD WAYDSGI KYAFS FELRDTGQYGFLL PATQI I PTAQETWMALRTIMEHTLNHP Y 



NOV97b 

The disclosed NOV97b (alternatively referred to herein as CG57070-02) includes the 
1291 nucleotide sequence (SEQ ID NO:309) shown in Table 97C. A SEC2 ORF begins with 
10 a Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a stop codon at 
nucleotides 1270-1272. The disclosed NOV97b maps to human chromosome 7q31. 



Table 97C NOV97b Nucleotide Sequence (SEQ ID 
NO:309) 

ATGCGGGGGTTGCTGGTGTTGAGTGTCCTGTTGGGGGCTGTCTTTGGCAAGGAGGACTTT 
GTGGGGCATCAGGTGCTCCGAATCTCTGTAGCCGATGAGGCCCAGGTACAGAAGGTGAAG 
GAGCTGGAGGACCTGGAGCACCTGCAGGTGGACTTCTGGCGTGGCCCAGCCAGGCCCAGC 
CTCCCTGTGGATATGAGAGTTCCTTTCTCTGAACTGAAAGACATCAAAGCTTATCTGGAG 
TCTCATGG ACTTGCTTACAG CATCATGATAAAGGACATCCAGGTGAAG CC CCAGGTGCTG 
CTGG ATG AGGAAAGACAGG C CATG GCG AAAT CCCGCCGGCTGG AG CG CAGCAC CAACAG C 
TTCAGTTACTCATCATACCACACCCTGGAGGAGGTATATAGCTGGATTGACAACTTTGTA 
ATGG AGC ATTC CG ATATTGT CTCAAAAATT CAGATTGGCAACAGCTTTG AAAAC CAGTC C 
ATTCTTGT CCTGAAGTTCAGCACTGGAGGTTCTCGGCACCCAGCCATCTGGAT CGACACT 
GGAATTCACTCCCGGGAGTGGATCACCCATGCCACCGGCATCTGGACTGCCAATAAGATT 
GTCAGTGATTATGGCAAAGACCGTGTCCTGACAGACATACTGAATGCCATGGACATCTTC 
ATAGAGCTCGTCACAAACCCTGATGGGTTTGCTTTTACCCACAGCATGAACCGCTTATGG 
CGGAAGAACAAGTCCATCAGACCTGGAATCTTCTGCATCGGCGTGGATCTCAACAGGAAC 
TGGAAGTCGGGTTTTGGAGGAAATGGTTCTAACAGCAACCCCTGCTCAGAAACTTATCAC 
GGGCCCTCCCCTCAGTCGGAGCCGGAGGTGGCTGCCATAGTGAACTTCATCACAGCCCAT 
GGCAACTTCAAGGCTCTGATCTCCATCCACAGCTACTCTCAGATGCTTATGTACCCrTAC 
GGCCGATTGCTGGAGCCCGTTTCAAATCAGAGGGAGTTGTACGATCTTGCCAAGGATGCG 
GTGGAGGCCTTGTATAAGGTCCATGGGATCGAGTACATTTTTGGCAGCATCAGCACCACC 
CTCTATGTGGCCAGTGGGATCACCGTCGACTGGGCCTATGACAGTGGCATCAAGTACGCC 
TT CAG CTTTGAGCTCCGGGACACTGGGCAGTATGG CTT C CTGCTGC CGGC CAC ACAGAT C 
ATCCCCACGGCCCAGGAGACGTGGATGGCGCTTCGGACCATCATGGAGCACACCCTGAAT 
CACC C CTACTAG C AG C ACG ACTGAGGGCAGG 



A NOV97b polypeptide (SEQ ID NO:310) encoded by SEQ ID NO:309 is 423 amino 
15 acids in length and is presented using the one-letter amino acid code in Table 97D. The Psort 
profile for NOV97b predicts that this sequence has a signal peptide and is likely to be 
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localized outside the cell with a certainty of 0.3703. In alternative embodiments, a NOV97b 
polypeptide is located to lysosomes with a certainty of 0.1900. The Signal P predicts a likely 
cleavage site for 'a NOV97b peptide is between positions 16 and 17, Le. 9 at the dash in the 
sequence VFG-KE. 

5 



Table 97D. NOV97b Polypeptide Sequence (SEQ ID 
NO:310) 

MRGLLVLSVLLGAVFGKEDFVGHQVLRISV 

LPVDMRVPFSELKDIKAYLESHGIJVYSIMIKDIQVKPQVLIJDEERQAMAKSRRLERSTNS 

FSYSSYHTLEEVYSWIDNFVMEHSDIVSKIQIGNSFENQSILVLKFSTGGSRHPAIWIOT 

GIHSREWITHATGI WTANKI VSDYGKDRVLTDILNAMDI FIELVTNPDGFAFTHSMNRLW 

RK2TKSIRPGIFCIGVDLNRNWKSGFGGNGSNSNPCSETYHGPSPQSEPEVAAIVNFITAH 

GNFKALISIHSYSQMLW^PYGRLI^PVSNQRELYDIAKDAVEALYKVHGIEYIFGSIS 

LYVASGITVDWAYDSGIKYAFSFELROTGQYGFLLPATQIIPTAQETWMALRTIMEHTLN 

HPY 



NOV97c 

The disclosed NOV97c (alternatively referred to herein as CG57070-03) includes the 
1344 nucleotide sequence (SEQ ID NO:31 1) shown in Table 97E. A NOV97a ORF begins 
1 0 with a Kozak consensus ATG initiation codon at nucleotides 25-27 and ends with a stop codon 
at nucleotides 1327-1329. 



Table 97E. NOV97c Nucleotide Sequence (SEQ ID 
NO:311) 

TGAAGCTCACCAGGAGGAAGAAGCATGCAGGGCACCCCTGGAGGCGGGACGCGCCCTGGG 
CC^TCCCCCGTGGACAGGCGGACACrrCCTGGTCTTCAGCTTTATCCTGGCAGCAGCTTTG 
GGC CAAATGAATTT CACAGGG CAGGTT CTTCG AGTCC TGGCCAAAGATGAGAAG CAGCT T 
TCACTTCTCGGGGATCTGGAGGGCCTGAAACCCCAGAAGGTGGACTTCTGGCGTGGCCCA 
GCCAGGCCCAGCCTCCCrGTGGATATGAGAGTTCCTTTCTCTGAACTGAAAGACATCAAA 
G CTT AT CTGGAGT CTC ATGGACTTG CTT ACAGCATCATGATAAAGG ACATC CAGGTGCTG 
CTGGATGAGGAAAGACAGGCCATGGCGAAATCCCGCCGGCTGGAGCGCAGCACCAACAGC 
TT CAGTTACTCAT CATACCACA CC CTGGAGGAGATATATAG CTGGATTGACAA CTTTGTA 
ATGGAGCATTCCGATATTGTCTCAAAAATTCAGATTGGCAACAGCTTTGAAAACCAGTCC 
ATTCTTGTCCTGAAGTTCAGCACTGGAGGTTCTCGGCACCCAGCCATCTGGATTGACACT 
GGAATTCACTCCCGGGAGTGGATCACCCATGCCACCGGCATCTGGACTGCCAATAAGATT 
GTCAGTGATTATGGCAAAGACCGTGTCCTGACAGACATACTGAATGCCATGGACATCTTC 
ATAGAGCTCGTCACAAACCCTGATGGGTTTGCTTTTACCCACAGCATGAACCGCTTATGG 
CGGAAGAACAAGTCCATCAGACCTGGAATCTTCTGCATCGGCGTGGATCTCAACAGGAAC 
TGGAAGTCGGGTTTTGGAAATGGTTCTAACAGCAACCCCTGCTCAGAAACTTATCACGGG 
CCCTCCCCTCAGTCGGAGCCGGAGGTGGCTGCCATAGTGAACTTCATCACAGCCCATGGC 
AACTTCAAGGCTCTGATCTCCATCCACAGCTACTCTCAGATGCTTATGTACCCTTACGGC 
CGATTGCTGGAGCCCGTTTCAAATCAGAGGGAGTTGTACGATCTTGCCAAGGATGCGGTG 
GAGGCCTTGTATAAGGTCCATGGGATCGAGTACATTTTTGGCAGCATCAGCACCACCCTC 
GATGTGGCCAGTGGGATC^CCGTCGACTGGGCCTATGACAGTGGCATCAAGTACGCCTTC 
AGCTTTGAGCTCCGGGACACTGGGCAGTATGGCTTCCTGCTGCCGGCCACACAGATCATC 
CCCACGGCCCAGGAGACGTGGATGGCGCTTCGGACCATCATGGAGCACACCCTGAATCAC 
CCCTACTAGCAGCACGACTGAGGG 



A NOV97c polypeptide (SEQ ID NO:312) encoded by SEQ ID NO:31 1 is 434 amino 
acids in length and is presented using the one-letter amino acid code in Table 97F. The Psort 
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profile for NOV97c predicts that this sequence has a signal peptide and is likely to be 
localized outside the cell with a certainty of 0.585 1 . In alternative embodiments, a NOV97c 
polypeptide is located to lysosomes with a certainty of 0.4366. The Signal P predicts a likely 
cleavage site for a NOV97c peptide is between positions 33 and 34, i.e., at the dash in the 
5 sequence ALG-QM. 



Table 97F. NOV97c Polypeptide Sequence (SEQ ID 
NO:312) 

MQGTPGGGTRPGPSPVDRRTLLVFSFILAAALGQMNFTGQVLRVLAKDEKQLSLIiGDLEG 
LKPQKVDFWRGPARPSLPVDMRVPFSELKDIKAYLES 

AKSRRLERSTNSFSYSSYHTLEEIYSWIDNFVMEHSDIVSKIQIGNSPENQSILVLKFST 
GGSRHPAIWIDTGIHSREWITHATGIWTANKIVSDYGKDRVLTDIMAMDIFIELVTNPD 
GFAFTHSMNRLWRKNKSIRPGIFCIGVDLNRNWKSGFGNGSNSNPCSETYHGPSPQSEPE 
VA^VNFITAHGNFKALISIHSYSQMLMYPYGRLLEP 

IEYIFGS I STTLDVASGITVDWAYDSGIKYAFSFELRDTGQYGFLIiPATQI IPTAQETWM 
ALRTIMEHTLNHPY 



NOV97d 

The disclosed NOV97d (alternatively referred to herein as CG57070-04) includes the 
10 988 nucleotide sequence (SEQ ID NO:313) shown in Table 97G. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a TGA codon at 
nucleotides 973-975. The disclosed NOV97d maps to human chromosomes 7q31 . 



Table 97G. NOV97d Nucleotide Sequence (SEQ ID 
NO:313) 

ATGCGGGGGTTGCTGGTGTTGAGTGTCCTGTTGGGGGCTGTCTTTGGCAAGGAGGACTTT 

GTGGGGCATCAGGTGCTCCGAATCTCTGTAGCCGATGAGGCCCAGGTACAGAAGGTGAAG 

GAGCTGGAGGACCTGGAGCACCTGCAGCTGGACTTCTGGCGGGGGCCTGCCCACCCTGGC 

TCCCCCATCGACGTCCGAGTGCCCTTCCCCAGCATCCAGGCGGTCAAGATCTTTCTGGAG 

TCCCACGGCATCAGCTATGAGACCATGATCGAGGACGTGCAGTCGCTGCTGGACGAGGAG 

C^GGAGCAGATGTTCGCCTTCCGGTCCCGGGCGCGCTCCACCG^CACTTTTAACTACGCC 

ACCTACCACACCCTGGAGGAGGTGTATAGCTGGATTGACAACTTTGTAATGGAGCATTCC 

GATATTGTCTCAAAAATTCAGATTGGCAACAGCTTTGAAAACCAGTCCATTCTTGTCCTG 

AAGTT CAG CACTGGAGGTT C T CGG CAC CC AGC CAT CTGGATTGACACTGGAATTCACT C C 

CGGGAGTGGATCACCCATGCCACCGGCATCTGGACTGCCAATAAGATTGTCAGTGATTAT 

GG CAAAGACCGTGTCCTG ACAGAC AT ACTGAAT GC CATGGACAT CT TC ATAGAG CT CGT C 

ACAAACCCTGATGGGTTTGCTTTTACCCACAGCATGAACCGCTTATGGCGGAAGAACAAG 

T C CAT CAG ACCTGGAAT CTTCTGCAT CGG CGT GG ATCT CAACAGG AACTGGAAGT CGGGT 

TTTGGAG ATGTGGCCAGTGGGATC AC C GT CGACTGGG C CT ACGACAGT GGC AT CAAGT AC 

GCCTTCAGCn^TGAGCTCCGGGACACTGGGCAGTATGGCTTCCTGCTGCCGGCCACACAG 

ATC^TCCCCACGGCCCAGGAGACGTGGATGGCGCTTCGGACCATCATGGA 

AATCACCCCTACTAGCAGCACGACTGAG 



15 A NOV97d polypeptide (SEQ ID NO:314) encoded by SEQ ID NO:313 is 324 amino 

acids in length and is presented using the one-letter amino acid code in Table 97H. The Psort 
profile for NOV97d predicts that this sequence has a signal peptide and is likely to be 
localized to lysosomes with a certainty of 0.4757, or outside the cell with a certainty of 
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0.3703. The Signal P predicts a likely cleavage site for a NOV97d peptide is between 
positions 16 and 17, i.e., at the dash in the sequence VFG-KE. 



Table 97H. NOV97d Polypeptide Sequence (SEQ ID 
NO:314) 

MRGLLVLS VLLGAVFGKEDFVGHQVLRI S VADEAQ VQKVKELEDL EHLQLDFWRGPAHPG 
SPIDVRVPFPSIQAVKIFLESHGISYETMIEDVQSLLDEEQEQMFAFRSRARSTDTFNYA 
TYHTLEEVYS WIDNFVMEHSDI VSKIQIGNS PENQS ILVLKFSTGGSRHPAIWIDTGIHS 
RE WI THATGI WTANKI VSDYGKDRVLTD I LNAMD I F I ELVTNPDG FAFTHSMNRLWRKNK 
SI RPGI FCIGVDLNRNWKSGFGDVASGITVDWAYDSGIKYAFSFELRDTGQYGFLLPATQ 
1 1 PTAQETWMALRTIMEHTLNHPY 



NOV97e 

The disclosed NOV97e (alternatively referred to herein as CG57070-05) includes the 
1348 nucleotide sequence (SEQ IDNO:315) shown in Table 97L A NOV97e ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 25-27 and ends with a stop codon 
at nucleotides 1333-1335. The disclosed NOV97e maps to human chromosome 7. 



Table 97L NOV97e Nucleotide Sequence (SEQ ID 
NO:315) 

TGAAG CT CAC CAGGAGGAAGAAGC ATGCAGGGTACTCCTGGAGG CGGG ACGCG CCCTGGG 
CCAT'CCCCCGTGGACAGGCGGACACTCCTGGTCTTCAGCTTTATCCTGGCAGCAGCTTTG 
GGC CAAATGAATT TCACAGGGGAC CAGGTT CTTCGAGT C CTGG CCAAAG AT GAGAAGCAG 
CTTTCACTTCTCGGGGATCTGGAGGGCCTGAAACCCCAGAAGGTGGACTTCTGGCGTGGC 
CCAGCCAGGCCCAGCCTCCCTGTGGATATGAGAGTTCCTTTCTCCGAACTGAAAGACATC 
AAAGCTTATCTGGAGTCTCATGGACTTGCTTACAGCATCATGATAAAGGACATCCAGGTG 
CTGCTGGATGAGGAAAGACAGGCCATGGCGAAATCCCGCCGGCTGGAGCGCAGCACCAAC 
AG CTT CAGTTACT CAT CAT AC CACAC C CTGGAGGAG AT AT AT AGCTGGATTGACAACTT T 
GTAATGGAGCATTCCGATATTGTCTGAAAAATTCAGATTC 

TCCATTCTTGTCCTGAAGTTCAGCACTGGAGGTTCTCGGCACCCAGCCATCTGGATCGAC 

ACTGGAATTCACTCCCGGGAGTGGATC^CCCATGCCACCGGCATCTGGACTGCCAATAAG 

ATTGTCAGTGATTATGGCAAAGACCGTGTC CTGACAGACATACTGAATGCCATGGACAT C 

TTCATAGAGCTCGTCACAAACCCTGATGGGTTTGCTTTTACCCACAGCATGAACCGCTTA 

TGGCGGAAGAACAAGTCCATCAGACCTGGAATCITCTGCATCGGCGTGGATCTCAACAGG 

AACTGGAAGT CGGGT TT TGGAGGAAATGGTT C TAACAG CAAC C C CTGCTCAGAAACTT AT 

C^CGGGCCCTCCCCTCAGTCGGAGCCGGAGGTGGCTGCCATAGTGAACTTCATCACAGCC 

CATGGCAACTTCAAGGCTCTGATCTCCATCCACAGCTACTCTCAGATGCra 

TACGGCCGATTGCTGGAGCCCGTTTC^AATCAGAGGGAGTTGTACGATCTTGCCAAGGAT 

GCGGTGGAGGCCTTGTATAAGGTCCATGGGATCGAGTACATTTTTGGCAGCATCAGCACC 

ACCCTCTATGTGGCCAGTGGGATCACCGTCGACTGGGCCTATGACAGTGGCATCAAGTAC 

GCCITCAGCTTTGAGCTCCGGGACACTGGGCAGTATGGCTTCCTG(^ 

ATC7VTCCCCACGGCCCAGGAGACGTGGATGGCGCTTCGGACCATCATGGAGCACAACCTG 

AATCACCCCTACTAGCAGCACGACTGAG 



A NOV97e polypeptide (SEQ ID NO:316) encoded by SEQ ID NO:315 is 436 amino 
acids in length and is presented using the one-letter amino acid code in Table 97J. The Psort 
profile for NOV97e predicts that this sequence has a signal peptide and is likely to be 
15 localized outside the cell with a certainty of 0.5851 . In alternative embodiments, a NOV97e 
polypeptide is located to lysosomes with a certainty of 0.442i . The Signal P predicts a likely 
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cleavage site for a NOV97e peptide is between positions 33 and 34, i.e., at the dash in the 
sequence ALG-QM. 



Table 97J. NOV97e Polypeptide Sequence (SEQ ID 
NO:316) 

MQGTPGGGTRPGPSPVDRRTLLVFSFIIJ^AALGQMNFTGDQVLRVLAKDEKQLSLIjGDLE 
GL K P Q KVD FWRGP ARP SL P VDMR VP F S ELKD I KAYLESHGLAYS I M I KDIQVLLDEERQA 
MAKSRRLERSTNSFSYSSYHTLEEIYSWIDNFVMEHSDIVSKIQIGNSFENQSILVLKFS 
TGGSRHPAIWIDTGI HSREWITHATG I WTANKI VSDYGKDRVLTDI LNAMDI F I ELVTNP 
DGFAFTHSMNRLWRKNKS IRPGIFCIGVDLNRNWKSGFGGNGSNSNPCSETYHGP SPQSE 
PE VAAI VNFI TAHGNFKAL I S IHS YSQMLMYP YGRLLEP VSNQREL YDI»AKDAVEAL YK V 
HGIEYIFGSI STTLYVASGITVDWAYDSGIKYAFSFELRDTGQYGFLLPATQI I PTAQET 
WMALRTIMEHNLNHPY 



NOV97f 

The disclosed NO V97f (alternatively referred to herein as CG57070-06) includes the 
975 nucleotide sequence (SEQ ID NO:317) shown in Table 97K. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a TAG codon at 
nucleotides 973-975 . The disclosed NOV97f maps to human chromosome 7q3 1 . 



Table 97IC NOV97f Nucleotide Sequence (SEQ ID 
NO:317) 

ATGCGGGGGTTGCTGGTGTTGAGTGTCCTGTTGGGGGCTGTCTTTGGCAAGGAGGACTTT 
GTGGGGCATCAGGTGCTCCGAATCTCTGTAGCCGATGAGGCCCAGGTACAGAAGGTGAAG 
GAGCTGGAGGACCTGGAGCACCTGCAGCTGGACTTCTGGCGGGGGCCTGCCCACCCTGGC 
TCCCCCATCGACGTCCGAGTGCC CTTC CC CAGCATCCAGG CGGTCAAGATCTTTCTGGAG 
TCCCACGGCATCAGCTATGAGACCATGATCGAGGACGTGCAGTCGCTGCTGGACGAGGAG 
CAGGAGCAGATGTTCGCCTTCCGGTCCCGGGCGCGCTCCACCGACACTTTTAACTACGCC 
ACCTACCACACCCTGGAGGAGGTGTATAGCTGGATTGACAACTTTGTAATGGAGCATTCC 
GATATTGTCTCAAATATTCAGATTGGCAACAGCTTTGAAAACCAGTCCATTCTTGTCCTG 
AAGTTCAGCACTGGAGGTTCTCGGCACCCAGCCATCTGGATCGACACTGGAATTCACTCC 
CGGGAGTGGATCACCCGTGCCACCGGCATCTGGACTGCCAATAAGATTGTCAGTGATTAT 
GGCAAAGACCGTGTCCTGACAGACATACTGAATGCCATGGACATCTTCATAGGGCTCGTC 
ACAAACCCTGATGGGTTTGCTTTTACCCACAGC^ITGAACCGCTTATGGCGGAAGAACAAG 
TCCATCAGACCTGGAATCTTCTGCATCGGCGTGGATCTCAACAGGAACTGGAAGTCGGGT 
TTTGGAGATGTGGCCAGTGGGAT»CCGTCGACTGGGCCTATGACAGTGGCATCAAGTAC 
GCCTTCAGCTTTGAGCTCCGGGACACTGGGC^GTATGGC^ 

ATCATCCCCACGGCCCAGGAGACGTGGATGGCGCTTCGGACCATCATGGAGCACATCCTG 
AATGACCCCTACTAG 



A NOV97f polypeptide (SEQ ID NO:318) encoded by SEQ ID NO:317 is 324 amino 
acids in length and is presented using the one-letter amino acid code in Table 97L. The Psort 
profile for NOV97f predicts that this sequence has a signal peptide and is likely to be localized 
1 5 outside the cell with a certainty of 03989. In alternative embodiments, a NOV97f polypeptide 
is located to lysosomes with a certainty of 0.506 L The Signal P predicts a likely cleavage site 
for a NOV97f peptide is between positions 16 and 17, i.e., at the dash in the sequence VFG- 
KE. 
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Table 97L. NO V97f Polypeptide Sequence (SEQ ID 
NO:3l8) 

MRGLLVL S VLLGAVFGKEDFVGHQVLRI S VADEAQVQKVKELEDLEHLQLDFWRGPAHPG 
SPIDVRVPFPSIQAVKIFLESHGISYETMIEDVQSLLDEEQEQMFAFRSRARSTDTFNYA 
TYHTLEEVYSWIDNFVMEHSDIVSNIQIGNSFENQSILVLKFSTGGSRHPAIWIDTGIHS 
REWITRATGIWTANKIVSDYGKDRVLTDII^AMDIFIGLVTNPDGFAFraSMNRLWRKI^ 
SIRPGIFCIGVDUjnWWKSGFGDVASGITVDWAYDSGIKYAFSFELRDTGQYGFLLPATQ 
IIPTAQETWMALRTIMEHILNHPY 



A BLAST analysis of NOV97 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV97 had high 
5 homology to other proteins as shown in Table 97M. 



Table 97M. BLASTX results from PatP database for NOV97 






Smallest 






Sum 


Sequences producing High-scoring Segment Pairs: 


High 


Probability 


Score 


P (N> 


patp:AAE01663 Novel human protease #2 


1898 


9.3e-196 


patp:AAB47565 Protease PRTS-7 


1898 


9.3e-196 


patp:AAE0l664 Novel human protease #3 


1013 


1.8e-174 


patp:AAR97618 Human carboxypeptidase Al 


1682 


7.2e-173 


patp:AAY289l5 Human regulatory protein HRGP-l 


1682 


7.2e-173 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1017 of 1264 bases (80%) identical to a gb:GENBANK- 

10 ID:HSPCBXAl|acc:X67318.1 mRNA from Homo sapiens (H.sapiens mRNA for 

procarboxypeptidase Al). The full amino acid sequence of the protein of the invention was 
found to have 315 of 419 amino acid residues (75%) identical to, and 357 of 419 amino acid 
residues (85%) similar to, the 419 amino acid residue ptnr:SWISSNEW-ACC:P15085 protein 
from Homo sapiens (Human) (CARBOXYPEPTIDASE A 1 PRECURSOR (EC 3.4.17.1)). 

1 5 NOV97 also has homology to the other proteins shown in the BLASTP data in Table 97N. 



Table 97N. NOV97 BLASTP results 


Gene Index/ 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%> 


Expect 


gi| 17939400 | ref 
|NP_525124.l| 
(NM 080385) 


carboxypeptidase A5 
[Homo sapiens] 


436 


354/418 
(84) 


382/418 
(90) 


0.0 


gi|4502997|ref | 
NP_001859.l| 
(NM_001868) 


pancreatic 
carboxypeptidase Al 
precursor; 
Carboxypeptidase A 
[Homo sapiens] 


419 


315/419 
(75) 


357/419 
(85) 


0.0 
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gi|l3528975|gb| 
AAH05279.l|AAH0 
5279 (BC005279) 


carboxypeptidase Al 
(pancreatic) 
[Homo sapiens] 


419 


314/419 
(74) 


356/419 
(84) 


0.0 


gi|4336196|gb|A 
AD17690.l| 
(AF076222) 


carboxypeptidase Al 
precursor [Sus scrofa] 


419 


295/419 
(68) 


345/419 
(81) 


e-177 


gi|ll5878|sp|P0 
0730|CBPA BOVIN 


CARBOXYPEPTIDASE A 
PRECURSOR 


419 


289/419 
(68) 


345/419 
(81) 


e-176 



This BLAST? data is displayed graphically in the ClustalW in Table 970. A multiple 
sequence alignment is given, with the NOV97a-f proteins being shown on lines 1-6 in a 
ClustalW analysis comparing the protein of the invention with the related protein sequences 
shown in Table 97R 



Table 970. ClustalW Alignment of NOV97 



NOV97a 
NOV97b 
NOV97C 
NOV97d 
NOV97e 
NOV97f 

gi 
gi 
gi 
gi 
gi 



(SEQ 
(SEQ 



17939400) 
4502997) 
13528975) 
4336196) 
115878) 



NOV97a 
NOV97b 
NOV97C 
NOV97d 
N0V97e 
N0V97f 



17939400) 

4502997) 

13528975.) 

4336196) 

115878) 



N0V97a 
NOV97b 
NOV97C 
NOV97d 
N0V97e 
NOV97f 

gi 
gi 
gi 
gi 
gi 



NOV97a 
NOV97b 
NOV97C 
NOV97d 
NOV97e 



(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 



ID NO 
ID NO 
ID NO 
ID NO 
ID NO 
ID NO 
ID NO 
ID NO 
ID NO 
ID NO 
ID NO 



:308) 
:310) 
:312) 
:314) 
:316) 
:318) 
:784) 
:785) 
:786) 
:787) 
:788) 



17939400) 
4502997 | 
13528975 | 
4336196) 
115878) ' 




100 



110 




120 



130 
. . 1 . . 



140 



I 



150 
•-I 



IiLdeeqeqmfaSrSrarst^fnyatyhtleeSy 



3^STXg Ffi^gg YHTLEEQY 
RSTtfl fSyHs YHTLEEI Y 



LLDEEgggM 

lldee SB m * _ 
llde5qeqmfa§r§rarst^fnyatyhtleeSy 

LLDEEi^M^^Ri^RST^F^Y^YHTLEEIY 
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NOV97f 

gi | 17939400 | 

gi j 4502997 | 

gi j 13528975 | 

gi|4336196| 

gi | 115878 | 



NOV97a 
NOV97b 
NOV97C 
NOV97d 
NOV97e 
NOV97f 

gi 
gi 
gi 
gi 
gi 



17939400 I 
4502997| 
13528975 | 
4336196 | 
115878 | 



NOV97a 
NOV97b 
NOV97C 
NOV97d 
NOV97e 
NOV97f 



gi 
gi 
gi 
gi 
gi 



17939400). 
4502997 | 
13528975 | 
4336196 | 
115878 | 



NOV97a 
NOV97b 
NOV97C 
NOV97d 
NOV97e 
NOV97f 



gi 
gi 
gi 
gi 
gi 



17939400) 

4502997) 

13528975) 

4336196) 

115878) 



NOV97a 

NOV97b 

NOV97C 

NOV97d 

NOV97e 

NOV97f 

gi |17939400| 

gi (4502997 | 

gi j 13528975 | 

gi |4336196| 

gi j 115878 | 



NOV97a 
NOV97b 
NOV97c 
NOV97d 




PQi^p[5gAiftBBi^:rc^ 




400 



ttgyvasgit vdwaydsgi kyafsf 
ttSyvasgitvdwaydsgikyaf.sf 

Tt33vAS6ITVDWAYDSGI KYAFSF 
VASGITVD^AYDSG I KYAFSF 
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NOV97e 
NOV97f 

gi 
gi 
gi 
gi 



17939400 | 
4502997 | 
13528975) 
4336196 | 
115878 | 



NOV97a 
NOV97b 
NOV97C 
N0V97d 
N0V97e 
NOV97f 

gi 
gi 
gi 
gi 
gi 



17939400] 
4502997| 
13528975 | 
4336196) 
1158781 




I. 



410 



420 
..| 



430 



I 



I 



440 



elrdtgqygfllpatqiiptaqetwmalrtimehtlnhpy 
elrdtgqygfllpatqiiptaqetwmalrtimehtlnhpy 
3lrdtgqygfllpatqiiptaqetwmalrtimehtlnhpy 
slrdtgqyg fllpatq 1 1 pt aqetwmalrt i me htlnhpy 
elrdtgqygfllpatqiiptaqetwmalrtimehJJlnhpy 
e lrdtgqyg fllpatq 1 1 pt aqetwmalrt i me hylnhpy 
elrdtgqygfllpatqiiptaqetwmalrtimehtlnhpy; 

SLRDTG gYGFLLPAgQI I PTASeTWJalB-T IMEHTLNHP Y 

elrdtgBygfllpaSqiipta[2ETwSalHtipoehtlnhpy 
elrdtgBygfllpaBqiiptaqetwSmjB.timehtlisjhpy 

ELRDTgSyGFLLPaBqI IPTAQETWp^^" " " 



timehtlnSHy 



Table 97P lists the domain description from DOMAIN analysis results against NOV97. 
This indicates that the NO V97 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 97P. Domain Analysis of NOV97 


2nIISmartlsmart0063KZn pept. Znjiept domain SEO ID NO s 8 87 








CD-Length « 286 residues, ' 100 . 0% aligned 






Score = 


301 bits (770), Expect = 6e-83 




NOV97 : 


122 


yhtleevyswidnfvmehsdivskiqignsfenqsilvIiKFSTGGSR--hpaiwidtgih 


179 






YH+ EE+ +W+ + D+V + IG S E + I VLK S G R PA+WID GIH 




Sbjct: 


1 


YHS YEE IEAWLKKLAARYPDLVRLVS IGKSVEGRP I WVLKI SNGPGRDGKPAVWI DAG I H 


60 


NOV97: 


180 


SREWITHATGI^ANKrVSDYGKDRVLTDILNAMDIFIELVTNPDGFAFTHSMNI^WR^ 


239 






+REWI AT ++ N+++ +YG D +T +L+ D +1 V NPDG+ +TH+ +RLWRKN 




Sb j Ct : 


61 


AREWIGPATALYLINQLLENYGSDPRWKLLDKTDWYIVPVLNPDGYEYTHTSDR^ 


120 


NOV97: 


240 


KSIRPGI FCIGVDLNRNWK5GFGGWGSWSNPCSETYHGPS PQSEPEVAAIVNFITAHGNF 


299 






+S G C GVDLNRN+ +G- G++SNPCSETY GPSP SEPE A+ +F+ ++ 




Sb j Ct : 


121 


RS PNSGSNCRGVDLNRNFPFHWGETGAS SNP CSETYAGPSP FSEPETKAVRDFLRSNRKI 


180 


NOV97 : 


300 


KAL I S IHS YSQMLM YPYGRLLE P V - SNQRELYDLAKDAVEALYKVHG - 1 E YI FGS I STTL 


357 






K I +HSYSQ+++YPYG + N +L ++AK +AL VHG Y +G + L 




Sb j Ct : 


181 


KLYIDLHSYSQLILYPYGYTKNDIjPPNVEDLPEVAKALADALASVHGGTRYTYGISNGAL 


240 


NOV97: 


358 


YVASGITVDWAYD- SGIKYAFSFELRDTGQYGFLLPATQI I PTAQE 4 02 








Y ASG + DWAY G+ ++++ ELRD G+YGFLLP +QIIPT E 




Sbjct: 


241 


YPASGGSDDWAYGTLGVPFSYTLELRDKGRYGFLLPPSQI IPTGWE 286 





10 



Carboxypeptidase A (EC 3.4.2.1 ) is a pancreatic exopeptidase. Three different forms 

of human pancreatic procarboxypeptidase A have been isolated. The Al and A2 (600688) 

forms are monomeric proteins with different biochemical properties. Honey et al. (1984, 1986) 

found that an 8.6-kb human DNA fragment (detected by means of a rat cDNA probe for CPA) 

cosegregated with chromosome 7. The assignment was narrowed by demonstration of absence 
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of the human DNA fragment in cells with a deletion of 7q22-qter. By studying mouse-hamster 
hybrid cells, Honey et al. (1986) assigned the CPA gene to mouse chromosome 6. Trypsin 
(276000) is also on human 7q22-qter and on mouse 6. Stewart et al. (1990) concluded from 
multipoint linkage analysis with established chromosome 7 markers that the most likely 
5 location of carboxypeptidase is 7q31-qter. It lies distal to cystic fibrosis at a distance of 
approximately 12 cM. 

NOV97 is predicted to be expressed in at least the following tissues: pancreas. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 

10 sources, and/or RACE sources. Further expression data for NOV97 is provided in Example 2. 

The NOV97 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalcemia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 

15 Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 

disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 
immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV97 nucleic acids encoding the carboxypeptidase-like protein of the invention, 

20 or fragments-thereof, may further be useful.in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a carboxypeptidase Al-like protein 
includes the nucleic acid whose sequence is provided in Table 97A, 97C, 97E, 97G, 971, or 
97K or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 

25 whose bases may be changed from the corresponding base shown in Table 97A, 97C, 97E, 
97G, 971, or 97K while still encoding a protein that maintains its carboxypeptidase Al-like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to the sequences of Table 
97A, 97C, 97E, 97G, 971, or 97K, including nucleic acid fragments that are complementary to 

30 any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
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in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 20% of the bases 
may be so changed. 

The novel protein of the invention includes the carboxypeptidase Al-like protein 
whose sequence is provided in Table 97B, 97D, 97F, 97H, 97J, or 97L. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 97B, 97D, 97F, 97H, 97J, or 97L while still encoding 
protein that maintains its carboxypeptidase Al-like activities and physiological functions, or 
functional fragment thereof. In the mutant or variant protein, up to about 25% of the amino 
acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV98 

The disclosed NOV98 (alternatively referred to herein as CG56939-01) includes the 
5583 nucleotide sequence (SEQ ID NO:319) shown in Table 98A. A NOV98 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a TGA codon 
at nucleotides 4630-4632. The disclosed NOV98 maps to human chromosome 1. 



Table 98A. NOV98 Nucleotide Sequence (SEQ ID 
NO:319) 

GCCGCGTGCGTCCTGCCCGGAGCCGGCGGGACATGCCCC^AGCGCGCGCTGGAGCGGCGC 
GAGGAGGAGGCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCG 
GTGCAGCACACGTACTCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTG 
GTGGCCCGGGAGAGCCTGCTGGACGGCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGAC 
CCCCTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCT 
GCACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTCATG 
CGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAGATAAACCCGGGACC 
CACTTCACTCCAGTGCCTCCGACGCCTCCTGATGCGTGCCGGGGAATGCTGTGCGGCTTC 
GGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTGCAAGAAG 
AGCCCGTGCCCCAGCGTGGTGGCGCCTGTGTGTGGGTCGGACGCCTCCACCTACAGCAAC 
GAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCCTGCTCAGCCGC 
GGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTTCGGCAGCACCTGT 
GCGCGCTCGGCCGACGGGCTGACGGCCTCGTGCCTGTGCCCCGCGACCTGCCGTGGCGCC 
CCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCGGCGAGTGCCAGCTCCTG 

TGTCAGGGCGCCCTCCCTGACCCGAGCCGCAGCTGCCGTGTGAACCCGCGCACGCGGCGC 
CCTGAGATGCGCCTACGGCCCGAGAGCTGCCCTGCCCGGCAGGCGCCAGTGTGTGGGGAC 
GACGGAGTCACCTACGAAAACGACTGTGTCATGGGCCGATCGGGGGCCGCCCGGGGTCTC 
CTCCTGCAGAAAGTGCGCTCCGGCCAGTGCCAGGGTCGAGACCAGTGCCCGGAGCCCTGC 



773 



CGGTTCAATGCCGTGTGCCTGTCCCGCCGTGGCCGTCCCCGCTGCTCCTGCGACCGCGTC 
ACCTGTGACGGGGCCTACAGGCCCGTGTGTGCCCAGGACGGGCGCACGTATGACAGTGAT 
TGCTGGCGGCAGC AGGCTGAGTGCCGGCAGCAG CGTGCCATCCCCAGC AAGCACCAGGGC 
CCGTGTGACCAGGCCCCGTCC CCATGCCTCGGGGTGCAGTGTG CATTTGGGGCGACGTGT 
GCTGTGAAGAACGGGCAGGCAGCGTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGAT 
CCTGTGTGCGGCAGCGACGGCGTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCC 
TGTACCCTCGGGCGGGAGATCCAGGTGGCGCGCAAAGGACCCTGTGACCGCTGCGGGCAG 
TGCCGCTTTGGAGCCCTGTGCGAGGCCGAGACCGGGCGCTGCGTGTGCCCCTCTGAATGC 
GTGGCTTTGGCCCAGCCCGTGTGTGGCTCCGACGGGCACACGTACCCCAGCGAGTGCATG 
CTGCACGTGCACGCCTGCACACACCAGATCAGCCTGCACGTGGCCTCAGCTGGACCCTGC 
GAGACCTGTGGAGATGCCGTGTGTGCTTTTGGGGCTGTGTGCTCCGCAGGGCAGTGTGTG 
TGTCCCCGGTGTGAGCACCCCCCGCCCGGCCCCGTGTGTGGCAGCGACGGTGTCACCTAC 
-GGCAGTGCCTGCGAGCTACGGGAAGCCGCCTGCCTCCAGCAGACACAGATCGAGGAGGCC 
CGGGCAGGGCCGTGCGAGCAGGCCGAGTGCGGTTCCGGAGGCTCTGGCTCTGGGGAGGAC 
GGTGACTGTGAGCAGGAGCTGTGCCGGCAGCGCGGTGGCATCTGGGACGAGGACTCGGAG 
GACGGGCCGTGTGTCTGTGACTTCAGCTGCCAGAGTGTCCCAGGCAGCCCGGTGTGCGGC 
TCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCACAG 
CGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAGGCCCCACCTTCGCCCCGCTGCCG 
CCTGTGGCCCCCTTACACTGTGCCCAGACGCCCTACGGCTGCTGCCAGGACAATATCACC 
GCAGCCCGGGGCGTGGGCCTGGCTGGCTGCCCCAGTGCCTGCCAGTGCAACCCCCATGGC 
TCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCCCAGGTGTGGGG 
GGCCTCAGGTGTGACCGCTGTGAGCCTGGCTTCTGGAACTTTCGAGGCATCGTCACCGAT 
GGCCGGAGTGGCTGTACACCCTGCAGC3X3TGATCCCCAAGGCGCCGTGCGGGATGACTGT 
GAGCAGATGACGGGGCTGTGCTCGTGTAAGCCCGGGGTGGCTGGACCCAAGTGTGGGCAG 
TGTCCAGACGGCCGTGCCCTGGGCCCCGCGGGCTGTGAAGCTGACGCTTCTGCGCCTGCG 
ACCTGTGCGGAGATGCGCTGTGAGTTCGGTGCGCGGTGCGTGGAGGAGTCTGGCTCAGCC 
CACTGTGTCTGCCCGATGCTCACCTGTCCAGAGGCCAACGCTACCAAGGTCTGTGGGTCA 
GATGGAGTCACATACGGCAACGAGTGTCAGCTGAAGACCATCGCCTGCCGACGGTGTCAC 
CTACGCCAGGGCCTGCAAATCTCTATCCAGAGCCTGGGCCCGTGCCAGGAGGCTGTTGCT 
CCCAGCACTCACCCGACATCTGCCTCCGTGACTGTGACCACCCC^GGGCTCCTCCTGAGC 
CAGGCACTGCCGGCCCCCCCCGGCGCCCTCCCCCTGGCTCCCAGCAGTACCGCACACAGC 
CAGACCACCCCTCCGCCCTCATCGCGACCTCGGACCACTGCCAGCGTCCCCAGGACCACC 
GTGTGGCCCGTGCTGACGGTGCCCCCCACGGCACCCTCCCCTGCACCCAGCCTGGTGGCG 
TC CG CCTTTGGTGAATC TGGC AGCACTGATGGAAG CAG CG ATG AGGAACTG AG CGGGGAC 
CAGGAGGCCAGTGGGGGTGGCTCTGGGGGGCCCGAGCCCTTGGAGGGCAGCAGCGTGGCC 
ACCCCTGGGCCACCTGTCGAGAGGGCTTCCTGCTACAACCCCTGCCATGGGGCGGCGCCC 
TGCCGTGTGCTGCCCGAGGGTGGTGCTCAGTGCGAGTGCCCCCTGGGGCGTGAGGGCACC 
TTCTGCCAGACAGCCTCGGGGCAGGACGGCTCTGGGCCCrTCCTGGCTGACTTCAACGGC 
TTCT C CCACCTGGAGCTGAGAGGC CTGCACAC CTTTG CACGGG AC CTG GGGG AGAAGATG 
GCGCTGGAGGTCGTGTTCCTGGCACGAGGCCCCAGCGGCCTCCTGCTCTACAACGGGCAG 
AAGACGGACGGCAAGGGGGACTTCGTGTCGCTGGCACTGCGGGACCGCCGCCTGGAGTTC 
CGCTACGACCTGGGCAAGGGGGCAGCGGTCATCAGGAGCAGGGAGCCAGTCACCCTGGGA 
GCCTGGACCAGGGTCTCACTGGAGCGAAACGGCCGCAAGGGTGCCCTGCGTGTGGGCGAC 
GGCCCCCGTGTGTTGGGGGAGTCCCCGGTTCCGCACACCGTCCTCAACCTGAAGGAGCCG 
CTCTACGTAGGGGGCGCTCCCGACTTCAGCAAGCTGGCCCGTGCTGCTGCCGTGTCCTCT 
GGCTTCGACGGTGCCATCCAGCTGGTCTCCCTCGGAGGCCGCCAGCTGCTGACCCCGGAG 
CACGTGCTGCGGCAGGTGGACGTCACGTCCTTTGCAGGTCACCCCTGCACCCGGGCCTCA 
GGCCACCCCTGCCTCAATGGGGCCTCCTGCGTCCCGAGGGAGGCTGCCTATGTGTGCCTG 
TGTCCCGGGGGATTCTCAGGACCGCACTGCGAGAAGGGGCTGGTGGAGAAGTCAGCGGGG 
GACGTGGATACCTTGGCCTTTGACGGGCGGACCTTTGTCGAGTACCTCAACGCTGTGACC 
GAGAGCGAGAAGGCACTGCAGAGCAACCACTTTGAACTGAGCCTGCGCACTGAGGCCACG 
CAGGGGCTGGTGCTCTGGAGTGGCAAGGCCACGGAGCGGGCAGACTATGTGGCACTGGCC 
ATTGTGGACGGGCACCTGCAACTGAGCTACAACCTGGGCrCCCAGCCCGTGGTGCTGCGT 
TCCACCGTGCCCGTCAACACCAACCGCTGGTTGCGGGTCGTGGCACATAGGGAGCAGAGG 
GAAGGTTCCCTGCAGGTGGGCAATGAGGCCCCTGTGACCGGCTCCTCCCCGCTGGGCGCC 
ACGCAGCTGGACACTGATGGAGCCCTGTGGCTTGGGGGCCTGCCGGAGCTGCCCGTGGGC 
CCAGCACTGCCCAAGGCCTACGGCACAGGCTTTGTGGGCTGCTTGCGGGATGTGGTGGTG 
GGCCGGCACCCGCTGCACCTGCTGGAGGACGCCGTCACCAAGCCAGAGCTGCGGCCCTGC 
CCCACCCCATGAGCTGGCACCAGAGCCCCGCGCCCGCTGTAATTATTTTCTATTTTTGTA 
AACTTGTTGCTTTTTGATATGATTTTCTTGCCTGAGTGTTGGCCGGAGGGACTGCTGGCC 
CGGCCTCCCTTCCGTCO^GCAGCCGTGCTGCAGACAGACCTAGTGCTGAGGGATGGACA 
GGCGAGGTGGCAGCGTGGAGGGCTCGGCGTGGATGGCAGCCTCAGGACACACACCCCTGC 
CTCAAGGTGCTGAGCCCCCGCCTTGCACTGCGCCTGCCCCACGGTGTCCCCGCCGGGAAG 
CAGCCCCGGCTCCTGAATCACCCTCGCTCCGTCAGGCGGGACTCGTGTCCCAAAAAGGAA 
GGGGCTGCTGAGGTCTGATGGGGCCCTTCCTCCGGGTGACCCCACAGGGCCTTTCCAAGC 
CCCTATTTGAGCTGCTCCTTCCrGTGTGTGCTCTGGACCCTGCCTCGGCCTCCTGCGCCA 
ATACTGTGACTTCCA^CAATGTTACTGCTGGGCACAGCTCTGCGTTGCTCCCGTGCTGC 
CTGCGCCAGCCCCAGGCTGCTGAGGAGCAGAGGCCAGACCAGGGCCGATCTGGGTGTCCT 
GACCCTCAGCTGGCCCTGCCCAGCCACCCTGGACATGACCGTATCCCTCTGCCACACCCC 
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AGGCCCTGCGAGGGGCTATCGAGAGGAGCTCACTGTGGGATGGGGTTGACCTCTGCCGCC 
TGCCTGGGTAT CTGGGCCTGGCCATGGCTGTGTTCTTCATGTGTTGATTTTATTTGAC C C 
CTGGAGTGGTGGGTCTCATCTTTCCCATCTCGCCTGAGAGCGGCTGAGGGCTGCCTCACT 
GCAAATCCTCCCCACAGCGTCAGTGAAAGTCGTCCTTGTCTCAGAATGACCAGGGGCCAG 
CCAGTGTCTGACCAAGGTCAAGGGGCAGGTGCAGAGGTGGCAGGGATGGCTCCGAAGCCA 
GAA 



A NOV98 polypeptide (SEQ ID NO:320) encoded by SEQ ID NO:319 is 1543 amino 
acids in length and is presented using the one-letter amino acid code in Table 98B. The Psort 
profile for NOV98 predicts that this sequence has no signal peptide and is likely to be 
5 localized to the cytoplasm with a certainty of 0.4500. In alternative embodiments, a NOV98 
polypeptide is located to m icrobodies with a certainty of 0.3000. 



AACVL PGAGGT CPERALERREEEANVVLTGTVEE ILNTOP VQHT YS CKVRVWR YLKGKDL 
VARESLI»DGGNKWISGFGDPLI CDNQVSTGDTRI FFVNPAPP YLWPAHKNELMLNSSLM 
RI TLRNLEE VEFCVEDKPGTHFTPVP PTPPDACRGMLCGFGAVCEPNAEGPGRAS CVCKK 
SPCPSWAPVCGSDASTYSNECTLQRAQCSQQRRIRLLSRGPCGSRDPCSNVTCSFGSTC 
ARSADGLTASCLCPATCRGAPEGTVCGSDGADYPGECQLLRRACARQENVFKKFDGPCDP 
CQGALPDP SRS CR VNPRTRRPEMRLRPES CPARQAPVCGDDGVT YENDCVMGRSGAARGL 
LLQKVRSGQCQGRDQCPEPCRFNAVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSD 
CWRQQAECRQQRAIPSKHQGPCDQAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYD 
P VCGS DGVTYGS ACELEATACTLGRE I QVARKGPCDRCGQCRFGAL CEAETGRCVCPSEC 
VALAQPVCGSDGHTYPSECMLHVHACTHQISLHVASAGPCETCGDAVCAFGAVCSAGQCV 
CPRCEHPPPGPVCGSDGVTYGSACEIiREAACLQQTQIEEARAGPCEQAECGSGGSGSGED 
GDCTQELCRQRGGIWDEDSEDGPCTCDFSCQSVPGSPVCGSIXJVTYSTECEIjKKARCESQ 
RGLYVAAQGACRGPTFAPLPPVAPLHCAQTPYGCCQDNITAARGVGIiAGCPSACQCNPHG 
SYGGTCDPATGQCSCRPGVGGIiRCDRCEPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDC 
EQMTGLCSCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSA 
HCVCPMLTCPEANATKVCGSDGVTYGNECQLKTI ACRRCHLRQGLQI S IQSLGPCQEAVA 
PSTHPTSASVTVTTPGLLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTT 
VWPVLTVPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGPEPLEGSSVA 
TPGPPVERASCYNPCHGAAPCRVLPEGGAQCECPLGREGTFCQTASGQDGSGPFLADFNG 
F SHLELRGIiHTF ARDLGEJCMALE WFIARGP S GXiLLYNGQKTDGKGDFVS LALRDRRLE F 
RYDLGKGAAVI RS RE PVTLGAWTRVSLERNGRKGALRVGDGPRVLGES P VPHTVLNLKEP 
LYVGGAPDFSKLARAAAVSSGFDGAIQLVSLGGRQLLTPEHVLRQVDVTSFAGHPCTRAS 
GHPCLNGASCVPREAAYVCLCPGGFSGPHCEKGLVEKSAGDVDTIAFDGRTFVEYLNAVT 
ESEKALQSNHFELSLRTEATQGLVLWSGKATERADYVALAIVDGHLQLSYNLGSQPVVLR 
STVPVmmWLRWAHREQREGSLQVGNEAPVTGSSPI/3ATQLDTIX3ALWLGGLPELPVG 
PALPKAYGTGFVGCLRDVWGRHPLHLLEDAVTKPELRPCPTP 



A BLAST analysis of NOV98 was run against the proprietary PatP GENESEQ Protein 



10 Patent database. It was found, for example, that the amino acid sequence of NOV98 had high 
homology to other proteins as shown in Table 98C. ' 



Table 98B.NOV98 Polypeptide Sequence (SEQ ID 
NO:320) 



Table 98C. BLASTX results from PatP database for NOV98 



Sequences producing High -scoring Segment Pairs: 



Smallest 
Sum 

High Probability 
Score P (N) 



patp:AAW26609 Human agrin - Homo sapiens, 492 aa. 
patp:AAB93 754 Human protein sequence 



2349 2.2e-246 
2179 1.5e-225 
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patp:AAY73993 Human prostate tumor EST fragment 


2177 


2 


5e- 


225 


patp:AAB31889 Amino acid sequence of a human protein 


380 


1 


.2e- 


■62 


patp:AAU16938 Human novel secreted protein 


551 


1 


2e- 


•51 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 3158 of 3416 bases (92%) identical to a gb:GENBANK- 
ID:AF016903|acc:AF016903.1 mRNA from Homo sapiens (agrin precursor mRNA). The full 
5 amino acid sequence of the protein of the invention was found to have 1092 of 1 1 14 amino 
acid residues (98%) identical to, and 1093 of 1 1 14 amino acid residues (98%) similar to, the 
2026 amino acid residue ptnr:SPTREMBL-ACC:O00468 protein from Homo sapiens 
(Human) (AGRIN PRECURSOR). NOV98 also has homology to the other proteins shown in 
the BLASTP data in Table 98D. 
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Table 98D. NOV98 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi|2988422|gb 

|AAC39776.l| 

(AF016903) 


agrin precursor 
[Homo sapiens] 


2026 


1088/1097 
(99) 


1088/1097 
(99) 


0.0 


gi|39902l(sp( 
P25304|AGRI R 
AT 


Agrin precursor 


1959 


772/958 
(80) f 


825/958 
(85) 


0.0 


gi|202800|gb| 
AAA40703.l| 
(M64780) 


agrin [Rattus 
norvegicus] 


1940 


772/958 
(80) 


825/958 
(83) 


0.0 


gi|202799)gb| 
AAA40702.l| 
(M64780) 


agrin [Rattus 
norvegicus] 


1937 


769/958 
(80) 


822/958 
(85) 


0.0 


gi|399020|sp| 
P31696|AGRI C 
HICK 


Agrin precursor 


1169 


637/1017 
(62) 


758/1017 
(73) 


0.0 



This BLASTP data is displayed graphically in the ClustalW in Table 98E. A multiple 
sequence alignment is given, with the NOV98 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
15 Table 98D. 



Table 98E. ClustalW Alignment of NOV98 


N0V98 


(SEQ ID NO:320) 


gi 


29884221 (SEQ IDN0:789) 


gi 


399021 


(SEQ ID NO: 790) 


gi 


202600 


(SEQ ID NO: 791) 


gi 


202799 


• (SEQ ID NO: 792) 


gi 


399020 


(SEQ ID NO: 793) 
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TYSCKVR 



2988422 

399021 

202800 

202799 

399020 



AACVLPGAGGT 



NOV98 



2988422 

399021 

202800 

202799 

399020 



NOV98 


gi 


2988422 


gi 


399021 


gi 


202800 


gi 


202799 


gi 


399020 



NOV98 


gi 


2988422 | 


gi 


399021 




gi 


202800 




gi 


202799 




gi 


399020 





NOV98 



2988422 

399021 

202800 

202799 

399020 



NOV98 



gi 


2988422 


gi 


399021 


gi 


202800 


gi 


202799 


gi 


399020 


NOV98 


gi 


2988422 


gi 


399021 


gi 


202800 


gi 


202799 


gi 


399020 



NOV98 



gi 
gi 
gi 
gi 
gi 



2988422 

399021 

202800 

202799 

399020 




tEEEANWtiTGT VEE Y LNVDP VQllT Y S CKVR 

IpRpE 

'RQE r 



60 



70 



80 



90 



100 




| | | I I I I I I 

4»vaBeslldggnkvvi sgfgdplicdnqvstgdtri ffvnp 

Ja 5 ESLLDGGNKWI SGFGDPLI CDNQVSTGDTRI FFVNP 




120 130 

.ITjERNLEEftE 
ITiURNLEE^E 

'^FA^ 
iGFAV 



140 



150 




160 
. . I . . 



170 



180 



190 



200 



■I- 



pffiPPTPPDgCRGMLCGFGAVCEPL 

pSpptppdScrgmlcgfgavceps 



EEPGRASCVCKK|| CPg_ 

SeSpgrascvckkSHcpI? 



pIpptppdScrgmlcgfgavcepsSedpgrascvckkSacp| 

^PPTPigpBcRGMLCGFGAVCESsBSDP^ASCVCKKgACPe 



240 



VAPVCGS 
VAPVCGS 
k/APVCGS 
kfAPVCGS 
pAPVCGS 
iVAPVCGS 



250 




300 



310 320 330 340 350 




TYENDCVMgR \ GAgRGLLLQKVRSGQCQSRDQCPEgCgFNgyCLSRRGRP 
TYENDCVMgR ' GAgRGLLLQKTOSGQCQgRDQCPE|c|FNgVCi:SR 

tyendcvmBr * gaRrglllqkvrsgqcqBrdqcpeHcSfn^clsrrgrp 

TYENDCVmSr J GARRGLLLQKTOSGQCQKRDQCPsRcgFNOTCLSRRGRP 
TYENDCVmSr j GARRGiLLQKVRSGQCQ^DQCP^cgFi^CLSRRG^ 
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410 



420 



430 



440 



450 



NOV98 



gi 
gi 
gi 
gi 



2988422) 

399021 

202800 

202799 

399020 



NOV98 

gi [2988422 | 
gi |39902l| 
gi |202800| 
gi |202799| 
gi|399020 j 



NOV98 

gi|2988422| 
gi (399021 
gi |202800 
gi (202799 
gi (399020 



NOV98 



NOV98 



2988422 | 

399021 

202800 

202799 

399020 



NOV98 



2988422 

399021 

202800 

202799 

399020 



NOV9 8 

gi | 2988422 | 
gi (399021 
gi (202800 
gi j 202799 
gi|399020 



NOV98 

gi | 2988422 | 
gi( 399021 | 



500 



550 



gi 


2988422) ] 


gi 


399021 




gi 


202800 




gi 


202799 




gi 


399020 






600 



650 



660 



I 



670 



680 



I. 



I 



690 



700 
..I 



gpcegaecgsggsgsgedgceqelcrqrggiwdedsedgpcvcdfscqs 
gpceSaecgsggsgsgedSceqelcrqrggiwdedsedgpcvcdfscqs 
gpce|aecgsggsgsged^ceqelcrqrggiwdedsedgpcvcdfscqs 
gpcejaecgsggsgsgedgeceqelcrqrggiwdedsedgpcvcdfscqs 
gpcejaecgsggsgsgedgeceqelcrqrggiwdedsedgpcvcdfscqs: 
gpceSecgsggsgsgi^eceq!Scr!IHggEwdedSed!ScvcdfSc! 
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gi 


202800 


gi 


202799 


gi 


399020 


NOV98 


gi 


2988422 


gi 


399021 


gi 


202800 


gi 


202799 


gi 


399020 



NOV98 



2988422 | 

399021 

202800 

202799 

399020 



NOV98 



2988422 

399021 

202800 

202799 

399020 



NOV98 


gi 


2988422 


gi 


399021 


gi 


202800 


gi 


202799 


gi 


399020 



NOV98 


gi 


2988422 


gi 


399021 


gi 


202800 


gi 


202799 


gi 


399020 



NOV98 



gi 
gi 
gi 
gi 
gi 



2988422 | 

399021 

202800 

202799 

399020 



NOV98 


gi 


2988422 


gi 


399021 


gi 


202800 


gi 


202799 


gi 


399020 




810 



820 . 



83 0 



840 



850 



CSCRPGVGGLRCDRCEPGFWNFRGIVTDGgSGCTPCSCDPgGAVRDDCEQ 
CSCRPGVGGLRCDRCEPGFWWFRGIVTDGBsGCTPCSCDPgGAVRDDCEq 

cscrpgvgglrcdrcepgfwnfrgivtdq5sgctpcscdp|gavrddceq' 
cscrpgvgglrcdrcepgfwnfrgivtdgSsgctpcscdp|gavrddceq 
cscrpgvgglrcdrcepgfwnfrgivtdg?isgctpcscdp|gavrddceq 
cscSpgvgglScdrcepgfwnfrgivtd^sgctpciScdpSgBvrddceq 



860 



870 



880 



890 



900 




1000 



1010 



aTSHTMPP|PLPTLPLDKl|| 
1020 1030 1040 1050 




1160 



1170 



1180 1190 



1200 



I — I — I 
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gvegqelfytpemadpkselfgetarsiestlddlfrnsdvkkdfgsvrl 
gvegqelfytpemadpkselfgetarsiestlddlfrnsdvkkdfSsvrl 
gvegqelfytpemadpkselfgetarsiestlddlfrnsdvkkdfSsvrl 
gvegqelfytpemadpkselfgetarsiestlddlfrnsdvkkdfRsvrl 

aVEGQELFYTPEMADPKSELFGETARSIESgLDgLFRNSDVKSjDFBsgRS 



NOV98 


gi 


2988422 


gi 


399021 


gi 


202800 


gi 


202799 


gi 


399020 



NOV98 


gi 


2988422 


gi 


39902l| 


gi 


202800 | 


gi 


202799 | 


gi 


399020) 



NOV98 

gi | 2988422 | 
gi [399021 
gi | 202800 
gi | 202799 
gi j 399020 



NOV98 

gi | 2988422 | 
gi | 399021 
gi | 202800 
gi (202799 
gi | 399020 



NOV98 



2988422 | 

399021 

202800 

202799 

399020 



NOV98 


gi 


2988422 


gi 


399021 


gi 


202800 


gi 


202799 


gi 


399020 



NOV98 



2988422| 
39902l| 
202800 | 
202799) 
399020) 



NOV98 


gi 


29884221 


gi 


399021 




gi 


202800 




gi 


202799 




gi 


399020 
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1300 
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NOV98 


gi 


2988422 


gi 


399021 


gi 


202800 


gi 


202799 


gi 


399020 



NOV98 



2988422 

399021 

202800 

202799 

399020 



NOV98 

gi (2988422 | 
gi |39902l| 
gi | 2G2800 | 
gi |202799| 
gi|399020| 



NOV98 

gi | 2988422 | 
gi|399021 
gi (202800 
gi |202799 
gi (399020 



NOV98 


gi 


2988422 


gi 


399021 


gi 


202800 


gi 


202799 


gi 


399020 



NOV98 



gi 
gi 
gi 
gi 



2988422 

399021 

202800 

202799 

399020 



NOV98 
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gi 
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2988422 

399021 

202800 

202799 

399020 
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gi|202800| 
gi |202799| 
gi|399020| 



NOV98 


gi 


2988422 


gi 


399021 | 


gi 


202800 j 


gi 


202799| 


gi 


399020) 


NOV98 


gi 


2988422 


gi 


399021 


gi 


202600 


gi 


202799 


gi 


399020 


NOV98 


gi 


2988422 


gi 


399021 


gi 


202800 


gi 


202799 


gi 


399020 




Table 98F lists the domain description from DOMAIN analysis results against NOV98. 
This indicates that the NOV98 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 98F. Domain Analysis of NOV98 


enIIPfamlDfam00054. lam in in G. Laminin G domain SEO ID NO;8fifi 






Score = 


CD-Length - 134 residues, 99.3% aligned 
152 bits (385), Expect = le-37 




NOV98: 
Sbjct: 


1396 
2 


RTEATQGLVLWSGKATERADYVAIAIVDGHLQ^ 
RT GL+L+ G T+R D++AL + DG L++SY+LGS P V+RS +N +W RV 
RTTEPS GLLL YGGTNTDR- DFLALELRDGRLEVS YDLGSGPAWRS GDRLNDGKWHRVEL 


1455 
60 


NOV98: 
SbjCt: 


1456 
61 


HREQREGSLQVGNEAPVTGSSPLGATQ LDTDGALWLGGLPELPVGPALPKAYGTGF 

R R+G+L VEVGSPG I*DD L++GGLPE L A T F 
ERNGRKGTLSVDGEESVDGESPSGPDVPHENLDLDTPLYVGGLPE - LSVKRLLAAISTSF 


1511 
119 


NOV98 : 
Sb j Ct : 


1512 
120 


VGCLRDVWGRHPLH 1526 
GC+RDV+V PL 
KG C I RDVI VNGKPLD 134 





10 



Synapses are essential relay stations for the transmission of information between 
neurones and other cells. An ordered and tightly regulated formation of these structures is 
crucial for the functioning of the nervous system. The synapse is also involved in perception, 
learning and memory. Understanding the sequence of steps that is involved in establishing 
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synapses during development might also help to understand mechanisms that cause changes in 
synapses during learning and memory. 

For practical reasons, most of the current knowledge of synapse development is 
derived from studies of the vertebrate neuromuscular junction. Upon arrival of a motor axon at 
5 the muscle fiber, signals released from its growth cone initiate the formation of a synapse. This 
process consists of two stages: arrest of axon growth at the target area and differentiation of 
pre- and postsynaptic cells at the site of nerve-muscle contact. 

Studies of regenerating neuromuscular junctions in vertebrates have revealed that 
important signals for the formation of this synapse are located in the synaptic basal lamina, 

10 and attempts to identify these signals have led to the isolation of agrin and other components. 
The induction of the intensively studied synapse between nerve and muscle is initiated by the 
binding of neuron-specific isoforms of the basal membrane protein agrin to receptors on the 
surface of myotubes. Agrin activates a receptor complex that includes the muscle-specific 
kinase and most likely additional, yet to be identified, components. Receptor activation leads 

1 5 to the aggregation of acetylcholine receptors (AChR) and other proteins of the postsynaptic 
apparatus. This activation process has unique features which distinguish it from other receptor 
tyrosine kinases. In particular, the autophosphorylation of the kinase domain, which usually 
induces the recruitment of adaptor and signalling molecules, is not sufficient for AChR 
aggregation. Apparently, interactions of the extracellular domain with unknown components 

20 are also required for this process. 

Agrin binds to a second protein complex on the muscle surface known as the 
dystrophin-associated glycoprotein complex. This binding forms one end of a molecular link 
between the extracellular matrix and the cytoskeleton. 

While many components of the machinery triggering postsynaptic differentiation have 

25 now been identified, the picture of the molecular pathway causing the redistribution of 

synaptic proteins is still incomplete. Recent advances implicate proteins such as dystroglycan, 
MuSK, and rapsyn in the transduction of agrin signals. Additional functions of agrin have 
been discovered, including the upregulation of gene transcription in myonuclei and the control 
of presynaptic differentiation. 

30 Agrin therefore appears to play a unique role in controlling synaptic differentiation on 

both sides of the neuromuscular junction. 

NOV98 is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
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kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus, bone, cerebral medulla/ cerebral white matter, cervix, colon, epidermis, 
foreskin, hair follicles, liver, lung, lymphoid tissue, ovary, parathyroid gland, parietal lobe, 
5 retina, skin, vein, whole organism. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV98 is provided in Example 2. 

The NOV98 nucleic acids and proteins are useful in potential therapeutic applications 

1 0 implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 

1 5 thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 

immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV98 nucleic acids encoding the agrin-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 

20 The novel nucleic acid of the invention encoding a agrin-like protein includes the 

nucleic acid whose sequence is provided in Table 98B, or a fragment thereof. The invention 
also includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 98B while still encoding a protein that maintains its agrin- 
like activities and physiological functions, or a fragment of such a nucleic acid. The invention 

25 further includes nucleic acids whose sequences are complementary to the sequence of Table 
98B, including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
30 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
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In the mutant or variant nucleic acids, and their complements, up to about 8% of the bases may 
be so changed. 

The novel protein of the invention includes the agrin-like protein whose sequence is 
provided in Table 98B. The invention also includes a mutant or variant protein any of whose 
residues may be changed from the corresponding residue shown in Table 98B while still 
encoding a protein that maintains its agrin-like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 2% of the amino acid 
residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the " Anti-NOVX Antibodies" section 
below. 



15 NOV99 

The disclosed NOV99 (alternatively referred to herein as CG570 10-01) includes the 
1513 nucleotide sequence (SEQ ID NO:321) shown in Table 99A. A NOV99 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 396-398 and ends with a TGA 
codon at nucleotides 1410-1412. The disclosed NOV99 maps to human chromosome 

20 14q32.33. 



Table 99 A. NOV99 Nucleotide Sequence (SEQ ID 
NO:321) 



GGAGCCCCCGCCCTGGGATTCCCAGGTGTTTTCATTTGGTGATCAGCACTGAACACAGAA 
GAGT CATG ACGGAGTTTGGGCTGAGCTGGGTTTTC CT TGTTG CTATTTTTAAAGGTGT CC 
AGTGTGAGGTGCAGCTGGTGGAGTCTGGGGGAGACTTGGTCCAGCCTGGGGGGTCCCTGA 
GACTCTCCTGTGCAGCCTCTGGATTCACCTTCAGTAGTTATGCTATGCACTGGGTCCGCC 
AGGCTCCAGGGAAGGGACTGAAATATGTTTCAGGTATTAGTAGTAATGGGCGTAGAACAT 
ATTATGCAAATTGTGTGAAGGGCAGATTCACCATCTCCAGAGACAATTCCAAGAACACGT 
TGTATCTTCAAATGGGCAGCCTGAGAGCTGAGGACATGGCTGTGTATTACTGTGTGTCCG 
GGGGAATCTATGATAGTAGTGGTCCCTTTGACTACTGGGGCCAGGGAACCCTGGTCACCG 
TCTCCTCAGCATCCCCGACCAGCCCCAAGGTCTTCCCGCTGAGCCTCTGCAGCACCCAGC 
CAGATGGGAACGTGGTCATCGCCTGCCTGGTCCAGGGCTTCTTCCCCCAGGAGCCACTCA 
GTGTGACCTGGAGCGAAAGCGGACAGGGCGTGACCGCCAGAAACTTCCCACCCAGCCAGG 
ATGCCTCCGGGGACCTGTACACCACGAGCAGCCAGCTGACCCTGCCGGCCACACAGTGCC 
TAGCCGGCAAGTCCGTGACATGCCACGTGAAGCACTACACGAATCCCAGCCAGGATGTGA 
CTGTGCCCTGCCCAGTTCCCTCAACTCCACCTACCCCATCTCCCTCAACTCCACCTACCC 
CATCTCCCTCATGCTGCCACCCCCGACTGTCACTGCACCGACCGGCCCTCGAGGACCTGC 
TCTTAGGTTCAGAAGCGAACCTCACGTGCACACTGACCGGCCTGAGAGATGCCTCAGGTG 
TCACCTTCACCTGGACGCCCTCAAGTGGGAAGAGCGCTGTTCAAGGACCACCTGAGCGTG 
ACCTCTGTGGCTGCTACAGCGTGTCCAGTGTCCTGCCTGGCTGTGCCCAGCCATGGAACC 
ATGGGGAGACCTTCACCTGCACTGCTGCCCACCCCGAGTTGAAGACCCCACTAGTTCGCT 
GGCTGCAGGGGTCACAGGAGCTGCCCCGCGAGAAGTACCTGACTTGGGCATCCCGGCAGG 
AGCCCAGCCAGGGCACCACCACCTTCGCTGTGACCAGCATACTGCGCGTGGCAGCCGAGG 
ACTGGAAGAAGGGGGACACCTTCTCCTGCATGGTGGGCCACGAGGCCCTGCCGCTGGCCT 
TCACACAGAAGACCATCGACCGCTTGGCGGGTAAACCCACCCATGTCAATGTGTCTGTTG 
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TCATGGTGGAGGTGGACGGCACCTGCTACTGAGCCGCCCGCCTGTCCCCACCCCTGAATA 
AACTCC^TGCTCCCCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA - 



A NOV99 polypeptide (SEQ ID NO:322) encoded by SEQ ID NO:321 is 338 amino 
acids in length and is presented using the one-letter amino acid code in Table 99B. The Psort 
profile for NOV99 predicts that this sequence has no signal peptide and is likely to be 
5 localized to the cytoplasm with a certainty of 0.4500. In alternative embodiments, a NOV99 
polypeptide is located to microbodies with a certainty of 0.1315. 



Table 99B. NOV99 Polypeptide Sequence (SEQ ID 
NO:322) 

MAVYYCVSGGIYDSSGPFDYWGQGTLVTVSSASPTSPKVFPLSLCSTQPDGNWIACLVQ 
GFFPQEPLSVTWSESGQGVTARNPPPSQDASGDLYTTSSQLTLPATQCLAGKSVTCHVKH 
YTNPSQDVTVPCPVPSTPPTPSPSTPPTPSPSCCHPRLSLHRPALEDLLLGSEANLTCTL 
TGLi^ASGVTFTWTPSSGKSAVQGPPERDLCGCYSVSSVLPGCAQPWNHGETFTCTAAHP 
ELKTPLVRWLQGSQELPREKYLTWASRQEPSQGTTTFAWSILRVAAEDWKKGDTFSCMV 
GHEALPLAFTQKTIDRLAGKPTHVNVSWMVEVDGTCY 



A BLAST analysis of NOV99 was run against the proprietary PatP GENESEQ Protein 
1 0 Patent database. It was found, for example, that the amino acid sequence of NOV99 had high 
homology to other proteins as shown in Table 99C. 



Table 99C BLASTX results from PatP database for NOV99 






Smallest 






Sum 


Sequences producing High-scoring Segment Pairs: 


High 


Probability 


Score 


P(N) 


patp:AAY88483 Cancer suppressor gene product 


1335 


9.8e-187 


patp:AAB829l4 Human immune response protein HIRP3 


1268 


1.2e-179 


patp:AAM93283 Human polypeptide, 


1266 


l.9e-179 


patp:AAY44723 Human immune system molecule, ISMO-4 


1262 


5.1e-179 


patp:AAY96304 Human IGFAM-16 immunoglobulin - Homo sapiens 


1254 


3.6e-178 



In a search of sequence databases, it was found, for example, that the nucleic acid 
15 sequence of this invention has 1 122 of 1 133 bases (99%) identical to a gbrGENBANK- 

ID:AF067420|acc:AF067420.1 mRNA from Homo sapiens (SNC73 protein (SNC73) mRNA). 
The full amino acid sequence of the protein of the invention was found to have 244 of 253 
amino acid residues (96%) identical to, and 247 of 253 amino acid residues (97%) similar to, 
the 384 amino acid residue ptnr:SPTREMBL-ACC:Q9UP60 protein from Homo sapiens 
20 (Human) (SNC73 PROTEDSf)(Fig. 3B). In addition to smaller changes, the sequence of this 
invention lacks 46 internal amino acids, when compared to ptnr:SPTREMBL-ACC:Q9UP60 
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protein from Homo sapiens (Human) (SNC73 PROTEIN). NOV99 also has homology to the 
other proteins shown in the BLASTP data in Table 99D. 



Table 99D. NOV99 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(«) 


Identity 

(%> 


Positive 

(%) 


Expect 


gi| 229537 |prf 
| (752400A 


IgA H 

[Homo sapiens] 


475 


292/384 
(76) 


325/384 
(84) 


e-168 


gi| 229585 |prf 
| |763134A 


IgAl Bur 
[Homo sapiens] 


686 


298/383 
(77) 


323/383 
(83) 


e-165 


gi| 223099 |prf 
| (0506249A 


igA-alphal Bur 
[Homo sapiens] 


472 


294/383 
(76) 


323/383 
(83) 


e-165 


gi|3201900|gb 

|AAC19365.l| 

(AF067420) 


SNC73 protein 
[Homo sapiens] 


384 


333/384 
(86) 


336/384 
(86) 


e-155 


gi 14042015 |d 
bj BAB55072.1 
| (AK027379) 


unnamed protean product 
[Homo sapiens] 


494 


321/387 
(82) 


325/387 
(83) 


e-150 



5 This BLASTP data is displayed graphically in the ClustalW in Table 99E. A multiple 

sequence alignment is given, with the NOV99 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 99D. 



Table 99E. ClustalW Alignment of NOV99 



NOV99 




(SEQ ID NO: 


322) 


gi 


229537 




{SEQ 


ID NO 


794) 


gi 


229585 




(SEQ 


ID NO 


795) 


gi 


223099 




(SEQ 


ID NO 


796) 


gi 


3201900) 


(SEQ 


ID NO 


797) 


gi 


14042015 | 


(SEQ 


ID NO 


798) 



10 20 30 40 50 

....|....|....|....|....|....|....|....|....|....| 

NOV99 

gi 229537| - - 

gi 229585) ESALTZPRSVSGSPGHSVTISCIGTSSNVGDYKYVSWYZZHPGKAPJKLII 

gi 223099| - -- ~ 

gi 3201900 | 

gi 14042015 | 



NOV99 


gi 


229537) 


gi 


229585 | 


gi 


223099 | 


gi 


3201900) 


gi 


14042015 



60 70 80 90 100 
)....) |....| |....| | | |....| 



YEVSSRPSGVPDRFSGSKSGBTASLTISGLQAEDEABYYCCSYIGSYVFG 



110 120 130 140 150 



NOV99 
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NOV99 




gi 


229537 




gi 


229585 




gi 


223099 




gi 


3201900) 


gi 


14042015 



gi 
gi 
gi 
gi 
gi 



229537| 
229585| 
223099) 
3201900 | 
140420151 



NOV99 
gi|229537 
gi [229585 
gi|223099 
gi | 3201900 | 
gi 1 14042015 | 



NOV99 




gi 


229537 




gi 


229585 




gi 


223099 




gi 


3201900] 


gi 


14042015 



NOV99 
gi | 229537 
gi (229585 
gi j 223099 
gi j 3201900 | 
gi (14042015 j 



NOV99 




gi 


229537 




gi 


229585 




gi 


223099 




gi 


3201900) 


gi 


14042015 



NOV99 


gi 


229537) 


gi 


229585) 


gi 


223099) 


gi 


3201900) 


gi 


14042015) 



NOV99 




gi 


229537 




gi 


229585 




gi 


223099 




gi 


3201900) 


gi 


14042015 



TGTKVLVIGZPKANPTVTLFPPSSZZLZABKATLVCLISBFYPGAVTVAW 



160 170 

— )....(.. 



180 



190 200 
• • I I I 



KADGSPVKAGVZTTKPSRQSBBKYAASSYLSLTPZZWKSHRSYSCQVTHZ 



- MELGLR WVFLVAFL 

230 240 250 




EGVQ- 



EVQLVgSGGGgVKSGgSLRLSqaAS 



3l|fst||aSJn|S 



260 



270 
|....|....| 



280 



290 
)....).. 



300 
-•I 




'ISRig^^LYL 
' ISR |gf||y LYL 

•isrSj^jjlyl 



310 



SSRSDYII 
320 




'SGGlYDSlSGP-Ftix 
.TBBFBWgTFSLBX 

I AVAGjERB f~ 

IAVAG32RB 

SGGIYDSgGP-FD^ 
•SCNGAICYGFSF 



330 



340 



350 
..) 



WGQGTLVTVSSASPTSPKVFPLSLC 

wgJ2g[[1lvtvssasptspkvfplslc 



WGQGTLVTVSSASPTSPKVFPLSLC 
WGQGTLVTVS S AS PTS PKV FPLSLC 



400 




410 420 430 440 450 
I I I I I 1 I I I I 



ttssqltlpatfficlagksvtchvkhytnpsqgvtvpcpvpstpptpspst 
ttssqltlpatsclagksvtchvkhyti])pssgvtvpcpvpstpptpspst 
ttssqltlpat|clagksvtchvkhytnpsq3vtvpcpvpstpptpspst 
ttssqltlpat|clagksvtchvkhytnpsqgvtvpcpvpstpptpspst 
ttssqltlpatsclagksvtchvkhytnpsqgvtvpcpvpstpptpspst 
ttssqltlpatHclagksvtchvkhytnpsqSvtvpcpvpstpptpspst 



460 



470 

..)....) 



480 



490 

J_ I . • 



500 
••I 



PPTPSPSCCHPRLSLHRPALggLLLGSK 

pptpspscchprlslhrpalB3lllgs| 
pptpspscchprlslhrpalB9lllgs| 
pptpspscchprlslhrpalB|lllgs| 
pptpspscchprlslhrpal^lllgsg 



\NLTCTLTGLRDASGVTFTWTP 
^jLTCTLTGLRDASGVTFTWT P 
tfJLTCTLTGLRDASGVTFTW 
tfJLTCTLTGLRDASGVTFT 
iNLTCTLTGLRDASGVTFTWTP 



PPgPSPSCCHPRLSLHRPALiaBLtLGSaANLTCTLTGLRDASGVTFTWTP 
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NOV99 

gi 
gi 
gi 
gi 
gi 



229537 | 
229585| 
223099| 
3201900 | 
14042015 | 



NOV99 




gi 


229537 




gi 


229585 




gi 


223099 




gi 


3201900) 


gi 


14042015 



NOV99 


gi 


229537 | 


gi 


229585 | 


gi 


223099 j 


gi 


3201900) 


gi 


14042015 



510 



| 



I 



520 



530 



540 



| 



550 



ssgksavsgpperdlcgcysvs svlpgci^pwj5|hgl2tftctaa!j)pej5kt 
ssgksavSgpp?|rdlcgcysvssvlSgca§pw|hgktftctaaypesktp 
sgksavggpperdlcgcysvssvlpgcaepwghgktftctaaypesktp 
sgksavSgpper[?Jlcgcysvssvlpgcaepw|hgktftctaaypesktp 
ssgksavSgpperdlcgcysvssvlpgcaepwShgktftctaaypesktp 
ssgksavSgpperdlcgcysvssvlpgcaepwShgktftctaaypesktp 



560 



600 




610 



620 



630 



640 



I 



650 

..I 



LQGSQELPREKYLTWASRQEPSQGTTTFAVTSILRVAAEDWKKGDTFSCM 

lqgsqelprekyltwasrqgpsqgt^tfavtsilrvaaslwkkgdtfscm 
lqgsqelprekyltwasrqepsqgtttfavtsilrvaaedwkkgdtfscm 
lqgsqelprekyltwasrqeps|gtttfavtsilrvaae[|wkkgdtfscm 
lqgsqelprekyltwasrqepsqgtttfavtsilrvaaedwkkgdtfscm 
lqgsqelprekyltwasrqepsqgtttfavtsilrvaaedwkkgdtfscm 



N0V99 




gi 


229537 




gi 


229585 




gi 


223099 




gi 


3201900 | 


gi 


14042015 | 



660 



I . 



670 



I. 



680 
..I .. 



VGHEALPLAFTQKTIDRLAGKPTHVNVSWMflEVDGTCY 
VGHEALPLAFTQKTIDRLAGKPTHVi^VSWMAEVDGTCY 
VGHgALPLAFTQKTIDRLAGKPTHVNVSWMAEVDGTCY 
VGH@AL PLAFTQKT IDRLAG KPTHVNVS WMA§VDGTC Y 
VGHEALPLAFTQKTIDRLAGKPTHVNVSWMAEVDGTCY 
VGHEAL PLAFTQKT IDRLAGKPTHVNVSWMAEVDGTCY 



Table 99F lists the domain description from DOMAIN analysis results against NOV99. 
This indicates that the NOV99 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 99F. Domain Analysis of NOV99 



gnllSmartl smart00407. IGcl. Immunoglobulin r-T y p«» atcq id NO:889 

CD-Length = 75 residues, 96.0% aligned 
Score = 46.6 bits (109), Expect = 2e-06 



NOV99: 
Sb j Ct : 



53 



WIACLVQGFFPQEPLSVTWSESGQGVT- - ARNFPPSQDASGDLYTTSSQLTLPATQCLA 110 
+ CLV GF+P ++VTW ++GQ VT + P +D G Y SS LT+ A+ + 
2 ATLVCLVTGFYP- PDITVTWLKNGQEVTSGVKTTDPLKDKDG-TYFLSSYLTVSASTWES 5 9 



NOV99: HI GKSVTCHVKHYTNP 124 

G TC V H 
Sbjct: 60 GDVYTCQVTHEGLT 73 



10 



SNC73. was identified by subtractive hybridization between normal mucosa and 
colorectal cancer tissue as a gene which is down-regulated in colorectal cancer. It is highly 
homologous to the constant region of immunoglobulin alpha- 1 chain. In higher vertebrates 
there are five classes of antibodies, IgA, IgD, IgE, IgG, and IgM, each with its own class of 
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heavy chain - alpha, delta, epsilon, gamma, and mu, respectively. IgA molecules have alpha 
chains, IgG molecules have gamma chains, and so on. In addition, there are a number of 
subclasses of IgG and IgA immunoglobulins; for example, there are four human IgG 
subclasses (IgGl, IgG2, IgG3, and IgG4) having gammal, gamma2, gamma3, and gamma4 
5 heavy chains, respectively. 

The various heavy chains impart a distinctive conformation to the hinge and tail 
regions of antibodies and give each class (and subclass) characteristic properties of its own. 
IgA is the principal class of antibody in secretions (saliva, tears, milk, and respiratory and 
intestinal secretions). It is transported through secretory epithelial cells from the extracellular 

10 fluid into the secreted fluid by the Poly Ig receptor, another type of Fc receptor that is unique 
to secretory epithelia. IgA serves both to defend against local infection and to prevent access 
of foreign antigens to the general immunologic system. This function is in accord with the 
potential role of SNC73 in colorectal cancer. 

NOV99 is predicted to be expressed in at least the following tissues: adrenal gland, 

1 5 bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. This information was derived by determining the tissue sources of the 

20 sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV99 is provided in Example 2, 

The NOV99 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 

25 Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 

hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 

30 immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV99 nucleic acids encoding the SNC73-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 
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The novel nucleic acid of the invention encoding a SNC73-like protein includes the 
nucleic acid whose sequence is provided in Table 99A, or a fragment thereof. The invention 
also includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 99 A while still encoding a protein that maintains its 
SNC73-like activities and physiological functions, or a fragment of such a nucleic acid. The 
invention further includes nucleic acids whose sequences are complementary to the sequence 
of Table 99A, including nucleic acid fragments that are complementary to any of the nucleic 
acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1% of the bases may 
be so changed. 

The novel protein of the invention includes the SNC73-like protein whose sequence is 
provided in Table 99B. The invention also includes a mutant or variant protein any of whose 
residues may be changed from the corresponding residue shown in Table 99B while still 
encoding a protein that maintains its SNC73-like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 4% of the amino acid 
residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOVX Nucleic Acids and Polypeptides 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
NOVX polypeptides or biologically active portions thereof. Also included in the invention are 
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nucleic acid fragments sufficient for use as hybridization probes to identify NOVX-encoding 
nucleic acids (e.g., NOVX mRNAs) and fragments for use as PCR primers for the 
amplification and/or mutation of NOVX nucleic acid molecules. As used herein, the term 
"nucleic acid molecule" is intended to include DNA molecules (e.g., cDNA or genomic 
5 DNA), RNA molecules (e.g., mRNA), analogs of the DNA or RNA generated using 
nucleotide analogs, and derivatives, fragments and homologs thereof. The nucleic acid 
molecule may be single-stranded or double-stranded, but preferably is comprised double- 
stranded DNA. 

An NOVX nucleic acid can encode a mature NOVX polypeptide. As used herein, a 

10 "mature" form of a polypeptide or protein disclosed in the present invention is the product of a 
naturally occurring polypeptide or precursor form or proprotein. The naturally occurring 
polypeptide, precursor or proprotein includes, by way of nonlimiting example, the full-length 
gene product, encoded by the corresponding gene. Alternatively, it may be defined as the 
polypeptide, precursor or proprotein encoded by an ORF described herein. The product 

1 5 "mature" form arises, again by way of nonlimiting example, as a result of one or more 
naturally occurring processing steps as they may take place within the ceil, or host cell, in 
which the gene product arises. Examples of such processing steps leading to a "mature" form 
of a polypeptide or protein include the cleavage of the N-terminal methionine residue encoded 
by the initiation codon of an ORF, or the proteolytic cleavage of a signal peptide or leader 

20 sequence. Thus a mature form arising from a precursor polypeptide or protein that has 

residues I to N, where residue 1 is the N-terminal methionine, would have residues 2 through 
N remaining after removal of the N-terminal methionine. Alternatively, a mature form arising 
from a precursor polypeptide or protein having residues 1 to N, in which an N-terminal signal 
sequence from residue 1 to residue M is cleaved, would have the residues from residue M+l to 

25 residue N remaining. Further as used herein, a "mature" form of a polypeptide or protein may 
arise from a step of post-translational modification other than a proteolytic cleavage event. 
Such additional processes include, by way of non-limiting example, glycosylation, 
myristoylation or phosphorylation. In general, a mature polypeptide or protein may result 
from the operation of only one of these processes, or a combination of any of them. 
„ 30 The term "probes", as utilized herein, refers to nucleic acid sequences of variable 

length, preferably between at least about 10 nucleotides (nt), 100 nt, or as many as 
approximately, e.g., 6,000 nt, depending upon the specific use. Probes are used in the 
detection of identical, similar, or complementary nucleic acid sequences. Longer length 
probes are generally obtained from a natural or recombinant source, are highly specific, and 
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much slower to hybridize than shorter-length oligomer probes. Probes may be single- or 
double-stranded and designed to have specificity in PCR, membrane-based hybridization 
technologies, or ELISA-like technologies. 

The term "isolated" nucleic acid molecule, as utilized herein, is one, which is separated 
5 from other nucleic acid molecules which are present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the nucleic 
acid (i.e., sequences located at the 5 - and 3'-termini of the nucleic acid) in the genomic DNA 
of the organism from which the nucleic acid is derived. For example, in various embodiments, 
the isolated NOVX nucleic acid molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 

10 kb, 0.5 kb or 0. 1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell/tissue from which the nucleic acid is derived (e.g., brain, heart, liver, 
spleen, etc.). Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can 
be substantially free of other cellular material or culture medium when produced by 
recombinant techniques, or of chemical precursors or other chemicals when chemically 

15 synthesized. 

A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 
nucleotide sequence SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162, or a 
complement of this aforementioned nucleotide sequence, can be isolated using standard 
molecular biology techniques and the sequence information provided herein. Using all or a 

20 portion of the nucleic acid sequence of SEQ ID NOS:2n-l, wherein n is an integer between 1 
and 162 as a hybridization probe, NOVX molecules can be isolated using standard 
hybridization and cloning techniques (e.g., as described in Sambrook, et al, (eds.), 
Molecular Cloning: A Laboratory Manual 2 nd Ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989; and Ausubel, etal, (eds.), Current Protocols in 

25 Molecular Biology, John Wiley & Sons, New York, NY, 1993.) 

A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to standard 
PCR amplification techniques. The nucleic acid so amplified can be cloned into an 
appropriate vector and characterized by DNA sequence analysis. Furthermore, 

30 oligonucleotides corresponding to NOVX nucleotide sequences can be prepared by standard 
synthetic techniques, e.g., using an automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide 
residues, which oligonucleotide has a sufficient number of nucleotide bases to be used in a 
PCR reaction. A short oligonucleotide sequence may be based on, or designed from, a 
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genomic or cDNA sequence and is used to amplify, confirm, or reveal the presence of an 
identical, similar or complementary DNA or RNA in a particular cell or tissue. 
Oligonucleotides comprise portions of a nucleic acid sequence having about 10 nt, 50 nt, or 
100 nt in length, preferably about 15 nt to 30 nt in length. In one embodiment of the 
5 invention, an oligonucleotide comprising a nucleic acid molecule less than 100 nt in length 
would further comprise at least 6 contiguous nucleotides SEQ ID NOS:2n-l, wherein n is an 
integer between 1 and 162, or a complement thereof Oligonucleotides may be chemically 
synthesized and may also be used as probes. 

In another embodiment, an isolated nucleic acid molecule of the invention comprises a 

10 nucleic acid molecule that is a complement of the nucleotide sequence shown in SEQ ID 

NOS:2n-l, wherein n is an integer between 1 and 162, or a portion of this nucleotide sequence 
(e.g., a fragment that can be used as a probe or primer or a fragment encoding a biologically- 
active portion of an NOVX polypeptide). A nucleic acid molecule that is complementary to 
the nucleotide sequence shown SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162 

15 is one that is sufficiently complementary to the nucleotide sequence shown SEQ ID NOS:2n- 
1, wherein n is an integer between 1 and 162 that it can hydrogen bond with little or no 
mismatches to the nucleotide sequence shown SEQ ID NOS:2n-l, wherein n is an integer 
between 1 and 162, thereby forming a stable duplex. 

As used herein, the term "complementary" refers to Watson-Crick or Hoogsteen base 

20 pairing between nucleotides units of a nucleic acid molecule, and the term "binding" means 
the physical or chemical interaction between two polypeptides or compounds or associated 
polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, van 
der Waals, hydrophobic interactions, and the like. A physical interaction can be either direct 
or indirect. Indirect interactions may be through or due to the effects of another polypeptide or 

25 compound. Direct binding refers to interactions that do not take place through, or due to, the 
effect of another polypeptide or compound, but instead are without other substantial chemical 
intermediates. 

Fragments provided herein are defined as sequences of at least 6 (contiguous) nucleic 
acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific 
30 hybridization in the case of nucleic acids or for specific recognition of an epitope in the case of 
amino acids, respectively, and are at most some portion less than a full length sequence. 
Fragments may be derived from any contiguous portion of a nucleic acid or amino acid 
sequence of choice. Derivatives are nucleic acid sequences or amino acid sequences formed 
from the native compounds either directly or by modification orpartial substitution. Analogs 
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are nucleic acid sequences or amino acid sequences that have a structure similar to, but not 
identical to, the native compound but differs from it in respect to certain components or side 
chains. Analogs may be synthetic or from a different evolutionary origin and may have a 
similar or opposite metabolic activity compared to wild type. Homologs are nucleic acid 
5 sequences or amino acid sequences of a particular gene that are derived from different species. 

Derivatives and analogs may be full length or other than full length, if the derivative or 
analog contains a modified nucleic acid or amino acid, as described below. Derivatives or 
analogs of the nucleic acids or proteins of the invention include, but are not limited to, 
molecules comprising regions that are substantially homologous to the nucleic acids or 

10 proteins of the invention, in various embodiments, by at least about 70%, 80%, or 95% 

identity (with a preferred identity of 80-95%) over a nucleic acid or amino acid sequence of 
identical size or when compared to an aligned sequence in which the alignment is done by a 
computer homology program known in the art, or whose encoding nucleic acid is capable of 
hybridizing to the complement of a sequence encoding the aforementioned proteins under 

15 stringent, moderately stringent, or low stringent conditions. See e.g. Ausubel, et aL 9 CURRENT 
Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1993, and below. 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 
variations thereof, refer to sequences characterized by a homology at the nucleotide level or 
amino acid level as discussed above. Homologous nucleotide sequences encode those 

20 sequences coding for isoforms of NOVX polypeptides. Isoforms can be expressed in different 
tissues of the same organism as a result of, for example, alternative splicing of RNA. 
Alternatively, isoforms can be encoded by different genes. In the invention, homologous 
nucleotide sequences include nucleotide sequences encoding for an NOVX polypeptide of 
species other than humans, including, but not limited to: vertebrates, and thus can include, e.g., 

25 frog, mouse, rat, rabbit, dog, cat cow, horse, and other organisms. Homologous nucleotide 
sequences also include, but are not limited to, naturally occurring allelic variations and 
mutations of the nucleotide sequences set forth herein. A homologous nucleotide sequence 
does not, however, include the exact nucleotide sequence encoding human NOVX protein. 
Homologous nucleic acid sequences include those nucleic acid sequences that encode 

30 conservative amino acid substitutions (see below) in SEQ ID NOS:2n-l, wherein n is an 
integer between 1 and 162, as well as a polypeptide possessing NOVX biological activity. 
Various biological activities of the NOVX proteins are described below. 

An NOVX polypeptide is encoded by the open reading frame ("ORF") of an NOVX 
nucleic acid. An ORF corresponds to a nucleotide sequence that could potentially be translated 
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into a polypeptide. A stretch of nucleic acids comprising an ORF is uninterrupted by a stop 
codon. An ORF that represents the coding sequence for a full protein begins with an ATG 
"start" codon and terminates with one of the three "stop" codons, namely, TAA, TAG, or 
TGA. For the purposes of this invention, an ORF may be any part of a coding sequence, with 
5 or without a start codon, a stop codon, or both. For an ORF to be considered as a good 

candidate for coding for a bona fide cellular protein, a minimum size requirement is often set, 
e.g., a stretch of DNA that would encode a protein of 50 amino acids or more. 

The nucleotide sequences determined from the cloning of the human NOVX genes 
allows for the generation of probes and primers designed for use in identifying and/or cloning 

10 NOVX homologues in other cell types, e.g. from other tissues, as well as NOVX homologues 
from other vertebrates. The probe/primer typically comprises substantially purified 
oligonucleotide. The oligonucleotide typically comprises a region of nucleotide sequence that 
hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 200, 250, 300, 350 
or 400 consecutive sense strand nucleotide sequence SEQ ID NOS:2n-l, wherein n is an 

15 integer between 1 and 162; or an anti-sense strand nucleotide sequence of SEQ ID NOS:2n-l, 
wherein n is an integer between 1 and 162; or of a naturally occurring mutant of SEQ ID 
NOS:2n-l, wherein n is an integer between 1 and 162. 

Probes based on the human NOVX nucleotide sequences can be used to detect 
transcripts or genomic sequences encoding the same or homologous proteins. In various 

20 embodiments, the probe further comprises a label group attached thereto, e.g. the label group 
can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such 
probes can be used as a part of a diagnostic test kit for identifying cells or tissues which mis- 
express an NOVX protein, such as by measuring a level of an NOVX-encoding nucleic acid in 
a sample of cells from a subject e.g., detecting NOVX mRNA levels or determining whether a 

25 genomic NOVX gene has been mutated or deleted. 

"A polypeptide having a biologically-active portion of an NOVX polypeptide" refers 
to polypeptides exhibiting activity similar, but not necessarily identical to, an activity of a 
polypeptide of the invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. A nucleic acid fragment encoding a "biologically- 

30 active portion of NOVX" can be prepared by isolating a portion SEQ ID NOS:2n- 1, wherein n 
is an integer between 1 and 162, that encodes a polypeptide having an NOVX biological 
activity (the biological activities of the NOVX proteins are described below), expressing the 
encoded portion of NOVX protein (e.g., by recombinant expression in vitro) and assessing the 
activity of the encoded portion of NOVX. 
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NOVX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequences shown in SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162 
due to degeneracy of the genetic code and thus encode the same NOVX proteins as that 
5 encoded by the nucleotide sequences shown in SEQ ID NOS:2n-l, wherein n is an integer 
between 1 and 162. In another embodiment, an isolated nucleic acid molecule of the invention 
has a nucleotide sequence encoding a protein having an amino acid sequence shown in SEQ 
ID NOS:2n, wherein n is an integer between 1 and 162. 

In addition to the human NOVX nucleotide sequences shown in SEQ ID NOS:2n-l, 

10 . wherein n is an integer between 1 and 162, it will be appreciated by those skilled in the art that 
DNA sequence polymorphisms that lead to changes in the amino acid sequences of the NOVX 
polypeptides may exist within a population (e.g., the human population). Such genetic 
polymorphism in the NOVX genes may exist among individuals within a population due to 
natural allelic variation. As used herein, the terms "gene" and "recombinant gene" refer to 

1 5 nucleic acid molecules comprising an open reading frame (ORF) encoding an NOVX protein, 
preferably a vertebrate NOVX protein. Such natural allelic variations can typically result in 
1-5% variance in the nucleotide sequence of the NOVX genes. Any and all such nucleotide . 
variations and resulting amino acid polymorphisms in the NOVX polypeptides, which are the 
result of natural allelic variation and that do not alter the functional activity of the NOVX 

20 polypeptides, are intended to be within the scope of the invention. 

Moreover, nucleic acid molecules encoding NOVX proteins from other species, and 
thus that have a nucleotide sequence that differs from the human SEQ ID NOS:2n-l, wherein 
n is an integer between 1 and 162 are intended to be within the scope of the invention. 
Nucleic acid molecules corresponding to natural allelic variants and homologues of the NOVX 

25 cDNAs of the invention can be isolated based on their homology to the human NOVX nucleic 
acids disclosed herein using the human cDNAs, or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 6 nucleotides in length and hybridizes under stringent conditions to the 

30 nucleic acid molecule comprising the nucleotide sequence of SEQ ID NOS:2n-l, wherein n is 
an integer between 1 and 162. In another embodiment, the nucleic acid is at least 10, 25, 50 ? 
100, 250, 500, 750, 1000, 1500, or 2000 or more nucleotides in length. In yet another 
embodiment, an isolated nucleic acid molecule of the invention hybridizes to the coding 
region. As used herein, the term "hybridizes under stringent conditions" is intended to 
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describe conditions for hybridization and washing under which nucleotide sequences at least 
60% homologous to each other typically remain hybridized to each other. 

Homologs (Le., nucleic acids encoding NOVX proteins derived from species other 
than human) or other related sequences (e.g., paralogs) can be obtained by low, moderate or 
5 high stringency hybridization with all or a portion of the particular human sequence as a probe 
using methods well known in the art for nucleic acid hybridization and cloning. 

As used herein, the phrase "stringent hybridization conditions" refers to conditions 
under which a probe, primer or oligonucleotide will hybridize to its target sequence, but to no 
other sequences. Stringent conditions are sequence-dependent and will be different in 

1 0 different circumstances. Longer sequences hybridize specifically at higher temperatures than 
shorter sequences. Generally, stringent conditions are selected to be about 5 °C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The 
Tm is the temperature (under defined ionic strength, pH and nucleic acid concentration) at 
which 50% of the probes complementary to the target sequence hybridize to the target 

1 5 sequence at equilibrium. Since the target sequences are generally present at excess, at Tm, 

50% of the probes are occupied at equilibrium. Typically, stringent conditions will be those in 
which the salt concentration is less than about 1 .0 M sodium ion, typically about 0.01 to 1.0 M 
sodium ion (or other salts) at 

pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes, primers or 
20 oligonucleotides (e.g;, 10 nt to 50 nt) and at least about 60°C for longer probes, primers and 
oligonucleotides. Stringent conditions may also be achieved with the addition of destabilizing 
agents, such as formamide. 

Stringent conditions are known to those skilled in the art and can be found in Ausubel, 
et ah, (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, N. Y. 
25 (1989), 6.3.1-6.3.6. Preferably, the conditions are such that sequences at least about 65%, 
70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous to each other typically remain 
hybridized to each other. A non-limiting example of stringent hybridization conditions are 
hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM 
EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 mg/ml denatured salmon sperm DNA 
30 at 65°C, followed by one or more washes in 0.2X SSC, 0.01% BSA at 50°C. An isolated 
nucleic acid molecule of the invention that hybridizes under stringent conditions to the 
sequences SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162, corresponds to a 
naturally-occurring nucleic acid molecule. As used herein, a "naturally-occurring" nucleic 
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acid molecule refers to an RNA or DNA molecule having a nucleotide sequence that occurs in 
nature (e.g. , encodes a natural protein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic 
acid molecule comprising the nucleotide sequence of SEQ ID NOS:2n-l, wherein n is an 
5 integer between 1 and 162, or fragments, analogs or derivatives thereof, under conditions of 
moderate stringency is provided. A non-limiting example of moderate stringency 
hybridization conditions are hybridization in 6X SSC, 5X Denhardt's solution, 0.5% SDS and 
100 mg/ml denatured salmon sperm DNA at 55°C, followed by one or more washes in 
IX SSC, 0.1% SDS at 37°C. Other conditions of moderate stringency that may be used are 

1 0 well-known within the art. See, e.g. , Ausubel, et al (eds.), 1 993, CURRENT PROTOCOLS IN 
Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990; Gene Transfer and 
Expression, A Laboratory Manual, Stockton Press, NY. 

In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid molecule 
comprising the nucleotide sequences SEQ ID NOS:2n-l, wherein n is an integer between 1 

15 and 1 62, or fragments, analogs or derivatives thereof, under conditions of low stringency, is 
provided. A non-limiting example of low stringency hybridization conditions are 
hybridization in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 mM EDTA, 0.02% 
PVP, 0.02% Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm DNA, 10% (wt/vol) 
dextran sulfate at 40°C, followed by one or more washes in 2X SSC, 25 mM Tris-HCl (pH 

20 7.4), 5 mM EDTA, and 0.1% SDS at 50°C. Other conditions of low stringency that may be 
used are well known in the art (e.g., as employed for cross-species hybridizations). See, e.g., 
Ausubel, et al (eds.), 1993, Current Protocols in Molecular Biology, John Wiley & 
Sons, NY, and Kriegler, 1990, Gene Transfer and Expression, A Laboratory Manual, 
Stockton Press, NY; Shilo and Weinberg, 1981. Proc Natl Acad Sci USA 78: 6789-6792. 

25 

Conservative Mutations 

In addition to naturally-occurring allelic variants of NOVX sequences that may exist in 
the population, the skilled artisan will further appreciate that changes can be introduced by 
mutation into the nucleotide sequences SEQ ID NOS:2n-l, wherein n is an integer between 1 
30 and 1 62, thereby leading to changes in the amino acid sequences of the encoded NOVX 
proteins, without altering the functional ability of said NOVX proteins. For example, 
nucleotide substitutions leading to amino acid substitutions at "non-essential" amino acid 
residues can be made in the sequence SEQ ID NOS:2n, wherein n is an integer between 1 and 
162. A "non-essential" amino acid residue is a residue that can be altered from the wild-type 
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sequences of the NOVX proteins without altering their biological activity, whereas an 
"essential" amino acid residue is required for such biological activity. For example, amino 
acid residues that are conserved among the NOVX proteins of the invention are predicted to be 
particularly non-amenable to alteration. Amino acids for which conservative substitutions can 
5 be made are well-known within the art. 

Another aspect of the invention pertains to nucleic acid molecules encoding NOVX 
proteins that contain changes in amino acid residues that are not essential for activity. Such 
NOVX proteins differ in amino acid sequence from SEQ ID NOS:2n-l, wherein n is an 
integer between 1 and 162 yet retain biological activity. In one embodiment, the isolated 

10 nucleic acid molecule comprises a nucleotide sequence encoding a protein, wherein the protein 
comprises an amino acid sequence at least about 45% homologous to the amino acid 
sequences SEQ ID NOS:2n, wherein n is an integer between 1 and 162. Preferably, the 
protein encoded by the nucleic acid molecule is at least about 60% homologous to SEQ ID 
NOS:2n, wherein n is an integer between 1 and 162; more preferably at least about 70% 

1 5 homologous SEQ ID NOS:2n, wherein n is an integer between 1 and 162; still more preferably 
at least about 80% homologous to SEQ ID NOS:2n, wherein n is an integer between 1 and 
162; even more preferably at least about 90% homologous to SEQ ID NOS:2n, wherein n is an 
integer between 1 and 162; and most preferably at least about 95% homologous to SEQ ID 
NOS:2n, wherein n is an integer between 1 and 162. 

20 An isolated nucleic acid molecule encoding an NOVX protein homologous to the 

protein of SEQ ID NOS:2n, wherein n is an integer between 1 and 162 can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the nucleotide 
sequence of SEQ ID NOS:2n-l, wherein n is an integer between 1 and 1 62, such that one or 
more amino acid substitutions, additions or deletions are introduced into the encoded protein. 

25 Mutations can be introduced into SEQ ID NOS:2n-l, wherein n is an integer between 1 

and 162 by standard techniques, such as site-directed mutagenesis and PCR-mediated 
mutagenesis. Preferably, conservative amino acid substitutions are made at one or more 
predicted, non-essential amino acid residues. A "conservative amino acid substitution" is one 
in which the amino acid residue is replaced with an amino acid residue having a similar side 

30 chain. Families of amino acid residues having similar side chains have been defined within 
the art. These families include amino acids with basic side chains {e.g., lysine, arginine, 
histidine), acidic side chains {e.g., aspartic acid, glutamic acid), uncharged polar side chains 
(e.g.* glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side 
chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
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tryptophan Vbeta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side 
chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted non-essential 
amino acid residue in the NOVX protein is replaced with another amino acid residue from the 
same side chain family. Alternatively, in another embodiment, mutations can be introduced 
5 randomly along all or part of an NOVX coding sequence, such as by saturation mutagenesis, 
and the resultant mutants can be screened for NOVX biological activity to identify mutants 
that retain activity. Following mutagenesis SEQ ID NOS:2n-l, wherein n is an integer 
between 1 and 162, the encoded protein can be expressed by any recombinant technology 
known in the art and the activity of the protein can be determined. 

10 The relatedness of amino acid families may also be determined based on side chain 

interactions. Substituted amino acids may be fully conserved "strong" residues or fully 
conserved "weak" residues. The "strong" group of conserved amino acid residues may be any 
one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, FYW, 
wherein the single letter amino acid codes are grouped by those amino acids that may be 

15 substituted for each other. Likewise, the "weak" group of conserved residues may be any one 
of the following: CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, NDEQHK, NEQHRK, 
VLIM, HFY, wherein the letters within each group represent the single letter amino acid code. 

In one embodiment, a mutant NOVX protein can be assayed for (/) the ability to form 
protein:protein interactions with other NOVX proteins, other cell-surface proteins, or 

20 biologically-active portions thereof, (if) complex formation between a mutant NOVX protein 
and an NOVX ligand; or (iif) the ability of a mutant NOVX protein to bind to an intracellular 
target protein or biologically-active portion thereof; (e.g. avidin proteins). 

In yet another embodiment, a mutant NOVX protein can be assayed for the ability to 
regulate a specific biological function (e.g., regulation of insulin release). 

25 Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules 
that are hybridizable to or complementary to the nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162, or 
fragments, analogs or derivatives thereof. An "antisense" nucleic acid comprises a nucleotide 
30 sequence that is complementary to a "sense" nucleic acid encoding a protein (e.g., 

complementary to the coding strand of a double-stranded cDNA molecule or complementary 
to an mRNA sequence). In specific aspects, antisense nucleic acid molecules are provided that 
comprise a sequence complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides 
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or an entire NOVX coding strand, or to only a portion thereof. Nucleic acid molecules 
encoding fragments, homologs, derivatives and analogs of an NOVX protein of SEQ ID 
NOS:2n, wherein n is an integer between 1 and 162, or antisense nucleic acids complementary 
to an NOVX nucleic acid sequence of SEQ ID NOS:2n-l, wherein n is an integer between 1 
5 and 162, are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding an NOVX protein. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residues. In another embodiment, the antisense nucleic acid 

10 molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence 

encoding the NOVX protein. The term "noncoding region" refers to 5' and 3 f sequences which 
flank the coding region that are not translated into amino acids (i.e., also referred to as 5' and 
3' untranslated regions). 

Given the coding strand sequences encoding the NOVX protein disclosed herein, 

1 5 antisense nucleic acids of the invention can be designed according to the rules of Watson and 
Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be complementary 
to the entire coding region of NOVX mRNA, but more preferably is an oligonucleotide that is 
antisense to only a portion of the coding or noncoding region of NOVX mRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 

20 start site of NOVX mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 
20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis or enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) 
can be chemically synthesized using naturally-occurring nucleotides or variously modified 

25 nucleotides designed to increase the biological stability of the molecules or to increase the 
physical stability of the duplex formed between the antisense and sense nucleic acids (e.g., 
phosphorothioate derivatives and acridine substituted nucleotides can be used). 

Examples of modified nucleotides that can be used to generate the antisense nucleic 
acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 

30 xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methyIcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
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beta-D-mannosylqueosine, 5 -methoxycarboxymethyluracil, 5-methoxyuracil, 
2-me%lthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 
5 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 
1 0 The antisense nucleic acid molecules of the invention are typically administered to a 

subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding an NOVX protein to thereby inhibit expression of the protein (e.g., by 
inhibiting transcription and/or translation). The hybridization can be by conventional 
nucleotide complementarity to form a stable duplex, or, for example, in the case of an 

15 antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 

20 such that they specifically bind to receptors or antigens expressed on a selected cell surface 
(e g., by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell 
surface receptors or antigens). The antisense nucleic acid molecules can also be delivered to 
cells using the vectors described herein. To achieve sufficient nucleic acid molecules, vector 
constructs in which the antisense nucleic acid molecule is placed under the control of a strong 

25 pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual |}-units, the 
strands run parallel to each other. See, e.g., Gaultier, et ah, 1987. Nucl Acids Res. 15: 

30 6625-664 1 . The antisense nucleic acid molecule can also comprise a 

T-o-methylribonucleotide (See, e.g., Inoue, et al 1987. Nucl. Acids Res. 15: 6131-6148) or a 
chimeric RNA-DNA analogue {See, e.g., Inoue, et al, 1987. FEBSLett. 215: 327-330. 
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Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified bases, 
and nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
modifications are carried out at least in part to enhance the chemical stability of the modified 
5 nucleic acid, such that they may be used, for example, as antisense binding nucleic acids in 
therapeutic applications in a subject. 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 
cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 

10 complementary region. Thus, ribozymes (e.g., hammerhead ribozymes as described in 

Haselhoff and Gerlach 1988. Nature 334: 585-591) can be used to catalytically cleave NOVX 
mRNA transcripts to thereby inhibit translation of NOVX mRNA. A ribozyme having 
specificity for an NOVX-encoding nucleic acid can be designed based upon the nucleotide 
sequence of an NOVX cDNA disclosed herein (i.e., SEQ ID NOS:2n-l, wherein n is an 

15 integer between 1 and 162). For example, a derivative of a Tetrahymena L-19 IVS RNA can 
be constructed in which the nucleotide sequence of the active site is complementary to the 
nucleotide sequence to be cleaved in an NOVX-encoding mRNA. See, e.g., U.S. Patent 
4,987,071 to Cech, et al and U.S. Patent 5,1 16,742 to Cech, et al NOVX mRNA can also be 
used to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA 

20 molecules. See, e.g., Bartel et al, (1993) Science 261:141 1-1418. 

Alternatively, NOVX gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the NOVX nucleic acid (e.g., the NOVX 
promoter and/or enhancers) to form triple helical structures that prevent transcription of the 
NOVX gene in target cells. See, e.g., Helene, 1 991. Anticancer Drug Des. 6: 569-84; Helene, 

25 et al 1992. Ann. N.Y. Acad Sci. 660: 27-36; Maher, 1992. Bioassays 14: 807-15. 

In various embodiments, the NOVX nucleic acids can be modified at the base moiety, 
sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or solubility 
of the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids can 
be modified to generate peptide nucleic acids. See, e.g., Hyrup, et al, 1996. BioorgMed 

30 Chem 4: 5-23. As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic 
acid mimics (e.g., DNA mimics) in which the deoxyribose phosphate backbone is replaced by 
a pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
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standard solid phase peptide synthesis protocols as described in Hyrup, et al. 9 1996. supra; 
. Perry-O'Keefe, et aU 1996. Proc. Natl Acad. Sci. USA 93: 14670-14675. 

PNAs of NOVX can be used in therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 

5 expression by, e.g., inducing transcription or translation arrest or inhibiting replication. PNAs 
of NOVX can also be used, for example, in the analysis of single base pair mutations in a gene 
(e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in combination 
with other enzymes, e.g., Si nucleases (See, Hyrup, et al, 1996. supra); or as probes or primers 
for DNA sequence and hybridization (See, Hyrup, et al 9 1996, supra; Perry-O'Keefe, et al, 

10 1996. supra). 

In another embodiment, PNAs of NOVX can be modified, e.g., to enhance their 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art. For example, PNA-DNA chimeras of NOVX can be generated that 

1*5 may combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes (e.g., Cleavage signal-1 protein H and DNA polymerases) to interact 
with the DNA portion while the PNA portion would provide high binding affinity and 
specificity. PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in 
terms of base stacking, number of bonds between the nucleobases, and orientation (.see, Hyrup, 

20 et al., 1 996. supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup, etaU 1 996. supra and Finn, etaU 1996. Nucl Acids Res 24: 3357-3363. For example, 
a DNA chain can be synthesized on a solid support using standard phosphoramidite coupling 
chemistry, and modified nucleoside analogs, e.g., 

5 ! -(4-methoxytrityl)amino-5 f -deoxy-thymidine phosphoramidite, can be used between the 
25 PNA and the 5' end of DNA. See, e.g., Mag, et al, 1989. Nucl Acid Res 17: 5973-5988. PNA 
monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5 ! 
PNA segment and a 3' DNA segment. See, e.g., Finn, et a/., 1996. supra. Alternatively, 
chimeric molecules can be synthesized with a 5' DNA segment and a 3' PNA segment. See, 
e.g., Petersen, et al, 1975. Bioorg. Med Chem. Lett 5: 1 1 19-1 1 124. 
30 In other embodiments, the oligonucleotide may include other appended groups such as 

peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across 
the cell membrane (see, e.g., Letsinger, et aL, 1989. Proc. Natl Acad. Sci. U.S.A. 86: 
6553-6556; Lemaitre, et al, 1987. Proc. Natl Acad. Set 84: 648-652; PCT Publication No. 
WO88/09810) or the blood-brain barrier (see, e.g., PCT Publication No. WO 89/10134). In 
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addition, oligonucleotides can be modified with hybridization triggered cleavage agents (see, 
e.g., Krol, et al., 1988. BioTechniques 6:958-976) or intercalating agents (see, e.g., Zon, 1988. 
Pharm. Res. 5: 539-549). To this end, the oligonucleotide may be conjugated to another 
molecule, e.g., a peptide, a hybridization triggered cross-linking agent, a transport agent, a 
5 hybridization-triggered cleavage agent, and the like. 

NOVX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the amino 
acid sequence of NOVX polypeptides whose sequences are provided in SEQ ID NOS:2n, 
wherein n is an integer between 1 and 162. The invention also includes a mutant or variant 

1 0 protein any of whose residues may be changed from the corresponding residues shown in SEQ 
ID NOS:2n, wherein n is an integer between 1 and 1 62 while still encoding a protein that 
maintains its NOVX activities and physiological functions, or a functional fragment thereof 
In general, an NOVX variant that preserves NOVX-like function includes any variant 
in which residues at a particular position in the sequence have been substituted by other amino 

1 5 acids, and further include the possibility of inserting an additional residue or residues between 
two residues of the parent protein as well as the possibility of deleting one or more residues 
from the parent sequence. Any amino acid substitution, insertion, or deletion is encompassed 
by the invention. In favorable circumstances, the substitution is a conservative substitution as 
defined above. 

20 One aspect of the invention pertains to isolated NOVX proteins, and biologically- 

active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also provided 
are polypeptide fragments suitable for use as immunogens to raise anti-NOVX antibodies. In 
one embodiment, native NOVX proteins can be isolated from cells or tissue sources by an. 
appropriate purification scheme using standard protein purification techniques. In another 

25 embodiment, NOVX proteins are produced by recombinant DNA techniques. Alternative to 
recombinant expression, an NOVX protein or polypeptide can be synthesized chemically 
using standard peptide synthesis techniques. 

An "isolated" or "purified" polypeptide or protein or biologically-active portion thereof 
is substantially free of cellular material or other contaminating proteins from the cell or tissue 

30 source from which the NOVX protein is derived, or substantially free from chemical 

precursors or other chemicals when chemically synthesized. The language "substantially free 
of cellular material" includes preparations of NOVX proteins in which the protein is separated 
from cellular components of the cells from which it is isolated or recombinantly-produced. In 
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one embodiment, the language "substantially free of cellular material" includes preparations of 
NOVX proteins having less than about 30% (by dry weight) of non-NOVX proteins (also 
referred to herein as a "contaminating protein"), more preferably less than about 20% of 
non-NOVX proteins, still more preferably less than about 10% of non-NOVX proteins, and 
5 most preferably less than about 5% of non-NOVX proteins. When the NOVX protein or 
biologically-active portion thereof is recombinantly-produced, it is also preferably 
substantially free of culture medium, /.<?., culture medium represents less than about 20%, 
more preferably less than about 10%, and most preferably less than about 5% of the volume of 
the NOVX protein preparation. 

10 The language "substantially free of chemical precursors or other chemicals" includes 

preparations of NOVX proteins in which the protein is separated from chemical precursors or 
other chemicals that are involved in the synthesis of the protein. In one embodiment, the 
language "substantially free of chemical precursors or other chemicals" includes preparations 
of NOVX proteins having less than about 30% (by dry weight) of chemical precursors or 

1 5 non-NOVX chemicals, more preferably less than about 20% chemical precursors or 

non-NOVX chemicals, still more preferably less than about 10% chemical precursors or 
non-NOVX chemicals, and most preferably less than about 5% chemical precursors or 
non-NOVX chemicals. 

Biologically-active portions of NOVX proteins include peptides comprising amino 

20 acid sequences sufficiently homologous to or derived from the amino acid sequences of the 
NOVX proteins (e.g., the amino acid sequence shown in SEQ ID NOS:2n, wherein n is an 
integer between 1 and 1 62) that include fewer amino acids than the full-length NOVX 
proteins, and exhibit at least one activity of an NOVX protein. Typically, biologically-active 
portions comprise a domain or motif with at least one activity of the NOVX protein. A 

25 biologically-active portion of an NOVX protein can be a polypeptide which is, for example, 
10, 25, 50, 100 or more amino acid residues in length. 

Moreover, other biologically-active portions, in which other regions of the protein are 
deleted, can be prepared by recombinant techniques and evaluated for one or more of the 
functional activities of a native NOVX protein. 

30 In an embodiment, the NOVX protein has an amino acid sequence shown SEQ ID 

NOS:2n, wherein n is an integer between 1 and 162. In other embodiments, the NOVX 
protein is substantially homologous to SEQ ID NOS:2n, wherein n is an integer between 1 and 
162, and retains the functional activity of the protein of SEQ ID NOS:2n, wherein n is an 
integer between 1 and 162, yet differs in amino acid sequence due to natural allelic variation 
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or mutagenesis, as described in detail, below. Accordingly, in another embodiment, the 
NOVX protein is a protein that comprises an amino acid sequence at least about 45% 
homologous to the amino acid sequence SEQ ID NOS:2n, wherein n is an integer between 1 
and 162, and retains the functional activity of the NOVX proteins of SEQ ID NOS:2n, wherein 
5 n is an integer between 1 and 162. 

Determining Homology Between Two or More Sequences 

To determine the percent homology of two amino acid sequences or of two nucleic 
acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced 
10 in the sequence of a first amino acid or nucleic acid sequence for optimal alignment with a 
second amino or nucleic acid sequence). The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide as the 
corresponding position in the second sequence, then the molecules are homologous at that 
15 position (/.e., as used herein amino acid or nucleic acid "homology" is equivalent to amino 
acid or nucleic acid "identity"). 

The nucleic acid sequence homology may be determined as the degree of identity 
between two sequences. The homology may be determined using computer programs known 
in the art, such as GAP software provided in the GCG program package. See, Needleman and 
20 Wunsch, 1 970. JMol Biol 48: 443-453. Using GCG GAP software with the following settings 
for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP extension 
penalty of 0.3, the coding region of the analogous nucleic acid sequences referred to above 
exhibits a degree of identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 
99%, with the CDS (encoding) part of the DNA sequence shown in SEQ ID NOS:2n-l, 
25 wherein n is an integer between 1 and 162. 

The term "sequence identity" refers to the degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular region of 
comparison. The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of 
30 positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of 

nucleic acids) occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in the region of comparison (Le., 
the window size), and multiplying the result by 100 to yield the percentage of sequence 
identity. The term "substantial identity" as used herein denotes a characteristic of a 
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polynucleotide sequence, wherein the polynucleotide comprises a sequence that has at least 80 
percent sequence identity, preferably at least 85 percent identity and often 90 to 95 percent 
sequence identity, more usually at least 99 percent sequence identity as compared to a 
reference sequence over a comparison region. 

5 

Chimeric and Fusion Proteins 

The invention also provides NOVX chimeric or fusion proteins. As used herein, an 
NOVX "chimeric protein" or "fusion protein" comprises an NOVX polypeptide operatively- 
linked to a non-NOVX polypeptide. An "NOVX polypeptide" refers to a polypeptide having 

1 0 an amino acid sequence corresponding to an NOVX protein SEQ ID NOS:2n, wherein n is an 
integer between 1 and 162, whereas a "non-NOVX polypeptide" refers to a polypeptide having 
an amino acid sequence corresponding to a protein that is not substantially homologous to the 
NOVX protein, e.g., a protein that is different from the NOVX protein and that is derived from 
the same or a different organism. Within an NOVX fusion protein the NOVX polypeptide can 

1 5 correspond to all or a portion of an NOVX protein. In one embodiment, an NOVX fusion 
protein comprises at least one biologically-active portion of an NOVX protein. In another 
embodiment, an NOVX fusion protein comprises at least two biologically-active portions of 
an NOVX protein. In yet another embodiment, an NOVX fusion protein comprises at least 
three biologically-active portions of an NOVX protein. Within the fusion protein, the term 

20 "operatively-linked" is intended to indicate that the NOVX polypeptide and the non-NOVX 
polypeptide are fused in-frame with one another. The non-NOVX polypeptide can be fused to 
the N-terminus or C-terminus of the NOVX polypeptide. 

In one embodiment, the fusion protein is a GST-NOVX fusion protein in which the 
NOVX sequences are fused to the C-terminus of the GST (glutathione S-transferase) 

25 sequences. Such fusion proteins can facilitate the purification of recombinant NOVX 
polypeptides. 

In another embodiment, the fusion protein is an NOVX protein containing a 
heterologous signal sequence at its N-terminus. In certain host cells (e.g., mammalian host 
cells), expression and/or secretion of NOVX can be increased through use of a heterologous 
30 signal sequence. 

In yet another embodiment, the fusion protein is an NOVX-immunoglobulin fusion 
protein in which the NOVX sequences are fused to sequences derived from a member of the 
immunoglobulin protein family. The NOVX-immunoglobulin fusion proteins of the invention 
can be incorporated into pharmaceutical compositions and administered to a subject to inhibit 
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an interaction between an NOVX ligand and an NOVX protein on the surface of a cell, to 
thereby suppress NOVX-mediated signal transduction in vivo. The NOVX-immunoglobulin 
fusion proteins can be used to affect the bioavailability of an NOVX cognate ligand. 
Inhibition of the NOVX ligand/NOVX interaction may be useful therapeutically for both the 
5 treatment of proliferative and differentiative disorders, as well as modulating (e.g. promoting 
or inhibiting) cell survival. Moreover, the NOVX-immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-NOVX antibodies in a subject, to purify 
NOVX ligands, and in screening assays to identify molecules that inhibit the interaction of 
NOVX with an NOVX ligand. 

10 An NOVX chimeric or fusion protein of the invention can be produced by standard 

recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 
techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction 
enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, 

15 alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In 

another embodiment, the fusion gene can be synthesized by conventional techniques including 
automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be 
carried out using anchor primers that give rise to complementary overhangs between two 
consecutive gene fragments that can subsequently be annealed and reamplified to generate a 

20 chimeric gene sequence (see, e.g., Ausubel, et ah (eds.) CURRENT PROTOCOLS IN MOLECULAR 
Biology, John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). An NOVX-encoding 
nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked 
in-frame to the NOVX protein. 

25 

NOVX Agonists and Antagonists 

The invention also pertains to variants of the NOVX proteins that function as either 
NOVX agonists (i.e., mimetics) or as NOVX antagonists. Variants of the NOVX protein can 
be generated by mutagenesis (e.g., discrete point mutation or truncation of the NOVX protein). 
30 An agonist of the NOVX protein can retain substantially the same, or a subset of, the 

biological activities of the naturally occurring form of the NOVX protein. An antagonist of 
the NOVX protein can inhibit one or more of the activities of the naturally occurring form of 
the NOVX protein by, for example, competitively binding to a downstream or upstream 
member of a cellular signaling cascade which includes the NOVX protein. Thus, specific 
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biological effects can be elicited by treatment with a variant of limited function. In one 
embodiment, treatment of a subject with a variant having a subset of the biological activities 
of the naturally occurring form of the protein has fewer side effects in a subject relative to 
treatment with the naturally occurring form of the NOVX proteins. 
5 Variants of the NOVX proteins that function as either NOVX agonists {i.e., mimetics) 

or as NOVX antagonists can be identified by screening combinatorial libraries of mutants 
(e.g., truncation mutants) of the NOVX proteins for NOVX protein agonist or antagonist 
activity. In one embodiment, a variegated library of NOVX variants is generated by 
combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene 
10 library. A variegated library of NOVX variants can be produced by, for example, 

enzymatically ligating a mixture of synthetic oligonucleotides into gene sequences such that a 
degenerate set of potential NOVX sequences is expressible as individual polypeptides, or 
alternatively, as a set of larger fusion proteins {e.g., for phage display) containing the set of 
NOVX sequences therein. There are a variety of methods which can be used to produce 
15 libraries of potential NOVX variants from a degenerate oligonucleotide sequence. Chemical 
synthesis of a degenerate gene sequence can be performed in an automatic DNA synthesizer, 
and the synthetic gene then ligated into an appropriate expression vector. Use of a degenerate 
set of genes allows for the provision, in one mixture, of all of the sequences encoding the 
desired set of potential NOVX sequences. Methods for synthesizing degenerate 
20 oligonucleotides are well-known within the art. See, e.g., Narang, 1983. Tetrahedron 39: 3; 
Itakura,e/a/., \9M.Annu. Rev. Biochem. 53: 323; Itakura, etaL, 1984. Science 198: 1056; 
Ike, et aL, 1983. Nucl. Acids Res. 1 1 : 477. 

Polypeptide Libraries 

In addition, libraries of fragments of the NOVX protein coding sequences can be used 
to generate a variegated population of NOVX fragments for screening and subsequent 
selection of variants of an NOVX protein. In one embodiment, a library of coding sequence 
fragments can be generated by treating a double stranded PCR fragment of an NOVX coding 
sequence with a nuclease under conditions wherein nicking occurs only about once per 
molecule, denaturing the double stranded DNA, renaturing the DNA to form double-stranded 
DNA that can include sense/antisense pairs from different nicked products, removing single 
stranded portions from reformed duplexes by treatment with Si nuclease, and ligating the 
resulting fragment library into an expression vector. By this method, expression libraries can 
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be derived which encodes N-terminal and internal fragments of various sizes of the NOVX 
proteins. 

Various techniques are known in the art for screening gene products of combinatorial 
libraries made by point mutations or truncation, and for screening cDNA libraries for gene 
5 products having a selected property. Such techniques are adaptable for rapid screening of the 
gene libraries generated by the combinatorial mutagenesis of NOVX proteins. The most 
widely used techniques, which are amenable to high throughput analysis, for screening large 
gene libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 

10 combinatorial genes under conditions in which detection of a desired activity facilitates 

isolation of the vector encoding the gene whose product was detected. Recursive ensemble 
mutagenesis (REM), a new technique that enhances the frequency of functional mutants in the 
libraries, can be used in combination with the screening assays to identify NOVX variants. 
See, e.g., Arkin and Yourvan, 1992. Proc. Natl Acad. Set USA 89: 781 1-7815; Delgrave, et 

1 5 a/., 1 993. Protein Engineering 6:327-33 1 . 

NOVX Antibodies 

The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 
contain an antigen binding site that specifically binds (immunoreacts with) an antigen. Such 

20 antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F a b, 
Fab- and Yj&fi fragments, and an F a b expression library. In general, antibody molecules 
obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ 
from one another by the nature of the heavy chain present in the molecule. Certain classes 
have subclasses as well, such as Igd, IgG 2 , and others. Furthermore, in humans, the light 

25 chain may be a kappa chain or a lambda chain. Reference herein to antibodies includes a 
reference to all such classes, subclasses and types of human antibody species. 

An isolated protein of the invention intended to serve as an antigen, or a portion or 
fragment thereof, can be used as an immunogen to generate antibodies that 
immunospecifically bind the antigen, using standard techniques for polyclonal and monoclonal 
30 antibody preparation. The full-length protein can be used or, alternatively, the invention 
provides antigenic peptide fragments of the antigen for use as immunogens. An antigenic 
peptide fragment comprises at least 6 amino acid residues of the amino acid sequence of the 
full length protein, such as an amino acid sequence shown in SEQ ID NOs: 2n, wherein n is an 
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integer between 1 and 162, and encompasses an epitope thereof such that an antibody raised 
against the peptide forms a specific immune complex with the full length protein or with any 
fragment that contains the epitope. Preferably, the antigenic peptide comprises at least 10 
amino acid residues, or at least 15 amino acid residues, or at least 20 amino acid residues, or at 
5 least 30 amino acid residues. Preferred epitopes encompassed by the antigenic peptide are 
regions of the protein that are located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of NOVX that is located on the surface of the protein, e.g., a 
hydrophilic region. A hydrophobicity analysis of the human NOVX protein sequence will 

1 0 indicate which regions of a NOVX polypeptide are particularly hydrophilic and, therefore, are 
likely to encode surface residues useful for targeting antibody production. As a means for 
targeting antibody production, hydropathy plots showing regions of hydrophilicity and 
hydrophobicity may be generated by any method well known in the art, including, for 
example, the Kyte Doolittle or the Hopp Woods methods, either with or without Fourier 

15 transformation. See, e.g., Hopp and Woods, 1981, Proc. Nat. Acad. Sci. USA 78: 3824-3828; 
Kyte and Doolittle 1982, J. Mol. Biol. 157: 105-142, each incorporated herein by reference in 
their entirety. Antibodies that are specific for one or more domains within an antigenic protein, or 
derivatives, fragments, analogs or homologs thereof, are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 

20 thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of polyclonal 
or monoclonal antibodies directed against a protein of the invention, or against derivatives, 
fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory 

25 Manual, Harlow E, and Lane D, 1 988, Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 
Polyclonal Antibodies 

For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, 
goat, mouse or other mammal) may be immunized by one or more injections with the native 
30 protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 

immunogenic preparation can contain, for example, the naturally occurring immunogenic 
protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 
recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated 
to a second protein known to be immunogenic in the mammal being immunized. Examples of 
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such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can further 
include an adjuvant. Various adjuvants used to increase the immunological response include, 
but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum 
5 hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, polyanions, 
peptides, oil emulsions, dinitrophenol, etc.), adjuvants usable in humans such as Bacille 
Calmette-Guerin and Corynebacterium parvum, or similar immunostimulatory agents. 
Additional examples of adjuvants which can be employed include MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

10 The polyclonal antibody molecules directed against the immunogenic protein can be 

isolated from the mammal (e.g., from the blood) and further purified by well known 
techniques, such as affinity chromatography using protein A or protein G, which provide 
primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific 
antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 

1 5 immobilized on a column to purify the immune specific antibody by immunoaffinity 

chromatography. Purification of immunoglobulins is discussed, for example, by D. Wilkinson 
(The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 
2000), pp. 25-28). 

20 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as 
used herein, refers to a population of antibody molecules that contain only one molecular 
species of antibody molecule consisting of a unique light chain gene product and a unique 
heavy chain gene product. In particular, the complementarity determining regions (CDRs) of 
25 the monoclonal antibody are identical in all the molecules of the population. MAbs thus 
contain an antigen binding site capable of immunoreacting with a particular epitope of the 
antigen characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature, 256:495 (1975). In a hybridoma method, a mouse, 
30 hamster, or other appropriate host animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 
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The immunizing agent will typically include the protein antigen, a fragment thereof or 
a fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of 
human origin are desired, or spleen cells or lymph node cells are used if non-human 
mammalian sources are desired. The lymphocytes are then fused with an immortalized cell 

5 line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell 
[Goding, Monoclonal Antibodies: Principles and Practice. Academic Press, (1986) pp. 59- 
103]. Immortalized cell lines are usually transformed mammalian cells, particularly myeloma 
cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are 
employed. The hybridoma cells can be cultured in a suitable culture medium that preferably 

0 contains one or more substances that inhibit the growth or survival of the unfused, 

immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas 
typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT-deficient cells. 

5 Preferred immortalized cell lines are those that fuse efficiently, support stable high 

level expression of antibody by the selected antibody-producing cells, and are sensitive to a 
medium such as HAT medium. More preferred immortalized cell lines are murine myeloma 
lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, San 
Diego, California and the American Type Culture Collection, Manassas, Virginia. Human 

) myeloma and mouse-human heteromyeloma cell lines also have been described for the 

production of human monoclonal antibodies rKozbor. J. Immunol.. 133:3001 (1984); Brodeur 
et al., Monoclonal Antibody Pro duction Techniques and Ap plications Marcel Dekker, Inc., 
New York, (1 987) pp. 5 1 -63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for 

> the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 
specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in 
the art. The binding affinity of the monoclonal antibody can, for example, be determined by 

> the Scatchard analysis of Munson and Pollard, Anal. Biochem.. 107:220 (1980). It is an 
objective, especially important in therapeutic applications of monoclonal antibodies, to 
identify antibodies having a high degree of specificity and a high binding affinity for the target 
antigen. 



815 



WO 02/068649 PCT/US02/02785 

After the desired hybrid oma cells are identified, the clones can be subcloned by 
limiting dilution procedures and grown by standard methods (Goding,1986). Suitable culture 
media for this purpose include, for example, Dulbecco f s Modified Eaglet Medium and RPMI- 
1640 medium- Alternatively, the hybridoma cells can be grown in vivo as ascites in a 
5 mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from 
the culture medium or ascites fluid by conventional immunoglobulin purification procedures 
such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 
electrophoresis, dialysis, or affinity chromatography. 

10 The monoclonal antibodies can also be made by recombinant DNA methods, such as 

those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of 
the invention can be readily isolated and sequenced using conventional procedures (e.g., by 
using oligonucleotide probes that are capable of binding specifically to genes encoding the 
heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as a 

15 preferred source of such DNA. Once isolated, the DNA can be placed into expression vectors, 
which are then transfected into host cells such as simian COS cells, Chinese hamster ovary 
(CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to 
obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also 
can be modified, for example, by substituting the coding sequence for human heavy and light 

20 chain constant domains in place of the homologous murine sequences (U.S. Patent No. 
4,816,567; Morrison, Nature 368, 812-13 (1994)) or by covalently joining to the 
immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin 
polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the constant 
domains of an antibody of the invention, or can be substituted for the variable domains of one 

25 antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody. 

Humanized Antibodies 

The antibodies directed against the protein antigens of the invention can further 
comprise humanized antibodies or human antibodies. These antibodies are suitable for 
30 administration to humans without engendering an immune response by the human against the 
administered immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab ! )2 or other antigen- 
binding subsequences of antibodies) that are principally comprised of the sequence of a human 
immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. 
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Humanization can be performed following the method of Winter and co-workers (Jones et al., 
Nature, 321:522-525 (1986); Riechmann et al. Nature. 332:323-327 (1988); Verhoeyen et al., 
Science, 239: 1 534-1 536 (1 988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some 
5 instances, Fv framework residues of the human immunoglobulin are replaced by 

corresponding non-human residues. Humanized antibodies can also comprise residues which 
are found neither in the recipient antibody nor in the imported CDR or framework sequences. 
In general, the humanized antibody will comprise substantially all of at least one, and typically 
two, variable domains, in which all or substantially all of the CDR regions correspond to those 
10 of a non-human immunoglobulin and all or substantially all of the framework regions are 
those of a human immunoglobulin consensus sequence. The humanized antibody optimally 
also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that 
of a human immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. 
Struct. Biol.. 2:593-596 (1992)). 



15 



Human Antibodies 



Fully human antibodies essentially relate to antibody molecules in which the entire 
sequence of both the light chain and the heavy chain, including the CDRs, arise from human 
genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. 

20 Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell 

hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: MONOCLONAL 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human monoclonal 
antibodies may be utilized in the practice of the present invention and may be produced by 

25 using human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or 
by transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 1985 In: 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol.. 227:381 (1991); 
30 Marks et al., J. Mol. Biol., 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire. This 
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approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 
5,625,126; 5,633,425; 5,661,016, and in Marks et al. (Biotechnology 10. 779-783 (1992)); 
Lonberg et al. (Nature 368 856-859 (1994)); Morrison ( Nature 368, 812-13 (1994)); Fishwild 
et al,( Nature Biotechnology 14, 845-51 (1996)); Neuberger (Nature Biotechnology J4, 826 
5 (1996)); and Lonberg and Huszar (Intern. Rev. Immunol. 13 65-93 (1 995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
which are modified so as to produce fully human antibodies rather than the animal's 
endogenous antibodies in response to challenge by an antigen. (See PCT publication 
WO94/02602). The endogenous genes encoding the heavy and light immunoglobulin chains in 

10 the nonhuman host have been incapacitated, and active loci encoding human heavy and light 
chain immunoglobulins are inserted into the host's genome. The human genes are 
incorporated, for example, using yeast artificial chromosomes containing the requisite human 
DNA segments. An animal which provides all the desired modifications is then obtained as 
progeny by crossbreeding intermediate transgenic animals containing fewer than the full 

15 complement of the modifications. The preferred embodiment of such a nonhuman animal is a 
mouse, and is termed the Xenomouse™ as disclosed in PCT publications WO 96/33735 and 
WO 96/34096. This animal produces B cells which secrete fully human immunoglobulins. 
The antibodies can be obtained directly from the animal after immunization with an 
immunogen of interest, as,' for example, a preparation of a polyclonal antibody, or alternatively 

20 from immortalized B cells derived from the animal, such as hybridomas producing 

monoclonal antibodies. Additionally, the genes encoding the immunoglobulins with human 
variable regions can be recovered and expressed to obtain the antibodies directly, or can be 
further modified to obtain analogs of antibodies such as, for example, single chain Fv 
molecules. 

25 An example of a method of producing a nonhuman host, exemplified as a mouse, 

lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent 
No. 5,939,598. It can be obtained by a method including deleting the J segment genes from at 
least one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of 
the locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy chain 

30 locus, the deletion being effected by a targeting vector containing a gene encoding a selectable 
marker, and producing from the embryonic stem cell a transgenic mouse whose somatic and 
germ cells contain the gene encoding the selectable marker. 

A method for producing an antibody of interest, such as a human antibody, is disclosed 
in U.S. Patent No. 5,916,771. It includes introducing an expression vector that contains a 
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nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, 
introducing an expression vector containing a nucleotide sequence encoding a light chain into 
another mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell 
expresses an antibody containing the heavy chain and the light chain. 
5 In a further improvement on this procedure, a method for identifying a clinically 

relevant epitope on an immunogen, and a correlative method for selecting an antibody that 
binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT 
publication WO 99/53049. 

10 Fab Fragments and Single Chain Antibodies 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent 
No. 4,946,778). In addition, methods can be adapted for the construction of F ab expression 
libraries (see e.g., Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective 

1 5 identification of monoclonal F a b fragments with the desired specificity for a protein or 

derivatives, fragments, analogs or homologs thereof. Antibody fragments that contain the 
idiotypes to a protein antigen may be produced by techniques known in the art including, but 
not limited to: (i) an F (ab . )2 fragment produced by pepsin digestion of an antibody molecule; (ii) 
an F ab fragment generated by reducing the disulfide bridges of an F w fragment; (iii) an F^ 

20 fragment generated by the treatment of the antibody molecule with papain and a reducing 
agent and (iv) F v fragments. 



Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 
have binding specificities for at least two different antigens. In the present case, one of the 
binding specificities is for an antigenic protein of the invention. The second binding target is 
any other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature. 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce 
a potential mixture of ten different antibody molecules, of which only one has the correct 
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bispecific structure. The purification of the correct molecule is usually accomplished by 
affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, published 
13 May 1993, and in Traunecker et al., EMBOJL 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen 
5 combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part 
of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant 
region (CHI) containing the site necessary for light-chain binding present in at least one of the 
fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 

10 immunoglobulin light chain, are inserted into separate expression vectors, and are co- 
transfected into a suitable host organism. For further details of generating bispecific 
antibodies see, for example, Suresh et ah, Methods in Enzymology, 121:210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a pair 
of antibody molecules can be engineered to maximize the percentage of heterodimers which 

15 are recovered from recombinant cell culture. The preferred interface comprises at least a part 
of the CH3 region of an antibody constant domain. In this method, one or more small amino 
acid side chains from the interface of the first antibody molecule are replaced with larger side 
chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the 
large side chain(s) are created on the interface of the second antibody molecule by replacing 

20 large amino acid side chains with smaller ones (e.g. alanine or threonine). This provides a 

mechanism for increasing the yield of the heterodimer over other unwanted end-products such 
as homodimers. 

Bispecific antibodies can be prepared as fiill length antibodies or antibody fragments 
(e.g. F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies from 

25 antibody fragments have been described in the literature. For example, bispecific antibodies 
can be prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a 
procedure wherein intact antibodies are proteolytically cleaved to generate F(ab')2 fragments. 
These fragments are reduced in the presence of the dithiol complexing agent sodium arsenite 
to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' 

30 fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of the 
Fab'-TNB derivatives is then reconverted to the Fab' -thiol by reduction with 
mercaptoethylamine and is mixed with an equimolar amount of the other Fab'-TNB derivative 
to form the bispecific antibody. The bispecific antibodies produced can be used as agents for 
the selective immobilization of enzymes. 
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Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al. 5 J. Exp. Med. 175:217-225 (1992) 
describe the production of a fully humanized bispecific antibody F(ab') 2 molecule. Each Fab' 
fragment was separately secreted from E. coli and subjected to directed chemical coupling in 
5 vitro to form the bispecific antibody. The bispecific antibody thus formed was able to bind to 
cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic 
activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody fragments directly 
from recombinant cell culture have also been described. For example, bispecific antibodies 

1 0 have been produced using leucine zippers. Kostelny et ah, J. Immunol. 148(5): 1 547-1 553 
(1992). The leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' 
portions of two different antibodies by gene fusion. The antibody homodimers were reduced 
at the hinge region to form monomers and then re-oxidized to form the antibody heterodimers. 
This method can also be utilized for the production of antibody homodimers. The "diabody" 

1 5 technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has 
provided an alternative mechanism for making bispecific antibody fragments. The fragments 
comprise a heavy-chain variable domain (V H ) connected to a light-chain variable domain (V L ) 
by a linker which is too short to allow pairing between the two domains on the same chain. 
Accordingly, the V H and V L domains of one fragment are forced to pair with the 

20 complementary V L and V H domains of another fragment, thereby forming two antigen-binding 
sites. Another strategy for making bispecific antibody fragments by the use of single-chain Fv 
(sFv) dimers has also been reported. See, Gruber et ah, J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 
antibodies can be prepared. Tutt et al, J. Immunol. 1 47:60 ( 1 99 1 ). 

25 Exemplary bispecific antibodies can bind to two different epitopes, at least one of 

which originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm 
of an immunoglobulin molecule can be combined with an arm which binds to a triggering 
molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or 
Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and FcyRIII (CD16) so as 

30 to focus cellular defense mechanisms to the cell expressing the particular antigen. Bispecific 
antibodies can also be used to direct cytotoxic agents to cells which express a particular 
antigen. These antibodies possess an antigen-binding arm and an arm which binds a cytotoxic 
agent or a radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another 
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bispecific antibody of interest binds the protein antigen described herein and further binds 
tissue factor (TF). 

Heteroconjugate Antibodies 

5 Heteroconjugate antibodies are also within the scope of the present invention. 

Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 
antibodies have, for example, been proposed to target immune system cells to unwanted cells 
(U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 
92/200373; EP 03089). It is contemplated that the antibodies can be prepared in vitro using 
10 known methods in synthetic protein chemistry, including those involving crosslinking agents. 
For example, immunotoxins can be constructed using a disulfide exchange reaction or by 
forming a thioether bond. Examples of suitable reagents for this purpose include iminothiolate 
and methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 
4,676,980. 

15 . 
Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector 
function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing interchain 

20 disulfide bond formation in this region. The homodimeric antibody thus generated can have 
improved internalization capability and/or increased complement-mediated cell killing and 
antibody-dependent cellular cytotoxicity (ADCC). See Caron et ah, J. Exp Med .. 176 : 1191- 
1 1 95 ( 1 992) and Shopes, J. Immunol .. 148 : 291 8-2922 (1 992). Homodimeric antibodies with 
enhanced anti-tumor activity can also be prepared using heterobifunctional cross-linkers as 

25 described in Wolff et al. Cancer Research. 53: 2560-2565 (1 993). Alternatively, an antibody 
can be engineered that has dual Fc regions and can thereby have enhanced complement lysis 
and ADCC capabilities. See Stevenson et al., Anti-Cancer Drug Design. 3: 219-230 (1989). 

Immunoconjugates 

30 The invention also pertains to immunoconjugates comprising an antibody conjugated 

to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active 

toxin of bacterial,, fungal, plant, or animal origin, or fragments thereof), or a radioactive 

isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have 

35 been described above. Enzymatically active toxins and fragments thereof that can be used 
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include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain 
(from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and 
PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, 
5 gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 
radionuclides are available for the production of radioconjugated antibodies. Examples 
include 2l2 Bi, ,3, I, 13I In, 90 Y, and ,86 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of 
bifunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate 

10 (SPDP), iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl 
adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis- 
diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates 
(such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such as 1,5-difluoro- 

15 2,4-dmitrobenzene). For example, a ricin immunotoxin can be prepared as described in 
VitettaetaL Science. 238: 1098(1987). Carbon- 1 4-labeled l-isothiocyanatobenzyl-3- 
methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for 
conjugation of radionucleotide to the antibody. See W094/1 1026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 

20 streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn 
conjugated to a cytotoxic agent. 

25 Immunoliposomes 

The antibodies disclosed herein can also be formulated as immunoliposomes. 
Liposomes containing the antibody are prepared by methods known in the art, such as 
described in Epstein et ah, Proc. Natl. Acad. Sci. USA. 82: 3688 (1985); Hwang et aL, Proc. 
Natl Acad. Sci.USA, 77: 4030 (1980); and U.S. Pat. Nos. 4,485,045 and 4,544,545. 
30 Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation 
method with a lipid composition comprising phosphatidylcholine, cholesterol, and PEG- 
derivatized phosphatidylethanolamine (PEG-PE). Liposomes are extruded through filters of 
defined pore size to yield liposomes with the desired diameter. Fab 1 fragments of the antibody 
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of the present invention can be conjugated to the liposomes as described in Martin et al 
Biol. Chem.. 257 : 286-288 (1982) via a disulfide-interchange reaction. A chemotherapeutic 
agent (such as Doxorubicin) is optionally contained within the liposome. See Gabizon et al., J. 
National Cancer Inst. 81(191: 1484 (1989). 

5 

Diagnostic Applications of Antibodies Directed Against the Proteins of the Invention 

Antibodies directed against a protein of the invention may be used in methods known 
within the art relating to the localization and/or quantitation of the protein (e.g., for use in 
measuring levels of the protein within appropriate physiological samples, for use in diagnostic 
10 methods, for use in imaging the protein, and the like). In a given embodiment, antibodies 
against the proteins, or derivatives, fragments, analogs or homologs thereof, that contain the 
antigen binding domain, are utilized as pharmacologically-active compounds (see below). 

An antibody specific for a protein of the invention can be used to isolate the protein by 
standard techniques, such as immunoaffinity chromatography or immunoprecipitation. Such 

15 an antibody can facilitate the purification of the natural protein antigen from cells and of 

recombinantly produced antigen expressed in host cells. Moreover, such an antibody can be 
used to detect the antigenic protein (e.g., in a cellular lysate or cell supernatant) in order to 
evaluate the abundance and pattern of expression of the antigenic protein. Antibodies directed 
against the protein can be used diagnostically to monitor protein levels in tissue as part of a 

20 clinical testing procedure, e.g., to, for example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling (i.e., physically linking) the antibody to a 
detectable substance. Examples of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 

25 phosphatase, p-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group 
complexes include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent 
materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
. luminescent material includes luminol; examples of bioluminescent materials include 

30 luciferase, luciferin, and aequorin, and examples of suitable radioactive material include 125 I, 
,3 V 5 Sor 3 H. 

Antibody Therapeutics 
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Antibodies of the invention, including polyclonal, monoclonal, humanized and fully 
human antibodies, may used as therapeutic agents. Such agents will generally be employed to 
treat or prevent a disease or pathology in a subject An antibody preparation, preferably one 
having high specificity and high affinity for its target antigen, is administered to the subject 
5 and will generally have an effect due to its binding with the target. Such an effect may be one 
of two kinds, depending on the specific nature of the interaction between the given antibody 
molecule and the target antigen in question. In the first instance, administration of the 
antibody may abrogate or inhibit the binding of the target with an endogenous Hgand to which 
it naturally binds. In this case, the antibody binds to the target and masks a binding site of the 

10 naturally occurring ligand, wherein the ligand serves as an effector molecule. Thus the 
receptor mediates a signal transduction pathway for which ligand is responsible. 

Alternatively, the effect may be one in which the antibody elicits a physiological result 
by virtue of binding to an effector binding site on the target molecule. In this case the target, a 
receptor having an endogenous ligand which may be absent or defective in the disease or 

1 5 pathology, binds the antibody as a surrogate effector ligand, initiating a receptor-based signal 
transduction event by the receptor. 

A therapeutically effective amount of an antibody of the invention relates generally to 
the amount needed to achieve a therapeutic objective. As noted above, this may be a binding 
interaction between the antibody and its target antigen that, in certain cases, interferes with the 

20 functioning of the target, and in other cases, promotes a physiological response. The amount 
required to be administered will furthermore depend on the binding affinity of the antibody for 
its specific antigen, and will also depend on the rate at which an administered antibody is 
depleted from the free volume other subject to which it is administered. Common ranges for 
therapeutically effective dosing of an antibody or antibody fragment of the invention may be, 

25 by way of nonlimiting example, from about 0.1 mg/kg body weight to about 50 mg/kg body 
weight. Common dosing frequencies may range, for example,, from twice daily to once a 
week. 

Pharmaceutical Compositions of Antibodies 
30 Antibodies specifically binding a protein of the invention, as well as other molecules 

identified by the screening assays disclosed herein, can be administered for the treatment of 
various disorders in the form of pharmaceutical compositions. Principles and considerations 
involved in preparing such compositions, as well as guidance in the choice of components are 
provided, for example, in Remington : The Science And Practice Of Pharmacy 19th ed. 
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(Alfonso R. Gennaro, et a!., editors) Mack Pub. Co., Easton, Pa. : 1995; Drug Absorption 
Enhancement : Concepts, Possibilities, Limitations, And Trends, Harwood Academic 
Publishers, Langhome, Pa., 1994; and Peptide And Protein Drug Delivery (Advances In 
Parenteral Sciences, Vol. 4), 1991, M. Dekker, New York. 
5 If the antigenic protein is intracellular and whole antibodies are used as inhibitors, 

internalizing antibodies are preferred. However, liposomes can also be used to deliver the 
antibody, or an antibody fragment, into cells. Where antibody fragments are used, the smallest 
inhibitory fragment that specifically binds to the binding domain of the target protein is 
preferred. For example, based upon the variable-region sequences of an antibody, peptide 

10 molecules can be designed that retain the ability to bind the target protein sequence. Such 
peptides can be synthesized chemically and/or produced by recombinant DNA technology. 
See, e.g., Marasco et ah, Proc. Natl. Acad. ScL USA, 90: 7889-7893 (1 993). The formulation 
herein can also contain more than one active compound as necessary for the particular 
indication being treated, preferably those with complementary activities that do not adversely 

15 affect each other. Alternatively, or in addition, the composition can comprise an agent that 
enhances its function, such as, for example, a cytotoxic agent, cytokine, chemotherapeutic 
agent, or growth-inhibitory agent. Such molecules are suitably present in combination in 
amounts that are effective for the purpose intended. 

The active ingredients can also be entrapped in microcapsules prepared, for example, 

20 by coacervation techniques or by interfacial polymerization, for example, 

hydroxymethylcellulose or gelatin-microcapsules and poly-(methylmethacrylate) 
microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, 
albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in 
macroemulsions. 

25 The formulations to be used for in vivo administration must be sterile. This is readily 

accomplished by filtration through sterile filtration membranes. 

Sustained-release preparations can be prepared. Suitable examples of sustained- 
release preparations include semipermeable matrices of solid hydrophobic polymers 
containing the antibody, which matrices are in the form of shaped articles, e.g., films, or 

30 microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (for 
example, poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), polylactides (U.S. Pat. 
No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable 
ethylene-vinyl acetate, degradable lactic acid-glycolic acid copolymers such as the LUPRON 
DEPOT ™ (injectable microspheres composed of lactic acid-glycolic acid copolymer and 
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leuprolide acetate), and poly-D-(-)~3-hydroxybutyric acid. While polymers such as ethylene- 
vinyl acetate and lactic acid-glycolic acid enable release of molecules for over 100 days, 
certain hydrogels release proteins for shorter time periods. 

5 ELISA Assay 

An agent for detecting an analyte protein is an antibody capable of binding to an 
analyte protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, 
or more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., F ab or F (ab)2 ) 
can be used. The term "labeled", with regard to the probe or antibody, is intended to 

10 encompass direct labeling of the probe or antibody by coupling (z.e., physically linking) a 
detectable substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of indirect 
labeling include detection of a primary antibody using a fluorescently-labeled secondary 
antibody and end-labeling of a DNA probe with biotin such that it can be detected with 

15 fluorescently-labeled streptavidin. The term "biological sample" is intended to include tissues, 
cells and biological fluids isolated from a subject, as well as tissues, cells and fluids present 
within a subject. Included within the usage of the term "biological sample", therefore, is 
blood and a fraction or component of blood including blood serum, blood plasma, or lymph. 
That is, the detection method of the invention can be used to detect an analyte mRNA, protein, 

20 or genomic DNA in a biological sample in vitro as well as in vivo. For example, in vitro 
techniques for detection of an analyte mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of an analyte protein include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations, and 
immunofluorescence. In vitro techniques for detection of an analyte genomic DNA include 

25 Southern hybridizations. Procedures for conducting immunoassays are described, for example 
in "ELISA: Theory and Practice: Methods in Molecular Biology", Vol. 42, J. R. Crowther 
(Ed.) Human Press, Totowa, NJ, 1995; "Immunoassay", E. Diamandis and T. Christopoulus, 
Academic Press, Inc., San Diego, CA, 1996; and "Practice and Thory of Enzyme 
Immunoassays", P. Tijssen, Elsevier Science Publishers, Amsterdam, 1985. Furthermore, in 

30 vivo techniques for detection of an analyte protein include introducing into a subject a labeled 
anti-an analyte protein antibody. For example, the antibody can be labeled with a radioactive 
marker whose presence and location in a subject can be detected by standard imaging 
techniques. 
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NOVX Recombinant Expression Vectors and Host Cells 
Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding an NOVX protein, or derivatives, fragments, analogs or 
homologs thereof. As used herein, the term "vector" refers to a nucleic acid molecule capable 
5 of transporting another nucleic acid to which it has been linked. One type of vector is a 

"plasmid", which refers to a circular double stranded DNA loop into which additional DNA 
segments can be Iigated. Another type of vector is a viral vector, wherein additional DNA 
segments can be Iigated into the viral genome. Certain vectors are capable of autonomous 
replication in a host cell into which they are introduced (e.g., bacterial vectors having a 

10 bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., 
non-episomal mammalian vectors) are integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are replicated along with the host genome. 
Moreover, certain vectors are capable of directing the expression of genes to which they are 
operatively-linked. Such vectors are referred to herein as "expression vectors". In general, 

1 5 expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. 
In the present specification, "plasmid" and "vector" can be used interchangeably as the 
plasmid is the most commonly used form of vector. However, the invention is intended to 
include such other forms of expression vectors, such as viral vectors (e.g., replication defective 
retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions. 

20 The recombinant expression vectors of the invention comprise a nucleic acid of the 

invention in a form suitable for expression of the nucleic acid in a host cell, which means that 
the recombinant expression vectors include one or more regulatory sequences, selected on the 
basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid 
sequence to be expressed. Within a recombinant expression vector, "operably-linked" is 

25 intended to mean that the nucleotide sequence of interest is linked to the regulatory 

sequence(s) in a manner that allows for expression of the nucleotide sequence (e.g., in an in 
vitro transcription/translation system or in a host cell when the vector is introduced into the 
host cell). 

The term "regulatory sequence" is intended to includes promoters, enhancers and other 
30 expression control elements (e.g., polyadenylation signals). Such regulatory sequences are 
described, for example, in Goeddel, Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include 
those that direct constitutive expression of a nucleotide sequence in many types of host cell 
and those that direct expression of the nucleotide sequence only in certain host cells (e.g., 
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tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the 
design of the expression vector can depend on such factors as the choice of the host cell to be 
transformed, the level of expression of protein desired, etc. The expression vectors of the 
invention can be introduced into host cells to thereby produce proteins or peptides, including 
5 fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., NOVX 
proteins, mutant forms of NOVX proteins, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for expression of 
NOVX proteins in prokaryotic or eukaryotic cells. For example, NOVX proteins can be 
expressed in bacterial cells such as Escherichia coli, insect cells (using baculovirus expression 

10 vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology: Methods in Enzymology 1 85, Academic Press, San 
Diego, Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 
Expression of proteins in prokaryotes is most often carried out in Escherichia coli with 

1 5 vectors containing constitutive or inducible promoters directing the expression of either fusion 
or non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded 
therein, usually to the amino terminus of the recombinant protein. Such fusion vectors 
typically serve three purposes: (/) to increase expression of recombinant protein; (if) to 
increase the solubility of the recombinant protein; and (Hi) to aid in the purification of the 

20 recombinant protein by acting as a ligand in affinity purification. Often, in fusion expression 
vectors, a proteolytic cleavage site is introduced at the junction of the fusion moiety and the 
recombinant protein to enable separation of the recombinant protein from the fusion moiety 
subsequent to purification of the fusion protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors 

25 include pGEX (Pharmacia Biotech Inc; Smith and Johnson, 1 988. Gene 67: 3 1 -40), pMAL 
(New England Biolabs, Beverly, Mass.) and pFUT5 (Pharmacia, Piscataway, N.J.) that fuse 
glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the 
target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 

30 (Amrann etaL, (1988) Gene 69:301-315) andpET lid (Studier etaL, Gene EXPRESSION 
TECHNOLOGY: Methods IN ENZYMOLOGY 1 85, Academic Press, San Diego, Calif. (1990) 
60-89). 

One strategy to maximize recombinant protein expression in E. coli is to express the 
protein in a host bacteria with an impaired capacity to proteolytically cleave the recombinant 
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protein. See, e.g., Gottesman, Gene Expression Technology: Methods in Enzymology 
185, Academic Press, San Diego, Calif. (1990) 1 19-128. Another strategy is to alter the 
nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the 
individual codons for each amino acid are those preferentially utilized in E. coli (see, e.g., 
5 Wada, et aL 9 1 992. NucL Acids Res. 20: 2 1 1 1 -2 1 1 8). Such alteration of nucleic acid 
sequences of the invention can be carried out by standard DNA synthesis techniques. 

In another embodiment, the NOVX expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast Saccharomyces cerivisae include pYepSecl 
(Baldari, et aL, 1987. EMBOJ. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Cell 30: 

10 933-943), pJRY88 (Schultz et aL, 1987. Gene 54: 1 13-123), pYES2 (Invitrogen Corporation, 
San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.). 

Alternatively, NOVX can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., 
SF9 cells) include the pAc series (Smith, et aL, 1983. MoL Cell. Biol. 3: 2156-2165) and the 

15 pVL series (Lucklow and Summers, 1989. Virology 170: 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in mammalian 
cells using a mammalian expression vector. Examples of mammalian expression vectors 
include pCDM8 (Seed, 1 987. Nature 329: 840) and pMT2PC (Kaufman, et aL, 1 987. EMBO 
J. 6: 187-195). When used in mammalian cells, the expression vector's control functions are 

20 often provided by viral Fegulatory elements. For example, commonly used promoters are 

derived from polyoma, adenovirus 2, cytomegalovirus, and simian virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of 
Sambrook, et aL, MOLECULAR CLONING: A LABORATORY MANUAL. 2nd ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. 

25 In another embodiment, the recombinant mammalian expression vector is capable of 

directing expression of the nucleic acid preferentially in a particular cell type (e.g., 
tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific 
promoters include the albumin promoter (liver-specific; Pinkert, et aL, 1987. Genes Dev. 1: 

30 268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv. Immunol. 43: 

235-275), in particular promoters of T cell receptors (Winoto and Baltimore, 1989. EMBOJ. 
8: 729-733) and immunoglobulins (Banerji, etaL, 1983. Ce//33: 729-740; Queen and 
Baltimore, 1983. Cell 33: 741-748), neuron-specific promoters (e.g., the neurofilament 
promoter; Byrne and Ruddle, 1989. Proc. Natl. Acad Sci. USA 86: 5473-5477), 
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pancreas-specific promoters (Edlund, et al, 1985. Science 230: 912-916), and mammary , 
gland-specific promoters (e.g., milk whey promoter; U.S. Pat. No. 4,873,316 and European 
Application Publication No. 264,166). Developmentally-regulated promoters are also 
encompassed, e.g., the murine hox promoters (Kessel and Gruss, 1990. Science 249: 374-379) 
5 and the a-fetoprotein promoter (Campes and Tilghman, 1989. Genes Dev. 3: 537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. That 
is, the DNA molecule is operatively-Iinked to a regulatory sequence in a manner that allows 
for expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to 

10 NOVX mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the 

antisense orientation can be chosen that direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen that direct constitutive, tissue specific or cell type specific expression 
of antisense RNA. The antisense expression vector can be in the form of a recombinant 

15 plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the activity of which can be determined by the 
cell type into which the vector is introduced. For a discussion of the regulation of gene 
expression using antisense genes see, e.g., Weintraub, et al, "Antisense RNA as a molecular 
tool for genetic analysis," Reviews-Trends in Genetics, Vol. 1(1) 1986. 

20 Another aspect of the invention pertains to host cells into which a recombinant 

expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such terms refer 
not only to the particular subject cell but also to the progeny or potential progeny of such a 
cell. Because certain modifications may occur in succeeding generations due to either 

25 mutation or environmental influences, such progeny may not, in fact, be identical to the parent 
cell, but are still included within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, NOVX protein can 
be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as 
Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to 

30 those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 
transformation or transfection techniques. As used herein, the terms "transformation" and 
"transfection" are intended to refer to a variety of art-recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium 
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chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. Suitable methods for transforming or transfecting host cells can be found in 
Sambrook, et at (MOLECULAR CLONING: A Laboratory Manual. 2nd ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1 989), 
5 and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may integrate 
the foreign DNA into their genome. In order to identify and select these integrants, a gene that 
encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the 

1 0 host cells along with the gene of interest. Various selectable markers include those that confer 
resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid encoding a 
selectable marker can be introduced into a host cell on the same vector as that encoding 
NOVX or can be introduced on a separate vector. Cells stably transfected with the introduced 
nucleic acid can be identified by drug selection {e.g., cells that have incorporated the 

1 5 selectable marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can 
be used to produce (/.e., express) NOVX protein. Accordingly, the invention further provides 
methods for producing NOVX protein using the host cells of the invention. In one 
embodiment, the method comprises culturing the host cell of invention (into which a 

20 recombinant expression vector encoding NOVX protein has been introduced) in a suitable 
medium such that NOVX protein is produced. In another embodiment, the method further 
comprises isolating NOVX protein from the medium or the host cell. 

Transgenic NOVX Animals 

The host cells of the invention can also be used to produce non-human transgenic 
25 animals. For example, in one embodiment, a host cell of the invention is a fertilized oocyte or 
an embryonic stem cell into which NOVX protein-coding sequences have been introduced. 
Such host cells can then be used to create non-human transgenic animals in which exogenous 
NOVX sequences have been introduced into their genome or homologous recombinant 
animals in which endogenous NOVX sequences have been altered. Such animals are useful 
30 for studying the function and/or activity of NOVX protein and for identifying and/or 

evaluating modulators of NOVX protein activity. As used herein, a "transgenic animal" is a 
non-human animal, preferably a mammal, more preferably a rodent such as a rat or mouse, in 
which one or more of the cells of the animal includes a transgene. Other examples of 
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transgenic animals include non-human primates, sheep, dogs, cows, goats, chickens, 
amphibians, etc. A transgene is exogenous DNA that is integrated into the genome of a cell 
from which a transgenic animal develops and that remains in the genome of the mature 
animal, thereby directing the expression of an encoded gene product in one or more cell types 
5 or tissues of the transgenic animal. As used herein, a "homologous recombinant animal" is a 
non-human animal, preferably a mammal, more preferably a mouse, in which an endogenous 
NOVX gene has been altered by homologous recombination between the endogenous gene 
and an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic cell 
of the animal, prior to development of the animal. 

10 A transgenic animal of the invention can be created by introducing NOVX-encoding 

nucleic acid into the male pronuclei of a fertilized oocyte (e.g., by microinjection, retroviral 
infection) and allowing the oocyte to develop in a pseudopregnant female foster animal. The 
human NOVX cDNA sequences SEQ ID NOS:2n-l, wherein n is an integer between 1 and 
162 can be introduced as a transgene into the genome of a non-human animal. Alternatively, a 

1 5 non-human homologue of the human NOVX gene, such as a mouse NOVX gene, can be 

isolated based on hybridization to the human NOVX cDNA (described further supra) and used 
as a transgene. Intronic sequences and polyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of the transgene. A tissue-specific 
regulatory sequence(s) can be operably-linked to the NOVX transgene to direct expression of 

20 NOVX protein to particular cells. Methods for generating transgenic animals via embryo 
manipulation and microinjection, particularly animals such as mice, have become 
conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866; 
4,870,009; and 4,873,191 ; and Hogan, 1986. In: Manipulating the Mouse Embryo, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. Similar methods are used for 

25 production of other transgenic animals. A transgenic founder animal can be identified based 
upon the presence of the NOVX transgene in its genome and/or expression of NOVX mRNA 
in tissues or cells of the animals. A transgenic founder animal can then be used to breed 
additional animals carrying the transgene. Moreover, transgenic animals carrying a transgene- 
encoding NOVX protein can further be bred to other transgenic animals carrying other 

30 transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains at 
least a portion of an NOVX gene into which a deletion, addition or substitution has been 
introduced to thereby alter, e.g., functionally disrupt, the NOVX gene. The NOVX gene can 
be a human gene (e.g., the cDNA of SEQ ID NOS:2n-l, wherein n is an integer between 1 and 
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162), but more preferably, is a non-human homologue of a human NOVX gene. For example, 
a mouse homologue of human NOVX gene of SEQ ID NOS:2n-l, wherein n is an integer 
between 1 and 162 can be used to construct a homologous recombination vector suitable for 
altering an endogenous NOVX gene in the mouse genome. In one embodiment, the vector is 
5 designed such that, upon homologous recombination, the endogenous NOVX gene is 

functionally disrupted (i.e., no longer encodes a functional protein; also referred to as a "knock 
out" vector). 

Alternatively, the vector can be designed such that, upon homologous recombination, 
the endogenous NOVX gene is mutated or otherwise altered but still encodes functional 

10 protein (e.g., the upstream regulatory region can be altered to thereby alter the expression of 
the endogenous NOVX protein). In the homologous recombination vector, the altered portion 
of the NOVX gene is flanked at its 5 - and 3-termini by additional nucleic acid of the NOVX 
gene to allow for homologous recombination to occur between the exogenous NOVX gene 
earned by the vector and an endogenous NOVX gene in an embryonic stem cell. The 

1 5 additional flanking NOVX nucleic acid is of sufficient length for successful homologous 

recombination with the endogenous gene. Typically, several kilobases of flanking DNA (both 
at the 5 - and 3 -termini) are included in the vector. See, e.g., Thomas, et al, 1987. Cell 5 1 : 
503 for a description of homologous recombination vectors. The vector is ten introduced into 
an embryonic stem cell line (e.g., by electroporation) and cells in which the introduced NOVX 

20 gene has homologously-recombined with the endogenous NOVX gene are selected. See, e.g., 
Li, et al, 1992. Cell 69: 915. 

The selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to 
form aggregation chimeras. See, e.g., Bradley, 1987. In: TERATOCARCINOMAS and 
Embryonic Stem Cells: A Practical Approach, Robertson, ed. IRL, Oxford, pp. 1 13-152. 

25 A chimeric embryo can then be implanted into a suitable pseudopregnant female foster animal 
and the embryo brought to term. Progeny harboring the homologously-recombined DNA in 
their germ cells can be used to breed animals in which all cells of the animal contain the 
homologously-recombined DNA by germline transmission of the transgene. Methods for 
constructing homologous recombination vectors and homologous recombinant animals are 

30 described further in Bradley, 1 991. Curr. Opin. Biotechnol 2: 823-829; PCT International 
Publication Nos.: WO 90/1 1354; WO 91/01 140; WO 92/0968; and WO 93/04169. 

In another embodiment, transgenic non-humans animals can be produced that contain 
selected systems that allow for regulated expression of the transgene. One example of such a 
system is the cre/loxP recombinase system of bacteriophage PI . For a description of the 
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cre/loxP recombinase system, See, e.g., Lakso, et aL, 1992. Proc. Natl Acad Set USA 89: 
6232-6236. Another example of a recombinase system is the FLP recombinase system of 
Saccharomyces cerevisiae. See, O'Gorman, et al 9 1991. Science 251:1351-1355. Ifacre/loxP 
recombinase system is used to regulate expression of the transgene, animals containing 
5 transgenes encoding both the Cre recombinase and a selected protein are required. Such 
animals can be provided through the construction of "double" transgenic animals, e.g., by 
mating two transgenic animals, one containing a transgene encoding a selected protein and the 
other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 

10 according to the methods described in Wilmut, et a/., 1997. Nature 385: 810-813. In brief, a 
cell (e.g., a somatic cell) from the transgenic animal can be isolated and induced to exit the 
growth cycle and enter G 0 phase. The quiescent cell can then be fused, e.g., through the use of 
electrical pulses, to an enucleated oocyte from an animal of the same species from which the 
quiescent cell is isolated. The reconstructed oocyte is then cultured such that it develops to 

1 5 morula or blastocyte and then transferred to pseudopregnant female foster animal. The 

offspring borne of this female foster animal will be a clone of the animal from which the cell 
(e.g., the somatic cell) is isolated. 

Pharmaceutical Compositions 

The NOVX nucleic acid molecules, NOVX proteins, and anti-NOVX antibodies (also 
20 referred to herein as "active compounds") of the invention, and derivatives, fragments, analogs 
and homologs thereof, can be incorporated into pharmaceutical compositions suitable for 
administration. Such compositions typically comprise the nucleic acid molecule, protein, or 
antibody and a pharmaceutically acceptable carrier. As used herein, "pharmaceutical^ 
acceptable carrier" is intended to include any and all solvents, dispersion media, coatings, 
25 antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. Suitable carriers are described in the most 
recent edition of Remington's Pharmaceutical Sciences, a standard reference text in the field, 
which is incorporated herein by reference. Preferred examples of such carriers or diluents 
include, but are not limited to, water, saline, finger's solutions, dextrose solution, and 5% 
30 human serum albumin. Liposomes and non-aqueous vehicles such as fixed oils may also be 
used. The use of such media and agents for pharmaceutically active substances is well known 
in the art. Except insofar as any conventional media or agent is incompatible with the active 
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compound, use thereof in the compositions is contemplated. Supplementary active 
compounds can also be incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible with its 
intended route of administration. Examples of routes of administration include parenteral, 
5 e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal (i.e., topical), 
transmucosal, and rectal administration. Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the following components: a sterile 
diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, 
propylene glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or 

10 methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such 
as ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, 
and agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be 
adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of 

15 glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, 

20 Parsippany, NJ.) or phosphate buffered saline (PBS). In all cases, the composition must be 
sterile and should be fluid to the extent that easy syringeability exists. It must be stable under 
the conditions of manufacture and storage and must be preserved against the contaminating 
action of microorganisms such as bacteria and fungi. The carrier can be a solvent or 
dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 

25 propylene glyctil, and liquid polyethylene glycol, and the like), and suitable mixtures thereof. . 
The proper fluidity can be maintained, for example, by the use of a coating such as lecithin, by 
the maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic 

30 acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 
for example, sugars, polyalcohols such as manitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about by 
including in the composition an agent which delays absorption, for example, aluminuifc 
monostearate and gelatin. 
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Sterile injectable solutions can be prepared by incorporating the active compound (e.g., 
an NOVX protein or anti-NOVX antibody) in the required amount in an appropriate solvent 
with one or a combination of ingredients enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by incorporating the active compound into a 
5 sterile vehicle that contains a basic dispersion medium and the required other ingredients from 
those enumerated above. In the case of sterile powders for the preparation of sterile injectable 
solutions, methods of preparation are vacuum drying and freeze-drying that yields a powder of 
the active ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

10 Oral compositions generally include an inert diluent or an edible carrier. They can be 

enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic 
administration, the active compound can be incorporated with excipients and used in the form 
of tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier 
for use as a mouthwash, wherein the compound in the fluid carrier is applied orally and 

15 swished and expectorated or swallowed. Pharmaceutically compatible binding agents, and/or 
adjuvant materials can be included as part of the composition. The tablets, pills, capsules, 
troches and the like can contain any of the following ingredients, or compounds of a similar 
nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient 
such as starch or lactose, a disintegrating agent such as alginic acid, Primogel, or com starch; a 

20 lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, e.g., 

25 a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, and 
include, for example, for transmucosal administration, detergents, bile salts, and fosidic acid 

30 derivatives. Transmucosal administration can be accomplished through the use of nasal sprays 
or suppositories. For transdermal administration, the active compounds are formulated into 
ointments, salves, gels, or creams as generally known in the art. 
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The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention enemas 
for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will protect 
5 the compound against rapid elimination from the body, such as a controlled release 

formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 
polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of 
such formulations will be apparent to those skilled in the art. The materials can also be 
10 obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 

suspensions (including liposomes targeted to infected cells with monoclonal antibodies to viral 
antigens) can also be used as pharmaceutically acceptable carriers. These can be prepared 
according to methods known to those skilled in the art, for example, as described in U.S. 
Patent No. 4,522,811. 

15 It is especially advantageous to formulate oral or parenteral compositions in dosage 

unit form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active compound calculated to produce the 
desired therapeutic effect in association with the required pharmaceutical carrier. The 

20 specification for the dosage unit forms of the invention are dictated by and directly dependent 
on the unique characteristics of the active compound and the particular therapeutic effect to be 
achieved, and the limitations inherent in the art of compounding such an active compound for 
the treatment of individuals. 

The nucleic acid molecules of the invention can be inserted into vectors and used as 

25 gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for example, 
intravenous injection, local administration (see, e.g., U.S. Patent No. 5,328,470) or by 
stereotactic injection (see, e.g., Chen, etal, 1994. Proc. Natl Acad. Sci. USA 91: 3054-3057). 
The pharmaceutical preparation of the gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a slow release matrix in which the gene delivery 

30 vehicle is imbedded. Alternatively, where the complete gene delivery vector can be produced 
intact from recombinant cells, e.g., retroviral vectors, the pharmaceutical preparation can 
include one or more cells that produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. 
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Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to express NOVX 
protein (e.g., via a recombinant expression vector in a host cell in gene therapy applications), 
to detect NOVX mRNA (e.g., in a biological sample) or a genetic lesion in an NOVX gene, 
5 and to modulate NOVX activity, as described further, below. In addition, the NOVX proteins 
can be used to screen drugs or compounds that modulate the NOVX protein activity or 
expression as well as to treat disorders characterized by insufficient or excessive production of 
NOVX protein or production of NOVX protein forms that have decreased or aberrant activity 
compared to NOVX wild-type protein (e.g.; diabetes (regulates insulin release); obesity (binds 
1 0 and transport lipids); metabolic disturbances associated with obesity, the metabolic syndrome 
X as well as anorexia and wasting disorders associated with chronic diseases and various 
cancers, and infectious disease(possesses anti-microbial activity) and the various 
dyslipidemias. In addition, the anti-NOVX antibodies of the invention can be used to detect 
and isolate NOVX proteins and modulate NOVX activity. In yet a further aspect, the invention 
1 5 can be used in methods to influence appetite, absorption of nutrients and the disposition of 
metabolic substrates in both a positive and negative fashion. 

The invention further pertains to novel agents identified by the screening assays 
described herein and uses thereof for treatments as described, supra. 

20 Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying modulators, Le. 9 candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) that bind to NOVX proteins or have a 
stimulatory or inhibitory effect on, e.g., NOVX protein expression or NOVX protein activity. 
25 The invention also includes compounds identified in the screening assays described herein. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the membrane-bound form of an NOVX 
- protein or polypeptide or biologically-active portion thereof. The test compounds of the 
invention can be obtained using any of the numerous approaches in combinatorial library 
30 methods known in the art, including: biological libraries; spatially addressable parallel solid 
phase or solution phase libraries; synthetic library methods requiring deconvolution; the 
"one-bead one-compound" library method; and synthetic library methods using affinity 
chromatography selection. The biological library approach is limited to peptide libraries, 



839 



WO 02/068649 



PCT/US02/02785 



while the other four approaches are applicable to peptide, non-peptide oligomer or small 
molecule libraries of compounds. See, e.g., Lam, 1997 '. Anticancer Drug Design 12: 145. 

A "small molecule" as used herein, is meant to refer to a composition that has a 
molecular weight of less than about 5 kD and most preferably less than about 4 kD. Small 
5 molecules can be, e.g., nucleic acids, peptides, polypeptides, peptidomimetics, carbohydrates, 
lipids or other organic or inorganic molecules. Libraries of chemical and/or biological 
mixtures, such as fungal, bacterial, or algal extracts, are known in the art and can be screened 
with any of the assays of the invention. 

Examples of methods for the synthesis of molecular libraries can be found in the art, 
10 for example in: DeWitt, et al, 1993. Proc. Natl. Acad Sci. USA. 90: 6909; Erb, et aL, 1994. 
Proc. Natl Acad ScL U.S.A. 91: 11422;Zuckermann, etaL, 1994. J. Med Chem. 37: 2678; 
Cho, etaL, 1993. Science 261: 1303; Carrell, etaL, l994.Angew. Chem. Int. Ed Engl 33: 
2059; Carell, etaL, l994.Angew. Chem.Int.Ed. Engl 33:2061;and Gallop,^ a/., 1994. J. 
Med Chem. 37: 1233. 

1 5 Libraries of compounds may be presented in solution (e.g. , Houghten, 1 992. 

Biotechniques 13: 412-421), or on beads (Lam, 1991. Nature 354: 82-84), on chips (Fodor, 
1 993. Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), spores (Ladner, 
U.S. Patent 5,233,409), plasmids (Cull, etal, 1992. Proc. Natl Acad ScL USA 89: 
1865-1869) or on phage (Scott and Smith, 1990. Science 249: 386-390; Devlin, 1990. Science 

20 249: 404-406; Cwirla, et aL, 1990. Proa Natl Acad ScL USA. 87: 6378-6382; Felici, 1991. 
J. Mol Biol 222: 301-310; Ladner, U.S. Patent No. 5,233,409.). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses a 
membrane-bound form of NOVX protein, or a biologically-active portion thereof, on the cell 
surface is contacted with a test compound and the ability of the test compound to bind to an 

25 NOVX protein determined. The cell, for example, can of mammalian origin or a yeast cell. 
Determining the ability of the test compound to bind to the NOVX protein can be 
accomplished, for example, by coupling the test compound with a radioisotope or enzymatic 
label such that binding of the test compound to the NOVX protein or biologically-active 
portion thereof can be determined by detecting the labeled compound in a complex. For 

30 example, test compounds can be labeled with 125 1, 35 S, ,4 C, or 3 H, either directly or indirectly, 
and the radioisotope detected by direct counting of radioemission or by scintillation counting. 
Alternatively, test compounds can be enzymatically-labeled with, for example, horseradish 
peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by 
determination of conversion of an appropriate substrate to product. In one embodiment, the 
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assay comprises contacting a cell which expresses a membrane-bound form of NOVX protein, 
or a biologically-active portion thereof, on the cell surface with a known compound which 
binds NOVX to form an assay mixture, contacting the assay mixture with a test compound, 
and determining the ability of the test compound to interact with an NOVX protein, wherein 
5 determining the ability of the test compound to interact with an NOVX protein comprises 
determining the ability of the test compound to preferentially bind to NOVX protein or a 
biologically-active portion thereof as compared to the known compound. 

In another embodiment,.an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound form of NOVX protein, or a biologically-active portion thereof, 
10 on the cell surface with a test compound and determining the ability of the test compound to 
modulate (e.g., stimulate or inhibit) the activity of the NOVX protein or biologically-active 
portion thereof. Determining the ability of the test compound to modulate the activity of 
NOVX or a biologically-active portion thereof can be accomplished, for example, by 
determining the ability of the NOVX protein to bind to or interact with an NOVX target 
1 5 molecule. As used herein, a "target molecule" is a molecule with which an NOVX protein 
binds or interacts in nature, for example, a molecule on the surface of a cell which expresses 
an NOVX interacting protein, a molecule on the surface of a second cell, a molecule in the 
extracellular milieu, a molecule associated with the internal surface of a cell membrane or a 
cytoplasmic molecule. An NOVX target molecule can be a non-NOVX molecule or an 
20 NOVX protein or polypeptide of the invention. In one embodiment, an NOVX target 

molecule is a component of a signal transduction pathway that facilitates transduction of an 
extracellular signal (e.g. a signal generated by binding of a compound to a membrane-bound 
NOVX molecule) through the cell membrane and into the cell. The target, for example, can be 
a second intercellular protein that has catalytic activity or a protein that facilitates the 
25 association of downstream signaling molecules with NOVX. 

Determining the ability of the NOVX protein to bind to or interact with an NOVX 
target molecule can be accomplished by one of the methods described above for determining 
direct binding. In one embodiment, determining the ability of the NOVX protein to bind to or 
interact with an NOVX target molecule can be accomplished by determining the activity of the 
30 target molecule. For example, the activity of the target molecule can be determined by 
detecting induction of a cellular second messenger of the target (i.e. intracellular Ca 2+ , 
diacylglycerol, IP 3 , etc.), detecting catalytic/enzymatic activity of the target an appropriate 
substrate, detecting the induction of a reporter gene (comprising an NOVX-responsive 
regulatory element operatively linked to a nucleic acid encoding a detectable marker, e.g., 

841 



WO 02/068649 PCT/US02/02785 

luciferase), or detecting a cellular response, for example, cell survival, cellular differentiation, 
or cell proliferation. . 

In yet another embodiment, an assay of the invention is a cell-free assay comprising 
contacting an NOVX protein or biologically-active portion thereof with a test compound and 
5 determining the ability of the test compound to bind to the NOVX protein or biologically- 
active portion thereof. Binding of the test compound to the NOVX protein can be determined 
either directly or indirectly as described above. In one such embodiment, the assay comprises 
contacting the NOVX protein or biologically-active portion thereof with a known compound 
which binds NOVX to form an assay mixture, contacting the assay mixture with a test 

1 0 compound, and determining the ability of the test compound to interact with an NOVX 
protein, wherein determining the ability of the test compound to interact with an NOVX 
protein comprises determining the ability of the test compound to preferentially bind to NOVX 
or biologically-active portion thereof as compared to the known compound. 

In still another embodiment, an assay is a cell-free assay comprising contacting NOVX 

1 5 protein or biologically-active portion thereof with a test compound and determining the ability 
of the test compound to modulate (e.g. stimulate or inhibit) the activity of the NOVX protein 
or biologically-active portion thereof. Determining the ability of the test compound to 
modulate the activity of NOVX can be accomplished, for example, by determining the ability 
of the NOVX protein to bind to an NOVX target molecule by one of the methods described 

20 above for determining direct binding. In an alternative embodiment, determining the ability of 
the test compound to modulate the activity of NOVX protein can be accomplished by 
determining the ability of the NOVX protein further modulate an NOVX target molecule. For 
example, the catalytic/enzymatic activity of the target molecule on an appropriate substrate 
can be determined as described, supra. 

25 In yet another embodiment, the cell-free assay comprises contacting the NOVX protein 

or biologically-active portion thereof with a known compound which binds NOVX protein to 
form an assay mixture, contacting the assay mixture with a test compound, and determining 
the ability of the test compound to interact with an NOVX protein, wherein determining the 
ability of the test compound to interact with an NOVX protein comprises determining the 

30 ability of the NOVX protein to preferentially bind to or modulate the activity of an NOVX 
target molecule. 

The cell-free assays of the invention are amenable to use of both the soluble form or 
the membrane-bound form of NOVX protein. In the case of cell-free assays comprising the 
membrane-bound form of NOVX protein, it may be desirable to utilize a solubilizing agent 
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such that the membrane-bound form of NOVX protein is maintained in solution. Examples of 
such solubilizing agents include non-ionic detergents such as n-octylglucoside, 
n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, 
decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, 
5 Isotridecypoly(ethylene glycol ether) n , N-dodecyl-N,N-dimethyl-3-ammonio-l -propane 
sulfonate, 3-(3-cholamidopropyl) dimethylamminiol- 1 -propane sulfonate (CHAPS), or 
3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l -propane sulfonate (CHAPSO). 

In more than one embodiment of the above assay methods of the invention, it may be 
desirable to immobilize either NOVX protein or its target molecule to facilitate separation of 
1 0 complexed from uncomplexed forms of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to NOVX protein, or interaction of 
NOVX protein with a target molecule in the presence and absence of a candidate compound, 
can be accomplished in any vessel suitable for containing the reactants. Examples of such 
vessels include microtiter plates, test tubes, and micro-centrifuge tubes. In one embodiment, a 
1 5 fusion protein can be provided that adds a domain that allows one or both of the proteins to be 
bound to a matrix. For example, GST-NOVX fusion proteins or GST-target fusion proteins 
can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtiter plates, that are then combined with the test compound or the 
test compound and either the non-adsorbed target protein or NOVX protein, and the mixture is 
20 incubated under conditions conducive to complex formation (e.g., at physiological conditions 
for salt and pH). Following incubation, the beads or microtiter plate wells are washed to 
remove any unbound components, the matrix immobilized in the case of beads, complex 
determined either directly or indirectly, for example, as described, supra. Alternatively, the 
complexes can be dissociated from the matrix, and the level of NOVX protein binding or 
25 activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either the NOVX protein or its target 
molecule can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 
NOVX protein or target molecules can be prepared from biotin-NHS 
30 (N-hydroxy-succinimide) using techniques well-known within the art (e.g., biotinylation kit, 
Pierce Chemicals, Rockford, 111.), and immobilized in the wells of streptavidin-coated 96 well 
plates (Pierce Chemical). Alternatively, antibodies reactive with NOVX protein or target 
molecules, but which do not interfere with binding of the NOVX protein to its target molecule, 
can be derivatized to the wells of the plate, and unbound target or NOVX protein trapped in 
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the wells by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of complexes 
using antibodies reactive with the NOVX protein or target molecule, as well as enzyme-linked 
assays that rely on detecting an enzymatic activity associated with the NOVX protein or target 
5 molecule. 

In another embodiment, modulators of NOVX protein expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of NOVX 
mRNA or protein in the cell is determined. The level of expression of NOVX mRNA or 
protein in the presence of the candidate compound is compared to the level of expression of 

10 NOVX mRNA or protein in the absence of the candidate compound. The candidate 

compound can then be identified as a modulator of NOVX mRNA or protein expression based 
upon this comparison. For example, when expression of NOVX mRNA or protein is greater 
(/.*., statistically significantly greater) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as a stimulator of NOVX mRNA or protein 

15 expression. Alternatively, when expression of NOVX mRNA or protein is less (statistically 
significantly less) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of NOVX mRNA or protein expression. The level of 
NOVX mRNA or protein expression in the cells can be determined by methods described 
herein for detecting NOVX mRNA or protein. 

20 In yet another aspect of the invention, the NOVX proteins can be used as "bait 

proteins" in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,317; 
Zervos, etaL, 1993. Cellll: 223-232; Madura, etal, 1993. J. Biol Chem. 268: 12046-12054; 
Bartel, et a/., 1993. Biotechniques 14: 920-924; Iwabuchi, et al. 9 1993. Oncogene 8: 
1693-1696; and Brent WO 94/10300), to identify other proteins that bind to or interact with 

25 NOVX ("NOVX-binding proteins" or "NOVX-bp") and modulate NOVX activity. Such 
NOVX-binding proteins are also likely to be involved in the propagation of signals by the 
NOVX proteins as, for example, upstream or downstream elements of the NOVX pathway. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes 

30 two different DNA constructs. In one construct, the gene that codes for NOVX is fused to a 
gene encoding the DNA binding domain of a known transcription factor (e.g., GAL-4). In the 
other construct, a DNA sequence, from a library of DNA sequences, that encodes an 
unidentified protein ("prey" or "sample") is fused to a gene that codes for the activation 
domain of the known transcription factor. If the "bait" and the "prey" proteins are able to 
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interact, in vivo, forming an NOVX-dependent complex, the DNA-binding and activation 
domains of the transcription factor are brought into close proximity. This proximity allows 
transcription of a reporter gene {e.g., LacZ) that is operably linked to a transcriptional 
regulatory site responsive to the transcription factor. Expression of the reporter gene can be 
5 detected and cell colonies containing the functional transcription factor can be isolated and 
used to obtain the cloned gene that encodes the protein which interacts with NOVX. 

The invention further pertains to novel agents identified by the aforementioned 
screening assays and uses thereof for treatments as described herein. 

Detection Assays 

1 0 Portions or fragments of the cDNA sequences identified herein (and the corresponding 

complete gene sequences) can be used in numerous ways as polynucleotide reagents. By way 
of example, and not of limitation, these sequences can be used to: (i) map their respective 
genes on a chromosome; and, thus, locate gene regions associated with genetic disease; (it) 
identify an individual from a minute biological sample (tissue typing); and (Hi) aid in forensic 

15 identification of a biological sample. Some of these applications are described in the 
subsections, below. 



Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
sequence can be used to map the location of the gene on a chromosome. This process is called 
chromosome mapping. Accordingly, portions or fragments of the NOVX sequences, SEQ ID 
NOS:2n-l, wherein n is an integer between 1 and 162, or fragments or derivatives thereof, can 
be used to map the location of the NOVX genes, respectively,, on a chromosome. The 
mapping of the NOVX sequences to chromosomes is an important first step in correlating 
these sequences with genes associated with disease. 

Briefly, NOVX genes can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp in length) from the NOVX sequences. Computer analysis of the NOVX, 
sequences can be used to rapidly select primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. These primers can then be used 
for PCR screening of somatic cell hybrids containing individual human chromosomes. Only 
those hybrids containing the human gene corresponding to the NOVX sequences will yield an 
amplified fragment. 
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Somatic cell hybrids are prepared by fusing somatic cells from different mammals 
(e.g., human and mouse cells). As hybrids of human and mouse cells grow and divide, they 
gradually lose human chromosomes in random order, but retain the mouse chromosomes. By 
using media in which mouse cells cannot grow, because they lack a particular enzyme, but in 
5 which human cells can, the one human chromosome that contains the gene encoding the 
needed enzyme will be retained. By using various media, panels of hybrid cell lines can be 
established. Each cell line in a panel contains either a single human chromosome or a small 
number of human chromosomes, and a full set of mouse chromosomes, allowing easy 
mapping of individual genes to specific human chromosomes. See, e.g., DEustachio, et al. 9 
10 1983. Science 220: 919-924. Somatic cell hybrids containing only fragments of human 
chromosomes can also be produced by using human chromosomes with translocations and 
deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a particular 
sequence to a particular chromosome. Three or more sequences can be assigned per day using 

15 a single thermal cycler. Using the NOVX sequences to design oligonucleotide primers, sub- 
localization can be achieved with panels of fragments from specific chromosomes. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in one 
step. Chromosome spreads can be made using cells whose division has been blocked in 

20 metaphase by a chemical like colcemid that disrupts the mitotic spindle. The chromosomes 
can be treated briefly with trypsin, and then stained with Giemsa. A pattern of light and dark 
bands develops on each chromosome, so that the chromosomes can be identified individually. 
The FISH technique can be used with a DNA sequence as short as 500 or 600 bases. 
However, clones larger than 1,000 bases have a higher likelihood of binding to a unique 

25 chromosomal location with sufficient signal intensity for simple detection. Preferably 1,000 
bases, and more preferably 2,000 bases, will suffice to get good results at a reasonable amount 
of time. For a review of this technique, see, Verma, et ah, HUMAN CHROMOSOMES: A 
Manual of Basic Techniques (Pergamon Press, New York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 

30 chromosome or a single site on that chromosome, or panels of reagents can be used for 

marking multiple sites and/or multiple chromosomes. Reagents corresponding to noncoding 
regions of the genes actually are preferred for mapping purposes. Coding sequences are more 
likely to be conserved within gene families, thus increasing the chance of cross hybridizations 
during chromosomal mapping. 
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Once a sequence has been mapped to a precise chromosomal location, the physical 
position of the sequence on the chromosome can be correlated with genetic map data. Such 
data are found, e.g., in McKusick, Mendelian Inheritance in Man, available on-line 
through Johns Hopkins University Welch Medical Library). The relationship between genes 
5 and disease, mapped to the same chromosomal region, can then be identified through linkage 
analysis (co-inheritance of physically adjacent genes), described in, e.g., Egeland, et aL, 1987. 
Nature, 325: 783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the NOVX gene, can be determined. If a mutation is 
10 observed in some or all of the affected individuals but not in any unaffected individuals, then 
the mutation is likely to be the causative agent of the particular disease. Comparison of 
affected and unaffected individuals generally involves first looking for structural alterations in 
the chromosomes, such as deletions or translocations that are visible from chromosome 
spreads or detectable using PCR based on that DNA sequence. Ultimately, complete 
1 5 sequencing of genes from several individuals can be performed to confirm the presence of a 
mutation and to distinguish mutations from polymorphisms. 

Tissue Typing 

The NOVX sequences of the invention can also be used to identify individuals from 
20 minute biological samples. In this technique, an individual's genomic DNA is digested with 
one or more restriction enzymes, and probed on a Southern blot to yield unique bands for 
identification. The sequences of the invention are useful as additional DNA markers for RFLP 
("restriction fragment length polymorphisms," described in U.S. Patent No. 5,272,057). 
Furthermore, the sequences of the invention can be used to provide an alternative 
25 technique that determines the actual base-by-base DNA sequence of selected portions of an 
individual's genome. Thus, the NOVX sequences described herein can be used to prepare two 
PCR primers from the 5'- and 3 f -termini of the sequences. These primers can then be used to 
amplify an individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in this manner, 
30 can provide unique individual identifications, as each individual will have a unique set of such 
DNA sequences due to allelic differences. The sequences of the invention can be used to 
obtain such identification sequences from individuals and from tissue. The NOVX sequences 
of the invention uniquely represent portions of the human genome. Allelic variation occurs to 
some degree in the coding regions of these sequences, and to a greater degree in the noncoding 
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regions. It is estimated that allelic variation between individual humans occurs with a 
frequency of about once per each 500 bases. Much of the allelic variation is due to single 
nucleotide polymorphisms (SNPs), which include restriction fragment length polymorphisms 
(RFLPs). 

5 Each of the sequences described herein can, to some degree, be used as a standard 

against which DNA from an individual can be compared for identification purposes. Because 
greater numbers of polymorphisms occur in the noncoding regions, fewer sequences are 
necessary to differentiate individuals. The noncoding sequences can comfortably provide 
positive individual identification with a panel of perhaps 10 to 1,000 primers that each yield a 
10 noncoding amplified sequence of 100 bases. If predicted coding sequences, such as those in 
SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162 are used, a more appropriate 
number of primers for positive individual identification would be 500-2,000. 

Predictive Medicine 

15 The invention also pertains to the field of predictive medicine in which diagnostic 

assays, prognostic assays, pharmacogenomics, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. Accordingly, 
one aspect of the invention relates to diagnostic assays for determining NOVX protein and/or 
nucleic acid expression as well as NOVX activity, in the context of a biological sample (e.g., 

20 blood, serum, cells, tissue) to thereby determine whether an individual is afflicted with a 
disease or disorder, or is at risk of developing a disorder, associated with aberrant NOVX 
expression or activity. The disorders include metabolic disorders, diabetes, obesity, infectious 
disease, anorexia, cancer-associated cachexia, cancer, neurodegenerative disorders, 
Alzheimer's Disease, Parkinson's Disorder, immune disorders, and hematopoietic disorders, 

25 and the various dyslipidemias, metabolic disturbances associated with obesity, the metabolic 
syndrome X and wasting disorders associated with chronic diseases and various cancers. The 
invention also provides for prognostic (or predictive) assays for determining whether an 
individual is at risk of developing a disorder associated with NOVX protein, nucleic acid 
expression or activity. For example, mutations in an NOVX gene can be assayed in a 

30 biological sample. Such assays can be used for prognostic or predictive purpose to thereby 
prophylactically treat an individual prior to the onset of a disorder characterized by or 
associated with NOVX protein, nucleic acid expression, or biological activity. 

Another aspect of the invention provides methods for determining NOVX protein, 
nucleic acid expression or activity in an individual to thereby select appropriate therapeutic or 
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prophylactic agents for that individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows for the selection of agents {e.g., drugs) for therapeutic or 
prophylactic treatment of an individual based on the genotype of the individual (e.g., the 
genotype of the individual examined to determine the ability of the individual to respond to a 
5 particular agent.) 

Yet another aspect of the invention pertains to monitoring the influence of agents (e.g., 
drugs, compounds) on the expression or activity of NOVX in clinical trials. 

These and other agents are described in further detail in the following sections. 

10 Diagnostic Assays 

An exemplary method for detecting the presence or absence of NOVX in a biological 
sample involves obtaining a biological sample from a test subject and contacting the biological 
sample with a compound or an agent capable of detecting NOVX protein or nucleic acid (e.g., 
mRNA, genomic DNA) that encodes NOVX protein such that the presence of NOVX is 

1 5 detected in the biological sample. An agent for detecting NOVX mRNA or genomic DNA is a 
labeled nucleic acid probe capable of hybridizing to NOVX mRNA or genomic DNA. The 
nucleic acid probe can be, for example, a full-length NOVX nucleic acid, such as the nucleic 
acid of SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162, or a portion thereof, 
such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and 

20 sufficient to specifically hybridize under stringent conditions to NOVX mRNA or genomic 
DNA. Other suitable probes for use in the diagnostic assays of the invention are described 
herein. 

An agent for detecting NOVX protein is an antibody capable of binding to NOVX 
protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or more 

25 preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or F(ab f ) 2 ) can be 
used. The term "labeled", with regard to the probe or antibody, is intended to encompass 
direct labeling of the probe or antibody by coupling (i.e., physically linking) a detectable 
substance to the probe or antibody, as well as indirect labeling of the probe or antibody by 
reactivity with another reagent that is directly labeled. Examples of indirect labeling include 

30 detection of a primary antibody using a fluorescently-labeled secondary antibody and 
end-labeling of a DNA probe with biotin such that it can be detected with fluorescently- 
labeled streptavidin. The term "biological sample" is intended to include tissues, cells and 
biological fluids isolated from a subject, as well as tissues, cells and fluids present within a 
subject. That is, the detection method of the invention can be used to detect NOVX mRNA, 
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protein, or genomic DNA in a biological sample in vitro as well as in vivo. For example, in 
vitro techniques for detection of NOVX mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of NOVX protein include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations, and 
5 immunofluorescence. In vitro techniques for detection of NOVX genomic DNA include 
Southern hybridizations. Furthermore, in vivo techniques for detection of NOVX protein 
include introducing into a subject a labeled anti-NOVX antibody. For example, the antibody 
can be labeled with a radioactive marker whose presence and location in a subject can be 
detected by standard imaging techniques. 

10 In one embodiment, the biological sample contains protein molecules from the test 

subject. Alternatively, the biological sample can contain mRNA molecules from the test 
subject or genomic DNA molecules from the test subject A preferred biological sample is a 
peripheral blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 

1 5 sample from a control subject, contacting the control sample with a compound or agent 
capable of detecting NOVX protein, mRNA, or genomic DNA, such that the presence of 
NOVX protein, mRNA or genomic DNA is detected in the biological sample, and comparing 
the presence of NOVX protein, mRNA or genomic DNA in the control sample with the 
presence of NOVX protein, mRNA or genomic DNA in the test sample. 

20 The invention also encompasses kits for detecting the presence of NOVX in a 

biological sample. For example, the kit can comprise: a labeled compound or agent capable of 
detecting NOVX protein or mRNA in a biological sample; means for deteimining the amount 
of NOVX in the sample; and means for comparing the amount of NOVX in the sample with a 
standard. The compound or agent can be packaged in a suitable container. The kit can further 

25 comprise instructions for using the kit to detect NOVX protein or nucleic acid. 

Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant NOVX 
30 expression or activity. For example, the assays described herein, such as the preceding 

diagnostic assays or the following assays, can be utilized to identify a subject having or at risk 
of developing a disorder associated with NOVX protein, nucleic acid expression or activity. 
Alternatively, the prognostic assays can be utilized to identify a subject having or at risk for 
developing a disease or disorder. Thus, the invention provides a method for identifying a 
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disease or disorder associated with aberrant NOVX expression or activity in which a test 
sample is obtained from a subject and NOVX protein or nucleic acid (e.g., mRNA, genomic 
DNA) is detected, wherein the presence of NOVX protein or nucleic acid is diagnostic for a 
subject having or at risk of developing a disease or disorder associated with aberrant NOVX 
5 expression or activity. As used herein, a "test sample" refers to a biological sample obtained 
from a subject of interest. For example, a test sample can be a biological fluid (e.g., serum), 
cell sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine whether 
a subject can be administered an agent (e.g., an agonist, antagonist, peptidomimetic, protein, 

10 peptide, nucleic acid, small molecule, or other drug candidate) to treat a disease or disorder 
associated with aberrant NOVX expression or activity. For example, such methods can be 
used to determine whether a subject can be effectively treated with an agent for a disorder. 
Thus, the invention provides methods for determining whether a subject can be effectively 
treated with an agent for a disorder associated with aberrant NOVX expression or activity in 

1 5 which a test sample is obtained and NOVX protein or nucleic acid is detected (e.g. , wherein 
the presence of NOVX protein or nucleic acid is diagnostic for a subject that can be 
administered the agent to treat a disorder associated with aberrant NOVX expression or 
activity). 

The methods of the invention can also be used to detect genetic lesions in an NOVX 
20 gene, thereby determining if a subject with the lesioned gene is at risk for a disorder 

characterized by aberrant cell proliferation and/or differentiation. In various embodiments, the 
methods include detecting, in a sample of cells from the subject, the presence or absence of a 
genetic lesion characterized by at least one of an alteration affecting the integrity of a gene 
encoding an NOVX-protein, or the misexpression of the NOVX gene. For example, such 
25 genetic lesions can be detected by ascertaining the existence of at least one of: (/) a deletion of 
one or more nucleotides from an NOVX gene; (if) an addition of one or more nucleotides to an 
NOVX gene; (Hi) a substitution of one or more nucleotides of an NOVX gene, (fv) a 
chromosomal rearrangement of an NOVX gene; (v) an alteration in the level of a messenger 
RNA transcript of an NOVX gene, (vz) aberrant modification of an NOVX gene, such as of the 
30 methylation pattern of the genomic DNA, (v/i) the presence of a non-wild-type splicing pattern 
of a messenger RNA transcript of an NOVX gene, (vw) a non-wild-type level of an NOVX 
protein, (ix) allelic loss of an NOVX gene, and (x) inappropriate post-translational 
modification of an NOVX protein. As described herein, there are a large number of assay 
techniques known in the art which can be used for detecting lesions in an NOVX gene. A 
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preferred biological sample is a peripheral blood leukocyte sample isolated by conventional 
means from a subject. However, any biological sample containing nucleated cells may be 
used, including, for example, buccal mucosal cells. 

In certain embodiments, detection of the lesion involves the use of a probe/primer in a 
5 polymerase chain reaction (PCR) {see, e.g., U.S. Patent Nos. 4,683,195 and 4,683,202), such - 
as anchor PCR or RACE PCR, or, alternatively, in a ligation chain reaction (LCR) (see, e.g., 
Landegran, et aL, 1988. Science 241: 1077-1080; and Nakazawa, et aL, 1994. Proc. Natl. 
Acad. Set USA 91: 360-364), the latter of which can be particularly useful for detecting point 
mutations in the NOVX-gene (see, Abravaya, et aL, 1995. Nucl. Acids Res. 23: 675-682). 

10 This method can include the steps of collecting a sample of cells from a patient, isolating 
nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, contacting the 
nucleic acid sample with one or more primers that specifically hybridize to an NOVX gene 
under conditions such that hybridization and amplification of the NOVX gene (if present) 
occurs, and detecting the presence or absence of an amplification product, or detecting the size 

15 of the amplification product and comparing the length to a control sample. It is anticipated 
that PCR and/or LCR may be desirable to use as a preliminary amplification step in 
conjunction with any of the techniques used for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication (see, 
Guatelli, et aL, 1990. Proc. Natl Acad. Sci. USA 87: 1874-1878), transcriptional amplification 

20 system (see, Kwoh, et aL, 1989. Proc. Natl. Acad Sci. USA 86: 1 173-1 177); Qp Replicase 
(see, Lizardi, et al, 1 988. BioTechnology 6: 1 197), or any other nucleic acid amplification 
method, followed by the detection of the amplified molecules using techniques well known to 
those of skill in the art. These detection schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present in very low numbers. 

25 In an alternative embodiment, mutations in an NOVX gene from a sample cell can be 

identified by alterations in restriction enzyme cleavage patterns. For example, sample and 
control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel electrophoresis and compared. 
Differences in fragment length sizes between sample and control DNA indicates mutations in 

30 the sample DNA. Moreover, the use of sequence specific ribozymes (see, e.g., U.S. Patent 
No. 5,493,531) can be used to score for the presence of specific mutations by development or 
loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations in NOVX can be identified by hybridizing a 
sample and control nucleic acids, e.g., DNA or RNA, to high-density arrays containing 
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hundreds or thousands of oligonucleotides probes. See, e.g., Cronin, et aU 1996. Human 
Mutation 7: 244-255; Kozal, et a!., 1 996. Nat. Med. 2: 753-759. For example, genetic 
mutations in NOVX can be identified in two dimensional arrays containing light-generated 
DNA probes as described in Cronin, et al, supra. Briefly, a first hybridization array of probes 
5 can be used to scan through long stretches of DNA in a sample and control to identify base 
changes between the sequences by making linear arrays of sequential overlapping probes. 
This step allows the identification of point mutations. This is followed by a second 
hybridization array that allows the characterization of specific mutations by using smaller, 
specialized probe arrays complementary to all variants or mutations detected. Each mutation 

1 0 array is composed of parallel probe sets, one complementary to the wild-type gene and the 
other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the art 
can be used to directly sequence the NOVX gene and detect mutations by comparing the 
sequence of the sample NOVX with the corresponding wild-type (control) sequence. 

15 Examples of sequencing reactions include those based on techniques developed by Maxim and 
Gilbert, 1977. Proc. Natl Acad. Sci. USA 74: 560 or Sanger, 1977. Proc. Natl. Acad. Sci. USA 
74: 5463. It is also contemplated that any of a variety of automated sequencing procedures 
can be utilized when performing the diagnostic assays (see, e.g., Naeve, et al 9 1 995. 
Biotechniques 19: 448), including sequencing by mass spectrometry (see, e.g., PCT 

20 International Publication No. WO 94/1 61 01 ; Cohen, et al 9 1 996. Adv. Chromatography 36: 
127-162; and Griffin, etal, 1993. Appl. Biochem. Biotechnol. 38: 147-159). 

Other methods for detecting mutations in the NOVX gene include methods in which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes. See, e.g., Myers, et ai 9 1985. Science 230: 1242. In general, the 

25 art technique of "mismatch cleavage" starts by providing heteroduplexes of formed by 

hybridizing (labeled) RNA or DNA containing the wild-type NOVX sequence with potentially 
mutant RNA or DNA obtained from a tissue sample. The double-stranded duplexes are 
treated with an agent that cleaves single-stranded regions of the duplex such as which will 
exist due to basepair mismatches between the control and sample strands. For instance, 

30 RNA/DNA duplexes can be treated with Cleavage signaM protein and DNA/DNA hybrids 
treated with Si nuclease to enzymatically digesting the mismatched regions. In other 
embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with hydroxylamine or 
osmium tetroxide and with piperidine in order to digest mismatched regions. After digestion 
of the mismatched regions, the resulting material is then separated by size on denaturing 



WO 02/068649 PCT/US02/02785 

polyacrylamide gels to determine the site of mutation. See, e.g., Cotton, et al, 1988. Proc. 
Natl Acad Sci. USA 85: 4397; Saleeba, etal, 1992. Methods EnzymoL 217: 286-295. In an 
embodiment, the control DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 
5 proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations in 
NOVX cDNAs obtained from samples of cells. For example, the mutY enzyme of E. coli 
cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T 
at G/T mismatches. See, e.g., Hsu, etal, 1994. Carcinogenesis 15: 1657-1662. According to 

1 0 an exemplary embodiment, a probe based on an NOVX sequence, e.g., a wild-type NOVX 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex is 
treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like. See, e.g., U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to identify 

15 mutations in NOVX genes. For example, single strand conformation polymorphism (SSCP) 
may be used to detect differences in electrophoretic mobility between mutant and wild type 
nucleic acids. See, e.g., Orita, et al, 1989. Proc. Natl Acad. Sci. USA: 86: 2766; Cotton, 
1993. Mutat. Res. 285: 125-144; Hayashi, 1992. Genet. Anal. Tech. Appl 9: 73-79. 
Single-stranded DNA fragments of sample and control NOVX nucleic acids will be denatured 

20 and allowed to renature. . The secondary structure of single-stranded nucleic acids varies 

according to sequence, the resulting alteration in electrophoretic mobility enables the detection 
of even a single base change. The DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA), in 
which the secondary structure is more sensitive to a change in sequence. In one embodiment, 

25 the subject method utilizes heteroduplex analysis to separate double stranded heteroduplex 
molecules on the basis of changes in electrophoretic mobility. See, e.g., Keen, et cd. 9 1991. 
Trends Genet. 7:5. 

In yet another embodiment, the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing gradient 
30 gel electrophoresis (DGGE). See, e.g., Myers, et al, 1985. Nature 313: 495. When DGGE is 
used as the method of analysis, DNA will be modified to insure that it does not completely 
denature,'for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich 
DNA by PCR. In a further embodiment, a temperature gradient is used in place of a 
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denaturing gradient to identify differences in the mobility of control and sample DNA. See, 
e.g., Rosenbaum and Reissner, 1987. Biophys. Chem. 265: 12753. 

Examples of other techniques for detecting point mutations include, but are not limited 
to, selective oligonucleotide hybridization, selective amplification, or selective primer 
5 extension. For example, oligonucleotide primers may be prepared in which the known 

mutation is placed centrally and then hybridized to target DNA under conditions that permit 
hybridization only if a perfect match is found. See, e.g., Saiki, et al., 1986. Nature 324: 163; 
Saiki, et ah, \989.' Proc. Natl. Acad, Sci. USA 86: 6230. Such allele specific oligonucleotides 
are hybridized to PCR amplified target DNA or a number of different mutations when the 
1 0 oligonucleotides are attached to the hybridizing membrane and hybridized with labeled target 
DNA. 

Alternatively, allele specific amplification technology that depends on selective PCR 
amplification may be used in conjunction with the instant invention. Oligonucleotides used as 
primers for specific amplification may carry the mutation of interest in the center of the 

1 5 molecule (so that amplification depends on differential hybridization; see, e.g., Gibbs, et al., 
1 989. Nucl. Acids Res. 1 7: 2437-2448) or at the extreme 3'-terminus of one primer where, 
under appropriate conditions, mismatch can prevent, or reduce polymerase extension {see, e.g., 
Prossner, 1 993. Tibtech. 1 1 : 238). In addition it may be desirable to introduce a novel 
restriction site in the region of the mutation to create cleavage-based detection. See, e.g., 

20 Gasparini, et al, 1992. Mol Cell Probes 6: 1 . It is anticipated that in certain embodiments 
amplification may also be performed using Tag ligase for amplification. See, e.g., Barany, 
1991. Proc. Natl. Acad. Sci. USA 88: 189. In such cases, ligation will occur only if there is a 
perfect match at the 3*-terminus of the 5' sequence, making it possible to detect the presence of 
a known mutation at a specific site by looking for the presence or absence of amplification. 

25 The methods described herein may be performed, for example, by utilizing 

pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
patients exhibiting symptoms or family history of a disease or illness involving an NOVX 
gene. 

30 Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in which 

NOVX is expressed may be utilized in the prognostic assays described herein. However, any 
biological sample containing nucleated cells may be used, including, for example, buccal 
mucosal cells. 
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Pharmacogenetics 

Agents, or modulators that have a stimulatory or inhibitory effect on NOVX activity 
(e.g., NOVX gene expression), as identified by a screening assay described herein can be 
5 administered to individuals to treat (prophylactically or therapeutically) disorders (The 

disorders include metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer- 
associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's 
Disorder, immune disorders, and hematopoietic disorders, and the various dyslipidemias, 
metabolic disturbances associated with obesity, the metabolic syndrome X and wasting 

10 disorders associated with chronic diseases and various cancers.) In conjunction with such 
treatment, the pharmacogenomics (i.e. 9 the study of the relationship between an individual's 
genotype and that individual's response to a foreign compound or drug) of the individual may 
be considered. Differences in metabolism of therapeutics can lead to severe toxicity or 
therapeutic failure by altering the relation between dose and blood concentration of the 

1 5 pharmacologically active drug. Thus, the pharmacogenomics of the individual permits the 

selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such pharmacogenomics can further be used to 
determine appropriate dosages and therapeutic regimens. Accordingly, the activity of NOVX 
protein, expression of NOVX nucleic acid, or mutation content of NOVX genes in an 

20 individual can be determined to thereby select appropriate agent(s) for therapeutic or 
prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected persons. See 
e.g., Eichelbaum, 1996. Clin. Exp. Pharmacol Physiol., 23: 983-985; Under, 1997. Clin. 

25 Chem., 43: 254-266. In general, two types of pharmacogenetic conditions can be 

differentiated. Genetic conditions transmitted as a single factor altering the way drugs act on 
the body (altered drug action) or genetic conditions transmitted as single factors altering the 
way the body acts on drugs (altered drug metabolism). These pharmacogenetic conditions can 
occur either as rare defects or as polymorphisms. For example, glucose-6-phosphate 

30 dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the main 
clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofiirans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a major 
determinant of both the intensity and duration of drug action. The discovery of genetic 
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polymorphisms of drug metabolizing enzymes (e.g., N-acetyitransferase 2 (NAT 2) and 
cytochrome NEUROPEPTIDE Y/PEPTIDE YY RECEPTOR enzymes CYP2D6 and 
. CYP2C19) has provided an explanation as to why some patients do not obtain the expected 
drug effects or show exaggerated drug response and serious toxicity after taking the standard 
5 and safe dose of a drug. These polymorphisms are expressed in two phenotypes in the 

population, the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of 
PM is different among different populations. For example, the gene coding for CYP2D6 is 
highly polymorphic and several mutations have been identified in PM, which all lead to the 
absence of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite 

1 0 frequently experience exaggerated drug response and side effects when they receive standard 
doses. If a metabolite is the active therapeutic moiety, PM show no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed metabolite 
morphine. At the other extreme are the so called ultra-rapid metabolizers who do not respond 
to standard doses. Recently, the molecular basis of ultra-rapid metabolism has been identified 

15 to be due to CYP2D6 gene amplification. 

Thus, the activity of NOVX protein, expression of NOVX nucleic acid, or mutation 
content of NOVX genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles encoding 

20 drug-metabolizing enzymes to the identification of an individual's drug responsiveness 
phenotype. This knowledge, when applied to dosing or drug selection, can avoid adverse 
reactions or therapeutic failure and thus enhance therapeutic or prophylactic efficiency when 
treating a subject with an NOVX modulator, such as a modulator identified by one of the 
exemplary screening assays described herein. 

25 

Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression or 
activity of NOVX (e.g., the ability to modulate aberrant cell proliferation and/or 
differentiation) can be applied not only in basic drug screening, but also in clinical trials. For 
30 example, the effectiveness of an agent determined by a screening assay as described herein to 
increase NOVX gene expression, protein levels, or upregulate NOVX activity, can be 
monitored in clinical trails of subjects exhibiting decreased NOVX gene expression, protein 
levels, or downregulated NOVX activity. Alternatively, the effectiveness of an agent 
determined by a screening assay to decrease NOVX gene expression, protein levels, or 
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downregulate NOVX activity, can be monitored in clinical trails of subjects exhibiting 
increased NOVX gene expression, protein levels, or upregulated NOVX activity. In such 
clinical trials, the expression or activity of NOVX and, preferably, other genes that have been 
implicated in, for example, a cellular proliferation or immune disorder can be used as a "read 
5 out" or markers of the immune responsiveness of a particular cell. 

By way of example, and not of limitation, genes, including NOVX, that are modulated 
in cells by treatment with an agent (e.g., compound, drug or small molecule) that modulates 
NOVX activity (e.g., identified in a screening assay as described herein) can be identified. 
Thus, to study the effect of agents on cellular proliferation disorders, for example, in a clinical 

10 trial, cells can be isolated and RNA prepared and analyzed for the levels of expression of 

NOVX and other genes implicated in the disorder. The levels of gene expression (i.e., a gene 
expression pattern) can be quantified by Northern blot analysis or RT-PCR, as described 
herein, or alternatively by measuring the amount of protein produced, by one of the methods 
as described herein, or by measuring the levels of activity of NOVX or other genes. In this 

1 5 manner, the gene expression pattern can serve as a marker, indicative of the physiological 

response of the cells to the agent. Accordingly, this response state may be determined before, 
and at various points during, treatment of the individual with the agent. 

In one embodiment, the invention provides a method for monitoring the effectiveness 
of treatment of a subject with an agent (e.g., an agonist, antagonist, protein, peptide, 

20 peptidomimetic, nucleic acid, small molecule, or other drug candidate identified by the 

screening assays described herein) comprising the steps of (/) obtaining a pre-administration 
sample from a subject prior to administration of the agent; (//) detecting the level of expression 
of an NOVX protein, mRNA, or genomic DNA in the preadministration sample; (Hi) obtaining 
one or more post-administration samples from the subject; (rv) detecting the level of 

25 expression or activity of the NOVX protein, mRNA, or genomic DNA in the 

post-administration samples; (v) comparing the level of expression or activity of the NOVX 
protein, mRNA, or genomic DNA in the pre-administration sample with the NOVX protein, 
mRNA, or genomic DNA in the post administration sample or samples; and (vz) altering the 
administration of the agent to the subject accordingly. For example, increased administration 

30 of the agent may be desirable to increase the expression or activity of NOVX to higher levels 
than detected, i.e., to increase the effectiveness of the agent. Alternatively, decreased 
administration of the agent may be desirable to decrease expression or activity of NOVX to 
lower levels than detected, Le. 9 to decrease the effectiveness of the agent. 



858 



WO 02/068649 



PCT/US02/02785 



Methods of Treatment 

The invention provides for both prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant 
NOVX expression or activity. The disorders include cardiomyopathy, atherosclerosis, 
5 hypertension, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 

atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic stenosis, 
ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, obesity, 
transplantation, adrenoleukodystrophy, congenital adrenal hyperplasia, prostate cancer, 
neoplasm; adenocarcinoma, lymphoma, uterus cancer, fertility, hemophilia, hypercoagulation, 
10 idiopathic thrombocytopenic purpura, immunodeficiencies, graft versus host disease, AIDS, 
bronchial asthma, Crohn's disease; multiple sclerosis, treatment of Albright Hereditary 
Osteodystrophy, and other diseases, disorders and conditions of the like. 
These methods of treatment will be discussed more fully, below. 

15 Disease and Disorders 

Diseases and disorders that are characterized by increased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that antagonize {i.e., reduce or inhibit) activity. Therapeutics that antagonize 
activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may 

20 be utilized include, but are not limited to: (i) an aforementioned peptide, or analogs, 

derivatives, fragments or homologs thereof; (ii) antibodies to an aforementioned peptide; (Hi) 
nucleic acids encoding an aforementioned peptide; (rv) administration of antisense nucleic acid 
and nucleic acids that are "dysfunctional" (i.e., due to a heterologous insertion within the 
coding sequences of coding sequences to an aforementioned peptide) that are utilized to 

25 "knockout" endogenous function of an aforementioned peptide by homologous recombination 
(see, e.g., Capecchi, 1989. Science 244: 1288-1292); or (v) modulators ( i.e., inhibitors, 
agonists and antagonists, including additional peptide mimetic of the invention or antibodies 
specific to a peptide of the invention) that alter the interaction between an aforementioned 
peptide and its binding partner. 

30 Diseases and disorders that are characterized by decreased (relative to a subject not 

suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that upregulate activity 
may be administered in a therapeutic or prophylactic manner. Therapeutics that may be 
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utilized include, but are not limited to, an aforementioned peptide, or analogs, derivatives, 
fragments or homologs thereof; or an agonist that increases bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide and/or 
RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying it in vitro for 
5 RNA or peptide levels, structure and/or activity of the expressed peptides (or mRNAs of an 
aforementioned peptide). Methods that are well-known within the art include, but are not 
limited to, immunoassays (e.g., by Western blot analysis, immunoprecipitation followed by 
sodium dodecyl sulfate (SDS) polyacrylamide gel electrophoresis, immunocytochemistry, etc.) 
and/or hybridization assays to detect expression of mRNAs (e.g., Northern assays, dot blots, in 
10 situ hybridization, and the like). 

Prophylactic Methods 

In one aspect, the invention provides a method for preventing, in a subject, a disease or 
condition associated with an aberrant NOVX expression or activity, by administering to the 

15 subject an agent that modulates NOVX expression or at least one NOVX activity. Subjects at 
risk for a disease that is caused or contributed to by aberrant NOVX expression or activity can 
be identified by, for example, any or a combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent can occur prior to the manifestation 
of symptoms characteristic of the NOVX aberrancy, such that a disease or disorder is 

20 prevented or, alternatively, delayed in its progression. Depending upon the type of NOVX 

aberrancy, for example, an NOVX agonist or NOVX antagonist agent can be used for treating 
the subject. The appropriate agent can be determined based on screening assays described 
herein. The prophylactic methods of the invention are further discussed in the following 
subsections. 

25 

Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating NOVX expression 
or activity for therapeutic purposes. The modulatory method of the invention involves 
contacting a cell with an agent that modulates one or more of the activities of NOVX protein 
30 activity associated with the cell. An agent that modulates NOVX protein activity can be an 
agent as described herein, such as a nucleic acid or a protein, a naturally-occurring cognate 
ligand of an NOVX protein, a peptide, an NOVX peptidomimetic, or other small molecule. In 
one embodiment, the agent stimulates one or more NOVX protein activity. Examples of such 
stimulatory agents include active NOVX protein and a nucleic acid molecule encoding NOVX 
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that has been introduced into the cell. In another embodiment, the agent inhibits one or more 
NOVX protein activity. Examples of such inhibitory agents include antisense NOVX nucleic 
acid molecules and anti-NOVX antibodies. These modulatory methods can be performed in 
vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering 
5 the agent to a subject). As such, the invention provides methods of treating an individual 
afflicted with a disease or disorder characterized by aberrant expression or activity of an 
NOVX protein or nucleic acid molecule. In one embodiment, the method involves 
administering an agent (e.g., an agent identified by a screening assay described herein), or 
combination of agents that modulates (e.g., up-regulates or down-regulates) NOVX expression 
1 0 or activity. In another embodiment, the method involves administering an NOVX protein or 
nucleic acid molecule as therapy to compensate for reduced or aberrant NOVX expression or 
activity. 

Stimulation of NOVX activity is desirable in situations in which NOVX is abnormally 
downregulated and/or in which increased NOVX activity is likely to have a beneficial effect. 
1 5 One example of such a situation is where a subject has a disorder characterized by aberrant 

cell proliferation and/or differentiation (e.g., cancer or immune associated disorders). Another 
example of such a situation is where the subject has a gestational disease (e.g., preclampsia). 

Determination of the Biological Effect of the Therapeutic 

In various embodiments of the invention, suitable in vitro or in vivo assays are 
20 performed to determine the effect of a specific Therapeutic and whether its administration is 
indicated for treatment of the affected tissue. 

In various specific embodiments, in vitro assays may be performed with representative 
cells of the type(s) involved in the patient's disorder, to determine if a given Therapeutic exerts 
the desired effect upon the cell type(s). Compounds for use in therapy may be tested in 
15 suitable animal model systems including, but not limited to rats, mice, chicken, cows, 
monkeys, rabbits, and the like, prior to testing in human subjects. Similarly, for in vivo 
testing, any of the animal model system known in the art may be used prior to administration 
to human subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the Invention 

10 The NOVX nucleic acids and proteins of the invention are useful in potential 

prophylactic and therapeutic applications implicated in a variety of disorders including, but not 
limited to: metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer- 
associated cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, 
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immune disorders, hematopoietic disorders, and the various dyslipidemias, metabolic 
disturbances associated with obesity, the metabolic syndrome X and wasting disorders 
associated with chronic diseases and various cancers. 

As an example, a cDNA encoding the NOVX protein of the invention may be useful in 
5 gene therapy, and the protein may be useful when administered to a subject in need thereof. 
By way of non-limiting example, the compositions of the invention will have efficacy for 
treatment of patients suffering from; metabolic disorders, diabetes, obesity, infectious disease, 
anorexia, cancer-associated cachexia, cancer, neurodegenerative disorders, Alzheimer's 
Disease, Parkinson's Disorder, immune disorders, hematopoietic disorders, and the various 
10 dyslipidemias. 

Both the novel nucleic acid encoding the NOVX protein, and the NOVX protein of the 
invention, or fragments thereof, may also be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. A further use could 
be as an anti-bacterial molecule (i.e., some peptides have been found to possess anti-bacterial 
1 5 properties). These materials are further useful in the generation of antibodies, which 

immunospecifically-bind to the novel substances of the invention for use in therapeutic or 
diagnostic methods. 

The invention will be further described in the following examples, which do not limit 
the scope of the invention described in the claims. 

20 

Examples 

Example 1. Identification of NOVX clones 

The novel NOVX target sequences identified in the present invention were subjected to 
the exon linking process to confirm the sequence. PCR primers were designed by starting at 

25 the most upstream sequence available, forfhe forward primer, and at the most downstream 
sequence available for the reverse primer. Table 100A shows the sequences of the PCR 
primers used for obtaining different clones. In each case, the sequence was examined, walking 
inward from the respective termini toward the coding sequence, until a suitable sequence that 
is either unique or highly selective was encountered, or, in the case of the reverse primer, until 

30 the stop codon was reached. Such primers were designed based on in silico predictions for the 
. full length cDNA, part (one or more exons) of the DNA or protein sequence of the target 
sequence, or by translated homology of the predicted exons to closely related human 
sequences from other species. These primers were then employed in PCR amplification based 
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on the following pool of human cDNAs: adrenal gland, bone marrow, brain - amygdala, brain 
- cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, 
fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary 
gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, small 
5 intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, uterus. Usually the resulting 
amplicons were gel purified, cloned and sequenced to high redundancy. The PCR product 
derived from exon linking was cloned into the pCR2.1 vector from Invitrogen. The resulting 
bacterial clone has an insert covering the entire open reading frame cloned into the pCR2. 1 
vector. Table 100B shows a list of these bacterial clones. The resulting sequences from all 
10 clones were assembled with themselves, with other fragments in CuraGen Corporation's 
database and with public ESTs. Fragments and ESTs were included as components for an 
assembly when the extent of their identity with another component of the assembly was at 
least 95% over 50 bp. In addition, sequence traces were evaluated manually and edited for 
corrections if appropriate. These procedures provide the sequence reported herein. 



Table 100A. PCR Primers for Exon Linking 



NOVX 
Clone 


Primer 1 <5' - 3') 


SEQ 

ID 

NO 


Primer 2 <5' - 3') 


SEQ 

ID 

NO 


NOVld 


TGACTATGGCCTGGAGTTTCCGT 


890 


TTATTCCAGAGATCCTTGGCAGAAGC 


891 


NOV3c 


ACCCTGGCCCAGCTCTGAGT 


892 


GAGCTCAGACGTACTCTCGGGCAG 


893 


NOV4c . 


ACCTGTCGCAATGGCTTTAATCTTTAG 


894 


ATCTCTGGCCTTGTCGGAGTCTTAC 


895 


NOVSa 


GTTTCCCCACCCCGCAGA 


896 


GAGAG GATCAAAG AA CCAGA CAGGAG 


897 


NOV5b 


ACGGCGATGACCCCCCAG 


898 


GGGGGAGGGGCTCAAACAAGA 


899 


NOV5c 


ACGGCGATGACCCCCCAG 


900 


GGGGGAGGGGCT CAAACAAGA 


901 


NOV5d 


ACGGCGATGACCCCCCAG 


902 


GGGGGAGGGGCTCAAACAAGA 


903 


NOVSe 


ACGGCGATGACCCCCCAG 


904 


GGGGGAGGGGCTCAAACAAGA 


905 


NOV6 


GTCGAACCGGGGGACCTG 


906 


TGGGGAAGGTGCTCAGCCC 


907 


NOV7a 


CAGCCAAACCCACCTCCACCAT 


908 


TTTGGCTGGCTTATATAGCGAGCTCCT 


909 


NOV7b 


CATTGCCAATTCTAAATCCATCATTTG 


910 


TCTTCTAAAGCACAAATAACACCTCCA 


911 


NOV7c 


ATGGCCGAGGCCGCGGAG 


912 


AGCTTCATTTCATTCTTTTTGCAACATCTG 


913 


N0V8 


GGGGACCATGGGGAGCGATC 


914 


CGAGAGGTTTTCTAGGCAGATTTGGAGC 


915 


NO VI 5b 


ATGGCAGCAGAAAACCATTCTTTT 


916 


TCTGTTCATAAGAATGTT CTTTTCCCTAGC 


917 I 


N0V18b 


AGATGGATGGAACCAATGGCAG 


'918 


AGTGGCCCTGGATGGAAGTGA 


919 


NOV22C 


CACTCCAACAGTTAATGCTTCCCTTG 


920 


GATGGTCTCGATCTCCTGACCTCTT 


921 


NOV24b 


ATGGCCGAGGCCGCGGAG 


922 


AGCTTCATTTCATTCTTTTTGCAACATCTG 


923 


NOV26b 


GTTCCTGCTGTCTGGACTTTTTCTGT 


924 


TTTTTGG AG AAAG CTG CAAAAGTTTAT 


925 


NOV27b 


CTCAGTCCCTCGGGCTCATACCTA 


926 


GGACT CAGAGCTCCTG C CTTTCTGT 


927 


NOV27C 


CTCAGTCCCTCGGGCTCATACCTA 


928 


CTCAGAGCTCCTGCCTTTCTGTCC 


929 


N0V29b 


GTAGCCACAAGACCGGGTCCG 


93 0 


CCCTGGCCTCTTGGAACTGCTTGAT 


931 


NOV32 


CTTGGAGGCTGCAGGTCCTGGAC 


932 


AGGAGCATCCTTCATCCCACTAGAGGT 


933 


N0V33 


GCTGGATTGGGTGATCTCTCAGAGC 


934 


CTCTTACTCCTGGCAAGCCCTGC 


935 


N0V36b 


AT GCAAGAAACAATTTTTTTTTTTG AGA 
CG 


936 


TTTTACCCATTCACCAGTTTAAGGACG 


937 


N0V38 


CAATGACCTCCTCATTGCTTCTGG 


938 


CAAAGCCCCAGGTCCTCTTGCTAG 


939 


NOV39a 


CCATCCGAGGCTCCTGAACC 


940 


CAAAGCCCCAGGTCCTCTTGC 


941 


N0V39b 


GTTCTTGAAC CAGGGCCATTCAC 


942 


CCTGGGAAACTTCATCTTGGTCTCTT 


943 


NOV42b 


ATGGA'CGGTGAGGCAGTCC 


944 


CCACACCCTGGCCCATG 


945 


NOV42C 


ATGGACGGTGAGGCAGTCC 


946 


CCACACCCTGGCCCATG 


947 


N0V42d 


CTGGAGGATGAAGGAAGCAGAGATG 


948 


CCAGAGAACAAGCAGAGCTCAGAGG 


949 


N0V46b 


GTTTCTGAGCATGGATCCAACCA 


950 


AAAGGGCAGAGGCTCTTCCTCAC 


951 


N0V48b 


GCTACCCTCT CTGCTGG CTACCTAAC 


952 


TTGATTTTCACCACCTCCATTTGTTCT 


953 


NOVSOb 


AAGATGTCGCAGCCCAAGAAAAG 


954 


TG CTTTG GGAGGT AGCTGGGA 


955 
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NOV51 


CTGGP CT A ATGA ATGT CTfTG AGC 


956 


TGATPATnnafinA ATA ATPTAftTATfiPPTT A 
Q 


OCT 


NOV52 


TCCCTGATGTCCAGCTCTGGCT 


958 


ATAGACTAACTGCACCCACAGGCCTCT 


959 


NOV5 6 b 


AT GG CGAAGATTG AGAAAAACG CT 


960 


CTPTPAGATPTPPAOGPAOAAnTTPAn 


961 


NOV58b 


AGTCTTGCCTTCTTTTGAGCCTAAGTC 


962 


CACATTCAACATATCTGAGGCTGTGG 


963 






964 


x ml x i /iHHbu J, LiuiobL L i. LbiiAu 1 w 




NOV61 


PAGPTGTfSPPPTPATPPTTftTriPPTflPT 
ACGTCC 


QCC 


PTPfia flP TTf3f2TP A PTf3T rZATTPPP A PPl^ TP 

at atpp»t htppp 


OCT 


NOV63 


GGATCCAGCTACCCGATCTGGTGGCTGA 
CGGGCAGC 


968 


GTPGAPPPPPTTPPARPTPTAPAPPTPPTPA 

CG 




NOV68 


GGCACGCTCCCTCTGGCT 


970 


TTP-APTGTGGGPrGPPTPiG 


971 


NOV$9b 


ATGGAAAAAGCATTGAAAATTGACACA 


972 


TTAAAACAGCATAGTTAACCXIAAAGTCAGTA 
GTG 


973 


NOV7 0a 


TATGCTGT CTATGCTGAATCTTCTAATC 
TTGTCT 


974 


TTTAAGAATGTTGAATATTGGCCCCCAC 


975 


NOV76b 


AC CATTACAT CAT CGT GG C AAATT AAA 


976 


GAAGT CACAAGTGTCTTT C TU'CTCAGGA 


977 


NOV81b 


GTCATGCGCTGCCCCAAGT 


978 


CCAATGAGAGTCAGCACTGGAGC 


979 


NOV87b 


TCTCTCATGGCCCCCAAAGAC 


980 


AGTCAGTGCGGCGGGAAGA 


981 


NOV96a 


GGTGCTCTCAGCGTTCTTCCAGTC 


982 


CTAGTGCTTCTGTTACAAGGTCCTGGG 


983 


NOV96& 


AGCTGGATTGACAACTTTGTAATGGAG 


984 


CTCAGTCG TG CTG CTAGTAGGGGT 


985 


N0V96e 


TG AAGCTCAC CAGGAG GAAGAAG 


986 


CTCAGTCGTGCTGCTAGTAGGGGT 


987 


NOV96f 


TCCCATGACCTGCCACTTCC 


988 1 


CGCTACCTGCAGCCGCA 


989 


NOV98 


CCGGCCCCGTGTGTGGCA 


990 


GGGGCTCTGGTGCCAGCTCATG 


991 


NOV99 


CAGCCAAACCCACCTCCACCAT 


992 


TTTGGCTGGCTTATATAGCGAGCTCCT 


993 



Physical clone: Exons were predicted by homology and the intron/exon boundaries 
were determined using standard genetic rules. Exons were further selected and refined by 
means of similarity determination using multiple BLAST (for example, tBlastN, BlastX, and 
5 BlastN) searches, and, in some instances, GeneScan and Grail. Expressed sequences from both 
public and proprietary databases were also added when available to further define and 
complete the gene sequence. The Dl^A sequence was then manually corrected for apparent 
inconsistencies thereby obtaining the sequences encoding the full-length protein. 



Table 100B. Physical Clones for PCR products 



NOVX Clone 


Bacterial Clone 


NOVla 


Genomic clone: AGO 8 013 7 


NOVlb 


Genomic clone: GMAC036188 


NOVld 


Physical clone: GMAC080137 A.6985 89.M14 


NOV2 


Physical clone: AC023194 


N0V3a 


Genomic clone: AL1363 83 


N0V3C 


Physical clone: GMAL136383 A. 698589. C6 


N0V4a 


Genomic clone: AC046164 


N0V4C 


Physical clone: GMAC046164 A. 698589. A2 


NOV5C 


Physical clone: 155643499 


NOV5d 


Physical clone: 155643499 


NOV5e 


Physical clone: 127939 : :153729589 .698590 ,P9 


NOV7a 


Physical clone: 153 634 912 


NOV7b 


Physical clone: 127994 :: 138896306 . 698587 .E16 


N0V7C 


Physical clone: 127557 : :CG564550-01 . 698587 .F19 


N0V9a 


Physical clone: 152568459, AC005961.1, AC068256.2, AI308124, 
AI307658 


NOV10 


Physical Clone: 126218 : :CG55964-01. 698509. G10 


N0V13b 


Physical clone: 126388 : :CG56021-01. 698539. G15 


NOVlSb 


Physical clone: 126694 : :CG56065-01. 698561. C17 


N0V16a 


Physical clone: 126696 : :CG56067-01 . 698561. All 


NOV16b 


Physical clone: 128769 : :GMAC084434 A. 698655. G14 


NO VI 7b 


Physical clone: 128198 : :GMba430il5 A. 698589. F10 


N0V17d 


Physical clone: 105889: :sggc draft ba430il5 20000823 . 698368 .G10 


NOV18b 


Physical clone: 128205 : :GMba4 30 il5 D. 698590. E4 
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NO VI 9b 


Physical clone: 128207 ; : GMba430il5 E. 698590. C4 


NOV2la 


Physical clone: 153485226, 145329583 


NOV2lb 


Physical Clone: 50119 :: 86375666 . 244185 . F7 


NOV22a 


Genomic clone: AC011522.6 


NOV22C 


Physical clone: 161710747. 698893 .K3 


NOV23a 


Physical clone: 140117553 146712128 146712112 148412737 


NOV23C 


Physical clone: 14 0117553 ~~ 


NOV24b 


Physical clone: 127557 : :CG564550-0l. 698591. Nil 


NOV25 


Physical clone: 151537975 and 128978463 


NOV26a 


Physical clone: 127998335 


NOV26b 


Physical Clone: 127561 : :CG5646l-01 . 698589 .G7 


NOV27a 


Physical clone: 111787393, HSAJ9617 


NOV27b 


Physical clone: 111787393 EXT. 698587 . K18 


NOV27C 


Physical clone: 167695055 170842341 170842333 


NOV28 


Physical clone: AC004832 


NOV29a 


Physical clone: AC004832 


NOV29b 


Physical clone: 112824 : :COR100399281 . 698230 .B12 


NOV29C 


Physical clone: AC004832 


NOV30 


Genomic clone: AC004832 


NOV31 


Genomic clone: 94329210 


NOV33 


Physical clone: 153778095, 138978176, 146713055, 105100551, 
153777948 


NOV34 


Physical clone: 125858 : :GMAC026083 E. 698508. J15 


NOV35 


Physical clone: 114740 : :AC0117ll . 698329 . 110 


NOV36a 


Physical clone: 152568436, AL132780 


NOV36b 


Physical clone: 152568436 134511756 


NOV37 


Physical clone: 151557368, 138195002, 152762569, 152768078 


NOV36 


Physical clone: 107207 : :AC061707 . 698315 .F14 


NOV39a 


Physical clone: 127119: :AC061707. 698564 .H20 


NOV3 9b 


Physical clone: 128110: : ADENOSINE A3 RECEPTOR. 698657 H22 


NOV40 


Physical clone: AC068471, AC068471, AV655524, T67857 


NOV41a J 


Physical clone: AC007278, AW242630.1 xn01f05.xl 


NOV42a 


Physical clone: AC007395, 153103275, 153103263, 153103260 


NOV42b 


Physical clone: 153103275 153103263 153103260 153103539 153103266 
152189065 


NOV42C 


Physical clone: 54701683 EXT . 698433 . J23 


NOV42d 


Physical clone: AC0073 95 A. 6 98587 _M17 


NOV43 


Genomic clone: AC021773 


NOV44 


Physical clone: AC023654, 78743598 


NOV45 


Genomic clone: AC023078 


NOV46a 


Genomic clone: AC023 078 


NOV4 6b 


Physical clone: 128292 : :AC023078 A. 698657. G13 


NOV46C 


Genomic clone: AC023654 


NOV46d 


Physical clone; 151667972 


NOV47 


Genomic clone: AF152363 


NOV48a 


Genomic clone: AC012510.5 


NOV46b 


Physical clone: 128669 : :AC012510 5 final . 698656 . J20 


NOV49 


Genomic clone: AC011492 


NOV50a 


Physical clone: 153778754, 122656699 


NOVSOb 


Physical clone: AK001421 A. 698657 . J10 


NOV51 


Physical clone: 126131 : :CG55922-01 . 698509 .09 




Physical clone: 153623113, 152186811, 148441423, 148441418, 
152186815, 152209564, 126066491, 129293170, 126630256, 124459512 


NOV53 


Physical clone: 151222559 


NOV54a 


Physical clone: 153512063 


NOV55 


Genomic clone: AL138816 . 12 , AL158192.12 


NOV56a 


Genomic clone: AC019100.4 


NOV57 


Physical clone: 50222151, 150222148 


NOV54b 


Physical clone: 164698940 


NOV56b 


Physical clone: 55048 : :nh0443k08 A. 698002. M9 



Example 2. Quantitative expression analysis of clones in various cells and tissues 
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The quantitative expression of various clones was assessed using microliter plates 
containing RNA samples from a variety of normal and pathology-derived cells, cell lines and 
tissues using real time quantitative PCR (RTQ PCR). RTQ PCR was performed on an Applied 
Biosystems ABI PRISM® 7700 or an ABI PRISM® 7900 HT Sequence Detection System. 
5 Various collections of samples are assembled on the plates, and referred to as Panel 1 

(containing normal tissues and cancer cell lines), Panel 2 (containing samples derived from 
tissues from normal and cancer sources), Panel 3 (containing cancer cell lines), Panel 4 
(containing cells and cell lines from normal tissues and cells related to inflammatory 
conditions), Panel 5D/5I (containing human tissues and cell lines with an emphasis on 

10 metabolic diseases), AI_comprehensive_panel (containing normal tissue and samples from 
autoimmune diseases), Panel CNSD.01 (containing central nervous system samples from 
normal and diseased brains) and CNS_neurodegeneration_panel (containing samples from 
normal and Alzheimer's diseased brains). 

RNA integrity from all samples is controlled for quality by visual assessment of 

15 agarose gel electropherograms using 28S and 18S ribosomal RNA staining intensity ratio as a 
guide (2:1 to 2.5:1 28s: 18s) and the absence of low molecular weight RNAs that would be 
indicative of degradation products. Samples are controlled against genomic DNA 
contamination by RTQ PCR reactions run in the absence of reverse transcriptase using probe 
and primer sets designed to amplify across the span of a single exon. 

20 First, the RNA samples were normalized to reference nucleic acids such as 

constitutively expressed genes (for example, p-actin and GAPDH). Normalized RNA (5 ul) 
was converted to cDNA and analyzed by RTQ-PCR using One Step RT-PCR Master Mix 
Reagents (Applied Biosystems; Catalog No. 4309169) and gene-specific primers according to 
the manufacturer's instructions. 

25 In other cases, non-normalized RNA samples were converted to single strand cDNA 

(sscDNA) using Superscript II (Invitrogen Corporation; Catalog No. 1 8064-147) and random 
hexamers according to the manufacturer's instructions. Reactions containing up to 10 fig of 
total RNA were performed in a volume of 20 ul and incubated for 60 minutes at 42°C. This 
reaction can be scaled up to 50 ng of total RNA in a final volume of 100 pi. sscDNA samples 

30 are then normalized to reference nucleic acids as described previously, using IX TaqMan® 
Universal Master mix (Applied Biosystems; catalog No. 4324020), following the 
manufacturer's instructions. 

Probes and primers were designed for each assay according to Applied Biosystems 
Primer Express Software package (version I for Apple Computer's Macintosh Power PC) or a 
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similar algorithm using the target sequence as input. Default settings were used for reaction 
conditions and the following parameters were set before selecting primers: primer 
concentration = 250 nM, primer melting temperature (Tm) range = 58°-60°C, primer optimal 
Tm = 59°C, maximum primer difference = 2°C, probe does not have 5'G, probe Tm must be 
5 10°C greater than primer Tm, amplicon size 75bp to lOObp. The probes and primers selected 
(see below) were synthesized by Synthegen (Houston, TX, USA). Probes were double piurified 
by HPLC to remove uncoupled dye and evaluated by mass spectroscopy to verify coupling of 
reporter and quencher dyes to the 5' and 3' ends of the probe, respectively. Their final ' 
concentrations were: forward and reverse primers, 900nM each, and probe, 200nM. 

10 PCR conditions: When working with RNA samples, normalized RNA from each tissue 

and each cell line was spotted in each well of either a 96 well or a 384-well PCR plate 
(Applied Biosystems). PCR cocktails included either a single gene specific probe and primers 
set, or two multiplexed probe and primers sets (a set specific for the target clone and another 
gene-specific set multiplexed with the target probe). PCR reactions were set up using 

1 5 TaqMan® One-Step RT-PCR Master Mix (Applied Biosystems, Catalog No. 43 1 3803) 
following manufacturer's instructions. Reverse transcription was performed at 48°C for 30 
minutes followed by amplification/PCR cycles as follows: 95°C 10 min, then 40 cycles of 
95°C for 15 seconds, 60°C for 1 minute. Results were recorded as CT values (cycle at which a 
given sample crosses a threshold level of fluorescence) using a log scale, with the difference in 

20 RNA concentration between a given sample and the sample with the lowest CT value being 
represented as 2 to the power of delta CT. The percent relative expression is then obtained by 
taking the reciprocal of this RNA difference and multiplying by 1 00. 

When working with sscDNA samples, normalized sscDNA was used as described 
previously for RNA samples. PCR reactions containing one or two sets of probe and primers 

25 were set up as described previously, using IX TaqMan® Universal Master mix (Applied 
Biosystems; catalog No. 4324020), following the manufacturer's instructions. PCR 
amplification was performed as follows: 95°C 10 min, then 40 cycles of 95°C for 15 seconds, 
60°C for 1 minute. Results were analyzed and processed as described previously. 
Panels 1, LI, L2, and 13D 

30 The plates for Panels 1, 1.1, 1 .2 and 1. 3D include 2 control wells (genomic DNA 

control and chemistry control) and 94 wells containing cDNA from various samples. The 
samples in these panels are broken into 2 classes: samples derived from cultured cell lines and 
samples derived from primary normal tissues. The cell lines are derived from cancers of the 
following types: lung cancer, breast cancer, melanoma, colon cancer, prostate cancer, CNS 
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cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, gastric cancer and 
pancreatic cancer. Cell lines used in these panels are widely available through the American 
Type Culture Collection (ATCC), a repository for cultured cell lines, and were cultured using 
the conditions recommended by the ATCC. The normal tissues found on these panels are 
5 comprised of samples derived from all major organ systems from single adult individuals or 
fetuses. These samples are derived from the following organs: adult skeletal muscle, fetal 
skeletal muscle, adult heart, fetal heart, adult kidney, fetal kidney, adult liver, fetal liver, adult 
lung, fetal lung, various regions of the brain, the spleen, bone marrow, lymph node, pancreas, 
salivary gland, pituitary gland, adrenal gland, spinal cord, thymus, stomach, small intestine, 
10 colon, bladder, trachea, breast, ovary, uterus, placenta, prostate, testis and adipose. 

j 

In the results for Panels 1, 1.1, 1.2 and 1.3D, the following abbreviations are used: 
ca. = carcinoma, 

* = established from metastasis, 

met = metastasis, 
15 s cell var = small cell variant, 

non-s = non-sm = non-small, 

squam = squamous, 

pi. eff = pi effusion = pleural effusion, 

glio = glioma, 
20 astro = astrocytoma, and 

neuro = neuroblastoma. 

General screening panel v 1.4 

The plates for Panel 1 .4 include 2 control wells (genomic DNA control and chemistry 
control) and 94 wells containing cDNA from various samples. The samples in Panel 1 .4 are 

25 broken into 2 classes: samples derived from cultured cell lines and samples derived from 
primary normal tissues. The cell lines are derived from cancers of the following types: lung 
cancer, breast cancer, melanoma, colon cancer, prostate cancer, CNS cancer, squamous cell 
carcinoma, ovarian cancer, liver cancer, renal cancer, gastric cancer and pancreatic cancer. 
Cell lines used in Panel 1.4 are widely available through the American Type Culture 

30 Collection (ATCC), a repository for cultured cell lines, and were cultured using the conditions 
recommended by the ATCC. The normal tissues found on Panel 1.4 are comprised of pools of 
samples derived from all major organ systems from 2 to 5 different adult individuals or 
fetuses. These samples are derived from the following organs: adult skeletal muscle, fetal 
skeletal muscle, adult heart, fetal heart, adult kidney, fetal kidney; adult liver, fetal liver, adult 
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lung, fetal lung, various regions of the brain, the spleen, bone marrow, lymph node, pancreas, 
salivary gland, pituitary gland, adrenal gland, spinal cord, thymus, stomach, small intestine, 
colon, bladder, trachea, breast, ovary, uterus, placenta, prostate, testis and adipose. 
Abbreviations are as described for Panels 1, 1.1, 1.2, and 1.3D. 
5 Panels 2D and 2.2 

The plates for Panels 2D and 2.2 generally include 2 control wells and 94 test samples 
composed of RNA or cDNA isolated from human tissue procured by surgeons working in 
close cooperation with the National Cancer Institute's Cooperative Human Tissue Network 
(CHTN) or the National Disease Research Initiative (NDRI). The tissues are derived from 

10 human malignancies and in cases where indicated many malignant tissues have "matched 
margins" obtained from noncancerous tissue just adjacent to the tumor. These are termed 
normal adjacent tissues and are denoted "NAT" in the results below. The tumor tissue and the 
"matched margins" are evaluated by two independent pathologists (the surgical pathologists 
and again by a pathologist at NDRI or CHTN). This analysis provides a gross 

1 5 histopathological assessment of tumor differentiation grade. Moreover, most samples include 
the original surgical pathology report that provides information regarding the clinical stage of 
the patient. These matched margins are taken from the tissue surrounding (i.e. immediately 
proximal) to the zone of surgery (designated "NAT", for normal adjacent tissue, in Table RR). 
In addition, RNA and cDNA samples were obtained from various human tissues derived from 

20 autopsies performed on elderly people or sudden death victims (accidents, etc.). These tissues 
were ascertained to be free of disease and were purchased from various commercial sources 
such as Clontech (Palo Alto, CA), Research Genetics, and Invitrogen. 
Panel 3D 

The plates of Panel 3D are comprised of 94 cDNA saimples and two control samples. 

25 Specifically, 92 of these samples are derived from cultured human cancer cell lines, 2 samples 
of human primary cerebellar tissue and 2 controls. The human cell lines are generally obtained 
from ATCC (American Type Culture Collection), NCI or the German tumor cell bank and fall 
into the following tissue groups: Squamous cell carcinoma of the tongue, breast cancer, 
prostate cancer, melanoma, epidermoid carcinoma, sarcomas, bladder carcinomas, pancreatic 

30 cancers, kidney cancers, leukemias/lymphomas, ovarian/uterine/cervical, gastric, colon, lung 
and CNS cancer cell lines. In addition, there are two independent samples of cerebellum. 
These cells are all cultured under standard recommended conditions and RNA extracted using 
the standard procedures. The cell lines in panel 3D and 1.3D are of the most common cell lines 
used in the scientific literature. 
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Panels 4D, 4R, and 4.1D 

Panel 4 includes samples on a 96 well plate (2 control wells, 94 test samples) 
composed of RNA (Panel 4R) or cDNA (Panels 4D/4.1D) isolated from various human cell 
lines or tissues related to inflammatory conditions. Total RNA from control normal tissues 
5 such as colon and lung (Stratagene, La Jolla, CA) and thymus and kidney (Clontech) was 

employed. Total RNA from liver tissue from cirrhosis patients and kidney from lupus patients 
was obtained from BioChain (Biochain Institute, Inc., Hayward, CA). Intestinal tissue for 
RNA preparation from patients diagnosed as having Crohn's disease and ulcerative colitis was 
obtained from the National Disease Research Interchange (NDRI) (Philadelphia, PA). 

10 Astrocytes, lung fibroblasts, dermal fibroblasts, coronary artery smooth muscle cells, 

small airway epithelium, bronchial epithelium, microvascular dermal endothelial cells, 
microvascular lung endothelial cells, human pulmonary aortic endothelial cells, human 
umbilical vein endothelial cells were all purchased from Clonetics (Walkersville, MD) and. 
grown in the media supplied for these cell types by Clonetics. These primary cell types were 

1 5 activated with various cytokines or combinations of cytokines for 6 and/or 12-14 hours, as 
indicated. The following cytokines were used; IL-1 beta at approximately l-5ng/ml, TNF 
alpha at approximately 5-10ng/ml, IFN gamma at approximately 20-50ng/ml, IL-4 at 
approximately 5-10ng/ml, IL-9 at approximately 5-10ng/ml, IL-13 at approximately 5- 
lOng/ml. Endothelial cells were sometimes starved for various times by culture in the basal 

20 media from Clonetics with 0. 1 % serum. 

Mononuclear cells were prepared from blood of employees at CuraGen Corporation, 
using Ficoll. LAK cells were prepared from these cells by culture in DMEM 5% FCS 
(Hyclone), 100^M non essential amino acids (Gibco/Life Technologies, Rockville, MD), 
ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO* 5 M (Gibco), and lOmM Hepes 

25 (Gibco) and Interleukin 2 for 4-6 days. Cells were then either activated with 10-20ng/ml PMA 
and l-2ng/ml ionomycin, IL-1 2 at 5-lOng/ml, IFN gamma at 20-50ng/ml and IL-1 8 at 5- 
lOng/ml for 6 hours. In some cases, mononuclear cells were cultured for 4-5 days in DMEM 
5% FCS (Hyclone), IOOjiM non essential amino acids (Gibco), ImM sodium pyruvate 
(Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), and lOmM Hepes (Gibco) with PHA 

30 (phytohemagglutinin) or PWM (pokeweed mitogen) at approximately 5ng/ml. Samples were 
taken at 24, 48 and 72 hours for RNA preparation. MLR (mixed lymphocyte reaction) samples 
were obtained by taking blood from two donors, isolating the mononuclear cells using Ficoll 
and mixing the isolated mononuclear cells 1:1 at a final concentration of approximately 
2xl0 6 cells/ml in DMEM 5% FCS (Hyclone), IOOjiM non essential amino acids (Gibco), ImM 
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sodium pyruvate (Gibco), mercaptoethanol (5.5x1 0* 5 M) (Gibco), and lOmM Hepes (Gibco). 
The MLR was cultured and samples taken at various time points ranging from 1-7 days for 
RNA preparation. 

Monocytes were isolated from mononuclear cells using CD 14 Miltenyi Beads, +ve VS 
5 selection columns and a Vario Magnet according to the manufacturer's instructions. 

Monocytes were differentiated into dendritic cells by culture in DMEM 5% fetal calf serum 
(FCS) (Hyclone, Logan, UT), lOOuM non essential amino acids (Gibco), ImM sodium 
pyruvate (Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), and lOmM Hepes (Gibco), 50ng/ml 
GMCSF and 5ng/ml IL-4 for 5-7 days. Macrophages were prepared by culture of monocytes 

10 for 5-7 days in DMEM 5% FCS (Hyclone), 1 OO^M non essential amino acids (Gibco), 1 mM 
sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), lOmM Hepes (Gibco) and 
10% AB Human Serum or MCSF at approximately 50ng/ml. Monocytes, macrophages and 
dendritic cells were stimulated for 6 and 12-14 hours with lipopolysaccharide (LPS) at 
1 OOng/ml. Dendritic cells were also stimulated with anti-CD40 monoclonal antibody 

15 (Pharmingen) at lO^g/ml for 6 and 12-14 hours. 

CD4 lymphocytes, CD8 lymphocytes and NK cells were also isolated from 
mononuclear cells using CD4, CD8 and CD56 Miltenyi beads, positive VS selection columns 
and a Vario Magnet according to the manufacturer's instructions. CD45RA and CD45RO CD4 
lymphocytes were isolated by depleting mononuclear cells of CD8, CD56, CD14 and CD1 9 

20 cells using CD8, CD56, CD14 and CD19 Miltenyi beads and positive selection. CD45RO 

beads were then used to isolate the CD45RO CD4 lymphocytes with the remaining cells being 
CD45RA CD4 lymphocytes. CD45RA CD4, CD45RO CD4 and CD8 lymphocytes were 
placed in DMEM 5% FCS (Hyclone), IOOjiM non essential amino acids (Gibco), ImM 
sodium pyruvate (Gibco), mercaptoethanol 5.5xl0" 5 M (Gibco), and lOmM Hepes (Gibco) and 

25 plated at 10 6 cells/ml onto Falcon 6 well tissue culture plates that had been coated overnight 
with O.Sjig/ml anti-CD28 (Pharmingen) and 3ug/ml anti-CD3 (OKT3, ATCC) in PBS. After 6 
and 24 hours, the cells were harvested for RNA preparation. To prepare chronically activated 
CD8 lymphocytes, we activated the isolated CD8 lymphocytes for 4 days on anti-CD28 and 
anti-CD3 coated plates and then harvested the cells and expanded them in DMEM 5% FCS 

30 (Hyclone), lOOjiM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 

mercaptoethanol 5.5xlO" 5 M (Gibco), and lOmM Hepes (Gibco) and IL-2. The expanded CD8 
cells were then activated again with plate bound anti-CD3 and anti-CD28 for 4 days and 
expanded as before. RNA was isolated 6 and 24 hours after the second activation and after 4 
days of the second expansion culture. The isolated NK cells were cultured in DMEM 5% FCS 
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(Hyclone), 100(iM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 
mercaptoethanol 5.5x1 0~ 5 M (Gibco), and lOmM Hepes (Gibco) and IL-2 for 4-6 days before 
RNA was prepared. 

To obtain B cells, tonsils were procured from NDRI. The tonsil was cut up with sterile 
5 dissecting scissors and then passed through a sieve. Tonsil cells were then spun down and 
resupended at 10 6 cells/ml in DMEM 5% FCS (Hyclone), lOOfiM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), and lOmM 
Hepes (Gibco). To activate the cells, we used PWM at Sjig/ml or anti-CD40 (Pharmingen) at 
approximately lOjig/ml and IL-4 at 5-10ng/ml. Cells were harvested for RNA preparation at 

10 24,48 and 72 hours. 

To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon plates 
were coated overnight with lOjig/ml anti-CD28 (Pharmingen) and 2|ug/ml OKT3 (ATCC), and 
then washed twice with PBS. Umbilical cord blood CD4 lymphocytes (Poietic Systems, 
German Town, MD) were cultured at 10 5 -10 6 ceIIs/mI in DMEM 5% FCS (Hyclone), 100|iM 

1 5 non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x10" 
5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (4ng/ml). IL-12 (5ng/ml) and anti-IL4 (1 ng/ml) 
were used to direct to Thl, while IL-4 (5ng/ml) and anti-IFN gamma (1 ng/ml) were used to 
direct to Th2 and IL-10 at 5ng/ml was used to direct to TrL After 4-5 days, the activated Thl, 
Th2 and Trl lymphocytes were washed once in DMEM and expanded for 4-7 days in DMEM 

20 5% FCS (Hyclone), 1 OOjiM non essential amino acids (Gibco), 1 mM sodium pyruvate 
(Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (Ing/ml). 
Following this, the activated Thl, Th2 and Trl lymphocytes were re-stimulated for 5 days 
with anti-CD28/OKT3 and cytokines as described above, but with the addition of anti-CD95L 
(1 \xg/ml) to prevent apoptosis. After 4-5 days, the Thl, TH2 and Trl lymphocytes were 

25 washed and then expanded again with IL-2 for 4-7 days. Activated Thl and Th2 lymphocytes 
were maintained in this way for a maximum of three cycles. RNA was prepared from primary 
and secondary Thl, Th2 and Trl after 6 and 24 hours following the second and third 
activations with plate bound anti-CD3 and anti-CD28 mAbs and 4 days into the second and 
third expansion cultures in Interleukin 2. 

30 The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-1, 

KU-812. EOL cells were further differentiated by culture in O.lmM dbcAMP at 5xl0 5 cells/ml 
for 8 days, changing the media every 3 days and adjusting the cell concentration to 
5xl0 5 cells/ml. For the culture of these cells, we used DMEM or RPMI (as recommended by 
the ATCC), with the addition of 5% FCS (Hyclone), lOO^M non essential amino acids 
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(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xI0" 5 M (Gibco), lOmM Hepes 
(Gibco). RNA was either prepared from resting cells or cells activated with PMA at lOng/ml 
and ionomycin at 1 ng/ml for 6 and 14 hours. Keratinocyte line CCD1 06 and an airway 
epithelial tumor line NCI-H292 were also obtained from the ATCC. Both were cultured in 
5 DMEM 5% FCS (Hyclone), IOOjiM non essential amino acids (Gibco), ImM sodium pyruvate 
(Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), and lOmM Hepes (Gibco). CCD1 106 cells were 
activated for 6 and 14 hours with approximately 5 ng/ml TNF alpha and lng/ml IL-1 beta, 
while NCI-H292 cells were activated for 6 and 14 hours with the following cytokines: 5ng/ml 
IL-4, 5ng/ml IL-9, 5ng/ml IL-13 and 25ng/ml IFN gamma. 

10 For these cell lines and blood cells, RNA was prepared by lysing approximately 

10 7 cells/ml using Trizol (Gibco BRL). Briefly, 1/10 volume of bromochloropropane 
(Molecular Research Corporation) was added to the RNA sample, vortexed and after 10 
minutes at room temperature, the tubes were spun at 14,000 rpm in a Sorvall SS34 rotor. The 
aqueous phase was removed and placed in a 15ml Falcon Tube. An equal volume of 

15 isopropanol was added and left at -20°C overnight. The precipitated RNA was spun down at 
9,000 rpm for 15 min in a Sorvall SS34 rotor and washed in 70% ethanol. The pellet was 
redissolved in 300jil of RNAse-free water and 35^1 buffer (Promega) 5fil DTT, 7\xl RNAsin 
and 8nl DNAse were added. The tube was incubated at 37°C for 30 minutes to remove 
contaminating genomic DNA, extracted once with phenol. chloroform and re-precipitated with 

20 1/10 volume of 3M sodium acetate and 2 volumes of 100% ethanol. The RNA was spun down 
and placed in RNAse free water. RNA was stored at -80°C. 
Alcomprehensive panel_vl.0 

The plates for Al^comprehensive panel_vl.O include two control wells and 89 test 

samples comprised of cDNA isolated from surgical and postmortem human tissues obtained 
25 from the Backus Hospital and Clinomics (Frederick, MD). Total RNA was extracted from 

tissue samples from the Backus Hospital in the Facility at CuraGen. Total RNA from other 

tissues was obtained from Clinomics. 

Joint tissues including synovial fluid, synovium, bone and cartilage were obtained from 

patients undergoing total knee or hip replacement surgery at the Backus Hospital. Tissue 
30 samples were immediately snap frozen in liquid nitrogen to ensure that isolated RNA was of 

optimal quality and not degraded. Additional samples of osteoarthritis and rheumatoid arthritis 

joint tissues were obtained from Clinomics. Normal control tissues were supplied by 

Clinomics and were obtained during autopsy of trauma victims. 
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Surgical specimens of psoriatic tissues and adjacent matched tissues were provided as 
total RNA by Clinomics. Two male and two female patients were selected between the ages of 
25 and 47. None of the patients were taking prescription drugs at the time samples were 
isolated. 

5 Surgical specimens of diseased colon from patients with ulcerative colitis and Crohns 

disease and adjacent matched tissues were obtained from Clinomics. Bowel tissue from three 
female and three male Crohn's patients between the ages of 41-69 were used. Two patients 
were not on prescription medication while the others were taking dexamethasone, 
phenobarbital, or tylenol. Ulcerative colitis tissue was from three male and four female 
10 patients. Four of the patients were taking lebvid and two were on phenobarbital. 

Total RNA from post mortem lung tissue from trauma victims with no disease or with 
emphysema, asthma or COPD was purchased from Clinomics. Emphysema patients ranged in 
age from 40-70 and all were smokers, this age range was chosen to focus on patients with 
cigarette-linked emphysema and to avoid those patients with alpha- lanti-trypsin deficiencies. 
1 5 Asthma patients ranged in age from 36-75, and excluded smokers to prevent those patients that 
could also have COPD. COPD patients ranged in age from 35-80 and included both smokers 
and non-smokers. Most patients were taking corticosteroids, and bronchodilators. 

In the labels employed to identify tissues in the AI_comprehensive panel jvl .0 panel, 
the following abbreviations are used: 
20 ~Al = Autoimmunity 

Syn = Synovial 
Normal = No apparent disease 
Rep22 /Rep20 = individual patients 
RA = Rheumatoid arthritis 
25 Backus = From Backus Hospital 

OA = Osteoarthritis 
(SS) (BA) (MF) = Individual patients 
Adj = Adjacent tissue 
Match control = adjacent tissues 
30 -M-Male 
-F = Female 

COPD = Chronic obstructive pulmonary disease 
Panels 5D and SI 
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The plates for Panel 5D and 51 include two control wells and a variety of cDNAs 
isolated from human tissues and cell lines with an emphasis on metabolic diseases. Metabolic 
tissues were obtained from patients enrolled in the Gestational Diabetes study. Cells were 
obtained during different stages in the differentiation of adipocytes from human mesenchymal 
5 stem cells. Human pancreatic islets were also obtained. 

In the Gestational Diabetes study subjects are young (18-40 years), otherwise healthy 
women with and without gestational diabetes undergoing routine (elective) Caesarean section. 
After delivery of the infant, when the surgical incisions were being repaired/closed, the 
obstetrician removed a small sample (<1 cc) of the exposed metabolic tissues during the 

10 closure of each surgical level. The biopsy material was rinsed in sterile saline, blotted and fast 
frozen within 5 minutes from the time of removal. The tissue was then flash frozen in liquid 
nitrogen and stored, individually, in sterile screw-top tubes and kept on dry ice for shipment to 
or to be picked up by CuraGen. The metabolic tissues of interest include uterine wall (smooth 
muscle), visceral adipose, skeletal muscle (rectus) and subcutaneous adipose. Patient 

1 5 descriptions are as follows : 



Patient 2 Diabetic Hispanic, overweight, not on insulin 

Patient 7-9 Nondiabetic Caucasian and obese (BMI>30) 

Patient 10 Diabetic Hispanic, overweight, on insulin 

20 Patient 1 1 Nondiabetic African American and overweight 

Patient 12 Diabetic Hispanic on insulin 

Adipocyte differentiation was induced in donor progenitor cells obtained from Osirus 
(a division of Clonetics/BioWhittaker) in triplicate, except for Donor 3U which had only two 
25 replicates. Scientists at Clonetics isolated, grew and differentiated human mesenchymal stem 
cells (HuMSCs) for CuraGen based on the published protocol found in Mark F. Pittenger, et 
ah, Multilineage Potential of Adult Human Mesenchymal Stem Cells Science Apr 2 1999: 
143-147. Clonetics provided Trizol lysates or frozen pellets suitable for mRNA isolation and 
ds cDNA production. A general description of each donor is as follows: 
30 Donor 2 and 3 U: Mesenchymal Stem cells, Undifferentiated Adipose 

Donor 2 and 3 AM: Adipose, AdiposeMidway Differentiated 

Donor 2 and 3 AD: Adipose, Adipose Differentiated 

Human cell lines were generally obtained from ATCC (American Type Culture 
Collection), NCI or the German tumor cell bank and fall into the following tissue groups: 
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kidney proximal convoluted tubule, uterine smooth muscle cells, small intestine, liver HepG2 
cancer cells, heart primary stromal cells, and adrenal cortical adenoma cells. These cells are all 
cultured under standard recommended conditions and RNA extracted using the standard 
procedures. All samples were processed at CuraGen to produce single stranded cDNA. 
5 Panel 51 contains all samples previously described with the addition of pancreatic islets 

from a 58 year old female patient obtained from the Diabetes Research Institute at the 
University of Miami School of Medicine. Islet tissue was processed to total RNA at an outside 
source and delivered to CuraGen for addition to panel 51. 

In the labels employed to identify tissues in the 5D and 51 panels, the following 
1 0 abbreviations are used: 

GO Adipose = Greater Omentum Adipose 
SK = Skeletal Muscle 
UT = Uterus 
PL = Placenta 
1 5 AD = Adipose Differentiated 

AM = Adipose Midway Differentiated 
U = Undifferentiated Stem Cells 
Panel CNSD.01 

The plates for Panel CNSD.01 include two control wells and 94 test samples 
20 comprised of cDNA isolated from postmortem human brain tissue obtained from the Harvard 
Brain Tissue Resource Center, Brains are removed from calvaria of donors between 4 and 24 
hours after death, sectioned by neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. 
All brains are sectioned and examined by neuropathologists to confirm diagnoses with clear 
associated neuropathology. 
25 Disease diagnoses are taken from patient records. The panel contains two brains from 

each of the following diagnoses: Alzheimer's disease, Parkinson's disease, Huntington's 
disease, Progressive Supernuclear Palsy, Depression, and "Normal controls". Within each of 
these brains, the following regions are represented: cingulate gyrus, temporal pole, globus 
palladus, substantia nigra, Brodman Area 4 (primary motor strip), Brodman Area 7 (parietal 
30 cortex), Brodman Area 9 (prefrontal cortex), and Brodman area 1 7 (occipital cortex). Not all 
brain regions are represented in all cases; e.g., Huntington's disease is characterized in part by 
neurodegeneration in the globus palladus, thus this region is impossible to obtain from 
confirmed Huntington's cases. Likewise Parkinson's disease is characterized by degeneration 
of the substantia nigra making this region more difficult to obtain. Normal control brains were 
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examined for neuropathology and found to be free of any pathology consistent with 
neurodegeneration. 

In the labels employed to identify tissues in the CNS panel, the following abbreviations 
are used: 

5 PSP = Progressive supranuclear palsy 

Sub Nigra = Substantia nigra 
Glob Palladus= Globus palladus 
Temp Pole = Temporal pole 
Cing Gyr = Cingulate gyrus 
10 B A 4 = Brodman Area 4 

Panel CNS_Neurodegeneration_V1.0 

The plates for Panel CNS_Neurodegeneration_V L0 include two control wells and 47 
test samples comprised of cDNA isolated from postmortem human brain tissue obtained from 
the Harvard Brain Tissue Resource Center (McLean Hospital) and the Human Brain and 

15 Spinal Fluid Resource Center (VA Greater Los Angeles Healthcare System). Brains are 
removed from calvaria of donors between 4 and 24 hours after death, sectioned by 
neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. All brains are sectioned and 
examined by neuropathologists to confirm diagnoses with clear associated neuropathology. 
Disease diagnoses are taken from patient records. The panel contains six brains from 

20 Alzheimer's disease (AD) patients, and eight brains from "Normal controls" who showed no 
evidence of dementia prior to death. The eight normal control brains are divided into two 
categories: Controls with no dementia and no Alzheimer's like pathology (Controls) and 
controls with no dementia but evidence of severe Alzheimer's like pathology, (specifically 
senile plaque load rated as level 3 on a scale of 0-3; 0 = no evidence of plaques, 3 = severe AD 

25 senile plaque load). Within each of these brains, the following regions are represented: 
hippocampus, temporal cortex (Brodman Area 21), parietal cortex (Brodman area 7), and 
occipital cortex (Brodman area 17). These regions were chosen to encompass all levels of 
neurodegeneration in AD. The hippocampus is a region of early and severe neuronal loss in 
AD; the temporal cortex is known to show neurodegeneration in AD after the hippocampus; 

30 the parietal cortex shows moderate neuronal death in the late stages of the disease; the 

occipital cortex is spared in AD and therefore acts as a "control" region within AD patients. 
Not all brain regions are represented in all cases. 

In the labels employed to identify tissues in the CNSJNeurodegeneration_V1.0 panel, 
the following abbreviations are used: 
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AD = Alzheimer's disease brain; patient was demented and showed AD-like 
pathology upon autopsy 

Control = Control brains; patient not demented, showing no neuropathology 
Control (Path) = Control brains; pateint not demented but showing sever AD-like 
5 pathology 

SupTemporal Ctx = Superior Temporal Cortex 
Inf Temporal Ctx = Inferior Temporal Cortex 

NOV9a and NOV9b 

Expression of gene NOV9a and variant NOV9b was assessed using the primer-probe 
10 sets Ag2930, Ag4297 and Ag573, described in Tables AA, AB and AC. Results of the RTQ- 
PCR runs are shown in Tables AD 9 AE, AF, AG, and AH. Please note that the probe and 
primer set Ag4297 do not match the NOV9b variant. 

Table AA. Probe Name Ag2930 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -attggattttccaactccatct-3 ' 


22 


1213 


994 


Probe 


TET-5 1 -tcccattgtctatgcatttatgaatga- 
3 1 -TAMRA 


27 


1239 


995 


Reverse 


5 1 -tgcaataacaaactgcagacaa-3 ' 


22 


1285 


996 



1 5 Table AB. Probe Name Ag4297 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -tctacaccaccttcatccttgt-3 1 


22 


1007 


997 


Probe 


TET-5 ' -ctgcctcttatggagaagaaacgagctg- 
3 ■ -TAMRA 


28 


1042 


998 


Reverse 


5 » -caccactgtcaccatcataatg-3 ' 


22 


1071 


999 



Table AC. Probe Name Ag573 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -tgtccagtctaccgctgttgtg-3 1 


22 


738 


1000 


Probe 


TET-5 1 - 

agaaatcctcactatgacctgcattgctgtg-3 ' - 
TAMRA 


31 


762 


1001 


Reverse 


5 1 -cacaagtccctggtgccttt-3 1 


20. 


794 


1002 



Table AD. Panel 1.1 



Tissue Name j Rel. Exp.(%) j Rel. Exp.(%) | Tissue Name j Rel. Exp.(%) j Rel. Exp.(%) 
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Ag573, Run 
109566077 


Ag573, Run 
1111 S6622 




Ag573, Run 


Ag573, Run 

111 13O0ZZ 


Adrenal gland 


0.0 


0.8 


Renal ca. UO- 
j i 


0.1 


6.0 


Bladder 


0.0 


0.0 


Renal ca. RXF 
393 


0.0 


0.0 


Brain 

(amygdala) 


0.0 


0.6 


Liver 


0.0 


0.0 


Brain 

( cerebellum^ 


0.1 


0.1 


Liver (fetal) 


0.0 


0.0 


Brain 

(hippocampus) 


0 0 




Liver ca. 
^nepaiODiaSi ) 
HepG2 




o.u 


Brain ( substantia 

-l— » 1 Ul 11 1 JUl/J IU1 1 LIU 

nigra) 


10.7 


13.7 


Lung 


0.0 


0.0 


Brain (thalamus) 


0.1 


4.9 


Lung (fetal) 


0.0 


1.5 


Cerebral Cortex 


9.3 


11.2 


Lung ca. (non- 
s.cell) HOP-62 


4.5 


0.0 


Brain (fetal) 


0.0 


0.8 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


Brain (whole) 


1.3 


6.0 


Lung ca. (non- 
s.cell) NCI- 
H23 


0.0 


0.0 


glio/astroU-118- 
MG 


0.0 


0.0 


Lung ca. (non- 

s.cI)NCI- 

H522 


2.2 


0.0 


astrocytoma SF- 
539 


0.0 


1.3 


Lung ca. (non- 
sm. cell) A549 


0.0 


0.0 


astrocytoma 

CXTD 
OiNJtWj 


0.0 


0.0 


Lung ca. (s.cell 
var.) SHP-77 


0.0 


0.0 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. 
(small cell) ] 
LX-1 


0.0 


0.0 


glioma U251 


0.0 


0.0 


Lung ca. 
(small cell) 
NCI-H69 


43.5 


47.0 


glioma or-zvD 


1.3 


0.0 


Lung ca. 
(squam.) SW 
900 


0.0 


0.0 


glioma SNB-19 


0.0 


0.5 


Lung ca. 
(squam.) NCI- 
H596 


100.0 


98.6 


glio/astro U87- 
MG 


0.0 


1.5 


Lymph node 


0.0 


0.0 


neuro*; met SK- 
N-AS 


2.6 


0.1 


Spleen 


0.0 


0.0 


Mammary gland 


0.0 


3.8 


Thymus 


0.0 


0.0 



879 



WO 02/068649 



PCT/US02/02785 



Breast ca. BT- 
549 


0.0 


0.3 


Ovary 


0.0 


0.6 


Breast ca. MDA- 
N 


28.3 


39.8 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


DicaSL Ca. 

(pl.ef)T47D 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.2 


1.8 


jDreasi ca. 
(pl.ef) MCF-7 


0.3 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


0.0 


Breast ca.* 

ipi.el; MJ-//\- 

MB-231 


U.U 


U.U 


Ovarian ca. 
OVCAR-5 


0.0 


0.1 


Small intestine 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


2.5 


16.3 


Colorectal 


0.0 


0.0 


Ovarian ca.* 
fascites> SK- 
OV-3 


0 8 


4 ft 


Colon ca. HT29 


0.0 


00 


Pjmprpnc 
x all VI wad 




n n 


Colon ca. CaCo- 

2 


0.0 


p.o 


Pancreatic ca. 
r A PAN 2 


0.0 


0.0 


Colon ca. HCT- 
15 


0.0 


0.0 


Pituitary gland 


1.5 


2.7 


coion ca. ri\^ i - 
116 


0.0 


0.0 


Placenta 


0.0 


0.0 


Colon ca. HCC- 


0.0 


0.0 


Prostate 


0.0 


0.0 


Colon ca. 
SW480 


0.0 . - 


0.0 


Prostate ca.* 

(bone met) PC- 
'S 


0.1 


0.0 


Colon ca.* 
SW620 (SW480 
met) 


1.9 


0.0 


Salivary gland 


0.1 


0.0 


Stomach 


0.0 


0.0 


Trachea 


0.0 


0.0 


Gastric ca. (liver 
met) NCI-N87 


0.1 


0.0 


Spinal cord 


0.0 


2.4 


Heart 


3.8 


19.6 


Testis 


0.0 


2.0 


Skeletal muscle 
(Fetal) 


0.0 


0.3 


Thyroid 


0.0 


2.3 


Skeletal muscle 


0.0 


0.0 


Uterus 


0 0 




Endothelial cells 


0.0 


0.0 


Melanoma 
M14 


0.0 


0.5 


Heart (Fetal) 


11.0 


31.0 


Melanoma 
LOXIMVI 


0.0 


0.0 


Kidney 


1.3 j 


12.2 


Melanoma 
UACC-62 


0.0 


0.0 


Kidney (fetal) 


0.0 


3.9 


Melanoma SK- 
MEL-28 j 


0.0 


0.4 
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Kenai ca. /oo-u 


U.U 


5.4 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Renal ca. A498 


10.5 


16.4 


Melanoma 
Hs688(A).T 


0.0 


0.2 


Renal ca. ACHN 


85.3 


100.0 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


0.0 


Renal ca. TK-10 


60.7 


59.0 




1 
1 



Table AE. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Ag2930, Run 
158090377 


Rel. Exp.(%) 
Ag2930, Run 
165701939 


Tissue Name 


Rel. Exp.(%) 
Ag2930, Run 
158090377 


Rel. Exp.(%) 
Ag2930, Run 
165701939 


Liver 

adenocarcinoma 


0.0 


0.0 


Kidney (fetal) 


10.1 


0.0 


Pancreas 


0.0 


0.0 


Renal ca. 786- 
0 


2.3 


7.8 


Pancreatic ca. 
CAPAN 2 


0.0 


0.0 


Renal ca. 
A498 


61.6 


44.8 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 
393 


0.0 


0.0 


Thyroid 


1.6 


0 0 


Renal ca. 
ACHN 


1 (\ R 
lO.o 


1UU.U 


Salivary gland 


0.0 


0.0 


Renal ca. UO- 
31 


10.1 


15.0 


Pituitary gland 


3.1 


0.0 


Renal ca. TK- 
10 


22.5 


52.5 


Brain (fetal) 


9.7 


3.6 


Liver 


0.0 


0.0 


Brain (whole) 


11.0 


24.3 


Liver (feta|) 


1.3 


0.0 


Brain (amygdala) 


23.2 


9.3 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


0.0 


Brain (cerebellum) 


0.0 


0.0 


Lung 


0.0 


0.0 


Brain 

(hippocampus) 


100.0 


3.4 


Lung (fetal) 


0.0 


15.0. 


Brain (substantia 
nigra) 


2.7 


19.5 


Lung ca. 
(small cell) 
LX-1 


0.0 


0.0 


Brain (thalamus) 


23.3 


6.8 


Lung ca. 
(small cell) 
NCI-H69 


39.2 


16.0 


Cerebral Cortex 


33.4 


0.0 


Lung ca. 
(s.cell van) 
SHP-77 


0.0 


0.0 
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Spinal cord 


6.4 


7.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


glio/astro U87-MG 


2.6 


4.9 


Lung ca. (non- 
sm. cell) A549 


0.0 


0.0 


glio/astroU-118- 
MG 


0.0 


A A 
0.0 


Lung ca. (non- 
s.cell) NCI- 
H23 


A t\ 

0.0 


A A 

0.0 


astrocytoma 

owl lo5 


0.0 


0.0 


Lung ca. (non- 
s.celJ) riUr-62 


0.0 


0.0 


neuro*; met SK-N- 
AS 


4.2 


0.0 


Lung ca. (non- 
s.cl) NCI- 
H522 


0.0 


0.0 


astrocytoma SF- 
539 


5.0 


A A 
0.0 


Lung ca. 
(squam.) SW 
900 


A A 

0.0 


0.0 


astrocytoma SNB- 
75 


0.0 


0.0 


Lung ca. 
(squam.) NCI- 
H596 


8.4 


6.7 


glioma SNB-19 


2.1 


0.0 


Mammary 
gland 


18.3 


0.0 


glioma U251 


0.0 


0.0 


Breast ca. 
(pl.ef)MCF-7 


1.4 


4.6 


glioma SF-295 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 
MR-231 


0.0 


0.0 


Heart (fetal) 


18.4 


0.0 


Rrf*aQt pa * 

(pl.ef)T47D 


0.0 


0.0 


Heart 


2.4 


4.1 


Rreast ra RT- 
549 


10.7 


5.8 


Skeletal muscle 
(fetal) 


1.5 


0.0 


Rreast ca 
MDA-N 


35.1 


4.7 


Skeletal muscle 


0.0 


0.0 


Ovary 


0 0 


0 0 


Bone marrow 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


Thymus 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


0.0 


Spleen 


1.8 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


0.0 


Lymph node 


0.0 


11.6 


Ovarian ca. 
OVCAR-8 


12.9 


4.9 


Colorectal 


0.0 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach 


1.3 


3.3 


Ovarian ca.* 
(ascites) SK- 
OV-3 


2.9 


6.5 


Small intestine 


1.7 


. o.o | 


Uterus 


0.0 j 0.0 
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L/Oion ca. bw4oU 


A A 
0.0 


A A 

0.0 


Placenta 


A A 

0.0 


A A 

0.0 


Colon ca.* 

CU/AOn/CW/y) OA 
o W OzU(o W 4oU 

met) 


A A 
0.0 


A A 

0.0 


Prostate 


a a 

0.0 


0.0 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met)PC- 
3 


0.0 


0.0 


v^oion ca. riC 1 - 
116 


0.0 


0.0 


Testis 


2.4 


0.0 


Colon ca. CaCo-2 


0.0 


0.0 


Ik X 1 

Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


0.0 


\/ 1 a. 

Melanoma* 
(met) 

nbuoo^D ). 1 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.0 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met)NCI-N87 


0.0 


0.0 


Melanoma 
M14 


0.0 


0.0 


Bladder 


0.0 


0.0 


Melanoma 
LOX IM VI 


0.0 


0.0 


Trachea 


0.0 


0.0 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney, 


0.0 


8.5 


Adipose 


1.3 


0.0 



Table AF. Panel 2D 



Tissue Name 


Rel.Exp.(%) 
Ag2930,Run 
158090382 


Tissue Name 


Rel. Exp.(%) 
Ag2930,Run 
158090382 


Normal Colon 


12 


Kidney Margin 
8120608 


0.6 


CC Well to Mod Diff 
(OD03866) 


0.4 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.3 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer 
9010320 


3.2 ! 


CC Margin (OD03868) 


0.0 


Kidney Margin 
9010321 


1.7 


CC Mod Diff (ODO3920) 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 064011 


0.0 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.5 


CC Margin (OD03921) 


2.2 


Thyroid Cancer 
064010 


0.3 
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lrom ramai 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


0.2 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


0.0 


Lung Margin (OD04451 - 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.0 


Normal Prostate 6546-1 


0.0 


Breast Cancer 
(OD04590-01) 


0.4 


r rosiaie cancer 
(OD04410) 


0.4 


Breast Cancer Mets 
(OD04590-03) 


0.0 


Prostate Margin 
(OD04410) 


0.0 


Breast Cancer 
Metastasis 


0.0 


Prostate Cancer 
(OD04720-01} 


1.0 


Breast Cancer 064006 


0.0 


Prostate Margin 
(OD04720-02) 


0.0 


Breast Cancer 1024 


0.2 


Normal Lung 061010 


0.2 


Breast Cancer 


3.5 


Lunf? Met to Muscle 
(OD04286) 


0.5 


Rf*psici" \ActToir\ 
DlCdM l vxal gill 

9100265 


0.3 


Muscle Margin 
(OD04286) 


0.0 


Breast Cancer 
A209073 


1.0 


Lung Malignant Cancer 
(OD03126) 




Breast Margin 
A2090734 


fi n 


Lung Margin (OD03126) 


0.4 


Normal Liver 


0.0 


Lung Cancer (OD04404) 


1.9 


Liver Cancer 064003 


0.2 


Lung Margin (OD04404) 


0.0 


Liver Cancer 1025 


0.0 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0.4 


Lung Margin (OD045651 


0.0 


T iver Canrer fift04-T 


ft ft 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


0.0 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (0DO43 1 0) 


0.0 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.2 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


5.0 


Normal Kidney 


1.6 


Bladder Cancer 
(OD04718-01) 


0.0 
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Kidney Ca, Nuclear grade 
2 (OD04338) 


0.0 


Bladder Normal 
Adjacent (OD04718- 
03^ 


0.0 


Kidney Margin 
(OD04338) 


2.7 


Normal Ovary 


0.9 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


45.1 


Ovarian Cancer 

t/Utv/UO 


0.1 


Kidney Margin 
(OD04339) 


2.4 


Ovarian Cancer 

[KJUV^ I UO~U / ) 


0.0 


Kidney Ca, Clear cell 
type (OD04340) 


100.0 


Ovary Margin 

^WUUt /00"uO ) 


0.0 


Kidney Margin 
(OD04340) 


3.1 


Normal Stomach 


0.0 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0.0 


Gastric Cancer 


0.0 


Kidney Margin 
(OD04348) 


1.3 ' 


Stomach Margin 

7UOUJJ7 


0.0 


Kidney Cancer 
(OD04622-01) 


20.9 


Gastric Cancer 
006mos 


0.0 


Kidney Margin 
(OD04622-03) 


0.3 


Stomach Margin 
9060394 


0.4 


Kidney Cancer 
(OD04450-01) 


33.4 


Gastric Cancer 
9060397 


0.0 


Kidney Margin 
(OD04450-03) 


2.4 


Stomach Margin 
9060396 


0.0 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


0.0 



Table AG. Panel 3D 



Tissue Name 


Rel. 

Exp.(%) Ag2930, 
Run 162374504 


Tissue Name 


Rel. 
Exp.(%) 
Ag2930, Run 
162374504 


Daoy- 
Medulloblastoma 


1.9 


Ca Ski- Cervical 
epidermoid carcinoma 
(metastasis) 


0.0 


TE671- 
Medulloblastoma 


0.0 


ES-2- Ovarian clear 
cell carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


2.0 


Ramos- Stimulated 
with PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated 
with PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 
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SNB-78- Glioma 


0.0 


Raji- Burkitt's 
lymphoma 


0.0 


SF-268- 
Glioblastoma 


0.0 


Daudi- Burkitt s 
lymphoma 


0.0 


T98G- 
Glioblastoma 


0.0 


U266- B-cell 
plasmacytoma 


0.0 


pi/ XT CT T 

SK-N-SH- 
Neuroblastoma 

( m pta Qt a qi 


0.0 


CA46- Burkitt's 
lymphoma 


0.0 


SF-295- 
Glioblastoma 


0.0 


RL- non-Hodgkin's 
B-cell lymphoma 


0.0 


Cerebellum 


2.7 


T\ iff Tpk 11 

JM1- pre-B-cell 
lymphoma 


0.0 


Cerebellum 


7.3 


Jurkat- T cell 
leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 

Pfl YC* 1 TintTIJI 


0.0 


TF-1- 
Erythroleukemia 


0.0 


DMS-1 14- Small 

ppll Inner pnnppr 


0.0 


HUT 78-T-cell 
jyrnpnornd 


0.0 


DMS-79- Small 

ppll Inner pnnp-Pt" 


0.0 


U937- Histiocytic 
lympnoiua 


0.0 


NCI-H146- Small 

cell Inner panppr 


0.0 


KU-812- 

iviyciogcuuub lcuKcmia 


0.0 


NCI-H526- Small 
cell lung cancer 


0.0 


769-P- Clear cell 
renal carcinoma 


39.0 


NCI-N417- Small 
cell lung cancer 


100.0 


Caki-2- Clear cell 
renal carcinoma 


4.6 


NCI-H82- Small 
cell lung cancer 


0.0 


SW 839- Clear cell 
renal carcinoma 


16.0 


NCI-H157- 
Squamous cell lung 
cancer (metastasis) 


A A 

0.0 


G401- Wilms tumor 


0.0 


xt/^t t_ji i <c 

Large cell lung cancer 


0.0 


Hs7ooT- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- 
Large cell lung cancer 


0.0 


/~i A n A XT 1 

CAPAN-1- 
Pancreatic adenocarcinoma 

livpr mptaQfjiQiQ^ 


0.0 


NCI-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


5.0 


NCI-UMC-11- 
Lung carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.0 


LX-1- Small cell 
lung cancer 


0.0 


HPAC- Pancreatic 
adenocarcinoma 


0.0 


Colo-205- Colon 

cancer 


0.0 


MIA PaCa-2- 
Pancreatic carcinoma 


0.0 
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KM12- Colon 

cancer 


0.0 


CFPAC-1- 
Pancreatic ductal 
aaenocar c inurna 


0.0 


KM20L2- Colon 

CaUCCl 


0.0 


PANC-1- Pancreatic 
epiinciiuiu uucLdi carcinoind 


0.0 


NCI-H716- Colon 

cancer 


0.0 


T24- Bladder 
carcinma ^uansiiionai ceiij 


0.0 


SW-48- Colon 

auenocarcinorna 


0.0 


5637- Bladder 
carcinoma 


0.0 


SWl 116- Colon 
aaenocarcinonia 


"0.0 


HT-l 197- Bladder 
carcinoma 


0.0 


LS 174T- Colon 
auenuLarwinoTiid 


0.0 


UM-UC-3- Bladder 
carcinma ^iransuionai ceiij 


0.0 


SW-948- Colon 

J* n pn r\c si it* i n om s* 

ClUCl lUCdl Lf illUlll£l 


0.0 


A204- 
ivnauuoniyubdrcunia 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1080- 
Fibrosarcoma 


0.0 


NCI-SNU-5- 
Gastric carcinoma 


0.0 


MG-63- 
Osteosarcoma 


0.0 


KATO III- Gastric 
carcinoma 


0.0 


SK-LMS-1- 
Leiomyosarcoma (vulva) 


0.0 


NCI-SNU-16- 
Gastric carcinoma 


0.0 


SJRH30- 
Rhabdomyosarcoma (met to 
bone marrow) 


0.0 


NCI-SNU-1- 
vjabtnc care l noma 


0.0 


A431- Epidermoid 
carcinoma 


0.0 


RF-l- Gastric 

aUCHUCal uillUlIla 


0.0 


WM266-4- 
Melanoma 


0.0 


RF-48- Gastric 

auci iu wal u JilUilla 


0.0 


DU 145- Prostate 
carcinoma (pram meiasiosisj 


0.0 


MKN-45- Gastric 

f*jirf*innmji 

wul wlll\JlliCl 


8.7 


MDA-MB-468- 
oreabi adenocarcinoma 


0.0 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous 
cell carcinoma of tongue 


0.0 


OVCAR-5- 
Ovarian carcinoma 


0.0 


SCC-9- Squamous 
cell carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous 
cell carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous 
cell carcinoma of tongue 


0.0 



Table AH. Panel 4D 





Rel. Exp.(%) 




ReLExp.(%) 


Tissue Name 


Ag2930,Run 


Tissue Name 


Ag2930,Run 




158090383 




158090383 
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Secondary Thl act 


5.9 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 ! 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


^ppnnHflrv TTi'? rp<:t 

JtUUHUOJ J JL IJLLr IWjL 


0 0 


HUVEC TT -1 1 


0 0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 


9.9 


Primary Thl act 


0.0 


Lung Microvascular EC 

TNTFalnha + TT -1 heta 

l i>x cxi una > lut/ia 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 

ilVJllw 


15.2 


Primary Trl act 


0.0 


Microsvasular Dermal EC 

TMFalnha -*- TT -IViPta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 


0.0 


Primary Trl rest 


0.0 


onid.ii airway cpiincuuiii 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


\-/Uroncry artery oivi^ 

TNFalpha + IL-lbeta 


0.0 


CDS lvmnhocvte act 


0 0 


AMI \J\sy IC3 ICaL 




Secondary CD8 
lvmnhocvte re^t 


0.0 


Astrocytes TNFalpha + 

TT-lheta 


0.0 


Secondary CD8 
lvmnhocvte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-R 1 2 fBasonhiH 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


22.1 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


8.2 


LAK cells IL-2 


0.0 


Liver cirrhosis 


0.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


10.7 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+IL- 18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells EL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 JHPAEC none 


0.0 
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iwo Way MLK 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMCPWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0 0 


T iincr "fiKri^V*lQc+ TT Q 
LrUIlg llurUDIdbl l±j-y 


ft ft 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 

PCT) 10,70 r^ct 


0.0 


EOL-1 dbcAMP 


0.0 


L/CiHial llurODlaSl 

CCD1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


T'iprmjal fihtrvKlact 
JL/wlllldJ llL/lUDldol 

CCD1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


7.9 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 




idu colitis Z 


6.3 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


100.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 j 







CNS_neurodegeneration_vl.O Summary: Ag2930 Expression of the NOV9A gene is 
low/undetectable in all samples on this panel. (CTs>35). (Data not shown.) 



General jscreeningjpaneI_vL5 Summary: Ag4297 Expression of the NOV9A gene 
is low/undetectable in all samples on this panel. (CTs>35). (Data not shown.) 

.5 Panel LI Summary: Ag573 Two experiments with the same probe and primer set 

produce results that are in excellent agreement, with highest expression of the NOV9A gene, a 
putative neuropeptide Y receptor, in lung cancer and renal cancer cell lines (CTs=23-26). 
Significant expression is also seen in a cluster of breast cancer cell lines. Thus, expression of 
this gene could be used to differentiate between these samples and other samples on this panel 

10 and as a marker to detect the presence of these cancers. Neuropeptide Y, which controls 
vasoconstriction and feeding behavior, is expressed in breast cancer (see ref. below). 
Furthermore, peptide receptors in human tumors represent clinically relevant targets for both 
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cancer diagnosis and treatment. Therefore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of breast, lung and renal cancers. 

This molecule, which encodes a neuropeptide Y receptor homolog, is also expressed in 
the brain. Neuropeptide Y and its receptors have been implicated in feeding behavior, learning 
5 and memory, and seizure. This gene would therefore be an excellent small molecule target for 
the treatment of epilepsy or any seizure disorder. 

Among tissues with metabolic function, this gene has low-to-moderate levels of 
expression in adrenal, heart, fetal skeletal muscle and pancreas. This gene product is highly 
expressed in fetal and adult heart. Since neuropeptide Y and its receptor are associated with 

10 appetite regulation, this gene product may be a small molecule target for the treatment of 
metabolic and endocrine disease, including obesity and Types 1 and 2 diabetes. In addition, 
the expression in heart and the suggested role of neuropeptide Y in vasoconstriction, 
cardiovascular signaling, and development of the heart suggest that this gene product may be 
useful in treating disorders that affect the heart. 

15 References: 

Reubi JC, Gugger M, Waser B, Schaer JC. Y(l)-mediated effect of neuropeptide Y in 
cancer: breast carcinomas as targets. Cancer Res 2001 Jun 1 ;61(1 1):4636-41 

Overexpression of selected peptide receptors in human tumors has been shown to 
represent clinically relevant targets for cancer diagnosis and therapy. Neuropeptide Y (NPY) is 

20 a peptide neurotransmitter mediating feeding behavior and vasoconstriction. It has never been 
shown to be involved iq human cancer. We show here, using in vitro receptor 
autoradiography, a NPY receptor incidence of 85% in primary human breast carcinomas (n = 
95) and of 100% in lymph node metastases of receptor-positive primaries (n = 27), 
predominantly as Y(l) subtype, whereas nonneoplastic human breast expressed Y(2) 

25 preferentially. Y(l) mRNA was detected in Y(l)-expressing tumors by in situ hybridization, 
whereas Y(2) mRNA was found in normal breast tissue. The strong predominance of Y(l) in 
breast carcinomas compared with Y(2) in normal breast suggests that neoplastic 
transformation can switch the NPY receptor expression from Y(2) to Y(l) subtype. Moreover, 
in Y(l)-expressing human SK-N-MC tumor cells, an NPY-induced, dose-dependent inhibition 

30 of tumor cell growth of >40% was observed, suggesting a functional role of NPY in cancer, 
mediated by Y(l). NPY should therefore be added to the list of small regulatory peptides 
related to cancer. The high incidence of Y(l) in in situ, invasive, and metastatic breast cancers 
allows for the possibility to target them for diagnosis and therapy with NPY analogues. 
PMID: 11389101 
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Furtinger S, Pirker S, Czech T, Baumgartner C, Ransmayr G, Sperk G. Plasticity of Yl 
and Y2 receptors and neuropeptide Y fibers in patients with temporal lobe epilepsy. J Neurosci 
2001 Aug 1;21(15):5804-12 

Marked expression of neuropeptide Y (NPY) and its Y2 receptors in hippocampal 
5 mossy fibers has been reported in animal models of epilepsy. Because NPY can suppress 

glutamate release by activating presynaptic Y2 receptors, these changes have been proposed as 
an endogenous protective mechanism. Therefore, we investigated whether similar changes in 
the NPY system may also take place in human epilepsy. We investigated Yl and Y2 receptor 
binding and NPY immunoreactivity in hippocampal specimens that were obtained at surgery 

1 0 from patients with temporal lobe epilepsy and in autopsy controls. Significant increases in Y2 
receptor binding (by 43-48%) were observed in the dentate hilus, sectors CA1 to CA3, and 
subiculum of specimens with, but not in those without, hippocampal sclerosis. On the other 
hand, Yl receptor binding was significantly reduced (by 62%) in the dentate molecular layer 
of sclerotic specimens. In the same patients, the total lengths of NPY immunoreactive (NPY- 

15 IR) fibers was markedly increased (by 1 15-958%) in the dentate molecular layer and hilus, in 
the stratum lucidum of CA3, and throughout sectors CA1 to CA3 and the subiculum, as 
compared with autopsies. In nonsclerotic specimens, increases in lengths of NPY-IR fibers 
were more moderate and statistically not significant. NPY mRNA was increased threefold in 
hilar interneurons of sclerotic and nonsclerotic specimens. It is suggested that abundant 

20 sprouting of NPY fibers, concomitant upregulation of Y2 receptors, and downregulation of Yl 
receptors in the hippocampus of patients with Amnion's horn sclerosis may be endogenous 
anticonvulsant mechanisms. 
PMID: 1 1466452 

Rahmouni K, Haynes WG. Leptin signaling pathways in the central nervous system: 
25 interactions between neuropeptide Y and melanocortins. Bioessays. 200 1 Dec;23( 1 2): 1 095-9. 

No other hormone has drawn more attention than leptin in recent studies on the control 
of appetite, body weight and obesity. This hormone is produced by adipose tissue and enters 
the brain via a saturable specific transport mechanism. Leptin acts in the hypothalamus to 
modulate food intake and heat production as well as several other neuroendocrine pathways. 
30 The mechanisms through which leptin exerts its central nervous effects are now better 
understood. Proopiomelanocortin- and neuropeptide Y-containing neurons in the 
hypothalamus have emerged as potent candidate mediators of leptin action. These two 
neuropeptides have been shown to exert opposing effects using different pathways. Recently, 
Cowley et al. (2001) described a new circuit in the regulation of neuronal activity by leptin 
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with an interaction between these two pathways. These data add complexity to the 
mechanisms by which leptin achieves its effects in the central nervous system, but they also 
offer potential mechanisms to explain the phenomenon of leptin resistance observed in 
obesity. Copyright 2001 John Wiley & Sons, Inc. 
5 PMID: 11746228 

Michalkiewicz M, Michalkiewicz T, Kreulen DL, McDougall SJ. 
Increased blood pressure responses in neuropeptide Y transgenic rats. Am J Physiol 
Regul Integr Comp Physiol 2001 Aug;281(2):R417-26 

Considering the coexistence of neuropeptide Y (NPY) and norepinephrine in 

10 perivascular sympathetic nerves and the known vasoconstrictor cooperation of NPY with 
norepinephrine, we investigated the involvement of NPY in long-term control of 
cardiovascular functions using NPY transgenic (NPY-tg) rats. These rats were developed by 
injection of the rat (Sprague-Dawley) pronuclei with a 14.5-kb clone of the rat structural NPY 
gene. When compared with nontransgenic littermates, NPY concentrations were significantly 

15 increased in a number of cardiovascular tissues of NPY-tg hemizygotes. Direct basal mean 
arterial pressure and heart rate were not changed, but calculated total vascular resistance was 
significantly increased in NPY-tg subjects. Arterial pressure increases, in response to 
norepinephrine injection, were greater in the NPY-tg rats. Also, the hypotension and 
bradycardia in response to hemorrhage were significantly reduced in NPY-tg subjects. These 

20 results indicate that NPY, when expressed in increased amounts, potentiates the pressor effects 
of norepinephrine and contributes to maintaining blood pressure during hemorrhage, but it 
does not alter resting blood pressure. These transgenic rats will facilitate studies of the role of 
NPY signaling in cardiovascular regulation, particularly regarding its functional cooperation 
with norepinephrine. 

25 PMID: 11448843 

Horackova M, Slavikova J, Byczko Z. Postnatal development of the rat intrinsic 
cardiac nervous system: a confocal laser scanning microscopy study in whole-mount atria. 
Tissue Cell 2000 Oct;32(5):377-88 

We used confocal laser scanning microscopy and fluorescent immunohistochemistry to 

30 study the developmental pattern and distribution of specific neuronal phenotypes within the 
intrinsic cardiac nervous system in whole-mount atrial preparations from newborn to 5 week 
old rats. Individual ganglia and neuronal cell bodies were localized by means of two general 
neuronal markers: protein gene product 9.5 (PGP) and microtubule-associated protein two 
(MAP). In rats < or =2 weeks old there were two main subpopulations of intrinsic neurons 

892 



WO 02/068649 PCT/US02/02785 



located in the intraatrial septum and around the origin of the superior vena cava. The more 
abundant was a population of strongly tyrosine hydroxylase (TH) immunoreactive (IR) 
neurons (10-40 microm in diameter) most of which were also PGP-IR. The second, less 
numerous (approximately 60-70% than the TH-IR group) type of neurons exhibited ChAT-IR 
5 which colocalized with MAP-IR. Towards the end of the second postnatal week and during the 
third, the ganglia containing these neurons became more numerous and their localization also 
included tissues around the origins of the inferior vena cava and the pulmonary veins, as well 
as both atrial walls close to the A V junction. During the second and third postnatal weeks, 
when the extrinsic innervation of the adrenergic and cholinergic phenotypes largely increases, 

10 the intrinsic innervation also changed greatly, and around the 21st postnatal day it appeared to 
acquire mature characteristics. The TH-IR neurons changed their characteristics and formed 
two types of ganglia. The larger ganglia containing large cells (20-40 microm in diameter) 
expressed TH-IR mostly close to their inner body surface (approximately 80-90% of identified 
neurons). Most of these neurons also expressed neuropeptide Y (NPY)-IR, specifically around 

1 5 their nuclei. The second type of small strongly TH-IR neurons (approximately 10% of all 
identified neurons) were contained in smaller groups (20-50 cells) which were usually 
embedded into much larger ganglia (100-400 cells), containing large (20-50 microm) neurons. 
Unlike all other intrinsic neurons, these small TH-IR cells did not exhibit any PGP-IR or 
MAP-IR. The number of ChAT-IR neurons increased at this stage, reaching approximately 

20 90% of the neurons identified by the general neuronal markers. These neurons were 

surrounded by a rich network of cholinergic varicose nerve fibers, some of which were likely 
of an extrinsic origin. We have also identified relatively small ganglia expressing 
immunoreactivity to vasoactive intestinal polypeptide (VIP), and to substance P (SP). The 
presented data indicate that the phenotypes of intrinsic neurons in the rat heart change greatly 

25 during the first month of postnatal development. This may be at least partially related to the 
development and maturation of functional extrinsic nervous control of the heart. 
PMID: 11201277 

Panel L3D Summary: Ag2930 The expression of the NOV9a gene was assessed in 
two independent runs on this panel. Low but significant levels of expression are seen in kidney 
30 cancer cells and a lung cancer cell, consistent with Panel 1.1 . Please see the previous panel for 
discussion of utility of this gene in cancer. 

The expression in this panel also confirms expression of this gene product in the brain. 
Please see Panel 1.1 for discussion of utility of this gene in the central nervous system. 
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Panel 2D Summary: Ag2930 The NOV9a gene is expressed at low but significant 
levels in kidney cancer samples in this panel but not in the adjacent normal tissue samples 
(CTs=30-32). This expression is consistent with results in the preceding panels. This suggests 
that expression of this gene can be used as a diagnostic marker for the presence of kidney 
5 cancer. Furthermore, therapeutic inhibition of the gene product could potentially be used in the 
treatment of kidney cancer. 

Panel 3D Summary: Ag2930 The NOV9a gene expression is restricted to NCI-N41 7, 
a small cell lung cancer cell line (CT=33.81). Expression of this gene can therefore be used for 
the diagnosis and treatment of this cancer. 
10 Panel 4D Summary: Ag2930 Expression of the NOV9a gene is low/undetectable in 

all samples on this panel. (CTs>35). (Data not shown.) 

NOV4b 

Expression of gene NOV4b was assessed using the primer-probe set Ag2955, 
described in Table BA. Results of the RTQ-PCR runs are shown in Tables BB, BC and BD. 

1 5 Table BA. Probe Name Ag2955 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID HO: 


Forward 


5 1 -caatgcaatctgttggactttt-3 ' 


22 


33 


1003 


Probe 


TET-5 ' -tctctcctgggatggattttatccat-3 1 - 
TAMRA 


26 


59 


1004 


Reverse 


5 • -gttcttccagtgtggcaaataa-3 1 


22 


94 


1005 



Table BB. General screening panel vl.4 



Tissue Name 


ReI.Exp.(%)Ag2955, 
Run 216860933 


Tissue Name 


Rel. Exp.(%) Ag2955, 
Run 216860933 


Adipose 


8.1 


Renal ca.TK-10 


8.0 


Melanoma* 
Hs688(A).T 


8.5 


Bladder 


18.9 


Melanoma* 
Hs688(B).T 


5.8 


Gastric ca. (liver met.) 
NCI-N87 


35:6 


Melanoma* M14 


1.1 


Gastric ca. KATO III 


2.5 


Melanoma* 
LOXIMVI 


31.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


3.1 


Colon ca. SW480 


4.4 


Squamous cell 
carcinoma SCC-4 


1.3 


Colon ca.* (SW480 
met) SW620 


1.7 


Testis Pool 


0.0 


Colon ca.HT29 


3.9 
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Prostate ca.* (bone 
met) PC-3 


36.9 


Colon ca. HCT-116 


5.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


5.9 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


00 


fnlnn ra <JWl 1 16 

\-/tJlUIl va. OWlllO 




Ovarian ca. 
OVCAR-3 


10.5 


Colon ca. Colo-205 


3.3 


Ovarian ca. SK-OV- 
3 


47.3 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


4.5 


Colon Pool 


1.6 


Ovarian ca 
OVCAR-5 


6.5 


Small Intestine Pool 


2.4 


Ovarian ca. IGROV- 
1 


21.0 


Stomach Pool 


4.3 


Ovarian ca. 
OVCAR-8 


3.1 


Bone Marrow Pool 


5.3 


Ovary 


4.3 


Fetal Heart 


2.9 


Breast ca. MCF-7 


0.0 


Heart Pool 


3.8 


Breast ca. MDA- 
MB-231 


29.7 


Lymph Node Pool 

* 


8.0 


Breast ca. BT 549 


3.0 


Fetal Skeletal Muscle 


1.6 


Breast ca. T47D 


6.3 


Skeletal Muscle Pool 


7.6 


Breast ca. MDA-N 


1.6 


Spleen Pool 


2.4 


Breast Pool 


0 7 


i nymus r ooi 


1.2 


Trachea 


2.5 


CNS cancer (glio/astro) 


2.0 


Lung 


1.5 


CNS cancer (glio/astro) 


1.2 


Fetal Lung 


9.6 


CNS cancer 


5.3 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 


2.6 


Lung ca. LX-1 


2.4 


v»liu \*<UI\AZ1 \a.all\jj 

SNB-75 


6.7 


.Lung ca. fsd-H14o 


0.0 


CNS cancer (glio) 
SNB-19 


9.3 


T a OTTT4 T7 

Lung ca. DrLP-77 


1.7 


CNS cancer (glio) SF- 
295 


62.9 


Lung ca. A549 


3.0 


Brain (Amygdala) Pool 


1.3 


Lungca.NCI-H526 


0.0 


Brain (cerebellum) 


4.0 


Lung ca. NCI-H23 


59.0 


Brain (fetal) 


7.2 


Lung ca. NCI-H460 


2.1 


Brain (Hippocampus) 
Pool 


2.1 


Lung ca. HOP-62 


3.0 


Cerebral Cortex Pool 


3.0 . 
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I una no "MPT 


Oj.l 


Brain (Substantia nigra) 
Pool 


0 (\ 

v/.u 


Liver 


0.0 


Brain (Thalamus) Pool 


2.6 


Fetal Liver 


3.1 


Brain (whole) 


0.0 


Liver ca. HepG2 


4.1 


Spinal Cord Pool 


0.0 


Kidney Pool 


4.4 


Adrenal Gland 


0.0 


Fetal Kidney 


23.2 


Pituitary gland Pool 


1.0 


Renal ca. 786-0 


2.5 


Salivary Gland 


0.0 


Renal ca. A498 


1.9 | 


Thyroid (female) 


1.7 


Renal ca. ACHN 


2.7 


Pancreatic ca. CAPAN2 


100.0 


Renal ca.UO-31 


1.7 


Pancreas Pool 


6.5 



Table BC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag29S5, 
Klin Iblywooo 


Tissue Name 


Rel. Exp.(%) Ag2955, 
Kun lo/fUoJoJ 


Liver adenocarcinoma 


8.5 


Kidney (fetal) 


4.4 


Pancreas 


0.0 


Renal ca. 786-0 


3.2 


Pancreatic ca. CAPAN 

2 


64.6 


Renal ca. A498 


. 15.1 


Adrenal gland 


0.0 


Renal ca. RXF 393 


10.7 


Thyroid 


1.5 


Renal ca. ACHN 


6.5 


Salivary gland 


1.1 


Renal ca.UO-31 


2.3 


Pituitary gland . 


0.0 


Renal ca.TK-10 


13.2 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


2.7 


Liver (fetal) 


0.0 


Brain (amygdala) 


. 12.1 


Liver ca. 

(hepatoblast) HepG2 


8.5 


Brain (cerebellum) 


1.0 


Lung 


6.4 


Brain (hippocampus) 


0.0 


Lung (fetal) 


2.9 


Brain (substantia nigra) 


6.2 


Lung ca. (small cell) 
LX-1 


5.5 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


3.9 


Lung ca. (s.cell var.) 
SHP-77 


4.1 


Spinal cord 


1.9 


Lung ca. (large 
cell)NCI-H460 


1.1 


glio/astroU87-MG 


0.0 


Lung ca. (non-sm. 
cell)A549 


19.2 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


26.1 


astrocytoma SW1783 


7.6 


Lung ca. (non-s.cell) 
HOP-62 


12.3 
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neuro*; met SK-N-AS 


3.5 


T imp ca fnon-s cl\ 
NCI-H522 


46.3 


astrocytoma SF-539 


13.2 


Tjino - ca fcfiiiam ^ 

J— 'Will ^ vui yOlJUCllll. J 

SW900 


16.7 


astrocytoma SNB-75 


3.6 


NCI-H596 


2.8 


glioma SNB-49 - 


~ - 16 5 


Mamtnarv otanH 
ivxciiiiiiicu. jf gianu 


l/.U 


glioma U251 


26.6 


RrpflQt ca * fnl pf\ 

MCF-7 


3.0 


glioma br-zio 


6.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


7.1 


Heart (ietai) 


0.0 


Breast ca.* (pl.ef) 
T47D 


11.5 


Heart 


1.1 


Breast ca. BT-549 


1.1 


Skeletal muscle (fetal) 


3.3 


Breast ca. MDA-N 


5.7 


Skeletal muscle 


0.0 


Ovarv 
v u.i y 


n n 
u.u 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 

3 


21.2 


Thymus 


0.0 


Ovarian ca. OVCAR- 
4 


7.4 


Spleen 


0.0 


Ovarian ca. OVCAR- 


14.3 


Lymph node 


0.0 


Ovarian on nvrAD. 

8 


4.7 


Colorectal 




Ovarian ca. IGROV- 
1 


38.4 


Stomach 


0.9 


Ovarian ca.* (ascites) 
SK-OV-3 


100.0 


Small intestine 


0.0 


Uterus 


0.0 


Colon ca. S W480 


6.8 

•t nm. ,- ,. l .. l ,r n n lll n, -,i T, M, l „.-„„.r» l im l , nmi r, 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


7.6 


Prostate 


0.0 


Colon ca. HT29 


6.1 


Prostate ca * fhnnp 

met)PC-3 


13.1 


Colon ca. HCT-116 


4.2 


Testis 


0.0 


Colon ca. CaCo-2 


19.3 


lvf^lanrwria 

IVlC/ICUlUlllO- 

Hs688(A).T 


14.7 


Colon ca. 
tissue(OD03866) 


0.0 


Ivf pfanrkmsi* ( m^t^ 
ivic-iaiiuilla ^IHCLJ 

Hs688(B).T 


5.4 


Colon ca. HCC-2998 


3.6 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


20.6 


Melanoma M14 


0.0 


Bladder 


13.4 


Melanoma LOX 
IMVI 


25.5 


Trachea 


3.6 


Melanoma* (met) 


1.7 
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SK-MEL-5 




Kidney | 5.1 


Adipose 


13.9 



Table BD. Panel 4D 



l ia&ue iiaine 


Rel. Exp.(%) 
164306318 


i issue iName 


Rel. Exp.(%) 
AgZy55, Kun 
164306318 


SeconHarv Th 1 art 






0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


20.2 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


15.7 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


8.4 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


6.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


8.7 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


7.7 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


6.4 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


9.5 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


5.3 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-1 beta 


19.9 


CD45RACD4 
lympnocyte act 


13.5 


Coronery artery SMC rest 


22.4 


CD45RO CD4 
lympnocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-1 beta 


12.0 . 


LUo lympnocyte act 


0.0 


Astrocytes rest 


37.6 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-1 beta 


29.3 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


25.7 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


100.0 


2ryThl/Th2ATrl anti- 
CD95 CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


4.2 


LAK cells rest 


0.0 


CCD1 1 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


1.3 


LAK cells IL-2 


0.0 [Liver cirrhosis 


7.5 


LAK cells IL-2+IL-12 


0.0 jLupus kidney 


5.0 
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LAK cells EL-2+IFN 
gamma 


0.0 


NCI-H292 none 


31.9 


LAKcellsIL-2+IL-18 


0.0 


NC1-H292 IL-4 


52.1 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


39.5 


NK Cells IL-2 rest 


0.0 


NCI-H292IL-13 


16.6 


Two Way MLR 3 day 


3.9 


NCI-H292 IFN gamma 


35.4 


Two Way MLR 5 day 


0.0 


HPAEC none 


4.0 


l wo Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


3.0 


PBMC rest 


0.0 


Lung fibroblast none 


4.9 


PBMCPWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


5.5 


Lung fibroblast IL-4 


9.9 


Ramos fB celll none 


0 0 


juung iiDroDiosi iL»-y 


O.J 


Ramos (B cell) 
ionomvcin 


0.0 


Lung fibroblast IL-1 3 


1.7 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


11.7 


B lymphocytes CD40L 
and IL-4 


0.0 

— 


Dermal fibroblast 
i u / u rest 


59.9 


EOL-1 dbcAMP 


0.0 


jjermai nnroDiast 
CCD1070 TNF alpha 


80.7 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


uermai nDroDiast 
CCD 1070 IL-1 beta 


35.4 


Dendritic cells none 


10.6 


Dermal fibroblast IFN 
gamma 


70.2 


Dendritic cells LPS 


4.5 


Dermal fibroblast IL-4 


72.7 


Dendritic cells anti- 
CD40 


n A 1 


IrJIJ Colitis 2 


4.2 


Monocytes rest 


7.2 


IBD Crohn's • 


0.0 


Monocytes LPS 


2.4 


Colon 


6.7 


Macrophages rest 


3.3 


Lung 


22.4 


Macrophages LPS 


0.0 


Thymus 


23.2 


HUVEC none 


7.3 


Kidney 


0.0 


HUVEC starved 


19.6 







General_screening_panel_vl.4 Summary: Ag2955 Highest expression of the 
NOV4b gene is seen in a pancreatic cancer cell line (CT=32.6). Low but significant levels of 
expression are also seen in melanoma, lung, brain, ovarian, breast and prostate cancer cell 
5 lines. Thus, expression of this gene might be used as a diagnostic marker for the presence of 
these cancers. Furthermore, therapeutic inhibition of this gene product may be useful in the 
treatment of melanoma, lung, brain, ovarian, breast and prostate cancers. 
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Panel 1.3D Summary: Ag2955 Highest expression of the NOV4b gene is seen in an 
ovarian cancer cell line, SK-OV-3, (CT=33.1). Low but significant levels of expression are 
also seen in melanoma, lung, brain and pancreatic cancer cell lines. Thus, expression of this 
gene might be used as a diagnostic marker for the presence of these cancers. Furthermore, 
5 therapeutic inhibition of this gene product may be useful in the treatment of melanoma, lung, 
brain, and pancreatic cancers. 

Panel 4D Summary: Ag 2955 The NOV4b gene is expressed at low but significant 
levels in treated and untreated dermal fibroblasts and in the basophil cell line treated with 
PMA and ionomycin. The latter mimics the condition that leads to the degranulation and 

10 release of various mediators which contribute to the symptomatology of allergic diseases. This 
transcript encodes a claudin 6 like protein, a member of the Claudin tight junction family. The 
expression of this transcript could potentially be used as a marker for activated basophils and 
dermal fibroblasts. Furthermore, modulation of the activity or expression of this putative 
protein by antibodies may reduce the symptoms of patients suffering from allergic diseases 

15 asthma, ulcerative colitis, atopic diseases such as contact dermatitis and eczema, or 
inflammatory skin diseases. 

NOV3b 

Expression of gene NO V3b was assessed using the primer-probe set Ag2957, 
described in Table CA. Results of the RTQ-PCR runs are shown in Tables CB, CC, CD and 
20 CE. 

Table CA. Probe Name Ag2957 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -caggcctctgcactttgac-3 ' 


19 


438 


1006 


Probe 


TET- 5 1 - ctgt ct cgtggtatgccaccctggt - 3 1 - 
TAMRA 


25 


459 


1007 


Reverse 


5 • -ccaaattctgggttgaagaact-3 1 


22 


492 


1008 



Table CB. General screening panel vl.4 



Tissue Name 


Rel. Exp.(%) Ag2957, 
Run 216861284 


Tissue Name 


Rel. Exp.(%) Ag2957, 
Run 216861284 


Adipose 


1.8 


Renal ca. TK-10 


6.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 
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ivieianoma Mi 4 


A A 

v.v 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


ivicianoma oJv- 
MEL-5 


0.0 


Colon ca. SW480 . 


0.0 


Sauamous cell 
carcinoma SCC-4 


0.0 


Colon ca * 
met) SW620 


4.6 


Testis Pool 


0.0 


Colon ca. HT29 


0.0 


Prostate ca * ( bone 
met)PC-3 


0.0 


Colon ca.HCT-1 16 


0.0 


Prostate Pool 


0 0 


Colon ra CaPo-? 




Placenta 


41.5 


Colon cancer tissue 


1.4 


uxerus Jrooi 


A A 

0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV- 


2.7 


Colon ca. SW-48 


0.0 


Ovarian ca. 

rw?r* ad /i 
UVLAK-4 


0.0 

— - — ■ — 


Colon Pool 


0.0 


uvanan ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.0 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


1.7 


Fetal Skeletal Muscle 


8.0 


Breast ca. T47D 


0.0 




n n 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Dicasi rooi 


A A 

u.o 


Thymus Pool 


3.0 


Trachea 


2.8 


CNS cancer (glio/astro) 
U87-MG 


0.0 


Lung 


2.9 


CNS cancer (glio/astro) 

t t i i o x nr> 

U-118-MG 


0.0 


Fetal Lung 


24.5 


CNS cancer 


0.0 


Lungca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.0 


Lungca. LX-1 


0.9 


CNS cancer (astro) 
SNB-75 


0.0 


Lungca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


0.0 
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Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF- 
295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.0 


Lungca.NCI-H526 


6.0 


Brain (cerebellum) 


0.0 


Lungca.NCI-H23 


0.0 


Brain (fetal) 


1.7 


r .« "XTV«T XT A /TV 

Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) 
Pool 


4.7 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


0.0 


Lune ca NCI-H527 


0 0 


Brain (Substantia nigra) 
Pool 


n n 


Liyer 


0.0 


Brain (Thalamus) Pool 


0.0 


Fetal Liver 


31.4 


Brain (whole) 


0.0 


Liver ca. HepG2 


7.6 


Spinal Cord Pool 


22.4 


Kidney Pool 


1.7 


Adrenal Gland j 


0.0 


Fetal Kidney 


100.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


o.o 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


2.0 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.0 



Table CC. Panel 13D 



Tissue Name 


Rel. Exp.(%) Ag2957, 
Run 167906390 


Tissue Name 


Rel.Exp.(%)Ag2957, 
Run 167906390 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


34.2 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAP AN 
2 


0.1 


Renal ca. A498 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivaiy gland 


0.2 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


100.0 


Brain (whole) 


0.0 


Liver (fetal) 


2.6 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


2.7 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


0.2 


Lung (fetal) 


0.8 


Brain (substantia nigra) 


0.3 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


1.5 
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Spinal cord 


23.8 


Lung ca. (large 
celI)NCI-ri4oO 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


glio/astro U-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.3 


astrocytoma SW1783 


0.0 

™ - - 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.2 


Lung ca. (squam.) 
SW 900 


2.1 


astrocytoma SNB-75 


0.1 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma oiNo-iy 


U.U 


Mammary gland 


1.7 


glioma U251 


0.0 


Breast ca.* (pl.ef) 


0.0 


glioma SF-295 


0.0 


MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


T47D 


0.0 


Heart 


0 6 


JOJ Ca£>l ta. O 1 -J*t" 


U.3 


Skeletal muscle (fetal) 


22 


Breast ca. MDA-N 


0.0 


oKeietai muscle 


1.5 ! 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Thymus 


2.3 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.0 


Lymph node 


0.1 


Ovarian ca. OVCAR- 
8 


0.0 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.1 


Small intestine 


0.0 


T Itpn i q 1 


u.u 


Colon ca. SW480 


0.0 


Placenta 


0.8 


Colon ca.* 
SW620(SW480 met) 


0.3 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


0.0 


Colon ca. CaCo-2 


22 


Melanoma 
Hs688(A).T 


0.0 
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V^UJUIl V/d.. 

tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 




u.u 


Melanoma UAtL-oz 


n a 
0.0 


Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.1 


Trachea 


0.2 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


30.6 


Adipose 


0.3 



Table CD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2957,Run 
170858345 


Tissue Name 


Rel. Exp,(%) 
Ag2957,Run 
170858345 


Normal Colon 


0.0 


Kidney Margin 
8120608 


68 8 


CC Well to ModDiff 
(OD03866) 


1.0 


Kidney Cancer 
8120613 


0 0 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


24 3 


CC Gr.2 rectosigmoid 
(OD03868) 


13.6 


Kidney Cancer 
9010320 


3.4 


CC Margin (OD03868) 


1.9 


Kidney Margin 
9010321 


85.9 


CC Mod Diff (ODfnQ^fft 


u.u 


iNormai uterus 


A A 

0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 064011 


1,6 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


1.2 


Thyroid Cancer 
064010 


0.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


0.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


3.3 


Lung Margin (OD04451- 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.0 


Normal Prostate 6546-1 


1.3 


Breast Cancer 
(OD04590-01) 


3.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
(OD04590-03) 


0.0 


Prostate Margin 


2.6 


Breast Cancer 


0.0 
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(OD04410) 




Metastasis 
(OD04655-05) 




Prostate Cancer 

//~\l~\f\A T^f\ A1\ 

(UD04720-01) 


0.0 


Breast Cancer 064006 


0.0 


Prostate Margin 

//"YP^A/l ^OA 

(UU04 720-02) 


0.0 


Breast Cancer 1024 


9.5 


Normal Lung 061010 


0.0 


Breast Cancer 
9100266 


3.3 


Lung Met to Muscle 
fOD04286^ 


0.0 


Breast Margin 


5.4 


Muscle Marcin 
(OD04286) 


0.0 


£>redi>i oncer 
A209073 


4.1 


Lung Malignant Cancer 
(OD03126) 


0.0 


Breast Margin 
A2090734 


4.9 


Lung Margin (OD03 126) 


0.0 


Normal Liver 


0.0 


Lung Cancer (OD04404) 


0.0 


Liver Cancer 064003 


0.0 


Lung Margin ffYn04404 , l 

J-zUllg 1»ACU gill yvyJ-ZV/T^VT^ 1 


0 0 


i-fiver cancer iuz3 


A A 
U.U 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0.0 


Lung Margin (OD04565) 


0.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


12.4 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005-T 


5.7 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (ODO4310) 


1.4 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


0.0 


Normal Kidney 


85.3 


Bladder Cancer 
(OD04718-01) 


0.0 


Kidney Ca, Nuclear grade 
2 (OD04338) 


6.2 


Bladder Normal 
Adjacent (OD0471 8- 
03) 


0.0 


Kidney Margin 

(vJJL)U433o) ! 


32.3 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OD04339} 


13.5 


Ovarian Cancer 


0.0 


Kidney Margin 
(OD04339) 


92.0 


Ovarian Cancer 
(OD04768-07) 


28.7 


Kidney Ca, Clear cell 
type (OD04340) 


9.7 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


87.7 


Normal Stomach 


0.0 
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Kidney Ca, Nuclear grade 
3 COD04348^ 


0.0 


Gastric Cancer 


0.0 


Kidney Margin 


100.0 


Stomach Margin 
9060359 

7UUVJJ7 


0.0 


Kidney Cancer 


0.0 


Gastric Cancer 
9060395 

7UUUJ ✓ J 


0.0 


Kidney Margin 
(OD04622-03) 


50.3 


Stomach Margin 
9060394 


0.0 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


0.0 


Kidney Margin 
(OD04450-03) 


58.6 


Stomach Margin 
9060396 


0.0 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


0.0 



Table CE. Panel 4D 



Tissue Name 


Rel. £xp.(%) 
Ag2957, Run 
164306319 


Tissue Name 


Rel. Exp.(%) 
Ag2957, Run 
164306319 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.8 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0,6 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 . 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 
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Secondary CD8 
lympnocyie rest 


0.0 


Astrocytes TNFalpha + 

IT 1 kiata 
IJLr 1 ue Id 


0.0 


Secondary CD8 
lympnocyie aci 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


Js.u-01/ (oasopnijj 
PMA/ionomycin 


0.0 


2ryThl/Th2/TrI_anti- 
CD95 CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD 1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


1.7 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


5.8 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Wav MLR 5 dav 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF abha + IL-1 
beta 


0.0 


PBMC rest 


0.0 


Lunff fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Kamos (d ceil j none 


U.U 


Lung iLDroDiast UL-y 


u.u 


Ramos (B cell) 
lonomycin 


0.0 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 

ar\A TT A 
and lL/-*+ 


0.0 


Dermal fibroblast 
KsK^LJ i u /u resx 


0.0 


EOL-l dbcAMP 


0.0 


MAttYIQl Oct 

ucrmdi iiDroDiaSL 
CCD1070 TNF alpha 


0.0 


POT -1 HVirAMP 

PMA/ionomycin 


0.0 


uermai iiDrouiasx 
CCD 1 070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


WD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


2.3 


Macrophages rest 


0.0 


|Lung 


0.0 
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Macrophages LPS 


0.0 


Thymus 


100.0 


HUVEC none 


0.0 


Kidney 


7.6 


HUVEC starved 


0.0 







General, screening panel vl.4 Summary: Ag2957 Expression of the NOV3b gene 
is restricted to placenta, fetal kidney and liver. Thus, expression of this gene could be used to 
differentiate between these samples and other samples on this panel. In addition, this gene 
5 shows no or very low expression in the cancer cell lines used in this panel. Thus, the absence 
of expression could potentially be used as a diagnostic marker for cancer. 

Panel 1.3D Summary: Ag2957 Expression of the NOV3b gene is restricted to kidney, 
spinal cord and liver. Thus, expression of this gene could be used to differentiate between 
these samples and other samples on this panel. This gene encodes a putative claudin. Claudins 
10 are components of tight junction strands. Thus, this specific pattern of expression may indicate 
that this gene product is involved in the formation of TJ strands in these tissues. 

Among the CNS regions on this panel, this tight junction protein is expressed only in 
the spinal cord and may be involved in the blood brain barrier in this region. This molecule 
may therefore be of utility in the treatment of spinal cord injury. Growth factors such as BDNF 
15 and NGF have been shown in animal models to enhance repair after spinal crush injury; 

however in the clinical condition it is hard to administer protein therapeutics due to the blood 
brain barrier. The selective downregulation of this molecule may therefore increase the 
amount of protein crossing the blood brain barrier in the spinal cord, while not hampering its 
function in the rest of the CNS. 
20 In addition, this gene shows no or very low expression in the cancer cell lines used in 

this panel. Thus, the absence of expression could potentially be used as a diagnostic marker for 
cancer. 

Panel 2D Summary: Ag2957 The NOV3b gene is consistently expressed in the 
normal kidney samples (CTs=32-33) but not in the adjacent kidney tumors. This result is in 
25 agreement with the expression in the previous panels. Thus, absence of expression of this gene 
could be used as a diagnostic marker for kidney cancer. Furthermore, therapeutic modulation 
of the function or expression of this gene may be a possible treatment for this cancer. 

Panel 4D Summary: Ag2957 The expression of the NOV3b transcript is restricted to 
the thymus (CT=32.1) but not in T cells. Thus, expression of this transcript could be used as a 
30 marker for this tissue. 



NOVla, NOVld, NOVlc, and NOVlb 
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Expression of gene NOV la and variants NOV ld 5 NOVlc and NOV lb was assessed 
using the primer-probe sets Ag2954 and Ag2956, described in Tables DA and DB. Results of 
the RTQ-PCR runs are shown in Tables DC and DD. 



Table DA. Probe Name Ag2954 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -cttcagccactaccctcctt-3 1 


20 


391 


1009 


Probe 


TET-5 1 -ccatgccacaatccaagacttctgg-3 1 - 
TAMRA 


25 


428 


1010 


Reverse 


5 ' -atgtcagggatgctgtcatc-3 1 


20 


453 


1011 



Table DB. Probe Name Ag2956 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -agtcagatggctccttcca-3 1 


19 


643 


1012 


Probe 


TET-5 ' -cctcatgctaagacctaggaacctgg-3 1 - 
TAMRA 


26 


662 


1013 


Reverse 


5 1 -ccagagat.ccttggcagaag-3 ' 


20 


702 


1014 



Table DC. General screening panel vl. 4 



Tissue Name 


Rel. Exp.(%) 
Ag2954, Run 
216607767 


Rel. Exp.(%) 
Ag2956,Run 
216607768 


Tissue Name 


Rel. Exp.(%) 
Ag2954,Run 
216607767 


Rel. Exp.(%) 
Ag2956, Run 
216607768 


Adipose 


0.0 


0.0 


Renal ca. TK-10 


58.6 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


0.0 


Bladder 


0.0 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


0.0 


Gastric ca. (liver 
met.)NCI-N87 


51.8 


0.0 


Melanoma* 
M14 


25.0 


0.0 


Gastric ca. 
KATO III 


0.0 


0.0 . 


Melanoma* 
LOXIMVI 


0.0 


0.0 


Colon ca. SW- 
948 


0.0 


0.0 


Melanoma* 
SK-MEL-5 


0.0 


0.0 


Colon ca. SW480 


20.9 


0.0 


Squamous cell 

carcinoma 

SCC-4 


0.0 


0.0 


Colon ca.* 

(SW480met) 

SW620 


0.0 


0.0 


Testis Pool 


24.3 


7.1 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met) 
PC-3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Prostate Pool 


0.0 


8.8 


Colon ca. CaCo-2 


0.0 


0.0 


Placenta 


0.0 


0.0 


Colon cancer 
tissue 


0.0 


0.0 
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Uterus Pool 


0.0 


0.0 


Colon ca. 

OYW1 1 1 iC 

aWl 1 lo 


0.0 


0.0 


Ovarian ca. 
UVL,AK-J 


62.4 


0.0 


Colon ca. Colo- 

205 


0.0 


0.0 


Ovarian ca. 

O JS.-\J V -J 


0.0 


7.9 


Colon ca. SW-48 


0.0 


. 0.0 


Ovarian ca. 

U V LAK-4 


69.7 


0.0 


Colon Pool 


99.3 


0.0 


Levari an ca. 
OVCAR-5 


58.2 


0.0 


bmaJI Intestine 
Pool 


0.0 


10.1 


L/ van an ca. 
IGROV-1 


18.7 


0.0 


Stomach Pool 


25.9 


10.3 


Ovarian ca. 


0.0 


0.0 


Bone Marrow 

rOOl 


28.3 


0.0 


Ovary 


0.0 


0.0 


Fetal Heart 


0.0 


0.0 


Breast ca. 
MCF-7 


78.5 


0.0 


Heart Pool 


0.0 


0.0 


Breast ca. 
MDA-MB- 


0.0 


0.0 


Lymph Node 
Pool 


0.0 


0.0 


oreasi ca. o i 
549 


0.0 


0.0 


retal Skeletal 
Muscle 


22.5 


1.7 


Breast ca. 


25.5 


0.0 


Skeletal Muscle 
Pool 


62.0 


262 


Breast ca. 
MDA-N 


57.8 


0.0 


Spleen Pool 


59.5 


0.0 


Breast Pool 


0.0 


0.0 


Thymus Pool 


23.8 


11.1 


Trachea 


0.0 


0.0 


CNS cancer 
(glio/astro) U87- 
MG 


0.0 


0.0 


Lung 


27.9 


0.0 


CNS cancer 
(glio/astro) U- 
118-MG 


0.0 


9.7 ! 


Fetal Lung 


0.0 


0.0 


CNS cancer 
(neuro;met) SK- 

XT AO 


17.0 


0.0 


Lung ca. NCI- 

N417 


0.0 


0.0 


CNS cancer 
(astro) br-5iy 


0.0 


0.0 


Lung ca. LX- 
1 


0.0 


0.0 


CNS cancer 
Castro^ SNB-75 


0.0 


0.0 


Lung ca. NCI- 
H146 


0.0 


0.0 


CNS cancer 
(glio) SNB-19 


0.0 


0.0 


Lung ca. 
SHP-77 


0.0 


0.0 


CNS cancer 
(glio) SF-295 


0.0 


0.0 


Lung ca. 
A549 


28.7 


0.0 


Brain 

(Amygdala) Pool 


0.0 


0.0 
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Lung ca. NCI- 
H526 


0.0 


6.0 


Brain 

(cerebellum) 


56.3 


3.3 


Lung ca. NCI- 
H23 


28.1 


11.0 


Brain (fetal) 


27.5 


100.0 


Lung ca. NCI- 
H460 

; 


100.0 


7.4 


Brain 

(Hippocampus) 

rOOl 


0.0 


0.0 


Lung ca. 
HOP-62 


0.0 


0.0 


Cerebral cortex 
Pool 


0.0 


0.0 


Lung ca. JNCl- 
H522 


0.0 


0.0 


Jirain (Substantia 
niffra^ Pool 


26.8 


0.0 


Liver 


0.0 


0.0 


Brain (Thalamus) 
Pool 


0.0 


0.0 


Fetal Liver 


54.7 


0.0 


Brain (whole) 


58.2 


6.2 


Liver ca. 
HepG2 


0.0 


0.0 


Spinal Cord Pool 


0.0 


0.0 


Kidney Pool 


A A 

u.o 




Adrenal UJand 


A A 
0.0 


A A 

0.0 


Fetal Kidney 


0.0 


0.0 


Pituitary gland 
rool 


25.3 


5.4 


Renal ca. 786- 
0 


0.0 


0.0 


Salivary Gland 


0.0 


0.0 


Renal ca. 
A498 


0.0 


0.0 


Thyroid (female) 


0.0 


0.0 


Renal ca. 
ACHN 


0.0 


0.0 


Pancreatic ca. 
CAPAN2 


85.3 


0.0 


Renal ca. UO- 

31 


0.0 


0.0 


Pancreas Pool 


0.0 


0.0 



Table DD. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
Ag2954, 

Run 
164329620 


Rel. 
Exp.(%) 
Ag2956, 

Run 
164401746 


Tissue Name 


Rel. 
Exp.(%) 
Ag2954, 

Run 
164329620 


Rel. 
Exp.(%) 
Ag2956, 

Run 
164401746 


Secondary Thl act 


0.0 


0.0 


HUVEC IL-lbeta 


48.3 


0.0 


Secondary Th2 act 


0.0 


0.0 


HUVEC IFN 
gamma 


0.0 


0.0 


Secondary Trl act 


20.0 


0.0 


HUVEC TNF 
alpha + IFN 
gamma 


0.0 


0.0 


Secondary Thl rest 


0.0 


0.0 


HUVEC TNF 
alpha + IL4 


0.0 


0.0 


Secondary Th2 rest 


0.0 


0.0 


HUVEC IL-11 


0.0 


0.0 


Secondary Trl rest 


0.0 


0.0 


Lung 

Microvascular EC 


0.0 


0.0 
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none 




Primary Thl act 


0.0 • 


o.o 


Lung 

Microvascular EC 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Primary Th2 act 


0.0 


0.0 


Microvascular 
Dermal EC none 


0.0 


0.0 


Primary Trl act 


0.0 


0.0 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
lbeta 




0.0 


Primary Thl rest 


0.0 


0.0 


Bronchial 
epithelium 
TNFalpha + 
ILlbeta 


0 0 


0.0 


Primary Th2 rest 


0.0 


0.0 


Small airway 
epithelium none 


38.7 


0.0 


Primary Trl rest 


17.0 


0.0 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


CD45KA CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC rest 


0.0 


19.1 


CD45ROCD4 . 
lymphocyte act 


A A 
0.0 


A A 

0.0 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


0.0 


0.0 


{ 1 1 "%0 1 111- t ■.I.j.jl, ,j j 1 

CJJo lymphocyte 
act 


- 0.0 


0.0 


Astrocytes rest 


0.0 


0.0 


Secondary CD8 
lymphocyte rest 


A A 

o.o 


A A 

0.0 


Astrocytes 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Secondary CD8 
lympnocyie act 


0.0 


0.0 


KU-812 
(Basophil) rest 


0.0 


0.0 


CD4 lymphocyte 
none 


0.0 


0.0 


KU-812 

(Basophil) 

PMA/ionomycin 


0.0 


0.0 


2ry 

mi/mz/iri anil- 
CD95CH11 


u.u 


u.u 


CCD1106 

(Keratinocytes) 

none 


0.0 


A A 

0.0 


LAK cells rest 


0.0 


0.0 


l^CDI lUo 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


LAK cells IL-2 


0.0 


0.0 


Liver cirrhosis 


72.2 


100.0 


LAK cells IL-2+IL- 
12 


0.0 


0.0 


Lupus kidney 


0.0 


0.0 


LAK cells IL- 


11.3 


0.0 


NCI-H292 none 


0.0 


0.0 
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2+IFN gamma 












t-./VlV CCliS ILr/rr 

IL-18 


0.0 


0.0 


NCI-H292 IL-4 


14.5 


0.0 


PMA/ionomycin 


0.0 


0.0 


NCI-H292 IL-9 


0.0 


0.0 


iNrv v^eus lJL-z re si 


u.u 


f\ n 
u.u 




u.u 


u.u 


Two Way MLR 3 
day 


0.0 


0.0 


NCI-H292 IFN 
gamma 


o:o 


0.0 


Two Way MLR 5 
day 


0.0 


0.0 


HPAEC none 


0.0 


0.0 


Two Way MLR 7 
day 


24.7 


0.0 


HPAEC TNF 
alpha + IL-1 beta 


0.0 


0.0 


PBMC rest 


6.9 


0.0 


Lung fibroblast 
none 


100.0 


0.0 


PBMC PWM 


15.3 


0.0 


Lung fibroblast 
TNF alpha + IL-1 
beta 


0.0 


0.0 


PBMC PHA-L 


0.0 


0.0 


Lung fibroblast 

TT A 


0.0 


0.0 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast 

TT O. 


9.7 


32.8 


Ramos (B cell) 
ionomycin 


10.7 


0.0 


Lung fibroblast 
IL-13 


0.0 


0.0 


B lymphocytes 
PWM 


13.0 


0.0 


Lung fibroblast 
IFN gamma 


0.0 


0.0 


B lymphocytes 
CD40L and IL-4 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


0.0 


EOL-1 dbcAMP 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


13.1 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


0.0 


Dermal fibroblast 
CCD1070 IL-1 
beta 


0.0 


0.0 


none 


0.0 


0.0 


Dermal fibroblast 
IFN gamma 


0.0 


0.0 


Dendritic cells LPS 


0.0 


0.0 


Dermal fibroblast 
IL-4 


34.2 


26.8 


Dendritic cells anti- 
CD40 


U.U 


A ft 

0.0 


1BD Colitis 2 


34.9 


A A 

0.0 


Monocytes rest 


0.0 


0.0 


IBD Crohn's 


20.0 


0.0 


Monocytes LPS 


0.0 


0.0 


Colon 


0.0 


0.0 


Macrophages rest 


13.0 


0.0 


Lung 


0.0 


0.0 


Macrophages LPS 


24.7 


0.0 


Thymus 


12.2 


0.0 


HUVEC none 


0.0 


0.0 


Kidney 


0.0 


0.0 


HUVEC starved 


40.1 


0.0 
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CNS_neurodegeneration_vl.O Summary: Ag2954/Ag2956 Expression of the 
NOV 1 1 gene is Iow/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

General_screening_panel_vl.4 Summary: Ag2954 The NOV la gene is expressed at 
a very low level or not at all in most of the cancer cell lines on this panel. Very low expression 
5 in cell lines from pancreatic, lung, breast and ovarian cancers suggests that it may be involved 
in these cancers. 

Ag2956 This gene is a member of the claudin family of proteins, and is only expressed 
in the fetal brain. It may be involved in the process of axonal growth or targeting and 
synaptogenesis (specifically in the development of tight junctions between neurons and other 
10 cell types). Therefore, this gene product may be of therapeutic benefit in the treatment of 
neuronal loss in clinical conditions such as head trauma or stroke where increased 
compensatory synaptogenesis is desireable. 

Panel 1 .3D Summary: Ag2954/Ag2956 Expression of the NOV1 1 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 
15 Panel 4D Summary: Ag2954/Ag 2956 Two experiments with two different sets of 

primers show low but significant levels of expression of this transcript in liver cirrhosis, 
dermal and lung fibroblasts and endothelium. Thus, the NOV1 1 transcript may serve as a 
marker for these tissues and play a role in maintaining the integrity of these tissues. 

NOV2 

20 Expression of gene NOV2 was assessed using the primer-probe set Ag2958, described 

in Table EA. Results of the RTQ-PCR runs are shown in Table EB. 



Table EA. Probe Name Ag2958 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID HO: 


Forward 


5 ' -gttgtcagggtagagcagaaga-3 1 


22 


341 


1015 


Probe 


TET-5 1 -ccaccaagaaaccttttgcaataaaa-3 1 - 
TAMRA 


26 


363 


1016 


Reverse 


5 1 -taccttccctctctctggtttc-3 1 


22 


395 


1017 



Table EB. Panel 1 3D 



Tissue Name 


ReI.Exp.(%)Ag2958, 
Run 167809085 


Tissue Name 


Rel. Exp.(%) Ag2958, 
Run 167809085 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


0.4 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


0.0 
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Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca.TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


0 0 




fi fi 

u.u 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
STTP-77 


0.0 


Spinal cord 


0.6 


Lung ca. (large 

ecu ^iN\^i~ri £ TOv 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
ecu j /\j 4 f y 


0.0 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


n n |Lung ca. (squam.) 
|SW 900 


0.0 


astrocytoma SNB-75 


n n jLung ca. (squam.) 
JNCI-H596 


0.0 


glioma SNB-1 9 


0.0 


Mammary gland 


fi fi 

u.u 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.4 


glioma or-zio 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


ricori ^ieiai ) 


0.0 


Breast ca.* (pl.ef) 
T47D 


0.0 


Heart 


0.0 


Breast ca. BT-549 


0.0 ; 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


2.0 


Thymus 


0.0 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 


0.0 
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5 




Lymph node 


0.0 


Ovarian ca OVCAR- 
8 


0.0 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca * 
SW620(SW480met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Pro<?tate ca * fhorie 
met)PC-3 


0.0 


Colon ca.HCT-1 16 


0.0 


Testis 


0.6 


Colon ca. CaCo-2 


0.0 


Mf*lannma 

IVXVrf 1CU1U1 lid 

Hs688(A).T 


0.0 


Colon ca 

tissue(OD03866) 


0.0 


lvxwicui wiiia tuiwti 

Hs688(B).T 


0.0 


Colon ca HCC-2998 


0 0 




0 0 


Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


100.0 


Adipose 


j 0.0 



Panel 13D Summary: Ag2958 Expression of the NOV2 is restricted to the kidney 
(CT=31.8). In addition, this gene is expressed at higher levels in adult kidney when compared 
to expression in fetal kidney (CT value = 40). Thus, this gene product may be useful for the 
5 differentiation of adult and fetal kidney tissue. This highly specific expression pattern also 
suggests that this gene product may be a small molecule drug for the treatment of diseases of 
the kidney. 

Panel 4D Summary: Ag2958 Expression of the NOV2 gene is low/undetectable in all 
samples on this panel (CTs>35). (Data not shown.) 

10 NOV10 

Expression of gene NOV10, also known as CG55964-02, was assessed using the 
primer-probe set Ag2857, described in Table FA. 
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Table FA. Probe Name Ag2857 



Primers 


Sequences 


Length 


Start 
Position 


SEQ XD NO: 


Forward 


5 1 -catttgccagagatttcttttg-3 » 


22 


300 


1018 


Probe 


TET-5 ' -caaatgtggcttattcactcattccagg- 
3 1 -TAMRA 


28 


336 


1019 


Reverse 


5 1 -agaaggatacccgattcaattg-3 ' 


22 


364 


1020 



CNS_neurodegeneration_vl.O Summary: Ag2857 Expression of the NOV10 gene is 
low/undetectable in all samples on this panel. (CTs>35). (Data not shown.) 
5 Panel 1.3D Summary: Ag2857 Expression of the NOV10 gene is low/undetectable in 

all samples on this panel (CTs>35). (Data not shown.) 

Panel 2.2 Summary: Ag2857 Expression of the NOV 10 gene is low/undetectable in 
all samples on this panel. (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2857 Expression of the NOV 10 gene is low/undetectable in 
10 all samples on this panel. (CTs>35). (Data not shown.) 

NOV11 

Expression of gene NOV1 1 was assessed using the primer-probe set Ag2858, 
described in Table GA. 



Table GA. Probe Name Ag2858 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID HO: 




5 1 ~cttaagtcagtcctggcagttg-3 ' 


22 


674 


1021 


Probe 


TET- 5 ' -aaattatttcagacctgcatctccca-3 » - 
TAMRA 


26 


716 


1022 


Reverse 


5 ' -agaacacaaggacagcacagat-3 1 


22 


743 


1023 



15 

CNS_neurodegeneration_vl.O Summary: Ag2858 Expressin of the NOVll gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 

Panel 13D Summary: Ag2858 Expression of the NOV1 1 gene is low/undetectable in 
20 all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

Panel 22 Summary: Ag2858 Expression of the NOV1 1 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 
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Panel 4D Summary: Ag2858 Expression of the NOV1 1 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

NOV12 

5 Expression of gene NOV12 was assessed using the primer-probe set Ag2867, 

described in Table HA. Results of the RTQ-PCR runs are shown in Tables HB and HC. 



Table HA. Probe Name Ag2867 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 1 -gtcacaccttgtgactgcaa-3 1 


20 


176 


1024 


Probe 


TET-5 1 -acttacttgtgtgcagtgccatgcct-3 1 - 
TAMRA 


26 


196 


1025 


Reverse 


5 1 -cattggaaacctttcaggaaa-3 ' 


21 


236 


1026 



Table HB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag2867, 
Run 208699903 


Tissue Name 


Rel. Exp.(%) Ag2867, 
Run 208699903 


AD 1 Hippo 


7.9 


Control (Path) 3 
Temporal Ctx 


1.5 


AD 2 Hippo 


20.3 


Control (Path) 4 
Temporal Ctx 


22.1 


AD 3 Hippo 


5.0 


AD 1 Occipital Ctx 


13.4 


AD 4 Hippo 


6.6 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


65.5 


AD 3 Occipital Ctx 


3.6 


AD 6 Hippo 


44.4 


AD 4 Occipital Ctx 


12.9 


Control 2 Hippo 


14.3 


AD 5 Occipital Ctx 


15.4 


Control 4 Hippo 


9.1 


AD 6 Occipital Ctx 


8.7 


Control (Path) 3 
Hippo 


5.6 


Control 1 Occipital 
Ctx 


0.7 


AD 1 Temporal Ctx 


100.0 


Control 2 Occipital 
Ctx 


1.4 


AD 2 Temporal Ctx 


12.6 


Control 3 Occipital 
Ctx 


14.5 


AD 3 Temporal Ctx 


7.6 


Control 4 Occipital 
Ctx 


3.4 


AD 4 Temporal Ctx 


14.0 


Control (Path) 1 
Occipital Ctx 


62.0 


AD 5 Inf Temporal 
Ctx 


55.9 


Control (Path) 2 
Occipital Ctx 


8.1 


AD 5 SupTemporal 


71.2 


Control (Path) 3 


0.0 
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Ctx 




v/L'L'ipilaj v>lA 




AD 6 Inf Temporal 
Ctx 


45.7 


Control (Path) 4 


21.0 


AD 6 Sup Temporal 
Ctx 

- , 


16.5 




Control 1 Parietal 
Ctv 

v-» La 


0.9 


Control 1 Temporal 
Ctx 


0.9 


Control 2 Parietal 

Ctv 


8.3 


Control 2 Temporal 
Ctx 


5.4 


Control 3 Parietal 

Ctv 


4.4 


Control 3 Temporal 
Ctx 


2.5 


Control (Path) 1 
Parietal Ctx 


40.1 


Control 4 Temporal 
Ctx 


6.3 


Control (Path) 2 
Parietal Ctx 


20.7 


Control (Path) 1 
Temporal Ctx 


39.8 


Control (Path) 3 
Parietal Ctx 


5.2 


Control (Path) 2 
Temporal Ctx 


24.8 


Control (Path) 4 
Parietal Ctx 


21.3 



Table HC. Panel 4D 



Tissue Name 


Rel. 

Exp.(%)Ag2867, 
Run 164311002 


Tissue Name 


Rel. 

Exp.(%)Ag2867, 
Run 164311002 


Secondary Thl 

act 


5.6 


HUVEC IL-lbeta 


4.2 


Secondary Th2 

act 


15.7 


HUVECIFN 

gamma 


1.9 


Secondary Trl 

act 


11.8 


HUVEC TNF alpha 
+ IFN gamma ' 


2.7 


Secondary Thl 

rest 


3.9 


HUVEC TNF alpha 

+ IL4 


2.9 


Secondary Th2 

rest 


17.3 


HUVEC IL-11 


2.3 


Secondary Trl 

rest 


12.2 


Lung Microvascular 
EC none 


6.1 


Primary Thl act 


38.4 


Lung Microvascular 
EC TNFalpha + IL-lbeta 


7.2 


Primary Th2 act 


22.2 


Microvascular 
Dermal EC none 


6.7 


Primary Trl act 


29.7 


Microsvasular 
Dermal EC TNFalpha + IL- 
lbeta 


3.9 


Primary Thl rest 


93.3 


Bronchial epithelium 
TNFalpha + ILlbeta 


9.0 


Primary Th2 rest 


55.9 


Small airway 
epithelium none 


1.7 
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Primary Trl rest 


39.2 


Small airway 
epithelium TNFalpha + IL- 


16.7 


CD45RA CD4 
lvmnhocvte act 


6.0 


Coronery artery 

CUP rpef 


3.2 


CD45RO CD4 
Ivmnhnrvte act 


15.6 


Coronery artery 

OlVlL^ I iSraipna « l±y 1 Octet 


1.3 


CD8 lymphocyte 

act 


16.0 


Astrocytes rest 


8.8 


Secondary CD8 
lymphocyte rest 


17.2 


Astrocytes 
TNFalpha + IL-1 beta 


6.2 


oeconoary cuo 
lymphocyte act 


13.6 


KU-812 (Basophil) 

rest 


0.7 


CD4 lymphocyte 

none 


22.4 


KU-812 (Basophil) 

"Till MAI* • 

PMA/ionomycin 


0.0 


2ry 

Thl/Th2/Trl anti-CD95 
CH11 


24.0 


CCD1106 
(Keratinocytes) none 


7.0 


LAK cells rest 

wvjjj 1 VOL 


23 7 


CCD1 106 

^rvcraijnocyicoy llNJraJpna T 

IL-lbeta 


O A 


LAK cells IL-2 




jji ver cirrnosis 


1 A 
J.O 


LAK cells IL- 
2+IL-12 


32.8 


Lupus kidney 


1.1 


LAK cells IL- 
2+IFN 2am ma 


50.7 


NCI-H292 none 


17.8 


LAK cells IL-2+ 

IL-18 


39.0 


NCI-H292 IL-4 


15.5 


LAK cells 
PMA/ionomvcin 


4.1 


NCI-H292 IL-9 


13.9 


NK Cells IL-2 

rest 


22.5 


NCI-H292IL-13 


7.2 


Two Way MLR 

3 day 


26.1 


NCT-H7Q9 TFM 

gamma 


8.2 


Two Way MLR 

5 day 


9.3 


HPAEC none 


2.7 


Two Way MLR 

7 day 


12.0 


HPAEC TNFalpha 
+ IL-1 beta 


2.7 


PBMC rest 


11.6 


Lung fibroblast none 


5.5 


PBMC PWM 


65.1 


Lung fibroblast TNF 
alpha + IL-1 beta 


2.8 


PBMC PHA-L 


17.1 


Lung fibroblast IL-4 


7.2 


Ramos (B cell) 

none 


13.9 


Lung fibroblast IL-9 


6.9 


Ramos (B cell) 
ionomycin 


23.8 


Lung fibroblast IL- 

13 


6.5 
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B lymphocytes 

rWM 


73.2 


Lung fibroblast IFN 

gamma 


4.6 


B lymphocytes 
LL/4UL and IJL-4 


100.0 


Dermal fibroblast 
prr*in7n r^ct 


12.7 


EOL-1 dbcAMP 


31 


Dermal fibroblast 
la^lji u/u iisr aipna 


40.1 


EOL-1 dbcAMP 
PMA/ionomycin 


5.2 


Dermal fibroblast 
la^lj i u /u iLr 1 oeia 


6.0 


Dendritic cells 

none 


6.5 


L/ermai iiDrooiasi 
IFN gamma 


2.6 


Dendritic cells 

LPS 


3.1 


Dermal fibroblast 

IL-4 


5.8 


Dendritic cells 
anti-CD40 


42 


IBD Colitis 2 


2.7 


Monocytes rest 


8.5 


IBD Crohn's 


5.5 


Monocytes LPS 


5.0 


Colon 


22.5 


Macrophages rest 


4.2 


Lung 


6.7 


Macrophages 

LPS 


2.4 


Thymus 


•6.7 


HUVEC none 


2.6 


Kidney 


92.7 


HUVEC starved 


12.7 







CNS_neurodegeneration_vl.O Summary: Ag2867 The NOV12 gene represents a 
novel G-protein coupled receptor (GPCR) with expression in the brain. The GPCR family of 
receptors contains a large number of neurotransmitter receptors, including the dopamine, 
5 serotonin, a and b-adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic 
glutamate receptors. GPCRs are excellent drug targets in various neurologic and psychiatric 
diseases. All antipsychotics have been shown to act at the dopamine D2 receptor; similarly 
novel antipsychotics also act at the serotonergic receptor, and often the muscarinic and 
adrenergic receptors as well. While the majority of antidepressants can be classified as 

10 selective serotonin reuptake inhibitors, blockade of the 5-HT1 A and a2 adrenergic receptors 
increases the effects of these drugs. The GPCRs are also of use as drug targets in the treatment 
of stroke. Blockade of the glutamate receptors may decrease the neuronal death resulting from 
excitotoxicity; further more the purinergic receptors have also been implicated as drug targets 
in the treatment of cerebral ischemia. The b-adrenergic receptors have been implicated in the 

1 5 treatment of ADHD with Ritalin, while the a-adrenergic receptors have been implicated in 

memory. Therefore this gene may be of use as a small molecule target for the treatment of any 
of the described diseases. 
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In addition, this GPCR is found to be upregulated in the temporal cortex of Alzheimer f s 
disease patients. Blockade of this receptor may be of use in the treatment of this disease and 
decrease neuronal death. 

References: 

5 El Yacoubi M, Ledent C, Parmentier M, Bertorelli R, Ongini E, Costentin J, Vaugeois 

JM. Adenosine A2A receptor antagonists are potential antidepressants: evidence based on 
pharmacology and A2A receptor knockout mice. Br J Pharmacol 2001 Sep;134(l):68-77 

1. Adenosine, an ubiquitous neuromodulator, and its analogues have been shown to 
produce 'depressant 1 effects in animal models believed to be relevant to depressive disorders, 

10 while adenosine receptor antagonists have been found to reverse adenosine-mediated 

'depressant' effect. 2. We have designed studies to assess whether adenosine A2A receptor 
antagonists, or genetic inactivation of the receptor would be effective in established screening 
procedures, such as tail suspension and forced swim tests, which are predictive of clinical 
antidepressant activity. 3. Adenosine A2A receptor knockout mice were found to be less 

15 sensitive to 'depressanf challenges than their wildtype littermates. Consistently, the adenosine 
A2A receptor blockers SCH 58261 (1 - 10 mg kg(-l), i.p.) and KW 6002 (0.1 - 10 mg kg(-l), 
p.o.) reduced the total immobility time in the tail suspension test. 4. The efficacy of adenosine 
A2A receptor antagonists in reducing immobility time in the tail suspension test was 
confirmed and extended in two groups of mice. Specifically, SCH 58261 (1 - 10 mgkg(-l)) 

20 and ZM 241385 (15-60 mg kg(-l)) were effective in mice previously screened for having 

high immobility time, while SCH 58261 at 10 mg kg(-l) reduced immobility of mice that were 
selectively bred for their spontaneous 'helplessness' in this assay. 5. Additional experiments 
were carried out using the forced swim test. SCH 58261 at 10 mg kg(-l) reduced the 
immobility time by 61 %, while KW 6002 decreased the total immobility time at the doses of 1 

25 and 10 mg kg(-l) by 75 and 79%, respectively. 6. Administration of the dopamine D2 receptor 
antagonist haloperidol (50 - 200 microg kg(-l) i.p.) prevented the antidepressant-like effects 
elicited by SCH 58261 (10 mg kg(-l) i.p.) in forced swim test whereas it left unaltered its 
stimulant motor effects. 7. In conclusion, these data support the hypothesis that A2A receptor 
antagonists prolong escape-directed behaviour in two screening tests for antidepressants. 

30 Altogether the results support the hypothesis that blockade of the adenosine A2A receptor 
might be an interesting target for the development of effective antidepressant agents. 

Blier P. Pharmacology of rapid-onset antidepressant treatment strategies. Clin 
Psychiatry 2001;62 Suppl 15:12-7 
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Although selective serotonin reuptake inhibitors (SSRls) block serotonin (5-HT) 
reuptake rapidly, their therapeutic action is delayed. The increase in synaptic 5-HT activates 
feedback mechanisms mediated by 5-HT1A (cell body) and 5-HT1B (terminal) autoreceptors, 
which, respectively, reduce the firing in 5-HT neurons and decrease the amount of 5-HT 

5 released per action potential resulting in attenuated 5-HT neurotransmission. Long-term 
treatment desensitizes the inhibitory 5-HT1 autoreceptors, and 5-HT neurotransmission is 
enhanced. The time course of these events is similar to the delay of clinical action. The 
addition of pindolol, which blocks 5-HT1A receptors, to SSRI treatment decouples the 
feedback inhibition of 5-HT neuron firing and accelerates and enhances the antidepressant 

1 0 response. The neuronal circuitry of the 5-HT and norepinephrine (NE) systems and their 

connections to forebrain areas believed to be involved in depression has been dissected. The 
firing of 5-HT neurons in the raphe nuclei is driven, at least partly, by alphal-adrenoceptor- 
mediated excitatory inputs from NE neurons. Inhibitory alpha2-adrenoceptors on the NE 
neuroterminals form part of a feedback control mechanism. Mirtazapine, an antagonist at 

1 5 alpha2-adrenoceptors, does not enhance 5-HT neurotransmission directly but disinhibits the 
NE activation of 5-HT neurons and thereby increases 5-HT neurotransmission by a 
mechanism that does not require a time-dependent desensitization of receptors. These 
neurobiological phenomena may underlie the apparently faster onset of action of mirtazapine 
compared with the SSRIs. 

20 Tranquillini ME, Reggiani A. Glycine-site antagonists and stroke. Expert Opin Investig 

Drugs 1999Nov;8(ll):1837-184S 

The excitatory amino acid, (S)-glutamic acid, plays an important role in controlling 
many neuronal processes. Its action is mediated by two main groups of receptors: the 
ionotropic receptors (which include NMDA, AMPA and kainic acid subtypes) and the 

25 metabotropic receptors (mGluR(l-8)) mediating G-protein coupled responses. This review 
focuses on the strychnine insensitive glycine binding site located on the NMDA receptor 
channel, and on the possible use of selective antagonists for the treatment of stroke. Stroke is a 
devastating disease caused by a sudden vascular accident. Neurochemically, a massive release 
of glutamate occurs in neuronal tissue; this overactivates the NMDA receptor, leading to 

30 increased intracellular calcium influx, which causes neuronal cell death through necrosis. 
NMDA receptor activation strongly depends upon the presence of glycine as a co-agonist. 
Therefore, the administration of a glycine antagonist can block overactivation of NMDA 
receptors, thus preserving neurones from damage. The glycine antagonists currently identified 
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can be divided into five main categories depending on their chemical structure: indoles, 
tetrahydroquinolines, benzoazepines, quinoxalinediones and pyrida-zinoquinolines. 

Monopoli A, Lozza G, Forlani A, Mattavelli A, Ongini E. Blockade of adenosine A2A 
receptors by SCH 58261 results in neuroprotective effects in cerebral ischaemia in rats. 
5 Neuroreport 1998 Dec l;9(17):3955-9 

Blockade of adenosine receptors can reduce cerebral infarct size in the model of global 
ischaemia. Using the potent and selective A2A adenosine receptor antagonist, SCH 58261, we 
assessed whether A2A receptors are involved in the neuronal damage following focal cerebral 
ischaemia as induced by occluding the left middle cerebral artery. SCH 58261 (0.01 mg/kg 
10 either i.p. or i.v.) administered to normotensive rats 10 min after ischaemia markedly reduced 
cortical infarct volume as measured 24 h later (30% vs controls, p < 0.05). Similar effects were 
observed when SCH 58261 (0.01 mg/kg, i.p.) was administered to hypertensive rats (28% 
infarct volume reduction vs controls, p < 0.05). Neuroprotective properties of SCH 58261 
administered after ischaemia indicate that blockade of A2A adenosine receptors is a 
1 5 potentially useful biological target for the reduction of brain injury. 

Panel 1 3D Summary: Ag2867 Results from one experiment with the NOV 12 gene 
are not included. The amp plot indicates that there were experimental difficulties with this run. 

Panel 2.2 Summary: Ag2867 Results from one experiment with theNOV12 gene are 
not included. The amp plot indicates that there were experimental difficulties with this run. 
20 Panel 4D Summary: Ag2867 Expression of the NOV1 2 gene is widespread among 

samples in this panel, with highest expression in B lymphocytes stimulated with CD40L and 
IL-4(CT=31.1). 

This transcript is also highly expressed in activated B cells and primary resting Thl 
and Th2 T cells. The expression of this transcript in PBMC treated with the B cell mitogen, 

25 PWM, confirms the importance of CG54575-01 gene expression in activated B cells. In 

addition, this transcript is also abundantly expressed on primary resting Thl cells (to a lesser 
degree on primary resting Th2 cells). Therefore, it appears that this gene, encoding a GPCR 
homolog, is a potential new member of the chemokine receptor family. The expression of this 
protein in activated B cells suggests a role for this protein in their trafficking to appropriate 

30 sites where they can fully activate antigen specific T cells. Thus, the protein encoded by this 
gene is likely to participate in the development of immune or inflammatory reactions. 

In addition, the high expression of this gene in the kidney suggests that the putative 
GPCR encoded for by this gene could allow cells within the kidney to respond to specific 
microenvironmental signals (For example, ref. 1). Therefore, antibody or small molecule 
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therapies designed with the protein encoded for by this gene could modulate kidney function 
and be important in the treatment of inflammatory or autoimmune diseases that affect the 
kidney, including lupus and glomerulonephritis. 

References: 

References: 

Mark M.D., Wittemann S., Herlitze S. (2000) G protein modulation of recombinant 
P/Q-type calcium channels by regulators of G protein signalling proteins. J. Physiol. 528 Pt 1 : 
65-77. 

1 . Fast synaptic transmission is triggered by the activation of presynaptic Ca2+ 
channels which can be inhibited by Gbetagamma subunits via G protein-coupled receptors 
(GPCR). Regulators of G protein signalling (RGS) proteins are GTPase-accelerating proteins 
(GAPs), which are responsible for > 100-fold increases in the GTPase activity of G proteins 
and might be involved in the regulation of presynaptic Ca2+ channels. In this study we 
investigated the effects of RGS2 on G protein modulation of recombinant P/Q-type channels 
expressed in a human embryonic kidney (HEK293) cell line using whole-cell recordings. 2. 
RGS2 markedly accelerates transmitter-mediated inhibition and recovery from inhibition of 
Ba2+ currents (IBa) through P/Q-type channels heterologously expressed with the muscarinic 
acetylcholine receptor M2 (mAChR M2). 3. Both RGS2 and RGS4 modulate the prepulse 
facilitation properties of P/Q-type Ca2+ channels. G protein reinhibition is accelerated, while 
release from inhibition is slowed. These kinetics depend on the availability of G protein alpha 
and betagamma subunits which is altered by RGS proteins. 4. RGS proteins unmask the Ca2+ 
channel beta subunit modulation of Ca2+ channel G protein inhibition. In the presence of 
RGS2, P/Q-type channels containing the beta2a and beta3 subunits reveal significantly altered 
kinetics of G protein modulation and increased facilitation compared to Ca2+ channels 
coexpressed with the betal b or beta4 subunit. 

PMID: 11018106 

NOV13aandNOV13b 

Expression of gene NOV13a and variant NOV13b was assessed using the primer-probe 
set Ag2869, described in Table IA. Results of the RTQ-PCR runs are shown in Table IB. 

Table IA. Probe Name Ag2869 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -tgtctgtggtagacaccacctt-3 * 


22 


475 


1027 


Probe 


TET-5 1 -ctgaggctaccctaccgaggcagtaa-3 1 - 


26 


501 


1028 
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• Jtamra 


1 


, ,i 


1 




Reversejs ■ -cacaaaagaaatgagcaatgct-3 ■ 


|22 


j528 


t 


1029 



Table IB. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2869, Run 
164311007 t 


Tissue Name 


Rel. Exp.(%) 
Ag2869, Run 
164311007 


Secondary Thl act 


5.5 


HUVEC IL-lbeta 


1.7 


Secondary Th2 act 


68.3 


HUVEC IFN gamma 


9.5 


Secondary Trl act 


33.9 


H1IVFC TNF alnha + TFN 
gamma 


5.6 


SppnnHnrv Thl r**ct 
uvwuiiuai y x li l icjl 


0 ft 


JT1U V I 1NJT dip I la i IL-rt 


R 0 


Secondary Th2 rest 


7.3 


HUVEC IL-11 


5.5 


Secondary Trl rest 


3.0 


Lung Microvascular EC 
none 


11.0 


Primary Thl act 


40.9 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


12.0 


Primary Th2 act 


19.1 


Microvascular Dermal EC 
none 


3.1 


Primary Trl act 


51.4 


Microsvasular Dermal EC 

TNTFalnha + TT -1 h*»t?i 
1 INrd-ipila T li-i-l QCla 


1.2 


Primary Thl rest 


23.5 


Bronchial epithelium 


2.4 


Primary Th2 rest 


11.8 


Small airway epithelium 

nnnp 


0.8 


Primary Trl rest 


172 


Small airway epithelium 
TNFalnha + IL-lheta 


9.6 


CD45RA CD4 
lymphocyte act 


1.6 


Coronery artery SMC rest 


5.5 \ 


CD45ROCD4 
lymphocyte act 


12.8 


Coronery artery SMC 
TNFalDha + IL-lbeta 


2.5 


CD8 lymphocyte act 


1.0 


Astrocvtes rest 


6.3 


Secondary CD8 
lymphocyte rest 


2.7 


Astrocytes TNFalpha + 
IL-lbeta 


1.8 


Secondary CD8 
lymphocyte act 


3.4 


KU-812 (Basophil) rest 


1.8 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


21.6 


2ry Thl/Th2/Trl anti- 
CD95CH11 


1.5 


CCD 1 106 (Keratinocytes) 
none 


6.3 


LAK cells rest 


0.7 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


1.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


5.4 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.9 
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LAK cells IL-2+TFN 
gamma 


3.1 


NC1-H292 none 


34.2 


LAK cells IL-2+IL-18 


2.4 


NC1-H292 IL-4 


46.3 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


29.7 


NK Cells IL-2 rest 


1.3 


NCI-H292 IL-13 


22.8 


Twn Wav MT R % Hav 


0 6 


NCT-H292 IFN eamma 


25.2 . 


Two Wav MLR 5 dav 


1 1 
1.1 


HPAEC none 


18.0 


Two Way MLR 7 day 


3.1 


HPAEC TNF aloha + IL-1 
beta 


19.9 


PBMC re<?t 


0 0 


T unf* fibroblast none 


0 8 


PBMC PWM 


4.1 


Lung fibroblast TNF alpha 
-f-IL-1 beta 

■ XXj X 1/W IH 


0.0 


PBMC PHA-L 


9.9 


Lung fibroblast IL-4 


8.9 


Ramos (B cell) none 


^ft *7 

50.7 


Lung fibroblast IL-9 


7.9 


Ramos (B cell) 
ionomycin 


100.0 


Lung fibroblast IL-13 


4.9 


B lymphocytes PWM 


1.1 


Lung fibroblast IFN 
gamma 


4.9 


B lymphocytes CD40L 
and IL-4 


3.4 


Dermal fibroblast 

/^OTM ft*7/\ — 

CCD 1070 rest 


42.6 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


39.8 


LOL-1 dbcAMP 

P1V4 A/i onnmvrin 

i lviru i\Jii\jLkiy win 


0.0 


Dermal fibroblast 
CCD1070 TI -1 beta 

v^viy i v / v 11-*- 1 ucia 


13.4 


Dendritic cells none 


2.1 


Dermal fibroblast TFN 

gamma 


2.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


2.6 


Dendritic cells anti- 
CD40 


0.6 


IBD Colitis 2 


1.1 


Monocytes rest 


0.0 


IBD Crohn's 


2.2 


Monocytes LPS 


0.0 


Colon 


3.2 


Macrophages rest 


4.8 


Lung 


1.0 


Macrophages LPS 


2.4 


Thymus 


4.4 


HUVEC none 


16.5 


Kidney 


16.3 


HUVEC starved 


17.6 







Panel 1 3D Summary: Ag2869 Expression of the NOV13a gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 
5 Panel 22 Summary: Ag2869 Expression of the NOV13a gene is low/undetectable in 

all samples on this panel (CTs>35). (Data not shown.) 
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Panel 4D Summary: Ag2869 Expression of the NOV 13a gene is widespread among 
the samples in this panel, with highest expression in the B cell line Ramos treated with 
ionomycin (CT=31.1), Lower but still significant levels of expression are seen in untreated 
Ramos B cells. B cells represent a principle component of immunity and contribute to the 
5 immune response in a number of important functional roles, including antibody production. 
For example, production of antibodies against self-antigens is a major component in 
autoimmune disorders such a systemic lupus erythematosus, with B cells playing a major role. 
Since B cells play an important role in autoimmunity, inflammatory processes and 
inflammatory cascades, therapeutic modulation of this gene product may reduce or eliminate 

1 0 the symptoms of patients suffering from asthma, allergies, chronic obstructive pulmonary 
disease, emphysema, Crohn's disease, ulcerative colitis, rheumatoid arthritis, psoriasis, 
osteoarthritis, and other autoimmune disorders including systemic lupus erythematosus. 

Significant levels of expression are also seen in IL-4, IL-9, IL-13 and IFN gamma 
activated-NCI-H292 mucoepidermoid cells as well as untreated NCI-H292 cells. Moderate 

1 5 expression is also detected in both treated and untreated human pulmonary aortic endothelial 
cells The expression of this gene in cells deri ved from or within the lung suggests that this 
gene may be involved in normal conditions as well as pathological and inflammatory lung 
disorders that include chronic obstructive pulmonary disease, asthma, allergy and emphysema 

NOV14 

20 Expression of gene NOV14 was assessed using the primer-probe set Ag2870, 

described in Table JA. Results of the RTQ-PCR runs are shown in Tables JB and JC. 



Table JA. Probe Name Ag2870 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 • -ttctcctgagagcaagaagttg-3 1 


22 


823 


1030 


Probe 


TET-5 ' -tgtgactcccatgttgaaccccatta-3 1 - 
TAMRA 


26 


865 


1031 


Reverse 


5 \ -tcttcacctcgctatttctcaa-3 * 


22 


899 


1032 



Table JB. Panel 1.3D 



Tissue Name 


Rel.Exp.(%)Ag2870, 
Run 167646334 


Tissue Name 


Rel. Exp.(%) Ag2870, 
Run 167646334 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


0.0 


Pancreas 


0.0 


Renal ca. 786-0 


2.5 


Pancreatic ca. CAP AN 
2 


0.0 


Renal ca. A498 


0.0 
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Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


7.5 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


VIA/ 


idling ^iciaij 


ft ft 

u.o 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 


0.0 


Brain (thalamus) 


6.1 


Lung ca. (small cell) 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 

orix ~ / / 


0.0 


Spinal cord 


0.0 


Lung ca. (large 


0.0 


glio/astro U87-MG 


20.0 


Lung ca. (non-sm. 
ecu ) /\j*ty 


0.0 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
nur-oz 


0.0 


neuro*; met SK-N-AS 


0.0 


i^ung ca. jnon-sxi ) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


uing ca. ^squam.j 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


i>ung La. ^squam.j 
NCI-H596 


0.0 


glioma SNB-19 


0 0 


iviammary giana 


ft ft 

0.0 


glioma U251 


0.0 


MCF-7 


0.0 


glioma or-Zyj 


ft ft 

0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


3.8 


Heart (ietai) 


ft ft 

0.0 


Breast ca.* (pl.ef) 
T47D 


0.0 


Heart 


0.0 


Breast ca. BT-549 


00 ! 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow . 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Thymus 


8.7 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 


28.9 
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Lymph node 


0.0 


V-/ VUI IdJ 1 La. W V V^/\Jx- 

8 


13.4 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


21.3 


Small intestine 


0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca * 

SW620(SW480 met) 


100.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


x iUalalC l/<l» ^DUliC 

met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


5.0 


Colon ca. CaCo-2 


0.0 


iVxCiailOIIIa 

Hs688(A).T 


0.0 


Colon ca 
tissue(OD03866) 


0.0 


ivieidnouia {inci) 
Hs688(B).T 


0.0 


Colon ca HCC-2998 


0 0 

\J.\J 


ivieianoma uai^v-*o^ 


ft ft 

u.u 


Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


-, 

Melanoma LOX 
IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


33.9 


Kidney 


0.0 


Adipose 


0.0 



Table JC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2870, Run 
164328103 


Tissue Name 


Rel. Exp.(%) 
Ag2870, Run 
164328103 


Secondary Thl act 


9.7 


HUVEC IL-lbeta 


9.3 


Secondary Th2 act 


30.4 


HUVEC IFN gamma 


11.6 


Secondary Trl act 


25.2 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.2 


HUVEC TNF alpha + IL4 


8.2 


Secondary Th2 rest 


1.9 


HUVEC IL-11 


0.0 


Secondary Trl rest 


2.0 


Lung Microvascular EC 
none 


11.5 


Primary Thl act 


16.7 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


3.6 


Primary Th2 act 


12.3 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


29.1 . 


Microsvasular Dermal EC 


2.6 
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1 IN r alpna + 1L- 1 beta 




Primary Thl rest 


9.3 


Bronchial epithelium 
1 IN r alpna t 1L l oeta 


0.0 


Primary Th2 rest 


1.1 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


4.5 


Small airway epithelium 
TNFalpha + IL-lbeta 


13.3 


lymphocyte act 


3.2 


Coronery artery SMC rest 


1.7 


lymphocyte act 


6.5 


uoronery artery aJvlt 
TNFalpha + IL-lbeta 


0.0 


lympnocyte act 


1 £ 
1.0 


Astrocytes rest 


u.o 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 

TT 1 U^frt 

UL-ioeta 


0.0 


Secondary CD8 
lymphocyte act 


1.4 


KU-812 (Basophil) rest 


2.4 


CD4 lymphocyte none 


0.0 


KU-0I2 (Basophil) 
PMA/ionomvcin 

X IVJLtU l\JLl\JLM.*.J Will 


8.7 


2ryThl/Th2/Trl_anti- 
CD95 CH11 


4.7 


CCD1 106 (Keratinocytes) 

none 


6.6 


LAK cells rest 


0.0 


CCD1 106 (Keratinncvtes^ 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


T iver rirrhn^iQ 

J—f 1 V Vl WIllllWOlO 


1 2 


LAK cells IL-2+IL-12 


0.0 


Limim IciHnpv 

JUUpUj IvlUllv Y 


0 0 


LAK cells IL-2+IFN 
gamma 


0.6 


NCI-H292 none 


30.8 


LAK cells IL-2+IL-18 


o.b 


NCI-H292 IL-4 


32.5 


LAK cells 
PMA/ionomycin 


0.4 


NCI-H292 IL-9 


26.8 


NK Cells IL-2 rest 


0.0 


NCI-H292IL-13 


11.3 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


21 3 


Two Way MLR 5 day 


0.0 


HPAEC none 


9.2 


Two Way MLR 7 day 


0.0 


HPAEC TNF aloha + IL-1 
beta 


8.0 


PBMC rest 


0.0 


Lun ff fibroblast none 


1 9 
1 *s 


PBMCPWM 


0.9 


Lung fibroblast TNF alpha 
+ IL-1 beta 


1.8 


PBMC PHA-L 


5.0 


Lung fibroblast IL-4 


7.5 


Ramos fR celH nnnp 


36 3 




7 n 


Ramps (B cell) 
ionomycin 


100.0 


Lung fibroblast IL-l 3 


0.5 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


1.4 


B lymphocytes CD40L 
andIL-4 


3.1 


Dermal fibroblast 
CCD1 070 rest 


26.8 
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EOL-l dbcAMP 


0.0 


JL/CI Illal llDiODldol 

CCD1070TNF alpha 


23.2 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


L/Cnilal liuruuiaoi 

CCD1070IL-1 beta 


14.6 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 j 


2.3 


Dendritic cells anti- 
CD40 








Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


1.8 


Macrophages rest 


3.5 


Lung 


o.o 


Macrophages LPS 


0.0 


Thymus 


0.3 


HUVEC none 


12.9 


Kidney 


8.1 


HUVEC starved 


22.7 







Panel 13D Summary: Ag2870 Expression of the NOV14 gene is restricted to a 
sample derived from a colon cancer cell line (CT=34.4). Thus, expression of this gene could 
be used to differentiate between this sample and other samples on this panel and as a marker to 
5 detect the presence of colon cancer. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of colon cancer. 

Panel 2.2 Summary: Ag2870 Expression of the NOV14 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2870 Expression of the NOV14 gene is highest in the B cell 
10 line Ramos treated with ionomycin (CT=30.2). . Lower but still significant levels of 

expression are seen in untreated Ramos B cells. . B cells represent a principle component of 
immunity and contribute to the immune response in a number of important functional roles, 
including antibody production. For example, production of antibodies against self-antigens is a 
major component in autoimmune disorders such a systemic lupus erythematosus, with B cells 
15 playing a major role. Since B cells play an important role in autoimmunity, inflammatory 

processes and inflammatory cascades, therapeutic modulation of this gene product may reduce 
or eliminate the symptoms of patients suffering from asthma, allergies, chronic obstructive 
pulmonary disease, emphysema, Crohn's disease, ulcerative colitis, rheumatoid arthritis, 
psoriasis, osteoarthritis, and other autoimmune disorders including systemic lupus 
20 erythematosus. 

Significant levels of expression are also seen in IL-4, IL-9, IL-13, IFN gamma 
activated and untreated NCI-H292 mucoepidermoid cells, IL-4, IL-9, IL-13 and IFN gamma 
activated lung fibroblasts, human pulmonary aortic endothelial cells (treated and untreated), 
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treated small airway epithelium and lung microvascular endothelial cells (treated and 
untreated). The expression of this gene in cells derived from or within the lung further 
suggests that this gene may be involved in normal conditions as well as pathological and 
inflammatoiy lung disorders that include chronic obstructive pulmonary disease, asthma, 
5 allergy and emphysema. 

NOVlSa and NOV15b 

Expression of gene NOV 15a and variant NOV 1 5b was assessed using the primer-probe 
set Ag2875, described in Table KA. Results of the RTQ-PCR runs are shown in Table KB. 



Table KA. Probe Name Ag2875 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 * -gaacatcatctcctaccctgaa-3 • 


22 


268 


1033 


Probe 


TET-5' -tgcatgactcagctctacttcttcctcg- 
3 ' -TAMRA 


28 


290 


1034 


Reverse 


5 1 -atgtgacactctgcaatagcaa-3 1 


22 


321 


1035 



Table KB. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2875, Run 
164311029 


Tissue Name 


Rel. Exp.(%) 
Ag2875,Run 
164311029 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


3.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 
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lymphocyte act 


0.0 


Cororiery artery SMC rest 


0.0 


Vw^JL/'t Jrvvy V-*J-/'t 

lymphocyte act 


0.0 


P nrnnAri; artAn; Ql^Y"** 

^urunery arxery oiviv^ 
TNFalpha + IL-lbeta 


0.0 


PT*)R Ivmnlinrvtp art 


0 0 






Secondary CDS 

lvm , nVinf > vtf* rpct 

ijrlJiJJllUl/jr Lw ICol 


0.0 


Astrocytes TNFalpha + 

TT -Ihpta 
iLj- i ucm 


0.0 


Secondary CD8 

IvTYmVinfvtf* art 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


r\.u-o i jl ^oddupnuj 
PMA/ionomycin 


24.0 


2ry Thl/Th2/Trl anti- 
CD95CH11 


5,1 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


72.2 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


20.7 


LAKcellsIL-2+IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 1 
beta 


1.4 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-lbeta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B celH none 


0.0 


Luno fibroblast TT -Q 


v.VF 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 

era mm ft 


3.2 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


3.1 


Dermal fibroblast 
CCD1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD1070IL-lbeta^ 


0.0 


Dendritic cells none 


33.2 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


2.4 


Dermal fibroblast IL-4 


1.9 
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Dendritic cells anti- 
CD40 


inn ft 


1DU V^OlllIS Z 


io.3 


Monocytes rest 


0.0 


IBD Crohn's 


6.7 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


44.1 


Lung 


21.8 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


3.1 


Kidney 


0.0 


HUVEC starved 


0.0 







Panel 1.3D Summary: Ag2875 Results from one experiment with the NOV 15a gene 
are not included. The amp plot indicates that there were experimental difficulties with this run. 

Panel 2.2 Summary: Ag2875 Expression of the NOV 15a gene is low/undetectable in 
5 all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2875 Highest expression of the NOV15a is in anti-CD40 
treated dendritic cells (CT=33.2), with much lower expression in untreated dendritic cells. 
Thus, this gene product may be important in dendritic cell activation. Significant expression of 
this gene is also seen in liver cirrhosis. This gene encodes a putative GPCR; therefore, 
1 0 antibodies or small molecule therapeutics could reduce or inhibit fibrosis that occurs in liver 
cirrhosis. In addition, antibodies to this putative GPCR could also be used for the diagnosis of 
liver cirrhosis. In addition, significant expression of this gene is seen in resting macrophages. 
The putative GPCR encoded for by this transcript may therefore be important in macrophage 
detection of chemokine gradients and trafficking into specific sites within a tissue and in 
1 5 activation. Antibody or protein therapeutics designed against the protein encoded for by this 
transcript could reduce or inhibit inflammation in asthma, emphysema, allergy, psoriasis, 
arthritis, or any other condition in which macrophage localization/activation is important. 

NOV16A: Olfactory Receptor 

Expression of gene NOV 16a was assessed using the primer-probe set Ag2876, 
20 described in Table LA. Results of the RTQ-PCR runs are shown in Tables LB and LC. 

Table LA. Probe Name Ag2876 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -acatcatctcctaccctgaatg-3 ' 


22 


273 


1036 


Probe 


TET-5! -catgactcagctttacttcttcctcatt- 
3 ' -TAMRA 


28 


295 


1037 j 


Reverse 


5 1 -tacagccaacatgtgacactct-3 ' 


22 


334 


1038 
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Table LB. Panel 1.3D 



Tissue Name 


n A l t?vv\ iOA\ A#»^C7/C 

Kei. Ii/Xp,(ybj Ag^o7o, 
Run 167646343 


Tissue Name 


Kei. JLXp.(yo) Ag2o7o, 
Run 167646343 


Liver adenocarcinoma 


0.0 


Kidnev (fetall 


0.0 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAP AN 
2 


0.0 


Renal ca. A498 


0.0 


Adrenal gland 


0.0 


Renal ca RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


0.0 


Renal ca UO-31 


0.0 ■ 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Drain ffatnW 


u.u 


Liver 


A A 

o.o 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


0.0 


Lung (fetal) 


0.0 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


1.7 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex. 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell)A549 


0.0 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 




n n 
u.u 


Mammary gland 


A A 

0.0 


glioma U251 


1.4 


Breast ca.* (pl.ef) 
MCF-7 


192 


glioma SF-295 


2.6 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


Breast ca* (pl.ef) 
T47D 


100.0 


Heart 


0.0 


Breast ca. BT-549 


0.0 
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Skeletal muscle (fetal) 


0,0 


3reast ca. MDA-N 


0.0 


Skeletal muscle 


2.1 < 


Dvary 




}nne marrow 


0.0 ! 


Dvarian ca. OVCAR- 

> 

5 


0.0 


Thvmus 

x j j j in w«3 


< 

0.0 

< 


Ovarian ca. OVCAR- 


0.0 




0.0 


Ovarian ca. OVCAR- 
5 


0.0 


Lymph node 


0.0 


Ovarian ca. OVCAK- 
c 

o 


0.0 


Colorectal 


2.0 


Ovarian ca. IGROV- 

i 

i 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 

Ol\ U V " J 


0.0 


Small intestine 


A A 
U.U 




0.0 


Colon ca. SW480 


0.0 


Placenta 


o.o 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 


3.6 


Colon ca. HCT-116 


0.0 


Testis 


8.5 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 




Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma Ml 4 

... _ „,,,, 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.0 


Adipose 


0.0 



Table LC. Panel 4D 



Tissue Name 


Rel.Exp.(%) 
Ag2876, Run 
164328133 


Tissue Name 


Rel. Exp.(%) 
Ag2876,Run 
164328133 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


o.b 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 
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Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalnha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalDha + IL-1 beta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalDha 4- ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


2.1 


Primary Trl rest 


0.0 


Small airwav eoitheliurn 
TNFalpha* IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronerv arterv SMC 

TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


9.7 


Astrocytes re<rt 


0 0 


Secondary CD8 
lymphocyte rest 


o.o 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8I2 fBasonhift 
PMA/ionomycin 


30.6 


2ryThl/Th2/Trl anti- 
CD95CH11 


1.5 


CCD 11 06 (Keratinocytes) 
none 


0.0 


Ly/\js. cens rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells EL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 jLupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAKcellsIL-2+IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-1 3 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


8.S 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAECTNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none j 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 | 
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Ramos CQ celH none 


0.0 


T nno fihrnhla<;t TT -Q 


4 R 


Ramos (B cell) 
ionomvcin 

IVllwlJJ J Will 


0.0 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 

g, annua 


15.6 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 

PfT) 1ft7ft rf>ot 


0.0 

- — ....... _ 


EOL-1 dbcAMP 


0.0 


JLrGIIIlal llUIULJlaM 

CCD1 070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dprmnl fiHmlVlsict 

JL/CIllidl llUiUUluol 

CCD1070 IL-1 beta 


5.4 


Dendritic cells none 


10.2 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


19.3 


Dendritic cells anti- 
CD40 


1 R R 
1 o.o 


lDU V^Oi llJS Z 


i < o 


Monocytes rest 


0.0 


IBD Crohn's 


2.2 


Monocytes LPS 


20.6 


Colon 


4.1 


Macrophages rest 


14.2 


Lung 


2L0 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







Panel 1.3D Summary: Ag2876 Expression of the NOV 16a gene is restricted to a 
sample derived from a breast cancer cell line (CT=32.5). Thus, expression of this gene could 
be used to differentiate between this sample and other samples on this panel and as a marker to 
5 detect the presence of breast cancer. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of breast cancer. 

Panel 4D Summary: Ag2876 Significant expression of the NOV16a gene is detected 
in a liver cirrhosis sample (CT - 33.5). Furthermore, expression of this gene is not detected in 
normal liver in Panel 1 .3D, suggesting that its expression is unique to liver cirrhosis. This gene 
1 0 encodes a putative GPCR; therefore, antibodies or small molecule therapeutics could reduce or 
inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this putative GPCR 
could also be used for the diagnosis of liver cirrhosis. 

NOV17a and NOV17b 

Expression of gene NOV17a and variant NOV17b was assessed using the primer-probe 
1 5 set Ag2969, described in Table MA. Results of the RTQ-PCR runs are shown in Tables MB 
and MC. 

939 



WO 02/068649 



PCT/US02/02785 



Table MA. Probe Name Ag2969 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ■ -cacccatgtatttcctgcttag-3 1 


22 i 


184 


1039 


Probe 


TET-5 1 -tcagctctccctcattgacctaaatt-3 ' - 
TAMRA 


26 


206 


1040 


Reverse 


5 1 -tcagaagccatcttaggaacaa-3 ' 


22 


244* 


1041 



Table MB, CNS_neurodegenerationjvl.O 



Tissue Name 


Rel. Exp.(%) Ag2969, 
xvuu zvy / /oyoA 


Tissue Name 


Rel. Exp.(%) Ag2969, 
Klin 2U977o!Ja2 


AD 1 Hippo 


0.5 


Control (Path) 3 
l emporai ctx 


0.5 


AD 2 Hippo 


9.1 


Control (Path) 4 
i emporai ctx 


23.7 


AD 3 Hippo 


0.7 


AD 1 Occipital Ctx 


42 


AD 4 Hippo 


1.6 . 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


1.9 


AD 6 Hippo 


24.3 


AD 4 Occipital Ctx 


9.5 


Control 2 Hippo 


4.7 


AD 5 Occipital Ctx 


17.8 


Control 4 Hippo 


2.1 


AD 6 Occipital Ctx 


14.5 


Control (Path) 3 
Hippo 


1.1 


Control 1 Occipital 
Ctx 


0.8 


AD 1 Temporal Ctx 


2.0 


Control 2 Occipital 
Ctx 


29.7 


AD 2 Temporal Ctx 


15.5 


Control 3 Occipital 
Ctx 


1 3.9 


AD 3 Temporal Ctx 


2.4 


Control 4 Occipital 
Ctx 


l.O 


AD 4 Temporal Ctx 


9.7 


Control (Path) 1 
Occipital Ctx 


65.5 


AD 5 Inf Temporal 
Ctx 


91.4 


Control (Path) 2 
Occipital Ctx 


4.8 


AD 5 Sup Temporal 
Ctx 


27.9 


Control (Path) 3 
Occipital Ctx 


0.0 


AD 6 Inf Temporal 

Ctx 


42.9 


Control (Path) 4 
Occipital Ctx 


13.6 


AD 6 Sup Temporal 
Ctx 


39.2 


Control 1 Parietal 
Ctx 


1.2 


Control 1 Temporal 
Ctx 


0.0 


Control 2 Parietal 
Ctx 


28.9 


Control 2 Temporal 
Ctx 


11.8 


Control 3 Parietal 
Ctx 


9.6 


Control 3 Temporal 


9.3 


Control (Path) 1 


48.0 
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Ptv 

\^ LA 




Poriptnl Ptv 




Control 3 TemporaJ 
Ctx 


2.5 ' 


Control (Path) 2 
Parietal Ctx 


12.0 


Control (Path) 1 
Temporal Ctx 


38.4 


Control (Path) 3 
Parietal Ctx 


0.0 


Control (Path) 2 
Temporal Ctx 


12.2 


Control (Path) 4 
Parietal Ctx 


41.2 



Table MC. Panel 4D 



Tissue Name 


ReL Exp.(%) 
Ag2969,Run 

1 0*t4U£00o 


Tissue Name 


Rel. Exp.(%) 
Ag2969, Run 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


3.0 


HUVEC IFN gamma 


2.8 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


6.1 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


4.1 


HUVEC IL-11 


0.0 


OGi/Vjiiucu y nil col 


0 0 


Lung Microvascular EC 
none 


o n 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha* IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


10.5 


Bronchial epithelium 
1 IN r alpha + \L, l beta 


0.0 


Primary Th2 rest 


13.6 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


7.9 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RACD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


2.0 


CD45ROCD4 
lymphocyte act 


7.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


3.4 


CDS lymphocyte act 


4.9 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


4.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


8.5 


CD4 lymphocyte none 


16.7 


KU-812 (Basophil) 
PMA/ionomycin 


51.1 


2ry Thl/Th2/Trl_anti- 


3.2 


CCD1106 (Keratinocytes) 


0.0 
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CD95 CH11 


none 




LAK cells rest 


23.2 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


29.3 


Liver cirrhosis 


18.6 


LAK cells IL-2+IL-12 


22.2 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


28.5 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-1 8 


21.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


7.2 


NCI-H292IL-13 


0.0 


Two Way MLR 3 day 


51.4 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


18.4 


HPAEC none 


0.0 


Two Way MLR 7 day 


2.9 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


45.7 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 




13. o 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lrung iiDroDiasi iL»-y 


U.U 


Ramos (B cell) 
ionomycin 


5.4 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


12.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
onu ii^-*T 


39.2 


Dermal fibroblast 

Ppni C\1C\ r*»ct 

\^\^xJx\) i\) res i 


0.0 


EOL-1 dbcAMP 


1.1 


UCIiUal IlurODJaSl 

CCD1070 TNF alpha 


0.0 


FOT -1 Hhr AMP 

PMA/ionomycin 


0.0 


l/Cimal IlDrODlaSl 

CCD1070 IL-1 beta 


0.0 


Dendritic cells none 


4.2 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


5.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


6.4 


IDXJ \sO lltlS L 


111 

11.1 ! 


Monocytes rest 


1.5 


IBD Crohn's 


0.0 i 


Monocytes LPS 


10.7 


Colon 


0.0 


Macrophages rest 


11.4 


Lung 


2.4 


Macrophages LPS 


3.7 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 
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receptors contains a large number of neurotransmitter receptors, including the dopamine, 
serotonin, a and b-adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic 
glutamate receptors. GPCRs are excellent drug targets in various neurologic and psychiatric 
diseases. All antipsychotics have been shown to act at the dopamine D2 receptor; similarly 
5 novel antipsychotics also act at the serotonergic receptor, and often the muscarinic and 
adrenergic receptors as well. While the majority of antidepressants can be classified as 
selective serotonin reuptake inhibitors, blockade of the 5-HT1 A and a2 adrenergic receptors 
increases the effects of these drugs. The GPCRs are also of use as drug targets in the treatment 
of stroke. Blockade of the glutamate receptors may decrease the neuronal death resulting from 

1 0 excitotoxicity; further more the purinergic receptors have also been implicated as drug targets 
in the treatment of cerebral ischemia. The b-adrenergic receptors have been implicated in the 
treatment of ADHD with Ritalin, while the a-adrenergic receptors have been implicated in 
memory. Therefore this gene may be of use as a small molecule target for the treatment of any 
of the described diseases. 

1 5 In addition, this panel shows that this GPCR is upregulated in the temporal cortex of 

Alzheimer's disease patients. Therefore, blockade of this receptor may be of use in the 
treatment of this disease and decrease neuronal death. 
References: 

El Yacoubi M, Ledent C, Parmentier M, Bertorelli R, Ongini E, Costentin J, Vaugeois 
20 JM. Adenosine A2A receptor antagonists are potential antidepressants: evidence based on 
pharmacology and A2A receptor knockout mice. Br J Pharmacol 2001 Sep;134(l):68-77 

1 . Adenosine, an ubiquitous neuromodulator, and its analogues have been shown to 
produce 'depressant' effects in animal models believed to be relevant to depressive disorders, 
while adenosine receptor antagonists have been found to reverse adenosine-mediated 
25 'depressant' effect 2. We have designed studies to assess whether adenosine A2A receptor 

antagonists, or genetic inactivation of the receptor would be effective in established screening 
procedures, such as tail suspension and forced swim tests, which are predictive of clinical 
antidepressant activity. 3. Adenosine A2A receptor knockout mice were found to be less 
sensitive to 'depressant' challenges than their wildtype littermates. Consistently, the adenosine 
30 A2A receptor blockers SCH 58261 (1 - 10 mg kg(-l), i.p.) and KW 6002 (0,1 - 10 mg kg(-l), 
p.o.) reduced the total immobility time in the tail suspension test 4. The efficacy of adenosine 
A2A receptor antagonists in reducing immobility time in the tail suspension test was 
confirmed and extended in two groups of mice. Specifically, SCH 58261 (1 - 10 mg kg(-l)) 
and ZM 241385 (15-60 mg kg(-l)) were effective in mice previously screened for having 
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high immobility time, while SCH 58261 at 10 mg kg(-l) reduced immobility of mice that were 
selectively bred for their spontaneous helplessness' in this assay, 5. Additional experiments 
were carried out using the forced swim test. SCH 58261 at 10 mg kg(-l) reduced the 
immobility time by 61%, while KW 6002 decreased the total immobility time at the doses of 1 
5 and 10 mg kg(-l) by 75 and 79%, respectively. 6. Administration of the dopamine D2 receptor 
antagonist haloperidol (50 - 200 microg kg(-l) i.p.) prevented the antidepressant-like effects 
elicited by SCH 58261 (10 mg kg(-l) i.p.) in forced swim test whereas it left unaltered its 
stimulant motor effects. 7. In conclusion, these data support the hypothesis that A2A receptor 
antagonists prolong escape-directed behaviour in two screening tests for antidepressants. 

10 Altogether the results support the hypothesis that blockade of the adenosine A2A receptor 
might be an interesting target for the development of effective antidepressant agents. 

Blier P. Pharmacology of rapid-onset antidepressant treatment strategies. Clin 
Psychiatry 2001;62 Suppl 15:12-7 

Although selective serotonin reuptake inhibitors (SSRIs) block serotonin (5-HT) 

1 5 reuptake rapidly, their therapeutic action is delayed. The increase in synaptic 5-HT activates 
feedback mechanisms mediated by 5-HT1 A (cell body) and 5-HT1B (terminal) autoreceptors, 
which, respectively, reduce the firing in 5-HT neurons and decrease the amount of 5-HT 
released per action potential resulting in attenuated 5-HT neurotransmission. Long-term 
treatment desensitizes the inhibitory 5-HT1 autoreceptors, and 5-HT neurotransmission is 

20 enhanced. The time course of these events is similar to the delay of clinical action. The 
addition of pindolol, which blocks 5-HT 1 A receptors, to SSRI treatment decouples the 
feedback inhibition of 5-HT neuron firing and accelerates and enhances the antidepressant 
response. The neuronal circuitry of the 5-HT and norepinephrine (NE) systems and their 
connections to forebrain areas believed to be involved in depression has been dissected. The 

25 firing of .5-HT neurons in the raphe nuclei is driven, at least partly, by alphal-adrenoceptor- 
mediated excitatory inputs from NE neurons. Inhibitory alpha2-adrenoceptors on the NE 
neuroterminals form part of a feedback control mechanism. Mirtazapine, an antagonist at 
alpha2-adrenoceptors, does not enhance 5-HT neurotransmission directly but disinhibits the 
NE activation of 5-HT neurons and thereby increases 5-HT neurotransmission by a 

30 mechanism that does not require a time-dependent desensitization of receptors. These 

neurobiological phenomena may underlie the apparently faster onset of action of mirtazapine 
compared with the SSRIs. 

Tranquillini ME, Reggiani A. Glycine-site antagonists and stroke. Expert Opin Investig 
Drugs 1999 Nov;8(ll): 1837-1 848 
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The excitatory amino acid, (S)-glutamic acid, plays an important role in controlling 
many neuronal processes. Its action is mediated by two main groups of receptors: the 
ionotropic receptors (which include NMD A, AMPA and kainic acid subtypes) and the 
metabotropic receptors (mGluR(l-8)) mediating G-protein coupled responses. This review 
5 focuses on the strychnine insensitive glycine binding site located on the NMDA receptor 

channel, and on the possible use of selective antagonists for the treatment of stroke. Stroke is a 
devastating disease caused by a sudden vascular accident. Neurochemically, a massive release 
of glutamate occurs in neuronal tissue; this overactivates the NMDA receptor, leading to 
increased intracellular calcium influx, which causes neuronal cell death through necrosis. 

10 NMDA receptor activation strongly depends upon the presence of glycine as a co-agonist. 
Therefore, the administration of a glycine antagonist can block overactivation of NMDA 
receptors, thus preserving neurones from damage. The glycine antagonists currently identified 
can be divided into five main categories depending on their chemical structure: indoles, 
tetrahydroquinolines, benzoazepines, quinoxalinediones and pyrida-zinoquinolines. 

15 Monopoli A, Lozza G, Forlani A, Mattavelli A, Ongini E. Blockade of adenosine A2A 

receptors by SCH 58261 results in neuroprotective effects in cerebral ischaemia in rats. 
Neuroreport 1998 Dec 1;9(17):3 955-9 

Blockade of adenosine receptors can reduce cerebral infarct size in the model of global 
ischaemia. Using the potent and selective A2A adenosine receptor antagonist, SCH 58261, we 

20 assessed whether A2A receptors are involved in the neuronal damage following focal cerebral 
ischaemia as induced by occluding the left middle cerebral artery. SCH 58261 (0.01 mg/kg 
either i.p. or i.v.) administered to normotensive rats 10 min after ischaemia markedly reduced 
cortical infarct volume as measured 24 h later (30% vs controls, p < 0.05). Similar effects were 
observed when SCH 58261 (0.01 mg/kg, i.p.) was administered to hypertensive rats (28% 

25 infarct volume reduction vs controls, p < 0.05). Neuroprotective properties of SCH 58261 
administered after ischaemia indicate that blockade of A2 A adenosine receptors is a 
potentially useful biological target for the reduction of brain injury. 

Panel 1.3D Summary: Ag2878 Expression of the NOV17a gene is Iow/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

30 Panel 4D Summary: Ag2878 Expression of the NOV1 7a gene is restricted to a few 

samples in this panel, with highest expression in the kidney (CT=33.1). Thus, the putative 
GPCR encoded for by this gene could allow cells within the kidney to respond to specific 
microenvironmental signals (For example, ref. 1). Therefore, antibody or small molecule 
therapies designed with the protein encoded for by this gene could modulate kidney function 
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and be important in the treatment of inflammatory or autoimmune diseases that affect the 
kidney, including lupus and glomerulonephritis. 

Furthermore, significant levels of expression are also seen in the PMA and ionomycin 
treated basophil cell line KU-812. GPCR-type receptors are important in multiple 
5 physiological responses mediated by basophils (ref. 2). Therefore, antibody or small molecule 
therapies designed with the protein encoded for by this gene could also block or inhibit 
inflammation or tissue damage due to basophil activation in response to asthma, allergies, 
hypersensitivity reactions, psoriasis, and viral infections. 

References: 

10 1. Mark M.D., Wittemann S., Herlitze S. (2000) G protein modulation of recombinant 

P/Q-type calcium channels by regulators of G protein signalling proteins. J. Physiol. 528 Pt 1 : 
65-77. 

1. Fast synaptic transmission is triggered by the activation of presynaptic Ca2+ 
channels which can be inhibited by Gbetagamma subunits via G protein-coupled receptors 

1 5 (GPCR). Regulators of G protein signalling (RGS) proteins are GTPase-accelerating proteins 
(GAPs), which are responsible for > 100-fold increases in the GTPase activity of G proteins 
and might be involved in the regulation of presynaptic Ca2+ channels. In this study we 
investigated the effects of RGS2 on G protein modulation of recombinant P/Q-type channels 
expressed in a human embryonic kidney (HEK293) cell line using whole-cell recordings. 2. 

20 RGS2 markedly accelerates transmitter-mediated inhibition and recovery from inhibition of 
Ba2+ currents (IBa) through P/Q-type channels heterologously expressed with the muscarinic 
acetylcholine receptor M2 (mAChR M2). 3. Both RGS2 and RGS4 modulate the prepulse 
facilitation properties of P/Q-type Ca2+ channels. G protein reinhibition is accelerated, while 
release from inhibition is slowed. These kinetics depend on the availability of G protein alpha 

25 and betagamma subunits which is altered by RGS proteins. 4. RGS proteins unmask the Ca2+ 
channel beta subunit modulation of Ca2+ channel G protein inhibition. In the presence of 
RGS2, P/Q-type channels containing the beta2a and beta3 subunits reveal significantly altered 
kinetics of G protein modulation and increased facilitation compared to Ca2+ channels 
coexpressed with the betal b or beta4 subunit. 

30 PMID: 11018106 

2. Heinemann A., Hartnell A., Stubbs V.E., Murakami K., Soler D., LaRosa G., 
Askenase P.W., Williams T.J., Sabroe I. (2000) Basophil responses to chemokines are 
regulated by both sequential and cooperative receptor signaling. J. Immunol. 165: 7224-7233. 
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To investigate human basophil responses to chemokines, we have developed a 
sensitive assay that uses flow cytometry to measure leukocyte shape change as a marker of cell 
responsiveness. PBMC were isolated from the blood of volunteers. Basophils were identified 
as a single population of cells that stained positive for IL-3Ralpha (CDwl23) and negative for 

5 HLA-DR, and their increase in forward scatter (as a result of cell shape change) in response to 
chemokines was measured. Shape change responses of basophils to chemokines were highly 
reproducible, with a rank order of potency: monocyte chemoattractant protein (MCP) 4 (peak 
at /= eotaxin-2 = eotaxin-3 >/= eotaxin > MCP-1 = MCP-3 > macrophage-inflammatory 
protein-1 alpha > RANTES = MCP-2 = IL-8. The CCR4-seIective ligand macrophage-derived 

1 0 chemokine did not elicit a response at concentrations up to 1 0 nM. Blocking mAbs to CCR2 
and CCR3 demonstrated that responses to higher concentrations (>10 nM) of MCP-1 were 
mediated by CCR3 rather than CCR2, whereas MCP-4 exhibited a biphasic response 
consistent with sequential activation of CCR3 at lower concentrations and CCR2 at 10 nM 
MCP-4 and above. In contrast, responses to MCP-3 were blocked only in the presence of both 

1 5 mAbs, but not after pretreatment with either anti-CCR2 or anti-CCR3 mAb alone. These 

patterns of receptor usage were different from those seen for eosinophils and monocytes. We 
suggest that cooperation between CCRs might be a mechanism for preferential recruitment of 
basophils, as occurs in tissue hypersensitivity responses in vivo. 
PMID: 11120855 

20 NOV17c 

Expression of gene NOV17c, also known as CG56659-02, was assessed using the 
primer-probe set Ag2970, described in Table NA. Results of the RTQ-PCRruns are shown in 
Table NB. 



Table NA. Probe Name Ag2970 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 1 -acatccatctccacacacctat-3 1 


22 


90 


1042 


Probe 


TET-5 1 -agtcagctctccctcattgacctaaa-3 ' - 
TAMRA 


26 


125 


1043 


Reverse 


5 1 -taaaccatctttggaacaatgg-3 1 


22 


162 


1044 



25 



Table NB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel.Exp.(%)Ag2970, 
Run 211008706 


Tissue Name 


Rel.Exp.(%)Ag2970, 
Run 211008706 


AD 1 Hippo 


2.3 


Control (Path) 3 


2.1 
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Temporal Ctx 




AD 2 Hippo 


16.7 


Control (Path) 4 
Temporal Ctx 


38.2 


AD 3 Hippo 


3.5 


AD 1 Occipital Ctx 


3.7 


AD 4 Hippo 


2.2 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital Ctx 


4.1 


AD 6 Hippo 


28.1 


AD 4 Occipital Ctx 


20.4 


Control 2 Hippo 


10.2 


AD 5 Occipital Ctx 


11.2 


Pnntml 4 Hinnn 


1 Q 


AHA fWvirntal Ptv 


ICO 


Control (Path) 3 

T-Tinnn 


2.4 


Control 1 Occipital 

Ptv 


0.0 


AD 1 Temporal Ctx 


3.4 


Control 2 Occipital 

PtY 


27.2 


AD 2 Temporal Ctx 


23.0 


Control 3 Occipital 
ptv 


14.0 


AD 3 Temporal Ctx 


1.0 


Control 4 Occipital 


4.8 


AD 4 Temporal Ctx 


18.2 


Control (Path) 1 

Or»r»iV»ita 1 Ptv 

wccipnai L^LX 


87.7 


AD 5 Inf Temporal 
Ctx 


99.3 


Control (Path) 2 


9.9 


AD 5 SupTemporal 
Ctx 


30.1 


Control (Path) 3 

fVvMnital Ptv 


0.5 


AD 6 Inf Temporal 
Ctx 


47.0 


Control (Path) 4 

v/CCipilai V^LX 


13.5 


AD 6 Sup Temporal 
Ctx 


59.9 


Control 1 Parietal 

PtY 


, 

1.1 


Control 1 Temporal 
Ctx 


0.0 


Control 2 Parietal 

Ptv 


40.1 


Control 2 Temporal 
Ctx 


17.0 


Control 3 Parietal 

V/LA 


13.2 


Control 3 Temporal 
Ctx 


10.7 


Control (Path) 1 
Parietal Ctx 


51.1 


Control 4 Temporal 
Ctx 


6.9 


Control (Path) 2 
Parietal Ctx 


14.9 


Control (Path) 1 
Temporal Ctx 


48.0 


Control (Path) 3 
Parietal Ctx 


0.0 


Control (Path) 2 
Temporal Ctx 


17.6 


Control (Path) 4 
Parietal Ctx 


38.4 



CNS_neurodegeneration_vl.O Summary: Ag2970 The NOV17c gene represents a 
novel G-protein coupled receptor (GPCJR) with expression in the brain. The GPCR family of . 
receptors contains a large number of neurotransmitter receptors, including the dopamine, 
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serotonin, a and b-adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic 
glutamate receptors. GPCRs are excellent drug targets in various neurologic and psychiatric 
diseases. All antipsychotics have been shown to act at the dopamine D2 receptor; similarly 
novel antipsychotics also act at the serotonergic receptor, and often the muscarinic and 
5 adrenergic receptors as well. While the majority of antidepressants can be classified as 

selective serotonin reuptake inhibitors, blockade of the 5-HT1A and a2 adrenergic receptors 
increases the effects of these drugs. The GPCRs are also of use as drug targets in the treatment 
of stroke. Blockade of the glutamate receptors may decrease the neuronal death resulting from 
excitotoxicity; further more the purinergic receptors have also been implicated as drug targets 

10 in the treatment of cerebral ischemia. The b-adrenergic receptors have been implicated in the 
treatment of ADHD with Ritalin, while the a-adrenergic receptors have been implicated in 
memory. Therefore this gene may be of use as a small molecule target for the treatment of any 
of the described diseases. 

In addition, this GPCR is upregulated in the temporal cortex of Alzheimer's disease 

1 5 patients. Therefore, blockade of this receptor may be of use in the treatment of this disease and 
decrease neuronal death. 
References: 

El Yacoubi M, Ledent C, Parmentier M, Bertorelli R, Ongini E, Costentin J, Vaugeois 
JM. Adenosine A2A receptor antagonists are potential antidepressants: evidence based on 

20 pharmacology and A2A receptor knockout mice. Br J Pharmacol 2001 Sep;134(l):68-77 
1. Adenosine, an ubiquitous neuromodulator, and its analogues have been shown to 
produce 'depressant' effects in animal models believed to be relevant to depressive disorders, 
while adenosine receptor antagonists have been found to reverse adenosine-mediated 
'depressant' effect. 2. We have designed studies to assess whether adenosine A2A receptor 

25 antagonists, or genetic inactivation of the receptor would be effective in established screening 
procedures, such as tail suspension and forced swim tests, which are predictive of clinical 
antidepressant activity. 3. Adenosine A2A receptor knockout mice were found to be less 
sensitive to 'depressant' challenges than their wildtype littermates. Consistently, the adenosine 
A2A receptor blockers SCH 58261 (1 - 10 mg kg(-l), i.p.) and KW 6002 (0.1 - 10 mg kg(-l), 

30 p.o.) reduced the total immobility time in the tail suspension test. 4. The efficacy of adenosine 
A2A receptor antagonists in reducing immobility time in the tail suspension test was 
confirmed and extended in two groups of mice. Specifically, SCH 58261 (1 - 10 mg kg(-l)) 
and ZM 241385 (15 - 60 mg kg(-l)) were effective in mice previously screened for having 
high immobility time, while SCH 58261 at 10 mg kg(-l) reduced immobility of mice that were 
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selectively bred for their spontaneous helplessness* in this assay. 5. Additional experiments 
were carried out using the forced swim test. SCH 58261 at 10 mg kg(-l) reduced the 
immobility time by 61%, while KW 6002 decreased the total immobility time at the doses of 1 
and 10 mg kg(-l) by 75 and 79%, respectively. 6. Administration of the dopamine D2 receptor 
5 antagonist haloperidol (50 - 200 microg kg(-l ) i.p.) prevented the antidepressanMike effects 
elicited by SCH 58261 (10 mg kg(-l) i.p.) in forced swim test whereas it left unaltered its 
stimulant motor effects. 7. In conclusion, these data support the hypothesis that A2A receptor 
antagonists prolong escape-directed behaviour in two screening tests for antidepressants. 
Altogether the results support the hypothesis that blockade of the adenosine A2A receptor 

10 might be an interesting target for the development of effective antidepressant agents. 

Blier P. Pharmacology of rapid-onset antidepressant treatment strategies. Clin 
Psychiatry 2001;62 Suppl 15:12-7 

Although selective serotonin reuptake inhibitors (SSRIs) block serotonin (5-HT) 
reuptake rapidly, their therapeutic action is delayed. The increase in synaptic 5-HT activates 

15 feedback mechanisms mediated by 5-HT1 A (cell body) and 5-HT1B (terminal) autoreceptors, 
which, respectively, reduce the firing in 5-HT neurons and decrease the amount of 5-HT 
released per action potential resulting in attenuated 5-HT neurotransmission. Long-term 
treatment desensitizes the inhibitory 5-HT1 autoreceptors, and 5-HT neurotransmission is 
enhanced. The time course of these events is similar to the delay of clinical action. The 

20 addition of pindolol, which blocks 5-HT1 A receptors, to SSRI treatment decouples the 

feedback inhibition of 5-HT neuron firing and accelerates and enhances the antidepressant 
response. The neuronal circuitry of the 5-HT and norepinephrine (NE) systems and their 
connections to forebrain areas believed to be involved in depression has been dissected. The 
firing of 5-HT neurons in the raphe nuclei is driven, at least partly, by alpha 1-adrenoceptor- 

25 mediated excitatory inputs from NE neurons. Inhibitory alpha2-adrenoceptors on the NE 
neuroterminals form part of a feedback control mechanism. Mirtazapine, an antagonist at 
alpha2-adrenoceptors, does not enhance 5-HT neurotransmission directly but disinhibits the 
NE activation of 5-HT neurons and thereby increases 5-HT neurotransmission by a 
mechanism that does not require a time-dependent desensitization of receptors. These 

30 neurobiological phenomena may underlie the apparently faster onset of action of mirtazapine 
compared with the SSRIs. 

Tranquillini ME, Reggiani A. Glycine-site antagonists and stroke. Expert Opin Investig 
Drugs 1999 Nov;8(ll): 1837-1 848 
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The excitatory amino acid, (S)-glutamic acid, plays an important role in controlling 
many neuronal processes. Its action is mediated by two main groups of receptors: the 
ionotropic receptors (which include NMDA, AMPA and kainic acid subtypes) and the 
metabotropic receptors (mGluR( 1-8)) mediating G-protein coupled responses. This review 
5 focuses on the strychnine insensitive glycine binding site located on the NMDA receptor 

channel, and on the possible use of selective antagonists for the treatment of stroke. Stroke is a 
devastating disease caused by a sudden vascular accident. Neurochemical^, a massive release 
of glutamate occurs in neuronal tissue; this overactivates the NMDA receptor, leading to 
increased intracellular calcium influx, which causes neuronal cell death through necrosis. 

1 0 NMDA receptor activation strongly depends upon the presence of glycine as a co-agonist. 
Therefore, the administration of a glycine antagonist can block overactivation of NMDA 
receptors, thus preserving neurones from damage. The glycine antagonists currently identified 
can be divided into five main categories depending on their chemical structure: indoles, 
tetrahydroquinolines, benzoazepines, quinoxalinediones and pyrida-zinoquinolines. 

1 5 Monopoli A, Lozza G, Forlani A, Mattavelli A, Ongini E. Blockade of adenosine A2A 

receptors by SCH 58261 results in neuroprotective effects in cerebral ischaemia in rats. 
Neuroreport 1998 Dec l;9(17):3955-9 

Blockade of adenosine receptors can reduce cerebral infarct size in the model of global 
ischaemia. Using the potent and selective A2A adenosine receptor antagonist, SCH 58261, we 

20 assessed whether A2A receptors are involved in the neuronal damage following focal cerebral 
ischaemia as induced by occluding the left middle cerebral artery. SCH 58261 (0.01 mg/kg 
either i.p. or i.v.) administered to normotensive rats 10 min after ischaemia markedly reduced 
cortical infarct volume as measured 24 h later (30% vs controls, p < 0.05). Similar effects were 
observed when SCH 58261 (0.01 mg/kg, i.p.) was administered to hypertensive rats (28% 

25 infarct volume reduction vs controls, p < 0.05). Neuroprotective properties of SCH 58261 
administered after ischaemia indicate that blockade of A2A adenosine receptors is a 
potentially useful biological target for the reduction of brain injury. 

Panel 13D Summary: Ag2970 Expression of the NOV 17c gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

30 Panel 4D Summary: Ag2970 Expression of the NOV17c gene is low/undetectable in 

all samples on this panel (CTs>35). (Data not shown.) 

NOV19a 
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Expression of gene NOV 1 9a was assessed using the primer-probe set Ag2972, 
described in Table OA. Results of the RTQ-PCR runs are shown in Tables OB, OC, and OD. 



Table OA. Probe Name Ag2972 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 » -atgtgtttcagcttccattctg-3 ' 


22 


543 


1045 


Probe 


TET-5 • -taggtctcggctcattaaccactttt-3 ' - 
TAMRA 


26 


565 


1046 


Reverse 


5 1 -tgtcctgacacaccaatgatag-3 1 


22 


611 


1047 



5 Table OB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag2972, 
Run 211008969 


Tissue Name 


Rel. Exp.(%) Ag2972, 
Run 211008969 


AD 1 Hippo 


2.1 


Control (Path) 3 
Temporal Ctx 


1.9 


AD 2 Hippo 


13.7 


Control (Path) 4 
Temporal Ctx 


70.2 


AD 3 Hippo 


2.2 


AD 1 Occipital Ctx 


3.3 


AD 4 Hippo 


1.4 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


75.8 


AD 3 Occipital Ctx 


2.2 


AD 6 Hippo 


33.4 


AD 4 Occipital Ctx 


46.3 


Control 2 Hippo 


11.9 


AD 5 Occipital Ctx 


22.5 


Control 4 Hippo 


2.5 


AD 6 Occipital Ctx 


19.8 


Control (Path) 3 
Hippo 




Control 1 Occipital 
Ctx 


1 1 
1.1 


AD 1 Temporal Ctx 


2.1 


Control 2 Occipital 
Ctx 


42.0 


AD 2 Temporal Ctx 


28.3 


Control 3 Occipital 
Ctx 


21.9 


AD 3 Temporal Ctx 


1.4 


Control 4 Occipital 
Ctx 


10.4 


AD 4 Temporal Ctx 


44.8 


Control (Path) 1 
Occipital Ctx 


100.0 


AD 5 Inf Temporal 
Ctx 


78.5 


Control (Path) 2 
Occipital Ctx 


3.5 


AD 5 Sup Temporal 
Ctx 


12.0 


Control (Path) 3 
Occipital Ctx 


1.1 


AD 6 Inf Temporal 
Ctx 


33.7 


Control (Path) 4 
Occipital Ctx 


26.6 


AD 6 Sup Temporal 
Ctx 


56.3 


Control 1 Parietal 
Ctx 


0.6 


Control 1 Temporal 
Ctx 


0.0 


Control 2 Parietal 
Ctx 


18.8 
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Control 2 Temporal 
ctx 


10.4 


Control 3 Parietal 
Ctx 


17.7 


Control 3 Temporal 
Ctx 


11.5 


Control (Path) 1 

Jrarlclul VAX. • 


77.4 


Control 3 Temporal 
Ctx 


7.8 


Control (Path) 2 
Parietal Ctx 


10.3 


Control (Path) 1 
Temporal Ctx 


52.9 


Control (Path) 3 
Parietal Ctx 


0.7 


Control (Path) 2 
Temporal Ctx 


7.9 


Control (Path) 4 
Parietal Ctx 


57.8 



Table OC. Panel 4D 



i issue r\ame 


Rel. Exp.(%) 
Ag2972, Run 
164314417 


Tissue Name 


Rel. Exp.(%) 
Ag2972, Run 
164314417 


oeconaary mi act 


A A 

0.0 


HUVEC IL-1 beta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


11.3 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


4.1 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2-rest 


2.1 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


l/.V 


Primary Thl act 


12.2 


Lung Microvascular EC 
l In r alpha + 1L-1 beta 


4.2 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


37.4 


Bronchial epithelium 
TNFalpha + ILlbeta 


7.6 


Primary Th2 rest 


46.7 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


9.7 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


21.2 


Coronery artery SMC rest 


22.2 


CD45RO CD4 
lymphocyte act 


16.3 


Coronery artery SMC 
TNFalphaH- IL-1 beta 


0.0 


CD8 lymphocyte act 


8.4 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


11.6 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


20.3 
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CD4 lymphocyte none 


25.5 


KU-812 (Basophil) 

PM A/ionnmvein 

X XVX^i/ IvIIUlltJ' will 


34.6 


2ry Thl/Th2/Trl_anti- 
CD95 CH11 


0.0 


CCD1 106 (Keratinocytes) 

nnnp 


0.0 


LAK ceils rest 


26.1 


CCD1 106 (Keratinocvtesi 
TNFalpha + IL-lbeta 


8.5 


LAK cells IL-2 


46 7 


T ivpr pttrhnsi^ 

ivl Vtl C> 11 1 1 Iv/Ol J> 


22 8 


LAK cells IL-2+IL-12 


32 5 


T nniiQ Widnpv 

X^/UJJUo IvlUi i^-'_y 


0 0 


LAK cells IT -2+TFN 
gamma 


49.0 


NCI-H292 none 


0.0 


LAK cells EL-2+IL-18 


38.2 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


5.7 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


7.7 


NCI-H292 IL-13 


b.o 


Two Wav MT R 1 dav 


71 2 


NCT-H292 TFN pamma 


0 0 


Two Wav MLR ^ dav 

i vyu ?t ay xvxx_#xv ^ kxqj 


24 8 


FTPAFC none 


0 0 


Two Way MLR 7 day 


0.0 


HP AFC TMF alnha + TT -1 
beta 


0.0 


PRMC rest 


1 7 


T liner fiKmHIjict nnnp 
i-jUllg lIUIUUlOol nunc 


0 A 

U.v 


PBMC PWM 


100.0 


Lung fibroblast TNF alpha 

4- TT -1 hpta 
» ixw"l tic La 


0.0 


PBMC PHA-L 


41.2 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


10.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


40.1 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and JL*4 


21.6 


Dermal fibroblast 
CCD 1 070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


6.6 


EAT 1 AUr> A \/TO 

PMA/ionomvcin 


0.0 


Dermal fibroblast 
CCD 1 070 IL-l beta 


0.0 


Dendritic cells none 


21.9 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


2.9 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


9.2 


Colon 


7.6 


Macrophages rest 


19.6 


Lung 


0.0 


Macrophages LPS 


8.1 


Thymus 


11.8 


HUVEC none 


0.0 


Kidney 


71.7 


HUVEC starved 


0.0 
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Table OD. Panel CNSJL 



Tissue Name 


Rel. Exp.(%) Ag2972, 
Run 171670020 


Tissue Name 


Rel. Exp.(%) Ag2972, 
Run 171670020 


BA4 Control 


17.8 


BA17PSP 


52.5 


BA4 Control2 


8.6 


BA17PSP2 


3.1 


BA4 

Alzheimer' s2 


7.1 


Suh Nipta rYintrnl 




BA4 Parkinson's 


42.9 


Sub Nigra Control2 


14.7 


BA4 

Parkinson 's2 


5l.l 


Sub Nigra 
Alzheimer's2 


j. l 


BA4 

Huntington's 


7.4 


Sub Nigra 
Parkinson's? 


zo.y 


BA4 

Huntington^ 


0.0 


Sub Nigra 
Huntington's 


inn n 


BA4 PSP 


o 

_J 


Sub Nigra 
Huntington's2 




BA4 PSP2 


3.7 


Sub Nigra PSP2 


1.3 


BA4 Depression 


16.7 


Sub Nigra 
Depression 


7.4 


BA4 

Depression2 


0.0 


Sub Nigra 
Depression2 


2.5 


BA7 Control 


15.7 


Glob Palladus 
Control 


7.8 


BA7 Control2 


15.7 


Glob Palladus 
Control2 


0.0 


BA7 

Alzheimer's2 


0.0 


Glob Palladus 
Alzheimer's 


0.0 


B A7 Parkinson's 


24.3 


Glob Palladus 
Alzheimer's2 




BA7 

Parkinson's2 


29.9 


Glob Palladus 
Parkinson's 


J / .u 


BA7 

xiunungton s 


43.5 


Glob Palladus 
Parkinsons2 


3.8 


BA7 

riunungton sz 


7.2 


Glob Palladus PSP 


0.0 


BA7PSP 


20.2 


Glob Palladus PSP2 


3.4 


BA7PSP2 


2.1 


Glob Palladus 


19.5 


BA7 Depression 


5.2 


Temp Pole Control 


0.0 


BA9 Control 


0.0 


Temp Pole Control2 


3.3 


BA9 Control2 


22.7 


Temp Pole 
Alzheimer's 


0.0 


BA9 Alzheimer's 


1.3 


Temp Pole 
Alzheimer^ 


8.6 


BA9 


9.0 


Temp Pole 


0.0 
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rYiz-nciinci ^z. 




rdrivinsori b 




BA9 Parkinson's 


11.7 


Temp Pole 

fal Kill bull dZ. 


23.0 


BA9 

Psi i*k i n eft ti c*7 
x oiivllloUli oZ> 


23.7 


Temp Pole 

iiUIUlIlglUIl o 


7.2 


RAO 

Huntington's 


12.4 


Temp Pole PSP 


5.4 


BA9 

Huntington's2 


7.0 


Temp Pole PSP2 


3.6 


BA9PSP 


13.6 


Temp Pole 
Depression2 


5.9 


BA9PSP2 


5.6 


Cing Gyr Control 


51.4 


BA9 Depression 


12.6 


Cing Gyr Control2 


0.0 


BA9 

Depression2 


2.7 


Corner Cy\rr 

Alzheimer's 


6.7 


BA1 7 Control 


75.8 


v^iiig vjyr 

Alzheimer f s2 


2.7 




1 S R 

IJ.O 


L/ing vjyr raiKinson s 


38 A 
jo.H 


BA17 

rVJZJicinier bz. 


8.4 


Cing Gyr 

r arKinson sz 


11.6 


BA17 

PflrHncnn'c 


32.3 


Cing Gyr 
xiunungion s 


60.3 


BA17 

X CUt^llloUIl 


51.4 


Cing Gyr 
riunungion sz 


0.0 


BA17 

Huntington's 


22.4 


Cing Gyr PSP 


23.7 


BAI7 

Huntington's2 


5.2 


Cing Gyr PSP2 


0.0 


BA17 
Depression 


17.6 


Cing Gyr Depression 


8.5 


BA17 

Depression2 


16.6 


Cing Gyr 
Depression2 


8.1 



CNS_neurodegeneration_vl,0 Summary: Ag2972 The NOV19a represents a novel 
G-protein coupled receptor (GPCR) with expression in the brain. The GPCR family of 
receptors contains a large number of neurotransmitter receptors, including the dopamine, 
5 serotonin, a and b-adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic 
glutamate receptors. GPCRs are excellent drug targets in various neurologic and psychiatric 
diseases. All antipsychotics have been shown to act at the dopamine D2 receptor; similarly 
novel antipsychotics also act at the serotonergic receptor, and often the muscarinic and 
adrenergic receptors as well. While the majority of antidepressants can be classified as 
10 selective serotonin reuptake inhibitors, blockade of the 5-HT1A and a2 adrenergic receptors 
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increases the effects of these drugs. The GPCRs are also of use as drug targets in the treatment 
of stroke. Blockade of the glutamate receptors may decrease the neuronal death resulting from 
excitotoxicity; further more the purinergic receptors have also been implicated as drug targets 
in the treatment of cerebral ischemia. The b-adrenergic receptors have been implicated in the 
5 treatment of ADHD with Ritalin, while the a-adrenergic receptors have been implicated in 
memory. Therefore this gene may be of use as a small molecule target for the treatment of any 
of the described diseases. 
References: 

El Yacoubi M, Ledent C, Parmentier M, Bertorelli R, Ongini E, Costentin J, Vaugeois 

10 JM. Adenosine A2A receptor antagonists are potential antidepressants: evidence based on 
pharmacology and A2A receptor knockout mice. Br J Pharmacol 2001 Sep;134(l):68-77 

1 . Adenosine, an ubiquitous neuromodulator, and its analogues have been shown to 
produce depressant' effects in animal models believed to be relevant to depressive disorders, 
while adenosine receptor antagonists have been found to reverse adenosine-mediated 

1 5 'depressant 1 effect. 2. We have designed studies to assess whether adenosine A2A receptor 
antagonists, or genetic inactivation of the receptor would be effective in established screening 
procedures, such as tail suspension and forced swim tests, which are predictive of clinical 
antidepressant activity. 3. Adenosine A2A receptor knockout mice were found to be less 
sensitive to 'depressant 1 challenges than their wildtype littermates. Consistently, the adenosine 

20 A2A receptor blockers SCH 58261 (1 - 10 mg kg(-l), i.p.) and KW 6002 (0.1 - 10 mg kg(-l), 
p.o.) reduced the total immobility time in the tail suspension test. 4. The efficacy of adenosine 
A2 A receptor antagonists in reducing immobility time in the tail suspension test was 
confirmed and extended in two groups of mice. Specifically, SCH 58261 (1 - 10 mg kg(-l)) 
and ZM 241385 (15 - 60 mg kg(-l)) were effective in mice previously screened for having 

25 high immobility time, while SCH 58261 at 10 mg kg(-l) reduced immobility of mice that were 
selectively bred for their spontaneous 'helplessness' in this assay. 5. Additional experiments 
were carried out using the forced swim test. SCH 58261 at 10 mg kg(-l) reduced the 
immobility time by 61%, while KW 6002 decreased the total immobility time at the doses of 1 
and 10 mg kg(-l) by 75 and 79%, respectively. 6. Administration of the dopamine D2 receptor 

30 antagonist haloperidol (50 - 200 microg kg(-l) i.p.) prevented the antidepressant-like effects 
elicited by SCH 58261 (10 mg kg(-l) i.p.) in forced swim test whereas it left unaltered its 
stimulant motor effects. 7. In conclusion, these data support the hypothesis that A2A receptor 
antagonists prolong escape-directed behaviour in two screening tests for antidepressants. 
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Altogether the results support the hypothesis that blockade of the adenosine A2A receptor 
might be an interesting target for the development of effective antidepressant agents. 

Blier P. Pharmacology of rapid-onset antidepressant treatment strategies. Clin 
Psychiatry 2001 ;62 Suppl 15:12-7 
5 Although selective serotonin reuptake inhibitors (SSRIs) block serotonin (5-HT) 

reuptake rapidly, their therapeutic action is delayed. The increase in synaptic 5-HT activates 
feedback mechanisms mediated by 5-HT1 A (cell body) and 5-HT1B (terminal) autoreceptors, 
which, respectively, reduce the firing in 5-HT neurons and decrease the amount of 5-HT 
released per action potential resulting in attenuated 5-HT neurotransmission. Long-term 

10 treatment desensitizes the inhibitory 5-HT1 autoreceptors, and 5-HT neurotransmission is 
enhanced. The time course of these events is similar to the delay of clinical action. The 
addition of pindolol, which blocks 5-HT1 A receptors, to SSRI treatment decouples the 
feedback inhibition of 5-HT neuron firing and accelerates and enhances the antidepressant 
response. The neuronal circuitry of the 5-HT and norepinephrine (NE) systems and their 

15 connections to forebrain areas believed to be involved in depression has been dissected. The 
firing of 5-HT neurons in the raphe nuclei is driven, at least partly, by alpha 1-adrenoceptor- 
mediated excitatory inputs from NE neurons. Inhibitory alpha2-adrenoceptors on the NE 
neuroterminals form part of a feedback control mechanism. Mirtazapine, an antagonist at 
alpha2-adrenoceptors, does not enhance 5-HT neurotransmission directly but disinhibits the 

20 NE activation of 5-HT neurons and thereby increases 5-HT neurotransmission by a 
mechanism that does not require a time-dependent desensitization of receptors. These 
neurobiological phenomena may underlie the apparently faster onset of action of mirtazapine 
compared with the SSRIs. 

Tranquillini ME, Reggiani A. Glycine-site antagonists and stroke. Expert Opin Investig 

25 Drugs 1999 Nov;8(l 1):1837-1848 

The excitatory amino acid, (S)-glutamic acid, plays an important role in controlling 
many neuronal processes. Its action is mediated by two main groups of receptors: the 
ionotropic receptors (which include NMDA, AMPA and kainic acid subtypes) and the 
metabotropic receptors (mGluR(l-8)) mediating G-protein coupled responses. This review 

30 focuses on the strychnine insensitive glycine binding site located on the NMDA receptor 

channel, and on the possible use of selective antagonists for the treatment of stroke. Stroke is a 
devastating disease caused by a sudden vascular accident. Neurochemically, a massive release 
of glutamate occurs in neuronal tissue; this overactivates the NMDA receptor, leading to 
increased intracellular calcium influx, which causes neuronal cell death through necrosis. 
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NMDA receptor activation strongly depends upon the presence of glycine as a co-agonist. 
Therefore, the administration of a glycine antagonist can block overactivation of NMDA 
receptors, thus preserving neurones from damage. The glycine antagonists currently identified 
can be divided into five main categories depending on their chemical structure: indoles, 
5 tetrahydroquinolines, benzoazepines, quinoxalinediones and pyrida-zinoquinolines. 

Monopoli A, Lozza G, Forlani A, Mattavelli A, Ongini E. Blockade of adenosine A2A 
receptors by SCH 58261 results in neuroprotective effects in cerebral ischaemia in rats. 
Neuroreport 1998 Dec l;9(17):3955-9 

Blockade of adenosine receptors can reduce cerebral infarct size in the model of global 

10 ischaemia. Using the potent and selective A2A adenosine receptor antagonist, SCH 58261 , we 
assessed whether A2A receptors are involved in the neuronal damage following focal cerebral 
ischaemia as induced by occluding the left middle cerebral artery. SCH 58261 (0.01 mg/kg 
either i.p. or i.v.) administered to normotensive rats 10 min after ischaemia markedly reduced 
cortical infarct volume as measured 24 h later (30% vs controls, p < 0.05). Similar effects were 

15 observed when SCH 58261 (0.01 mg/kg, i.p.) was administered to hypertensive rats (28% 
infarct volume reduction vs controls, p < 0.05). Neuroprotective properties of SCH 58261 
administered after ischaemia indicate that blockade of A2A adenosine receptors is a 
potentially useful biological target for the reduction of brain injury. 

Panel 1.3D Summary: Ag2972 Expression of the NOV19a gene is low/undetectable 

20 in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2972 Expression of the the NOV19a gene is restricted to a 
few samples in this panel, with highest expression in peripheral blood mononuclear cells 
(PBMC) treated with the B cell selective pokeweed mitogen. No significant levels of 
expression of the transcript are seen in PBMC that contain normal B cells. Therefore, the 

25 putative GPCR encoded by this gene could potentially be used diagnostically to identify 

activated B cells. In addition, the gene product could also potentially be used therapeutically in 
the treatment of diseases in which B cells are activated. 

Panel CNS_1 Summary: Ag2972 This panel confirms expression of the NOV19a 
gene in the brains of an independent group of subjects. Please see panel 1.3d for a discussion 

30 of utility of this gene in the central nervous system. 

NOV20 



Expression of gene NOV20 was assessed using the primer-probe set Ag2973, 
described in Table PA. Results of the RTQ-PCR runs are shown in Tables PB and PC. 
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Table PA. Probe Name Ag2973 



Primers 


Sequences 


Length 


Start 
Position 


SEQ XD NO: 


Forward 


5' -gctgtgtggctcaagtctattt-3 ' 


22 


287 


1048 


Probe 


TET-5 ' -ttctctgcctttgcatctgctgagct-3 ' - 
TAMRA 


26 


310 


1049 


Reverse 


5 ' -agcggtcataagacatgacagt-3 1 


22 


346 


1050 



Table PB. CNS_neurodegeneration_yl.O 



Tissue Name 


Rel. Exp.(%) Ag2973, 
Run 209779096 


Tissue Name 


Rel. Exp.(%) Ag2973, 
Run 209779096 


AD 1 Hippo 


4.9 


Control (Path) 3 
Temporal Ctx 


1.7 


AD 2 Hippo 


16.2 


Control (Path) 4 
Temporal Ctx 


46.7 


AD 3 Hippo 


22 


AD 1 Occipital Ctx 


3.9 


AD 4 Hippo 


3.7 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


50.7 


AD 3 Occipital Ctx 


1.7 


AD 6 Hippo 


16.4 


AD 4 Occipital Ctx 


19.1 


Control 2 Hippo 


8.2 


AD 5 Occipital Ctx 


24.5 


Control 4 Hippo 


0.5 


AD 6 Occipital Ctx 


2.7 


Control (Path) 3 
Hippo 


0.0 


Control 1 Occipital 
Ctx 


0.0 


AD 1 Temporal Ctx 


4.5 


Control 2 Occipital 
Ctx 


17.8 


AD 2 Temporal Ctx 


27.7 


Control 3 Occipital 
Ctx 


8.7 


AD 3 Temporal Ctx 


3.6 


Control 4 Occipital 
Ctx 


0 7 


AD 4 Temporal Ctx 


17.6 


Control (Path) 1 
Occipital Ctx 


74.7 


AD 5 Inf Temporal 
Ctx 


64.2 


Control (Path) 2 
Occipital Ctx 


13.2 


AD 5 Sup Temporal 
Ctx 


21.8 


Control (Path) 3 
Occipital Ctx 


0.0 


AD 6 Inf Temporal 
Ctx 


24.3 


Control (Path) 4 
Occipital Ctx 


9.2 


AD 6 Sup Temporal 
Ctx 


22.2 


Control 1 Parietal 
Ctx 


1.5 


Control 1 Temporal 
Ctx 


1.4 


Control 2 Parietal 
Ctx 


27.9 


Control 2 Temporal 
Ctx 


33.9 


Control 3 Parietal 
Ctx 


16.8 


Control 3 Temporal 


18.4 


Control (Path) 1 


95.3 
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(_tx 




rarietal Ctx 




Control 3 Temporal 
Ctx 


3.2 


Control (Path) 2 
Parietal Ctx 


23.7 


Control (Path) 1 
Temporal Ctx 


100.0 


Control (Path) 3 
Parietal Ctx 


1.4 


Control (Path) 2 
Temporal Ctx 


68.8 


Control (Path) 4 
Parietal Ctx 


33.7 



Table PC. Panel 4D 



Tissue Name 


ReL Exp.(%) 
Ag2973, Run 
164329850 


Tissue Name 


Rel. Exp.(%) 
Ag2973,Run 
164329850 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondarv Trl act 


0 0 


HUVEC TNF alpha + IFN 
gamma 


0 0 


Secondary Thl rest 


11.7 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 




Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 

rpvm It I TT t 1 a 

TNFalpha + ELI beta 


0.0 


Primary Th2 rest 


6.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl_anti- 


0.0 


CCD 1 106 (Keratinocytes) 


0.0 
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CD95CH11 i jnone 




LAK cells rest 


n ft jCCD 1 1 06 (Keratinocytes) 
U jTOFalpha + IL-lbeta 


0.0 


T AK rplk II -9 


0.0 jLiver cirrhosis 


100 0 


T AK rplk IT -9+TT -19 


0 0 


I lfnnc kiHtipv 


9 7 


gamma 


0.0 


NCI-H292 none 


0.0 


T AK celkTT -9+ IT -1R 


0 0 


NCT-H9Q9 II -4 


0.0 


I AK ce\W 
PMA/ionomycin 


0.0 


NCI-H292JL-9 


0.0 


NK Celk IT -9 rest 


0.0 


NCI-H292IL-13 


0.0 


Twn W/av MI ft 1 rtav 
i wu way lviljI\ j \x<xy 


0 0 




0 0 


Two Wav MI R S dav 


0.0 


HPAEC none 


0 0 


Two Way MLR 7 day 


0.0 


Tilt A /"^ rf tv m 11 ■ TT 1 

HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0 0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


o.o 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0 0 

l/.v 


T imo *fiHrr»Vikcf TT -1 ^ 

LjUllg LiUlSJUiaol lJL-r" I J 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 

J TT A 

and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 

rrD107ft TMF alnha 


0.0 


PAT 1 JL„ A X yTn 

P\4 A /i o n nm vc i n 


0.0 


Thermal fiWrnhlast 

CCD1070IL-1 beta 


0.0 


Dendritic cells none 




Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


BD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


8.7 


Macrophages rest 


0.0 


Lung 


6.6 


Macrophages LPS 


0.0 


Thymus 


13.9 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved. 


0.0 







CNS_neurodegeneration_vl.0 Summary: Ag2973 The NOV20 gene represents a 
novel G-protein coupled receptor (GPCR) with expression in the brain. The GPCR family of 
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receptors contains a large number of neurotransmitter receptors, including the dopamine, 
serotonin, a and b-adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic 
glutamate receptors. GPCRs are excellent drug targets in various neurologic and psychiatric 
diseases. All antipsychotics have been shown to act at the dopamine D2 receptor; similarly 
5 novel antipsychotics also act at the serotonergic receptor, and often the muscarinic and 
adrenergic receptors as well. While the majority of antidepressants can be classified as 
selective serotonin reuptake inhibitors, blockade of the 5-HT1 A and a2 adrenergic receptors 
increases the effects of these drugs. The GPCRs are also of use as drug targets in the treatment 
of stroke. Blockade of the glutamate receptors may decrease the neuronal death resulting from 

10 excitotoxicity; further more the purinergic receptors have also been implicated as drug targets 
in the treatment of cerebral ischemia. The b-adrenergic receptors have been implicated in the 
treatment of ADHD with Ritalin, while the a-adrenergic receptors have been implicated in 
memory. Therefore this gene may be of use as a small molecule target for the treatment of any 
of the described diseases. 

15 References: 

El Yacoubi M, Ledent C, Parmentier M, Bertorelli R, Ongini E, Costentin J, Vaugeois 
JM. Adenosine A2A receptor antagonists are potential antidepressants: evidence based on 
pharmacology and A2A receptor knockout mice. Br J Pharmacol 2001 Sep;134(l):68-77 
1. Adenosine, an ubiquitous neuromodulator, and its analogues have been shown to 

20 produce 'depressant 1 effects in animal models believed to be relevant to depressive disorders, 
while adenosine receptor antagonists have been found to reverse adenosine-mediated 
'depressant 1 effect. 2. We have designed studies to assess whether adenosine A2A receptor 
antagonists, or genetic inactivation of the receptor would be effective in established screening 
procedures, such as tail suspension and forced swim tests, which are predictive of clinical 

25 antidepressant activity. 3. Adenosine A2A receptor knockout mice were found to be less 

sensitive to 'depressant' challenges than their wildtype littermates. Consistently, the adenosine 
A2A receptor blockers SCH 58261 (1 - 10 mg kg(-l), i.p.) and KW 6002 (0.1 - 10 mg kg(-l), 
p.o.) reduced the total immobility time in the tail suspension test. 4. The efficacy of adenosine 
A2A receptor antagonists in reducing immobility time in the tail suspension test was 

30 confirmed and extended in two groups of mice. Specifically, SCH 58261 (1 - 10 mg kg(-l)) 
and ZM 241385 (15 - 60 mg kg(-l)) were effective in mice previously screened for having 
high immobility time, while SCH 58261 at 10 mg kg(-l) reduced immobility of mice that were 
selectively bred for their spontaneous 'helplessness' in this assay. 5. Additional experiments 
were carried out using the forced swim test SCH 58261 at 10 mg kg(-l) reduced the 
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immobility time by 61%, while KW 6002 decreased the total immobility time at the doses of 1 
and 10 mg kg(-l) by 75 and 79%, respectively. 6. Administration of the dopamine D2 receptor 
antagonist haloperidol (50 - 200 microg kg(-l) i.p.) prevented the antidepressant-like effects 
elicited by SCH 58261 (10 mg kg(-l) i.p.) in forced swim test whereas it left unaltered its 
5 stimulant motor effects. 7. In conclusion, these data support the hypothesis that A2A receptor 
antagonists prolong escape-directed behaviour in two screening tests for antidepressants. 
Altogether the results support the hypothesis that blockade of the adenosine A2A receptor 
might be an interesting target for the development of effective antidepressant agents. 
Blier P. Pharmacology of rapid-onset antidepressant treatment strategies. Clin 

10 Psychiatry 2001;62 Suppl 15:12-7 

Although selective serotonin reuptake inhibitors (SSRIs) block serotonin (5-HT) 
reuptake rapidly, their therapeutic action is delayed. The increase in synaptic 5-HT activates 
feedback mechanisms mediated by 5-HT1A (cell body) and 5-HT1B (terminal) autoreceptors, 
which, respectively, reduce the firing in 5-HT neurons and decrease the amount of 5-HT 

15 released per action potential resulting in attenuated 5-HT neurotransmission. Long-term 
treatment desensitizes the inhibitory 5-HT1 autoreceptors, and 5-HT neurotransmission is 
enhanced. The time course of these events is similar to the delay of clinical action. The 
addition of pindolol, which blocks 5-HT 1 A receptors, to SSRI treatment decouples the 
feedback inhibition of 5-HT neuron firing and accelerates and enhances the antidepressant 

20 response. The neuronal circuitry of the 5-HT and norepinephrine (NE) systems and their 

connections to forebrain areas believed to be involved in depression has been dissected. The 
firing of 5-HT neurons in the raphe nuclei is driven, at least partly, by alphal -adrenoceptor- 
mediated excitatory inputs from NE neurons. Inhibitory alpha2-adrenoceptors on the NE 
neuroterminals form part of a feedback control mechanism. Mirtazapine, an antagonist at 

25 alpha2-adrenoceptors, does not enhance 5-HT neurotransmission directly but disinhibits the 
NE activation of 5-HT neurons and thereby increases 5-HT neurotransmission by a 
mechanism that does not require a time-dependent desensitization of receptors. These 
neurobiological phenomena may underlie the apparently faster onset of action of mirtazapine 
compared with the SSRIs. 

30 Tranquillini ME, Reggiani A. Glycine-site antagonists and stroke. Expert Opin Investig 

Drugs 1999Nov;8(ll):1837-1848 

The excitatory amino acid, (S)-glutamic acid, plays an important role in controlling 
many neuronal processes. Its action is mediated by two main groups of receptors: the 
ionotropic receptors (which include NMDA, AMPA and kainic acid subtypes) and the 
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metabotropic receptors (mGluR(l-8)) mediating G-protein coupled responses. This review 
focuses on the strychnine insensitive glycine binding site located on the NMDA receptor 
channel, and on the possible use of selective antagonists for the treatment of stroke. Stroke is a 
devastating disease caused by a sudden vascular accident. Neurochemically, a massive release 
5 of glutamate occurs in neuronal tissue; this overactivates the NMDA receptor, leading to 
increased intracellular calcium influx, which causes neuronal cell death through necrosis. 
NMDA receptor activation strongly depends upon the presence of glycine as a co-agonist. 
Therefore, the administration of a glycine antagonist can block overactivation of NMDA 
receptors, thus preserving neurones from damage. The glycine antagonists currently identified 

10 can be divided into five main categories depending on their chemical structure: indoles, 
tetrahydroquinolines, benzoazepines, quinoxalinediones and pyrida-zinoquinolines. 

Monopoli A, Lozza G 5 Forlani A, Mattavelli A, Ongini E. Blockade of adenosine A2A 
receptors by SCH 58261 results in neuroprotective effects in cerebral ischaemia in rats. 
Neuroreport 1998 Dec l;9(17):3955-9 

1 5 Blockade of adenosine receptors can reduce cerebral infarct size in the model of global 

ischaemia. Using the potent and selective A2A adenosine receptor antagonist, SCH 58261, we 
assessed whether A2A receptors are involved in the neuronal damage following focal cerebral 
ischaemia as induced by occluding the left middle cerebral artery. SCH 58261 (0.01 mg/kg 
either i.p. or i.v.) administered to normotensive rats 10 min after ischaemia markedly reduced 

20 cortical infarct volume as measured 24 h later (30% vs controls, p < 0.05). Similar effects were 
observed when SCH 58261 (0.01 mg/kg, i.p.) was administered to hypertensive rats (28% 
infarct volume reduction vs controls, p < 0.05). Neuroprotective properties of SCH 58261 
administered after ischaemia indicate that blockade of A2A adenosine receptors is a 
potentially useful biological target for the reduction of brain injury. 

25 Panel 1 3D Summary: Ag2973 Expression of the NOV20 gene is low/undetectable in 

all samples on this panel (CTs>35). (Data not shown.) 

Panel 3D Summary: Ag2973 Expression of the NOV20 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2973 Significant expression of the NOV20 gene is detected in 

30 a liver cirrhosis sample (CT = 32.7). Furthermore, expression of this gene is not detected in 

normal liver in Panel 1.3D, suggesting that its expression is unique to liver cirrhosis. This gene 
encodes a putative GPCR; therefore, antibodies or small molecule therapeutics could reduce or 
inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this putative GPCR 
could also be used for the diagnosis of liver cirrhosis. 
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Panel CNS_1 Summary: Ag2973 Expression of the NOV20 gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

NOV16b 

Expression of gene NOV16b was assessed using the primer-probe sets Ag2875 and 
5 Ag3010, described in Tables QA and QB. Results of the RTQ-PCR runs are shown in Table 
QC. 

Table QA. Probe Name Ag2875 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gaacatcatctcctaccctgaa-3 1 


22 


267 


1051 


Probe 


TET- 5 1 - tgcatgactcagct c tacttctt cc t cg- 
3 ' -TAMRA 


28 


289 


1052 


Reverse 


5 1 -atgtgacactctgcaatagcaa-3 1 


22 


320 


1053 



Table QB. Probe Name Ag3010 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ggggaaagtatcctctgtgttt-3 1 


22 


810 


1054 


Probe 


TET -5 ' - ttattgtgcccatgttgaaccctctg-3 1 - 
TAMRA • 


26 


839 


1055 


Reverse 


5 ' -cagggaaacatggacatcttta-3 1 


22 


882 


1056 



10 



Table QC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2875,Run 
164311029 


Tissue Name 


Rel. Exp.(%) 
Ag2875, Run 
164311029 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


3.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 
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Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CDS 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
r MA/ionomycin 


24.0 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


5.1 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


L,cu i i uo { Keratmocyxes ) 
TNFalpha + IL-lbeta 


0.0 


LAK cells 1L-2 


A A 
U.U 


Liver cirrhosis 


/Z.Z 


1 ATS nail,, TT ^ i TT n 

LAK cells 1L-Z+1L-1Z 


A A 
U.U 


Lupus kidney 


U.U 


T ALT _ TT OlTTJXT 

LAK cells LL-2+lrN 
gamma 


0.0 


NCI-H292 none 


20.7 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK ceils 
PMA/ionomvcin 


0.0 


NC1-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


i wo way iviLrK ^ oay 


A A 

U.U 


XT/^T TJOQO TTTM mamma 

jNL4-rL£yz lriN gamma 


a o. 

U.U 


Txurz-k \I7rt!, AyTT D ^ riot/ 

i wo way MLrK D aay 


n a 
U.U 


UDA T3P nnna 

rir AJfcL/ none 


U.U 


Two Way MLR 7 day 


0.0 


T-JPAPP TMF nlnVin 4- TT -1 

rxr/VJDL/ iiNr aipnd t tLt-i 
beta 


1.4 


"dml rest 


U.U 


tyUng riDroDiaSi none 


n n 

U.U 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 

4- TT 1 Koto 

t u_,-i ocia 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


3.2 


B lymphocytes CD40L 
andIL-4 


0.0 


Dermal fibroblast 
CCD1070 rest 


0.0 


EOL-1 dbcAMP 


3.1 


Dermal fibroblast 


0.0 
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CCD 1070 TNF alpha 




PMA/ionomvcin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


33.2 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


2.4 


Dermal fibroblast IL-4 


1.9 


Dendritic cells anti- 
CD40 


100.0 


IBD Colitis 2 


16.3 


Monocytes rest 


0.0 


IBD Crohn's 


6.7 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest ' 


44.1 


Lung 


21.8 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


3.1 


Kidney 


0.0 


HUVEC starved 


0.0 







Panel 1.3D Summary:.Ag2875/Ag3010 Results from two experiments with the 
NOV 16b gene are not included. The amp plots indicate that there were experimental 
difficulties with these runs. 
5 Panel 2.2 Summary: Ag2875 Expression of the NOV16b gene is low/uridetectable in 

all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2875 Highest expression of the NOV 1 5a is in anti-CD40 
treated dendritic cells (CT=33.2), with much lower expression in untreated dendritic cells. 
Thus, this gene product may be important in dendritic cell activation. Significant expression of 

1 0 this gene is also seen in liver cirrhosis. This gene encodes a putative GPCR; therefore, 

antibodies or small molecule therapeutics could reduce or inhibit fibrosis that occurs in liver 
cirrhosis. In addition, antibodies to this putative GPCR could also be used for the diagnosis of 
liver cirrhosis. In addition, significant expression of this gene is seen in resting macrophages. 
The putative GPCR encoded for by this transcript may therefore be important in macrophage 

1 5 detection of chemokine gradients and trafficking into specific sites within a tissue and in 
activation. Antibody or protein therapeutics designed against the protein encoded for by this 
transcript could reduce or inhibit inflammation in asthma, emphysema, allergy, psoriasis, 
arthritis, or any other condition in which macrophage localization/activation is important. A 
second experiment with the probe/primer set Ag3010 shows low/undetectable expression in all 

20 samples on this panel (CTs>35). (Data not shown.) 

NOV21a 
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Expression of gene NOV21a was assessed using the primer-probe sets Ag2963 and 
Agl292, described in Tables RA and RB. Results of the RTQ-PCR runs are shown in Tables 
RC, RD and RE. 



Table RA, Probe Name Ag2963 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ggtaccaccctttgt tccaat- 3 1 


21 


903 


. 1057 


Probe 


TET-5 ' -aaacacagcccctccaccctagct-3 ' - 
TAMRA 


24 


924 


1058 


Reverse 


5 1 -gtcctcgctgtgacactga-3 ' 


19 


963 


1059 



Table RB. Probe Name Agl292 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -cagctgtgagaggctggata-3 ' 


20 


731 


1060 


Probe 


TET-5 ■ -atccaggagtgaccaccacgtgact-3 1 - 
TAMRA 


25 


760 


1061 


Reverse 


5 1 -tcctgttgctttcacgtagagt-3 1 


22 


797 


1062 



Table RC. CNS_neurodegeneration_yLO 



Tissue Name 


Rel. Exp.(%) 
Ag2963,Run 
209778824 


Rel. Exp.(%) 
Ag2963, Run 
230512509 


Tissue 
Name 


Rel. Exp.(%) 
Ag2963, Run 
209778824 


Rel. Exp.(%) 
Ag2963,Run 
230512509 


AD 1 Hippo 


6.5 


6.8 


Control 
(Path) 3 
Temporal 
Ctx 


6.1 


3.6 


AD 2 Hippo 


21.2 


16.8 


Control 
(Path) 4 
Temporal ■ 
Ctx 


32.5 


26.2 


AD 3 Hippo 


5.5 


6.3 


AD 1 

Occipital 

Ctx 


5.3 


4.9 


AD 4 Hippo 


7.4 


7.9 


AD 2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


AD 5 hippo 


21.0 


22.2 


AD 3 

Occipital 

Ctx 


7.9 


6.2 


AD 6 Hippo 


12.2 


19.6 


AD 4 

Occipital 

Ctx 


12.1 

/ 


13.8 


Control 2 


58.6 


44.8 


AD 5 


6.7 


3.3 
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Hippo 






Occipital 
Ctx 






Control 4 
Hippo 


12.9 


12.9 


AD 6 

Occipital 

Ctx 


73.7 


81.8 


Control (Path) 
3 Hippo 


2.0 


4.5 


Control 1 
Occipital 
Ctx 


5.9 


2.5 


AD 1 Temporal 
Ctx 


2.3 


6.3 


Control 2 
Occipital 
Ctx 


89.5 


100.0 


AD 2 Temporal 
Ctx 


13.2 


19.5 


Control 3 
Occipital 
Ctx 


27.4 


25.7 


AD 3 Temporal 
Ctx 


3.3 


7.1 


Control 4 
Occipital 
Ctx 


11.3 


12.4 


AD 4 Temporal 
Ctx 


15.8 


20.7 


Control 
(Path)l 
Occipital 
Ctx 


100.0 


93.3 


AD5Inf 
Temporal Ctx 


16.7 


21.5. 


Control 
(Path) 2 
Occipital 
Ctx 


8.4 


11.0 


AD ^ 

SupTemporai 
Ctx 


12.5 


11.8 


Control 
(Path) 3 
Occipital 
Ctx 


4.4 


2.2 


AD6M 
Temporal Ctx 


25.0 


25.2 


Control 
(Path) 4 
Occipital 
ctx 


16.7 


20.6 


AD 6 Sup 
Temporal Ctx 


22.7 


17.1 


Control 1 
Parietal Ctx 


9.8 


4.6 


Control 1 
Temporal Ctx 


2.1 


6.1 


Control 2 
Parietal Ctx 


8.2 


8.4 


Control 2 
Temporal Ctx 


82.4 


51.8 


Control 3 
Parietal Ctx 


26.6 


24.7 


Control 3 
Temporal Ctx 


31.2 


20.2 


Control 
(Path) 1 
Parietal Ctx 


94.6 


66.9 


Control 4 
Temporal Ctx 


11.6 


12.9 . 


Control 
(Path) 2 
Parietal Ctx 


19.3 


24.8 


Control (Path) 
1 Temporal Ctx 


77.9 


56.3 


Control 
(Path) 3 
Parietal Ctx 


2.6 


3.8 
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Control (Path) 
2 Temporal Ctx 






Control 






17.0 


12.3 


(Path) 4 
Parietal Ctx 


35.8 


23.0 



Table RD. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2963, 
Run 167809191 


Tissue Name 


Rel. Exp.(%) Ag2963, 
Run 167809191 


Liver adenocarcinoma 


82.4 


Kidney (fetal) 


45.4 


Pancreas 


50.0 


Renal ca. 786-0 


53.6 


Pancreatic ca. CAP AN 
2 


10.7 


Renal ca. A498 


19.6 


Adrenal gland 


14.2 


Renal ca. RXF 393 


21.2 


Thyroid 


26.2 


Renal ca. ACHN 


12.5 


Salivary gland 


21.2 


Renal ca.UO-31 


13.7 


r liuiiary giana 




rvCUal Ca. 1 1V~ 1 1/ 


97 9 


Brain (fetal) 


40.9 


Liver 


49.7 


Brain (whole) 


71.7 


Liver (fetal) 


45.1 


Brain (amygdala) 


11.6 


Liver ca. 

(hepatoblast) HepG2 


16.3 


Brain (cerebellum) 


62.4 


Lung 


26.4 


Brain (hippocampus) 


47.0 


Lung (fetal) 


66.0 


TVrfiin f <iiilvitJiritia niorji^ 


89 5 


Lung ca. (small cell) 
LX-1 


42.0 


Brain f thalamus^ 


53.2 


Lung ca. (small cell) 
NCI-H69 


21.6 


Cerebral Cortex 


92.7 


Lung ca. (s.cell var.) 
SHP-77 


59.9 


Sninal cord 


30.6 


Lung ca. (large 
cell)NCI-H460 


12.9 


glio/astro U87-MG 


28.5 


Lung ca. (non-sm. 
cell)A549 


37.1 


glio/astroU-118-MG 


44.4 


Lung ca. (non-s.cell) 
NCI-H23 


47.3 


astrocytoma SW1783 


50.3 


Lung ca. (non-s.cell) 
HOP-62 


11.8 


neuro*; met SK-N-AS 


40.6 


Lung ca. (non-s.cl) 
NCI-H522 


16.8 


astrocytoma SF-539 


37.1 


Lung ca. (squam.) 
SW 900 


29.3 


astrocytoma SNB-75 


49.3 


Lung ca. (squam.) 
NCI-H596 


51.1 


glioma SNB-19 


31.0 


Mammary gland 


70.2 


glioma U251 


52.9 


Breast ca.* (pl.ef) 
MCF-7 


15.2 
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glioma SF-295 


27.0 


Rrpa^t ra * fnl ef\ 

MDA-MB-231 


29.7 


Heart (fetal) 


90.8 


Rreast ca * fol eft 
T47D 


41.2 


Heart 


22.7 


Rreast ca RT-549 




Skeletal muscle (fetal) 


83.5 


Breast ca. MDA-N 


54.3 


oKeiexai muscie 


57.0 


Ovary 


47.0 


Bone marrow 


25.2 


Ovarian ca. OVCAR- 

3 


26.1 


Thymus 


74.7 


Ovarian ca. OVCAR- 
4 


18.7 


Spleen 


28.1 


Ovarian ca. OVCAR- 
5 


82.9 


Lymph node 


54.7 


U van an ca. UVCAR- 


3.4 


Colorectal 


7.9 


Ovarian ca. IGROV- 
1 


22.2 


Stomach 


7.0 


Ovarian ca.* (ascites) 

OIv v V *«j 


100.0 


Small intestine 


25.3 


T Tt^niQ 

w Id Uj 




Colon ca. SW480 


25.9 


Placenta 


3.2 


v^oion ca. 

SW620(SW480 met) 


40.1 


Prostate 


22.1 


Colon ca. HT29 


24.1 


Prostate ca.* (bone 


293 


Colon ca.HCT-1 16 


35.8 


Testis 


19.3 


Colon ca. CaCo-2 


52.9 


Melanoma 
Hs688(A).T 


19.5 


v^uion Cd. 

tissue(OD03866) 


4.4 


Melanoma* (met) 
Hs688(B).T 


22.4 


Cnlnn rn WPP 90Q8 


Dl.y 


Melanoma UACC-62 


AO i~\ 

42.9 


Gastric ca.* (liver met) 
NCI-N87 


27.2 


Melanoma M 14 


8.3 


Bladder 


20.6 


Melanoma LOX 
IMVI 


25.0 


Trachea 


11.7 


Melanoma* (met) 
SK-MEL-5 


23.3 


Kidney 


56.6 


Adipose 


21.0 



Table RE. Panel 4D 





Rel. 


Rel. 




Rel. 


Rel. 


Tissue Name 


Exp.(%) 


Exp.(%) 


Tissue Name 


Exp.(%) 


Exp.(%) 




Agl292, 


Ag2963, 


Agl292, 


Ag2963, 




Run 


Run 




Run 


Run 
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138719232 


164333805 


138719232 


164333805 


Secondary Thl act 


14.5 


18.6 


HUVEC IL-lbeta 


7.6 


3.0 


Secondary Th2 act 


11.3 


47.0 


HUVECIFN 
gamma 


13.5 


31.6 


Secondary Trl act 


23.2 


29.1 


HUVEC TNF 
alpha + EFN 
gamma 


2.2 


26.1 


Secondary Thl rest 


4.0 


8.8 


HUVEC TNF 
alpha + IL4 


8.3 


33.2 


Secondary Th2 rest 


5.0 


18.2 


HUVEC IL-11 


2.8 


26.6 


Secondary Trl rest 


5.8 


2.2 


T nno 

Microvascular EC 
none 


7.1 


62.0 


Primary Th l act 


19.1 


55.9 


Lung 

!\4ifirnv?i^fiii1flr T*jC* 

lVi.ll>l \J V ClJV UlCLk 1— 

TNFalpha + IL- 
lbeta 


2.7 


40.9 


Primary Th2 act 


16.4 


33.4 


Microvascular 

iVCliilal U/V-^ nunc 


8.8 


. 41.2 


Primary Trl act 


253 


66.9 


Microsvasular 

U\Zi Illal JCiV-/ 

TNFalpha + IL- 
lbeta 


7.5 


31.6 


Primary Thl rest 


13.6 


33.0 


Bronchial 

(*r\i tin p H i i in 

TNFalpha + 
ILlbeta 


8.7 


37.1 


Primary Th2 rest 


9.3 


10. / 


Small airway 
epithelium none 


1.7 


8 A 


Primary Trl rest 


18.8 


ZJ.U 


Small airwav 

epithelium 
TNFalpha + IL- 
lbeta 


6.6 




CD45RA CD4 
lymphocyte act 


7.9 


39.0 


Coronery artery 
SMC rest 


55.9 


21.6 


CD45RO CD4 
lymphocyte act 


10.3 


27.2 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


5.3 


14.6 


CD8 lymphocyte 
act 


y.y 


29.7 


Astrocytes rest 


5.6 


24.1 


Secondary CD8 
lymphocyte rest 


11.9 


44.4 


Astrocytes 
TNFalpha + IL- 
lbeta 


4.4 


19.5 


Secondary CD8 
lymphocyte act 


13.0 


22.8 


KU-812 
(Basophil) rest 


16.0 


66.9 


CD4 lymphocyte 
none 


6.3 


21.3 


KU-812 
(Basophil) 


8.1 


87.7 
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PMA/ionomycin 




2ry 

Thl/Th2/Trl anti- 
CD95CH11 


7.9 


20.3 


CCD1106 

(Keratinocytes) 

none 


5.8 


41.2 


LAK cells rest 


10.1 


39.5 


CCD1106 
(Keratinocytes) 
TNFalpha + IL- 
Ibeta 


6,5 


11.5 


T A V «/vl1- TT O 

LAK cells LL-2 


5.7 


33.2 


Liver cirrhosis 


4.9 


7.9 


LAK cells IL-2+IL- 


9.0 


34.2 


Lupus kidney 


4.5 


9.2 


LAiv cells 1L- 
2+IFN gamma 


12.4 


31.9 


NCI-H292 none 


11.5 


63.7 


LAK Cells lL-2-r 

IL-18 


16.8 


16.2 


NCI-H292 IL-4 


17.7 


77.9 


T A IS _ -11 _ 

LAK cells 
PMA/ionomycin 


15.4 


50.7 


NCI-H292 IL-9 


59.5 


54.3 


NK. Cells 1L-2 rest 


7.6 


22.S 


NCI-H292IL-13 


44.1 


63.7 


Two Way MLR 3 
day 


7.7 


49.0 


NCI-H292 IFN 
gamma 


10.4 


89.5 


Two Way MLR 5 
day 


5.4 


29.9 


T_JT> ACr* nnnA 

rir/iLL none 


ft 0 
o.Z 


28.1 


Two Way MLR 7 
day 


5.3 


17.0 


HPAECTNF 
alpha + DL-1 beta 


O f\ 

y.o 


30.1 


PBMC rest 


5.2 


11.2 


Lung fibroblast 
none 


R 7 
o. / 


28.7 


PBMC PWM 


17.1 


100.0 


Lung fibroblast 
TNF alpha + IL-1 
beta 


4.5 


6.9 


PBMCPHA-L 


10.9 


51.4 


Lung fibroblast 
IL-4 


10.2 


53.6 


Ramos (B cell) 
none 


24.1 


88.3 


Lung fibroblast 
IL-9 


9.7 


49.7 


Ramos (B cell) 
ionomycin 


14.6 


92.7 


Lung fibroblast 
IL-13 


11 1 


44.1 


B lymphocytes 
PWM 


11.8 


59.0 


Lung fibroblast 
IFN gamma 


9.5 


29.5 


B lymphocytes 
CD40L and IL-4 


20.7 


21.8 


Dermal fibroblast 
CCD 1070 rest 


32.3 


37 6 


EOL-1 dbcAMP 


233 


32.8 


Dermal fibroblast 
CCD1070 TNF 
alpha 


100.0 


39.0 


EOL-1 dbcAMP 
PMA/ionomycin 


4.2 


25.5 


Dermal fibroblast 
CCD1070 IL-1 
beta 


14.3 


28.7 


Dendritic cells 


4.3 


22.8 


Dermal fibroblast 


7.5 


17.8 
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none 






irrN gal I 111 la 






Dendritic cells LPS 


4.4 


10.5 


JL/cmial lIDrvJUlaai 

IL-4 


30.4 


48.6 


Dendritic cells anti- 
CD40 


6.4 


24.3 


IBD Colitis 2 


1.3 


1.2 


Monocytes rest 


7.6 


38.4 


IBD Crohn's 


2.6 


2.2 


Monocytes LPS 


' 4.5 


8.0 


Colon 


7.3 


37.9 


Macrophages rest 


6.3 


21.5 


Lung 


5.8 


22.8 


Macrophages LPS 


5.5 


1.9 


Thymus 


12.7 


39.2 


HUVEC none 


4.8 


30.8 


Kidney 


19.1 


98.6 | 


HUVEC starved 


7.8 


27.7 









CNS_neurodegeneration_vl,0 Summary: Ag2963 The NOV21a gene, a secretory 
protease homolog, appears to be downregulated in the temporal cortex of Alzheimer's disease 
patients when compared to non-demented controls. Up regulation of this protease may 
5 therefore be of use in the treatment of Alzheimer's, particularly because Alzheimer's disease is 
believed to result at least in part from the improper processing of proteins (APP, Tau). This 
protease may serve to lower the levels of these disease proteins and ameliorate the 
dementia/pathology associated with Alzheimer's. 

Panel 13D Summary: Ag2963 The NOV21a gene, a putative secretory serine- 
10 protease, is widely expressed in this panel. Highest expression is in an ovarian cancer cell line 
(CT=32), with expression detected in all cancer cell lines in this panel. Thus, inhibition of the 
protease domain might lead to a decrease in cell survival and proliferation and serve as a small 
molecule target in cancer. 

This gene product also has low levels of expression in pancreas, thyroid, pituitary, 
15 adipose, and adult and fetal types of heart, skeletal muscle and liver. Therefore, this serine 
protease-like gene product may be a small molecule target for the treatment of endocrine and 
metabolic diseases, including obesity and Types 1 and 2 diabetes. 

The expression in this panel further confirms expression of this gene in the CNS. 
Please see CNS_neurodegeneration for discussion of utility of this gene in the CNS. 
20 Panel 4D Summary: Agl292/Ag2693 The NOV21a transcript is expressed on most 

tissues in panel 4D. This widespread expression is consistent with the results in Panel 1 .3D. 
This transcript encodes a serine protease like protein with potential enzymatic activity and 
may important in maintaining normal cellular functions in a number of tissues. Therefore, 
therapies designed with the protein encoded by this transcript could be important in regulating 
25 cellular viability or function. 
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NOV22a 



Expression of gene NOV22a was assessed using the primer-probe set Ag2964, 
described in Table SA. 

Table SA. Probe Name Ag2964 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gaaatacgcaggaaggaaatct-3 1 


22 


473 


1063 


Probe 


TET-5 1 -cgtgggcatcatagaccagaaaacct-3 ' - 
TAMRA 


26 


507 


1064 


Reverse 


5' -ggtctgtgagggagaagttgta-3 ' 


22 


544 


1065 



5 

CNS_neurodegeneration_vLO Summary: Ag2964 Expression of the NOV22a gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 

Panel 13D Summary: Ag2964 Expression of the NOV22a gene is low/undetectable 
10 in all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

Panel 4D Summary: Ag2964 Expression of the NOV22a gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

15 NOV23a 



Expression of gene NOV23a was assessed using the primer-probe set Ag2967, 
described in Table f A. 

Table TA. Probe Name Ag2967 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 • -ctcaaggaggacttccgtaag-3 ' 


21 


478 


1066 


Probe 


TET-5 1 -atcttcctctgcagaaagccactgtg-3 * - 
TAMRA 


26 


500 


1067 


Reverse 


5 * -agtgagtgaatggccgtaca-3 1 


20 


557 


1068 



20 CNS_neurodegeneration_vl.O Summary: Ag2967 Expression of the NOV23a gene 

is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 
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Panel 1.3D Summary: Ag2967 Expression of the NOV23a gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

Panel 4D Summary: Ag2967 Expression of the NOV23a gene is low/undetectable in 
5 all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

NOV23a and NOV23b 

Expression of gene NOV23a and variant NOV23b was assessed using the primer-probe 
set Ag2996, described in Table UA. Results of the RTQ-PCR runs are shown in Table UB. 

10 Table UA. Probe Name Ag2996 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -cagaggttacgccctgaagt-3 1 


20 


54 


1069 


Probe 


TET-5 1 -ctcagatcctgggccaggcactg-3 ' - 
TAMRA 


23 


77 


1070 


Reverse 


5 • -cagaaagaggtcctgctcatg-3 ' 


21 


113 


1071 



Table UB. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2996, Run 
165296353 


Tissue Name 


Rel. Exp.(%) 
Ag2996,Run 
165296353 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


23.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + BLlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 


0.0 
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TNFalpha + IL-lbeta 




l~JJ*f Ji\/\ V^JL/H 

lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


lymphocyte act 


0.0 


Coronery artery 5>MC 
TNFalpha + IL-lbeta 


0.0 


luo lyinpnocyie act 


u.u 


Astrocytes rest 


0.0 


Secondary CD8 
jyrnpnuwyic real 


0.0 


Astrocytes TNFalpha + 
iLr-lbeta 


0.0 


Secondary CD8 
jyuipnucyic act 


0.0 


KU-812 (Basophil) rest 


0.0 j 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomvcin 

mJ lujiuiu j will 


0.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD1 106 nCeratinocvte^ 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


iilVvl wlllAIwolO 


inn a 


LAK cells IL-2+IL-12 


0.0 


Ludlis kidnev 




LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


0.0 ! 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCT-H292 TFN ramma 
rx^.7^> irii gamma 


v.U 


Two Way MLR 5 day 


0.0 


HPAEC none 


v.v 


Two Way MLR 7 day 


0.0 


HPAEC TNF alnha + IL-1 
beta . 


0.0 


PBMC rest 


0.0 


Luncr fibroblast none 




PBMC PWM 


11.3 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Axaiiiua \xj nunc 


111 6 


bung tibroblast IL-9 


0.0 


Ramos (B cell) 

innnmvoin 

iKjiii jr^ 111 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
?amma 


0.0 


B lymphocytes CD40L 
and IL-4 


15.3 


dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 
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Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


49.0 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


o.o 


Kidney 


0.0 


HUVEC starved 


0.0 







Panel 13D Summary: Ag2996 Results from one experiment with the NOV23a gene 
are not included. The amp plot indicates that there were experimental difficulties with this run. 

Panel 4D Summary: Ag2996 Significant expression of the NOV23a gene is detected 
5 in a liver cirrhosis sample and normal lung tissue(CTs=33-35). Thus, antibodies to this protein 
product could potentially be used for the diagnosis of liver cirrhosis or as a marker of normal 
lung tissue. Furthermore, therapeutic modulation of the expression or function of this gene 
may reduce or inhibit fibrosis that occurs in liver cirrhosis. 

NOV24a 

1 0 Expression of gene NOV24a was assessed using the primer-probe set Ag2934, 

described in Table VA. 

Table VA. Probe Name Ag2934 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID HO: 


Forward 


5 1 -gaatctaattccgcaggaaatg-3 ' 


22 


1645 


1072 


Probe 


TET- 5 1 -cgtgtttgccaaattctcagcgttta-3 1 - 
TAMRA 


26 


1668 


1073 


Reverse 


5 * - tcttttcatgaacctcatttgc-3 1 


22 


1714 


1074 



CNS_neurodegeneration_vl.O Summary: Ag2934 Expression of the NOV24a gene 
15 is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 1.3D Summary: Ag2934 Expression of the NOV24a gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2934 Expression of the NOV24a gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
20 high probability of a probe failure. 

NOV25 
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Expression of gene NOV25 was assessed using the primer-probe sets Ag2935 and 
Ag3039, described in Tables WA and WB. Results of the RTQ-PCR runs are shown in Tables 
WC, WD and WE. 



Table WA. Probe Name Ag2935 



Primers 


Sequences 


Length 


Start • 
Position 


SEQ ID NO: 


Forward 


5 • -ctggactctacctcggaaactt-3 1 


22 


88 


1075 


Probe 


TET- 5 1 -ctgggccgaaataagatcacacacat-3 • - 
TAMRA 


26 


135 


1076 


Reverse 


5 1 -gggtgactcatggatagagatg-3 1 


22 


161 


1077 



Table WB. Probe Name Ag3039 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 1 -gccgaaataagatcacacacat-3 • 


22 


139 


1078 


Probe 


TET- 5 ' -tctatccatgagtcaccccagcctct-3 1 - 
TAMRA 


26 


165 


1079 


Reverse 


5 1 -atgcgaaggtaggtgatatcct-3 ' 


22 


196 


1080 



Table WC. CNS_neurodegeneration_vl.O 



Tissue 
Name 


Re!. Exp.(%) 
Ag2935, Run 
209777519 


Rel. Exp.(%) 
Ag3039, Run 
211012103 


Tissue 
Name 


Rel. Exp.(%) 
Ag2935, Run 
209777519 


Rel. Exp.(%) 
Ag3039, Run 
211012103 


AD 1 Hippo 


34.2 


18.4 


Control 
(Path) 3 
Temporal 
Ctx 


11.9 


8.2 


AD 2 Hippo 


59.5 


48.0 


Control 
(Path) 4 
Temporal 
Ctx 


33.7 


36.3 


AD 3 Hippo 


15.3 


9.8 


AD 1 

Occipital 

Ctx 


24.3 


9.5 


AD 4 Hippo 


18.7 


13.6 


AD2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


AD 5 Hippo 


90.8 


70.2 


AD3 

Occipital 

Ctx 


12.9 


6.3 


AD 6 Hippo 


50.7 


69.3 


AD4 

Occipital 

Ctx 


32.8 


20.9 


Control 2 


36.3 25.5 AD 5 


66.9 


18.3 
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Hippo 




( 
I 


Occipital 

CtX 






Control 4 
Hippo 


33.4 


i 

24.0 


\D6 

Occipital 

Ctx 


28.9 


43.2 


Control 
(Path) 3 
Hippo 


7.2 


7,6 


Control 1 
Occipital 
Ctx 


7.6 


6.0 


AD 1 

Temporal 

Ctx 


20.7 


24.3 


Control 2 
Occipital 
Ctx 


90.8 


57.0 


AD 2 

Temporal 

Ctx 


53.2 


36.9 


Control 3 
Occipital 
Ctx 


24.7 


18.7 


AD 3 

Temporal 

Ctx 


5.8 


4.7 


Control 4 
Occipital 
Ctx 


17.7 


13.9 


AD4 
Temporal 

Ctx 


35.8 


24.5 


Control 
(Path) 1 
Occipital 
Ctx 


93.3 


74.2 


AD5Inf 
Temporal 
Ctx 


100.0 


100.0 


Control 
(Path) 2 
Occipital 
Ctx 


19.1 


14.8 


AD 5 Sup 
Temporal 
Ctx 


81.2. 


62.9 


Control 
(Path) 3 
Occipital 
Ctx 


8.6 


4.3 


AD6Inf 
Temporal 

L-tX 


62.4 


58.2 


Control 
(Path) 4 
Occipital 
Ctx 


27.9 


252 


AD 6 Sup 
Temporal 
Ctx 


50.3 


49.3 ' 


Control 1 

Pariptal Ptv 

JT OllwUU vlA 


16.2 


15.9 


Control 1 
Temporal 
Ctx 


14.5 


11.6 


Control 2 

Parietal Ptv 


68.3 


58.2 


Control 2 
Temporal 
Ctx 


51.4 


34.4 


Control 3 
Parietal Ctx 


31.0 


32.1 


Control 3 
Temporal 
Ctx 


33.0 


20.0 


Control 
(Path) 1 
Parietal Ctx 


69.7 


66.9 


Control 3 
Temporal 
Ctx 


19.6 


20.7 


Control 
(Path) 2 
Parietal Ctx 


31.2 


39.0 
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Control 
(Path) 1 
Temporal 
Ctx 


55.1 


44.4 


Control 

(Path) 3 
Parietal Ctx 


3.1 


4.6 


Control 
(Path) 2 
Temporal 
Ctx 


56.3 


30.4 


Control 
(Path) 4 
Parietal Ctx 


61.6 


35.8 



Table WD. Pane! 1.3D 



Tissue Name 


"PaI T7vn (OA\ 

Ag2935, Run 
167646849 


Dal Vim f°/ \ 

Ag3039, Run 
167961816 


Tissue Name 


Kei. n»xp.( /o) 
Ae2935, Run 
167646849 


Kei. H,xp.(%) 
Ae3039. Run 
167961816 


Liver 

adenocarcinoma 


1.4 


1.7 


Kidney (fetal) 


49.3 


38.2 


Pancreas 


1.2 


2.0 


Renal ca. 786- 
0 


0.0 


0.0 


Pancreatic ca. 
CAP AN 2 


0.0 


0.0 


Renal ca. 
A498 


1.7 


0.4 


Adrenal gland 


3.1 


1.2 


Renal ca. RXF 
393 


0.0 


0.1 


Thyroid 


11.7 


6.0 


Renal ca. 
ACHN 


5.7 


1.5 


Salivary gland 


1.3 


0.8 


Ppnal ra T TO- 
jxciiat La. UU" 

31 


0.0 


0.0 


Pituitary gland 


6.3 


3.0 


Renal ca. TK- 
10 


0.0 


0.0 


Brain (fetal) 


10.0 


7.4 - 


Liver 


0.7 


0.3 


Brain (whole) 


12.8 


7.7 


Liver (fetal) 


1.8 


0.7 


Brain (amygdala) 


6.9 


6.0 


Liver ca. 

(hepatoblast) 

HepG2 


1.0 


0.1 


Brain (cerebellum) 


9.3 


5.9 


Lung 


1.3 


1.3 


Brain 

(hippocampus) 


12.1 


3.9 j 


Lung (fetal) 


6.5 


3.0 


Brain (substantia 
nigra) 


24.0 


16.0 


Lung ca. 
(small cell) 
LX-1 


0.3 


0.0 


Brain (thalamus) 


5.1 


3.8 


Lung ca. 
(small cell) 
NCI-H69 


2.8 


1.5 


Cerebral Cortex 


15.6 


10.8 


Lung ca. 
(s.cell var.) 
SHP-77 


7.2 


3.7 


Spinal cord 


37.4 


15.2 


Lung ca. (large 


0.0 


0.1 
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ceII)NCI-H460 






glio/astro U87-MG 


0.0 


0.2 


Lung ca. (non- 
sm. cell) A549 


2.4 


1.6 


giio/astro u-iio- 
MG 


0.4 


0.1 


Lung ca. (non- 

s.cell)NCI- 

H23 


4.0 


0.4 


astrocytoma 
SW1783 


J. J 


u.o 


Lung ca. (non- 
s.cell) HOP-62 


1 1 

1.1 


0 0 


neuro*; met SK-N- 

A C 


0.8 


0.3 


Lung ca. (non- 

s.cl)NCI- 

H522 


3.9 


1.4 


astrocytoma SF- 
539 


3.0 


1.5 


Lung ca. 
(squam.) SW 
900 


2.7 


0.7 


astrocytoma SNB- 
75 


2.5 


1.2 


Lung ca. 
(squam.) NCI- 
H596 


122 


5.7 


glioma SNB-19 


2.7 ; 


2.1 


Mammary ; 
gland 


3.9 


2.4 


glioma U251 


1.0 


0.3 


Breast ca.* 
(pl.ef) MCF-7 


0.7 


0.7 


glioma SF-295 


2.9 


2.3 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.4 


0.1 


Heart (fetal) 


24.0 


14.5 


Breast ca.* 
(pl.ef)T47D 


23.3 


13.5 


Heart 


5.6 


3.4 


Breast ca. BT- 
549 


0.5 


1.2 


Skeletal muscle 
(fetal) 


11.3 


5.1 


Breast ca. 
MDA-N 


6.2 


6.4 


Skeletal muscle 


1.1 


0.0 


Ovary 


11.4 


3.9 


Bone marrow 


0.0 


0.4 


Ovarian ca. 
OVCAR-3 


1.0 


0.3 


Thymus 


1.4 


0.2 


Ovarian ca. 
OVCAR-4 


21.3 


13.8 


Spleen 


3.5 


3.1 


Ovarian ca. 
OVCAR-5 


5.6 


1.5 


Lymph node 


0.7 


0.9 


Ovarian ca. 
OVCAR-8 


0.1 


0.0 


Colorectal 


1.5 


0.4 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach 


2.5 


jOvarian ca.* 
0.6 (ascites) SK- 
JOV-3 


0.4 


0.4 


Small intestine 


3.4 


1.0 juterus 


3.0 


2.2 


Colon ca. SW480 


8.8 


6.7 jPlacenta 


0.6 


0.0 
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Colon ca.* 

SW620(SW480 

met) 


0.0 


0.0 


Prostate 


3.1 


2.7 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met)PC- 
3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Testis 


100.0 


100.0 


Colon ca. CaCo-2 


23.0 


12.2 


Melanoma 
Hs688(A).T 


0.0 


0.1 


Colon ca. 
tissue(OD03866) 


1.3 


1.2 


Melanoma* 
(met) 

Hso88(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


6.4 


3.0 


Melanoma 
UACC-62 


12.0 


5-2 


Gastric ca.* (liver 
mert NCT-NR7 


2.7 


1.2 


Melanoma 

JVllH 


2.7 


0.8 


Bladder 


3.1 


1.8 


Melanoma 
LOXIMVI 


0.0 


0.0 


Trachea 


1.4 


0.6 


Melanoma* 
(met) SK- 
MEL-5 


6.5 


2.0 


Kidney 


84.7 


41.8 


Adipose 


5.0 


0.6 



Table WE. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
Ag2935, 

Run 
164403313 


Rel. 
Exp.(%) 
Ag3039, 

Run 
162427949 


Tissue Name 


Rel. 
Exp.(%) 
Ag2935, 

Run 
164403313 


Rel. 

Exp.(%) 
Ag3039, 

Run 
162427949 


Secondary Thl act 


0.0 


0.0 


HUVEC IL-lbeta 


0.2 


0.0 


Secondary Th2 act 


0.0 


0.0 


HUVEC IFN 
gamma 


0.2 


0.3 


Secondary Trl act 


0.0 


0.0 


HUVEC TNF 
alpha + IFN 
gamma 


0.0 


0.0 


Secondary Thl rest 


0.0 


0.0 


HUVEC TNF 
alpha + IL4 


0.0 


0.0 


Secondary Th2 rest 


0.0 


0.0 


HUVEC IL-11 


0.0 


0.3 


Secondary Trl rest 


0.0 


0.0 


Lung 

Microvascular EC 
none 


0.0 


1.3 


Primary Thl act 


0.0 


0.0 


Lung 

Microvascular EC 
TNFalpha + IL- 
lbeta 


0.0 


0.0 
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Primary Th2 act 


0.0 


0.0 


Microvascular 
Dermal EC none 


0.3 


0.3 


Primary Trl act 


0.0 


. 0.0 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
Ibeta 


0.3 


0.4 


Primary Thl rest 


0.0 


0.0 


Bronchial 
epithelium 
TNFalpha + 
ILlbeta 


1.7 


1.4 


Primarv Th? re<tf 

XillllCUj X lit* 


0 0 

V/.V 


0 0 

v/.v 


Small airway 
epithelium none 


02 


1 1 

X . X 


X UlllCH J 111 Iwol 


0 0 


0 0 

V/.V/ 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


03 


0 3 


CD45RA CD4 
lymphocyte act 


0.4 


0.0 


Coronery artery 
SMC rest 


0.2 


0.0 


PD4SRO rr>4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


0.0 


0.0 


CDS lymphocyte 
act 


0.1 


0.0 


Astrocytes rest 


1.9 


4.0 


lymphocyte rest 


0.0 


0.0 


Astrocytes 
TNFalpha + IL- 
lbeta 


2.4 


2.3 


Secondary CD8 
lymphocyte act 


o n 


0 0 

V/.v/ 


KU-812 
(Basophil) rest 


00 

V/.V 


0 0 

V/.V/ 


CD4 lymphocyte 
none 


0.0 


0.0 


KU-812 

(Basophil) 

PMA/ionomycin 


0.0 


0.0 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


0.0 


CCD1106 

(Keratinocytes) 

none 


0.8 


0.4 


LAK cells rest 


0.6 


0.1 


CCD 11 06 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


0.5 


0.4 


LAK cells IL-2 


0.0 


0.4 


Liver cirrhosis 


0.0 


0.7 


LAK cells IL-2+IL- 


0.3 


0.0 


Lupus kidney 


4.0 


5.2 


LAK cells IL- 
2+IFN gamma 


0.0 


0.4 


NCI-H292 none 


1.4 - 


3.5 


LAK cells IL-2+ 
IL-18 


0.8 


0.8 


NCI-H292 IL-4 


1.2 


0.9 


LAK cells 
PMA/ionomycin 


0.3 


0.0 


NCI-H292 IL-9 


1.6 


1.8 
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\TT/ y"S 1| TT , 

NK Cells IL-2 rest 


0.0 


0.0 


NCI-H292 IL-13 


2,1 


1.2 


Two Way MLR 3 

j 

day 


0.0 


0.0 


NCI-H292 IFN 
gamma 


2.3 


3.6 


Two Way MLR 5 
day 


0.0 


0.0 


HPAEC none 


0.0 


0.0 


Two Way MLR 7 
day 


0.0 


0.0 


HPAEC TNF 
alpha + IL-l beta 


0.0 


0.0 


PBMC rest 


0.0 


0.3 


Lung fibroblast 
none 


0.0 


0.0 


PBMC PWM 


0.2 


2.5 


Lung fibroblast 
TNF alpha + IL-1 
beta 


0.0 


0.0 


PBMC PHA-L 


0.4 


1.6 


Lung fibroblast 

TT A 

IL-4 


0.0 


0.0 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast 

tt r\ 

IL-9 


0.0 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


0.0 


Lung fibroblast 
IL-13 


0.0 


0.5 


B lymphocytes 
PWM 


3.0 


5.5 


Lung fibroblast 
IFN gamma 


0.0 


0.0 


B lymphocytes 
CD40Land IL-4 


1 0 


1 3 


Dermal fibroblast 
CCD 1070 rest 


0 0 


0 0 


EOL-1 dbcAMP 


0.5 


0.6 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


0.0 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.3 


0.0 


Dermal fibroblast 

CCD1070IL-1 

beta 


0.0 


0.0 


Dendritic cells 
none 


0.0 


0.0 


Dermal fibroblast 

1JT1N gamma 


0.0 


0.0 


Dendritic cells LPS 


0.1 


0.4 


T^pmtijiI fiKrAKlnct 
J_/CIIIlal llUIUUIaM 

IL-4 


0.0 


0.0 


Dendritic cells anti- 
CD40 


0.3 


0.0 


D3D Colitis 2 


0.0 


0.0 


Monocytes rest 


0.0 


0.0 


IBD Crohn's 


0.2 


1.1 


Monocytes LPS 


0.0 


0.4 


Colon 


2.1 


2.5 


Macrophages rest 


1.1 


1.0 


Lung 


2.1 


5.1 


Macrophages LPS 


0.3 


0.4 


Thymus 


100.0 


100.0 


HUVEC none 


0.0 


0.6 


Kidney 


3.7 


3.1 


HUVEC starved 


0.0 


0.0 









CNS_neurodegeneration_vl.0 Summary: Ag2935/Ag3039 No differential 
expression of the NOV25 gene is detected between the postmortem brains of Alzheimer's 
diseased patients and those of non-demented controls. However, this panel confirms the 
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expression of this gene in the CNS. Please see panel 1.3D for a discussion of utility of this 
gene in the central nervous system. 

Panel L3D Summary: Ag2935/Ag3039 The expression of the NOV25 gene was 
assessed in two independent runs with good concordance between runs. Highest expression is 
5 seen in the testis (CTs=29). In addition, expression of this gene is extremely low in renal and 
brain cancer cell lines but is expressed in the normal brain and kidney tissues on this sample. 
Therefore, this gene may be used as a diagnostic marker for brain and kidney cancer and 
prostate tissue. Furthermore, therapeutic modulation of the expression or function of this gene 
may be effective in the treatment of brain and renal cancers. 
10 This gene encodes a novel protein phosphatase expressed at moderate to low levels in 

the CNS that may therefore be a small molecule target for the treatment of neurologic 
diseases. 

In addition, this gene is expressed at low levels in metabolic tissues including pancreas, 
adrenal, thyroid, pituitary, adult and fetal heart, and adipose. This novel protein phosphatase 

15 may be a small molecule target for the treatment of metabolic and endocrine disease* including 
obesity and Types 1 and 2 diabetes. This gene is also differentially expressed in fetal skeletal 
muscle (CT values = 32-33) when compared to expression in adult skeletal muscle (CT values 
= 35-40). Therefore, expression of this gene may also be useful for the differentiation of adult 
and fetal skeletal muscle. 

20 Panel 4D Summary: Ag2935/Ag3039 Expression of the NOV25 gene is highest and 

almost exclusive to the thymus (CTs=29-30). Expression of this gene could be used to 
distinguish thymus from the other samples on this panel. The putative phosphatase encoded by 
this gene may play an important role in T cell development. Small molecule therapeutics 
designed against the protein encoded by this gene could therefore be utilized to modulate 

25 immune function (T cell development) and be important for organ transplant, AIDS treatment 
or post chemotherapy immune reconstitution. 

NOV26a and NOV26b 

Expression of gene NOV26a and variant NOV26b was assessed using the 
primer-probe set Ag2936, described in Table XA. 

30 Table XA. Probe Name Ag2936 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 ' -gttcctgctgtctggactttt-3 ' 


21 


1 


1061 
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Probe 


TET-5 ■ -cccactgagacgcagctgtattctgt-3 ' - 
TAMRA • 


26 ^21 


1082 


Reverse 


5 • -tcgccaaatcatatttcacact-3 • 


22 j57 


1083 



CNS_neurodegeneration_vl.O Summary: Ag2936 Expression of the NOV26a gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 
5 Panel 1 3D Summary: Ag2936 Expression of the NOV26a gene is low/undetectable 

in all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

Panel 4D Summary: Ag2936 Expression of the NOV26a gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
10 high probability of a probe failure. 

NOV24a and NOV24b 

Expression of gene NOV24a and variant NOV24b was assessed using the primer-probe 
set Ag2966> described in Table YA. Results of the RTQ-PCR runs are shown in Tables YB 
and YC. 

1 5 Table YA. Probe Name Ag2966 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -agcgggcaactcttcatcta-3 1 


20 


1356 


1084 


Probe 


TET-5 1 -atgcagtcagtgaccagctccctg-3 1 - 
TAMRA 


24 


1377 


1085 


Reverse 


5 1 -caggacaaagactgcagtcact-3 ' 


22 


1409 


1086 



Table YB. Panel 13D 



Tissue Name 


Rel. Exp.(%) 
Ag2966,Run 
160658385 


Rel. Exp.(%) 
Ag2966, Run 
165701959 


Tissue Name 


Rel. Exp.(%) 
Ag2966,Run 
160658385 


Rel. Exp.(%) 
Ag2966, Run 
165701959 


Liver 

adenocarcinoma 


0.0 


0.0 


Kidney (fetal) 


1.0 


0.0 


Pancreas 


0.2 


1.3 


Renal ca. 786- 
0 


0.2 


0.0 


Pancreatic ca. 
CAP AN 2 


0.0 


0.0 


Renal ca. 
A498 


0.0 


0.3 


Adrenal gland 


0.2 


0.0 


Renal ca. RXF 
393 


0.0 


0.5 


Thyroid 


0.3 


0.0 


Renal ca. 
ACHN 


0.3 


0.0 
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Salivary gland 


0.1 


0.4 


Renal ca. UO- 
31 


0.0 


0.0 


Pituitary gland 


0.0 


0.0 


10 


0.0 


0.0 


Brain (fetal) 


0.0 


0.0 


Liver 


0.6 


0.9 


tsrain ^wnoie ) 


ft ft 


ft ^ 
U.J 


T i \/*»r ( fiokfci 1 ^ 
JUIVCi \\X>uX\) 


0 4 


0 0 


Brain (amygdala) 


0.0 


0.0 


Liver ca. 

(hepatoblast) 

riepoz 


0.3 


0.0 


Brain (cerebellum) 


0.0 


0.0 


Lung 


0.8 


0.6 


Brain 

(hippocampus) 


0.5 


0.3 


Lung (fetal) 


1.4 


1.4 


Brain (substantia 
nigra) 


0.0 


0.0 


Lung ca. 
(small cell) 
LX-l 


0.6 


0.0 


Brain (thalamus) 


0.0 


0.5 


Lung ca. 
(small cell) 
NCI-H69 


0.0 


0.0 


Cerebral Cortex 


0.0 


0.0 


Lung ca. 
(sxell var.) 
SHP-77 


0.3 


0.4 


Spinal cord 


0.2 


0.2 


Lung ca. (large 
cell)NCI-H460 


0.2 


0.0 


glio/astro U87-MG 


0.0 


0.0 


Lung ca. (non- 
sm. cell) A549 


0.8 


0.7 


glio/astroU-118- 
MG 


0.0 


0.0 


Lung ca. (non- 

s.cell)NCI- 

H23 


0.0 


0.0 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


1.6 


2.2 


neuro*; met SK-N- 
AS 


0.0 


0.4 


Lung ca. (non- 

s.cl)NCI- 

H522 


0.5 


0.0 


astrocytoma SF- 
539 


0.0 


0.0 


Lung ca. 
(squam.) SW 
900 


0.0 


0.0 


astrocytoma SNB- 
75 


0.0 


0.0 


Lung ca. 
(squam.) NCI- 
H596 


0.0 


0.0 


glioma SNB-19 


0.0 


0.4 


Mammary 

crlanH 


0.3 


0.7 


glioma U251 


0.0 


0.4 


Breast ca.* 
(pl.ef) MCF-7 


0.2 


0.0 . 


glioma SF-295 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


' 0.0 


0.0 


Heart (fetal) 


0.0 


0.0 


Breast ca.* 


0.0 


0.0 
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(pl.ef) T47D 






Heart 


0.0 


0.0 


549 


0.2 


0.0 


Skeletal muscle 
(fetal) 


3.9 


0.3 


MDA-N 


0.0 


0.0 


Skeletal muscle 


0 0 


0 0 


Ovflrv 
v/vai y 


U.J 


0 0 


Bone marrow 


6.9 


2.5 


Ovarian ca. 

V V/AIV'J 


0.0 


0.0 


Thymus 


1.9 


0.9 


Ovarian ca. 

fWPAR-4 


0.0 


0.4 


Spleen 


4.2 


1.0 


Ovarian ca. 
OVCAR-5 


0.3 


0.0 


Lymph node 


3.0 


5.4 


Ovarian ca. 

\J v v^/\i\.-o 


0.0 


0.0 


Colorectal 


0.5 


0.0 


v^val Jail ta. 

IGROV-1 


0.0 


0.0 


5 torn anli 




ft 0 


Ovarian ca.* 
OV-3 


MM 


u.u 


Small intestine 


1.5 


0.5 


Uterus 


0.0 


0.5 


Colon ca. SW480 


0.8 


0.0 


Placenta 


0.0 


0.0 


Colon ca.* 

SW620(SW480 

met) 


0.0 


0.0 


Prostate 


0.0 


0.0 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met)PC- 
3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


.0.0 


Testis 


1.6 


0.8 


Colon ca. CaCo-2 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


0.3 


0.3 


Melanoma* 
(met) 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.3 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met)NCI-N87 


0.2 


0.7 


Melanoma 
M14 


0.0 


0.0 


Bladder 


0.2 


0.3 


Melanoma 
LOXIMVI 


0.0 


0.0 


Trachea 


0.6 


0.0 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney 


100.0 


100.0 


Adipose 


o.i ! 


0.6 
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Table YC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2966, Run 
160660646 


Tissue Name 


Rel. Exp:(%) 
AgZiJoo, Kun 
160660646 


Secondary Thl act 


0.0 


riU V bC 1L.- 1 beta 




Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.9 


Secondary Trl act 


0.7 


HUVEC TNF alpha + IFN 
gamma 


0.3 


Secondary Thl rest 


0.2 


HUVEC TNF alpha + IL4 


0.5 


Secondary Th2 rest 


1.6 


HUVEC IL-11 


0.8 


Secondary Trl rest 


1.5 


Lung Microvascular EC 
none 


2.0 


Primary Thl act 


0.3 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.8 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


1.9 


Primary Trl act 


0.3 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


1.7 


Primary Thl rest 


4.8 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primarv Th?. rest 


2.1 


Small airway epithelium 
none 


0.0 


Primarv Tr1 rest 

1 1 11I1CU V All. IvoU 


3.4 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.3 


CD45RACD4 
lymphocyte act 


0.6 


Coronerv arterv SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


1.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.3 


CDS lymphocyte act 


0.3 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


1.3 


Astrocytes TNFalpha + 
IL-lbeta 


0.3 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.3 


CD4 lymphocyte none 


1.7 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


0.8 


CCD1106 (Keratinocytes) 
none 


0.1 


LAK cells rest 


2.1 


llui iuo ^rwcraunot/yicsj 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.4 


Liver cirrhosis 


0.4 


LAK cells IL-2+IL-12 


0.3 


Lupus kidney 


1.0 


LAK cells IL-2+IFN 
gamma 


0.6 


NCI-H292 none 


0.6 


LAKcellsIL-2+IL-18 


0.6 


NCI-H292 IL-4 


0.0 


LAK cells 


0.0 


NCI-H292 IL-9 


0.0 
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PMA/ionomycin 




L . ... 


NK Cells IL-2 rest 


0.6 


NCI-H292 IL-13 


! o.o 


Two Way MLR 3 day 


1.1 


NCI-H292 IFN gamma 


| 0.0 


Two Way MLR 5 day 


1.0 


HPAEC none 


6.6 


Two Way MLR 7 day 


0.5 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.6 


Lung fibroblast none 


0.0 


PBMC PWM 


1.1 


T una -PiU^/^WI ' 1 'KTT7 n Lko 

jLung iiDroDiast liNr aipna 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.2 


Lung IiDroDiast il,-** 


u.u 


Ramos fR celH none 


1 7 


Lung fibroblast EL-9 


0.0 


Ramos (B cell) 
ionomvcin 

XVllVllXT Will 


1.1 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


1.4 


Lung fibroblast IFN 
gamma 


0.2 


B lymphocytes CD40L 
and IL-4 


4.7 


Dermal fibroblast 
cuuiu/u rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CULM U/U INr alpna 


1.4 


EOL-1 dbcAMP 
PMA/ionomycin 


0.1 


Dermal tibro blast 
CCD1070 IL-1 beta 


0.0 


Dendritic cells none 


0.5 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


6.6 


Dendritic cells anti- 
CD40 


U.J 


TRF) Politic 0 


u.Z 


Monocytes rest 


1.2 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


4.9 


Macrophages rest 


0.2 


Lung 


3.1 


Macrophages LPS 


0.0 


Thymus 


100.0 


HUVEC none 


0.5 


Kidney 


4.6 


HUVEC starved 


0.3 







Panel 1.3D Summary: Ag2966 Two experiments both show that expression of the 
NOV24a gene, a sodium-glucose cotransporter homolog, is limited to the kidney (CTs=29). 
This restricted expression is in agreement with published data which has shown that secondary 
active transport of glucose in the kidney is mediated by sodium glucose cotransporter. (See 
ref. 1). Thus, expression of this gene could be used as a marker for kidney tissue. Furthermore, 
the protein product may be important for normal function of the kidney. Thus, therapeutic 
modulation of the expression or function of this protein may be useful in treating diseases that 
affect the kidney, including diabetes. 
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Bissonnette P, Noel J, Coady MJ, Lapointe JY. Functional expression of tagged human 
Na+-g]ucose cotransporter in Xenopus laevis oocytes. J Physiol 1999 Oct 15;520 Pt 2:359-71 

1. High-affinity, secondary active transport of glucose in the intestine and kidney is 
mediated by an integral membrane protein named SGLT1 (sodium glucose cotransporter). 

5 Though basic properties of the transporter are now defined, many questions regarding the 
structure- function relationship of the protein, its biosynthesis and targeting remain 
unanswered. In order to better address these questions, we produced a functional hSGLTl 
protein (from human) containing a reporter tag. 2. Six constructs, made from three tags (myc, 
haemaglutinin and poly-His) inserted at both the C- and N-terminal positions, were thus tested 

10 using the Xenopus oocyte expression system via electrophysiology and 

immunohistochemistry. Of these, only the hSGLTl construct with the myc tag inserted at the 
N-terminal position proved to be of interest, all other constructs showing no or little transport 
activity. A systematic comparison of transport properties was therefore performed between the 
myc-tagged and the untagged hSGLTl proteins. 3. On the basis of both steady-state (affinities 

15 for substrate (glucose) and inhibitor (phlorizin) as well as expression levels) and presteady- 
state parameters (transient currents) we conclude that the two proteins are functionally 
indistinguishable, at least under these criteria. Immunological detection confirmed the 
appropriate targeting of the tagged protein to the plasma membrane of the oocyte with the 
epitope located at the extracellular side. 4. The myc-tagged hSGLTl was also successfully 

20 expressed in polarized MDCK cells. alpha-Methylglucose uptake studies on transfected cells 
showed an exclusively apical uptake pathway, thus indicating that the expressed protein was 
correctly targeted to the apical domain of the cell. 5. These comparative studies demonstrate 
that the myc epitope inserted at the N-terminus of hSGLTl produces a fully functional protein 
while other epitopes of similar size inserted at either end of the protein inactivated the final 

25 protein. 

PMID: 10523405 

Panel 2D Summary: Ag2966 Expression of the NOV24a gene is predominantly 
limited to the kidney. This result is in agreement with the expression seen in Panel 1 .3D. 

Panel 3D Summary: Ag2966 Results from one experiment with the NOV24a gene are 
30 not included. The amp plot indicates that there were experimental difficulties with this run. 

Panel 4D Summary: Ag2966 Expression of the NOV24a gene is predominantly found 
in normal tissue from thymus, lung, colon and kidney. This expression profile suggests that the 
protein product may be involved in glucose transport in these tissues. Therefore, therapeutic 
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modulation of the expression or function of this protein may be useful in treating diseases that 
affect these organs. 

NOV28 

Expression of gene NOV28 was assessed using the primer-probe set Ag2891, 
5 described in Table ZA. Results of the RTQ-PCR runs .are shown in Tables ZB, ZC, ZD and 
ZE. 

Table ZA. Probe Name Ag2891 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ccggaaattaatctaccatcaa-3 ' 


22 


252 


1087 


Probe 


TET-5 1 -ccccgggtaacaactgtttcagatct-3 1 - 
TAMRA 


26 


289 


1088 


Reverse 


5 ' -gatggaaaagtccatgttggt-3 1 


21 


325 


1089 



Table ZB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag2891, 
Run 224116294 


Tissue Name 


Rel. Exp.(%) Ag2891, 
Run 224116294 


AD 1 Hippo 


13.9 


Control (Path) 3 
Temporal Ctx 


43.2 


AD 2 Hippo 


27.0 


Control (Path) 4 
Temporal Ctx 


20.9 


AD 3 Hippo 


36.1 


AD 1 Occipital Ctx 


6.4 


AD 4 Hippo 


16.5 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


35.1 


AD 3 Occipital Ctx 


19.1 


AD 6 Hippo 


29.9 


AD 4 Occipital Ctx 


51.1 


Control 2 Hippo 


6.0 


AD 5 Occipital Ctx 


0.0 


Control .4 Hippo 


34.4 


AD 6 Occipital Ctx 


lhO 


Control (Path) 3 
Hippo 


17.4 


Control 1 Occipital 
Ctx 


20.6 


AD 1 Temporal Ctx 


76.3 


Control 2 Occipital 
Ctx 


18.2 


AD 2 Temporal Ctx 


23.3 


Control 3 Occipital 
Ctx 


0.0 


AD 3 Temporal Ctx 


0.0 


Control 4 Occipital 
Ctx 


17.3 


AD 4 Temporal Ctx 


34.6 


Control (Path) 1 
Occipital Ctx 


65.1 


AD 5 Inf Temporal 
Ctx 


24.5 


Control (Path) 2 
Occipital Ctx 


13.5 


AD 5 SupTemporal 


58.2 


Control (Path) 3 


0.0 



994 



WO 02/068649 



PCT/US02/02785 



Ctx 








AD 6 Inf Temporal 
Ctx 


45.1 


Control (Path) 4 

Hr-pi r\ itn 1 PtY 

uccipiuu i-xiA 


17.6 


AD 6 Sup Temporal 

LtX 


95.9 


Control 1 Parietal 

Ptv 


0.0 


Control 1 Temporal 
Ctx 


5.2 


Control 2 Parietal 


19.2 


Control 2 Temporal 
Ltx 


0.0 


Control 3 Parietal 


37.9 


Control 3 Temporal 

Ptv 


12.4 


Control (Path) 1 
Parietal Ctx 


100.0 


Control 4 Temporal 
Ctx 


21.6 


Control (Path) 2 
Parietal Ctx 


25.9 


Control (Path) 1 
Temporal Ctx 


65.1 


Control (Path) 3 
Parietal Ctx 


0.0 


Control (Path) 2 
Temporal Ctx 


55.9 


Control (Path) 4 
Parietal Ctx 


11.2 



Table ZC Panel 13D 



Tissue Name 


Rel. Exp.(%) 
Ag2891,Run 
160898914 


Rel.Exp.(%) 
Ag2891,Run 
165701349 


Tissue Name 


Rel. Exp.(%) 
Ag2891,Run 
160898914 


Rel. Exp.(%) 
Ag2891,Run 
165701349 


Liver 

adenocarcinoma 


0.0 


0.0 


Kidney (fetal) 


0.0 


0.0 


Pancreas 


0.0 


0.0 


Renal ca. 786- 
0 


72.7 


17.7 


Pancreatic ca. 
CAPAN2 


0.0 


0.0 


Renal ca. 
A498 


0.0 


0.0 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 

393 


98.6 


100.0 


Thyroid 


0.0 


0.0 


Renal ca. 
ACHN 


35.6 


21.3 


Salivary gland 


0.0 


0.0 


Renal ca. UO- 
31 


43.5 


17.9 


Pituitary gland 


0.0 


0.0 


Renal ca. TK- 
10 


0.0 


0.0 


Brain (fetal) 


0.0 


0.0 


Liver 


0.0 


0.0 


Brain (whole) 


0.0 


0.0 


Liver (fetal) 


o.o 


0.0 


Brain (amygdala) 


18.4 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


0.0 


Brain (cerebellum) 


1.9 


10.2 


Lung 


o.o 


0.0 . 


Brain 

(hippocampus) 


19.6 


12.2 


Lung (fetal) 


0.0 


0.0 


Brain (substantia 


0.0 


0.0 


Lung ca. 


0.0 


0.0 
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nigra) 






(small cell) 
LX-1 






Brain (thalamus) 


0.0 


0.0 


Lung ca. 
(small cell) 
NCI-H69 


0.0 


0.0 


Cerebral Cortex 


100.0 


11.8 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


12.4 


Spinal cord 


14.9 


13.5 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


glio/astro U87-MG 


0.0 


0.0 


Lung ca. (non- 
sm. cell) A549 


0.0 


26.1 


gIio/astroU-118- 
MG 


0.0 


0.0 


Lung ca. (non- 
s.cell) NCI- 
H23 


16.3 


0.0 


astrocytoma 

oW I /oi 


0.0 


0.0 


Lung ca* (non- 
s.cell) HOP-62 


0.0 


0.0 


neuro*; met SK-N- 
AS 


0.0 


0.0 


Lung ca. (non- 

s.cI)NCI- 

H522 


19.8 


0.0 


astrocytoma SF- 
539 


0.0 


10.5 


Lung ca. 
(squam.) SW 
900 


0.0 


0.0 


astrocytoma SNB- 
75 


0.0 


0.0 


Lung ca. 
(squam.) NCI- 
H596 


0.0 


0.0 


glioma SNB-19 


0.0 


0.0 


Mammary 
gland 


0.0 


37.4 


glioma U251 


14.4 


0.0 


Breast ca.* 
(pl.ef)MCF-7 


0.0 


0.0 


glioma SF-295 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 


0.0 


0.0 


Heart (fetal) 


0.0 


0.0 


(pl.ef)T47D 


14.6 


0.0 


Heart 


0.0 


o.o j 


549 


0.0 


0.0 


Skeletal muscle 
(fetal) 


15.8 


0.0 


MDA-N 


0.0 


0.0 


Skeletal muscle 


0.0 


0 0 


wvcu jr 


JO.O 


u.u 


Bone marrow 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


4.6 


Thymus 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


0.0 


Spleen 


0.0 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


0.0 


Lymph node 


0.0 


0.0 


Ovarian ca. 


0.0 


0.0 
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OVCAR-8 






Colorectal 


26.2 


0.0 


Ovarian cn 

IGROV-1 


0.0 


0.0 


Stomach 


0.0 


0 0 


Ovarian ca * 

^aaUllwo/ OJV 

OV-3 


u.u 




Small intestine 


0.0 


30.4 


Uterus 


0.0 


0.0 


colon ca. oW4ou 


6.7 


23.3 


Placenta 


0.0 


0.0 


Colon ca.* 
met) 


ft ft 

u.u 


ft n 
U.U 


Prostate 


0.0 


0.0 


Colon ca. HT29 


0.0 


0.0 


Prostate ca * 
(bone met)PC- 
3 


0.0 


0.0 


cojon ca. riu i - 
116 


0.0 


0.0 


Testis 


92.0 


71.2 


Colon ca. CaCo-2 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


0.0 


Melanoma* 
(met) 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.0 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met)NCI-N87 


0.0 


0.0 


Melanoma 
M14 


3.3 


0.0 


Bladder 


0.0 


0.0 


Melanoma 
LOX IMVI 


0.0 


0.0 


Trachea 


0.0 


0.0 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney 


0.0 


0.0 


Adipose 


0.0 


0.0 



Table ZD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2891,Run 
160899401 


Tissue Name 


Rel. Exp.(%) 
Ag2891, Run 
160899401 


Normal Colon 


0.0 ' 


Kidney Margin 
8120608 


18.2 


CCWelltoModDiff 
(OD03866) j 


15.6 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


37.1 


CC Gr.2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer 
9010320 


100.0 


CC Margin (OD03868) 


0.0 


Kidney Margin 


63 
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9010321 




CCModDiff(ODO3920) 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 06401 1 


0.0 


L.C Or.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


0.0 


Thyroid Cancer 
064010 


0.0 


trom rartial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


20.6 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 


0.0 


Colon mets to lung 
fOD04451-0n 


0.0 


Normal Breast 


0.0 


Lung Margin (OD04451- 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.0 


Normal Prostate 6546-1 


0.0 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
(OD04590-03) 


0.0 


Prostate Margin 
(OD04410) 


17.2 


Breast Cancer 
Metastasis 


0.0 


Prostate Cancer 
COD04720-01 , > 


0.0 


Breast Cancer 064006 


0.0 


Prostate Margin 
(OD04720-02} 


0.0 


Breast Cancer 1024 


0.0 


Normal Lung 061010 


24.7 


Breast Cancer - 


0.0 


Lung Met to Muscle 
(ODC4286) . 


15.7 


cabi iviai gin 

9100265 


0.0 


Muscle Margin 
(OD04286) 


0.0 


Breast Cancer 
A209073 


0.0 


Lung Malignant Cancer 
(OD03126) 


0 0 


Breast Margin 
A2090734 


u.u 


Lung Margin (OD03126) 


0.0 


Normal Liver 


0.0 


Lung Cancer (OD04404) 


0.0 


Liver Cancer 064003, 


0.0 


Lung Margin (OD04404) 


34.4 


Liver Cancer 1025 


8.7 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


36.6 | 


Lung Margin (OD04565) 


8.4 


Liver Cancer 6004-T 


8.0 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


0.0 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005-T 


8.2 


Ocular Mel Met to Liver 


17.0 


Liver Tissue 6005-N 


0.0 
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(ODO4310) 








Liver Margin [}JL>\j^fj iuj 




Normal ruaauer 


O.J 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


45.7 


Bladder Cancer 
A302173 


10.8 


Normal Kidney 


41.5 


Bladder Cancer 
(OD04718-01) 


8.2 


Kidney Ca, Nuclear grade 
2 (OD04338) 


28.3 


Bladder Normal 
Adjacent (OD0471 8- 
03) 


0.0 


Kidney Margin 


47.3 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/Z yKJiJKJHODy) 


62.9 


Ovarian Cancer 

U04UUo 


9.1 


Kidney Margin 


40.9 


Ovarian Cancer 

(UDU4 1 oo-U /) 


0.0 


Kidney Ca, Clear cell 
type {kjljvhdW) 


97.3 


Ovary Margin 

(UDU4 /Oo-Uo) 


0.0 


Kidney Margin 

frVTlhAIAfW 
(yJUW dh U ) 


17.4 


Normal Stomach 


35.1 


Kidney Ca, Nuclear grade 


. 0.0 


Gastric Cancer 


0.0 


Kidney Margin 
frxr\f\Ai aq\ 


3.2 


Stomach Margin 


0.0 


Kidney Cancer 
ff\r%(\A6.T) ni^ 

^UUU4v)ZZ-Ul ) 


8.3 


Gastric Cancer 


0.0 


Kidney Margin 


17.7 


Stomach Margin 


23.5 


Kidney Cancer 
(OD04450-01) 


.0.0 


Gastric Cancer 
9060397 


8.1 


Kidney Margin 
(OD04450-03) 


4.3 


Stomach Margin 
9060396 


0.0 


Kidney Cancer 8 120607 


10.6 


Gastric Cancer 
064005 


0.0 



Table ZE. Panel 4D 



Tissue Name 


Rel.Exp.(%) 
Ag2891,Run 
159632970 


Tissue Name 


Rel.Exp.(%) 
Ag2891, Run 
159632970 


Secondary Thl act 


0.0 


HUVEC EL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 
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0 0 

U.V/ 


H7?VFP II -1 1 


V/.U 


Secondary Trl rest 


0.0 ■-. 


Lung Microvascular EC 

nunc 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 

I xNFaipila > li> i L/C la 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 

TMPalnha 4- TT -IKpta 
X rHraipucL < 11*- iucia 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 

1 INFdipna » 11^ lucid 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 

IJUJ1C 


0.0 


Primary Trl rest 


0.0 


Ctrl oil q tnirov **nitl"if»lliim 

Oman airway epimcjium 
TNFalpha + IL-lbeta 


0.0 


CD4SRA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD4SRO CD4 
lymphocyte act 


0.0 


L^oronery ancry oiviL/ 
TNFalpha + IL-lbeta 


0.0 




VJ*\J 


/\j>[rocyics re si 


ft ft 


Secondary CD8 

Ivmnhnpvtp rf*Qt 


0.0 


Astrocytes TNFalpha + 

TT -1hf*t?i 
LL/- 1 UcUi 


0.0 


Secondary CD8 

lvmnhrvf*vtp ar*t • 


3.5 


KU-812 (Basophil) rest 


0,0 


CD4 lymphocyte none 


0.0 


PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CHI 1 


0.0 


CCD1106 (Keratinocytes) 
none 


0.0 


T A If 11 

LAK cells rest 


0.0 


CCD1 1 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


41.8 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAKcellsIL-2+IL-18 


0.0 


NCI-H292 IL-4 


1.5 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


19.5 


NK Cells IL-2 rest 


0.0 


NCI-H292IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 
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PBMC PHA-L 


0.0 


jLung fibroblast IL-4 


0.0 


Ramos {p ceil) none 


U.U 


(-Lung TiDroDiasi \\j-y 


0 0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


jLung fibroblast IFN 
j gam ma 


0.0 


B lymphocytes CD40L 

„ J TT yl 

ana 1L-4 


o.o 


jDermal fibroblast 
jiAsUiu/u resx 


0.0 


EOL-1 dbcAMP 


7.3 


juermai iiuroDiasx 
JCCD1070TNF alpha 


0.0 


JbUL-1 abcAMr 
PMA/ionomvcin 

X IVlTi/ lWllvFlllJr will 


0.0 


iLiermai iiDroDiasi 
JCCD1070 IL-l beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
j gamma 


0.0 


Dendritic cells LPS 


2.2 


]Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


I1BD Colitis 2 




Monocytes rest 


0.0 


|IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


JColon 


100.0 


Macrophages re£t 


0.0 


|Lung 


32.3 


Macrophages LPS 


1.3 


jThymus 


40.3 


HUVEC none 


0.0 


(Kidney 


0.0 


HUVEC starved 


0.0 | 





CNS_neurodegeneration_vl.O Summary: Ag2891 The NOV28 gene is expressed at 
very low levels in the CNS. No differential expression is detected in the postmortem brains of 
Alzheimer's patients when compared with non-demented controls. The widespread expression 
5 in the brain however suggests that this gene may be of utility in the treatment of neurological 
diseases. 

Panel 13D Summary: Ag2891 Two experiments with the same probe and primer set 
produce results that are in excellent agreement, with highest expression of the NOV28 gene in 
a lung cancer cell line and the brain (CTs=33-34). Significant expression is also seen in the 
10 testis and a cluster of lung cancer cell lines. Thus, expression of this gene could be used to 
differentiate these samples from other samples on this panel, and as a marker of testis tissue 
and lung cancer. 

Panel 2D Summary: Ag2891 Expression of the NOV28 gene is limited to samples 

derived from kidney cancer (CTs=33-34). Thus, expression of this gene could be used to 

1 5 differentiate between these samples and other samples on this panel and as a marker to detect 

the presence of kidney cancer. Furthermore, therapeutic modulation of the expression or 

function of this gene may be effective in the treatment of kidney cancer. 
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Panel 3D Summary: Ag2891 Expression of the NOV28 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2891 The NOV28 transcript is expressed at low but 
significant levels in the colon and thymus (CTs=33-35). Thus, the transcript or the protein it 
5 encodes could be used for detection of these tissues. The protein encoded by this transcript 
may also play an important role in the normal homeostasis of these tissues. Therefore, 
therapeutics designed with the protein encoded by this transcript could be important for 
modulating T cell development in the thymus or maintaining or restoring normal function to 
these organs during inflammation due to inflammatory bowel disease in the colon. 

10 NOV29a 

Expression of gene NOV29a was assessed using the primer-probe set Ag2892, 
described in Table AAA. Results of the RTQ-PCR runs are shown in Tables AAB, AAC, 
AADand AAE. 



Table AAA. Probe Name Ag2892 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 • -ggcagctcgaaactttgac-3 1 


19 


171 


1090 


Probe 


TET-5 ' -cagaaatccgaagacatgctccgaag-3 1 - 
TAMRA 


26 


193 


1091 


Reverse 


5 ■ -gacaatgttgtccaggtcttgt-3 1 


22 


240 


1092 



15 



Table AAB. Panel 1.3D 



Tissue Name 


ReL Exp.(%) Ag2892, 
Run 165721697 


Tissue Name 


Rel. Exp.(%) Ag2892, 
Run 165721697 


Liver adenocarcinoma 


4.6 


Kidney (fetal) 


4.4 


Pancreas 


0.7 


Renal ca. 786-0 


3.6 


Pancreatic ca. CAPAN 

2 


0.0 


Renal ca. A498 


0.0 


Adrenal gland 


42 


Renal ca.RXF393 


7.0 


Thyroid 


0.0 


Renal ca. ACHN 


2.1 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca.TK-10 


0.0 


Brain (fetal) 


o.o 


Liver 


41.5 


Brain (whole) 


0.6 


Liver (fetal) 


76.8 


Brain (amygdala) 


4.0 


Liver ca. 

(hepatoblast) HepG2 


8.0, 


Brain (cerebellum) 


2.0 


Lung 


26.8 


Brain (hippocampus) 


0.0 


Lung (fetal) 


11.5 
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Brain (substantia nigra) 


1.9 


Lung ca. (small cell) 
LX-1 


100.0 


Brain (thalamus) 


4.1 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


4.2 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


2.6 


Lung ca. (large 
cell)NCI-H460 


19.1 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


16.6 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


28.3 


astrocytoma SW1783 


2.1 


Lung ca. (non-s.cell) 
HOP-62 


10.4 


neuro*; met SK-N-AS 


1.3 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


0.0 


glioma U251 


0.0 


Breast ca* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.7 


Heart (fetal) 


7.3 


Breast ca.* (pl.ef) 
T47D 


0.0 


Heart 


o.u 


Breast ca. BT-549 


A A 
0.0 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


77.4 


Ovarian ca. OVCAR- 
3 


0.0 


Thymus 


0.0 


Ovarian ca. OVCAR- 
4 


2.2 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.0 


Lymph node 


0.0 


Ovarian ca. OVCAR- 

Q 
O 


0.0 


Colorectal 


0.0 


Ovarian ra TfrRfYV- 

l 


0.0 


Stomach 


5.6 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0 


Uterus 


2.7 


Colon ca. SW480 


32.3 


Placenta 


0.0 


Colon ca.* 


31.6 


Prostate 


0.0 
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SW620(SW480 met) 








Colon ca. HT29 


1.6 


Prostate ca.* (bone 


25.0 


Colon ca.HCT-116 


28.9 


Testis 


39.2 


Colon ca. CaCo-2 


■ 0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


1.4 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


142 


Gastric ca.* (liver met) 
NCI-N87 


3.4 


Melanoma M 14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


4.4 


Kidney 


3.2 


Adipose 


4.1 



Table AAC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2892,Run 
160942674 


Tissue Name 


Rel. Exp.(%) 
Ag2892, Run 
160942674 


Normal Colon 


1.8 


Kidney Margin 
8120608 


0.9 


CC WelltoModDiff 
(OD03866) 


1.0 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


0.7 


CC Gr.2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer 
9010320 


2.0 


CC Margin (OD03 868) 


0.0 


Kidney Margin 
9010321 


0.0 


CCModDiff(ODO3920) 


2.4 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 0640 11 


1.1 


CC Gr.2 ascend colon 
(OD03921) 


1.0 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


0.0 


Thyroid Cancer 
064010 


1.5 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


5.3- 


Thyroid Cancer 
A302152 


1.6 


Liver Margin (ODO4309) 


100.0 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


5.6 


Normal Breast 


0.7 


Lung Margin (OD04451- 
02) 


42 


Breast Cancer 
(OD04566) 


0.0 
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Normal Prostate 6546-1 


0.0 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
(OD04590-03) 


0.0 


Prostate Margin 
(OD04410) 


0.7 


Breast Cancer 

Metastasis 

(OD04655-05) 


3.3 


Prostate Cancer 
(OD04720-01) 


0.0 


Breast Cancer 064006 


0.0 


Prostate Margin 
(OD04720-02) 


0.7 


Breast Cancer 1024 


0.0 


Normal Lung 061010 


6.7 


Breast Cancer 
9100266 


0.0 


Lung Met to Muscle 
rOD04286^ 


3.4 


Breast Margin 


0.0 


iviuouic i vial gin 

(OD04286) 


0.0 


Breast Cancer 
A209073 


0.0 


Lung Malignant Cancer 
(OD03126) 


l.O 


Breast Margin 
A2090734 


0.0 


Lung Margin (OD03126) 


9.2 


Normal Liver 

11U11UU1 JUlVvl 


59 0 


Lung Cancer (OD04404) 


3.5 


Liver Cancer 064003 


19.2 


T lino \Aara\n (CWl(\AAf\A\ 
J^mig ivlalglll \\JLJ\j L r^T\j t -r ) 


5.Z 


Liver Cancer 1U25 


60.7 


Lung Cancer (OD04565) 


1.4 


Liver Cancer 1026 


39.5 


Lung Margin (OD04565) 


12.3 


Liver Cancer 6004-T 


73.2 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


9.8 


Lung Margin (OD04237- 
02) 


7.2 


Liver Cancer 6005-T 


39.5 


Ocular Mel Met to Liver 
(ODO4310) 


0.6 


Liver Tissue 6005-N 


7.9 


Hver Margin (ODO4310) 


23.0 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


l.O 


Bladder Cancer 1023 


0.0 


Lung Margin (OD0432 1 ) 


18.8 


Bladder Cancer 
A302173 


0.0 


Normal Kidney 


2.0 


Bladder Cancer 
(OD04718-01) 


6.1 


Kidney Ca, Nuclear grade 
2 (OD04338) 


12 


Bladder Normal 
Adjacent (OD04718- 
U3) 


0.0 


Kidney Margin 
(OD04338) 


l.O 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


0.0 


Ovarian Cancer 
064008 


1.8 


Kidney Margin 
(OD04339) 


0.0 


Ovarian Cancer 
(OD04768-07) 


0.0 
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Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Ovary Margin 
(OD04768-08) 


0.6 


Kidney Margin 
(OD04340) 


3.3 


Normal Stomach 


1.6 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0.0 


Gastric Cancer 
9060358 


0.6 


Kidney Margin 
(OD04348) 


0.0 


Stomach Margin 
9060359 


1.7 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


0.0 


Kidney Margin 

(U1JU40Z.£-U 5 ) 


0.8 


Stomach Margin 


0.6 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


0.7 


Kidney Margin 
(OD04450-03) 


2.7 


Stomach Margin 
9060396 


0:9 


Kidney Cancer 8120607 


1.4 


Gastric Cancer 
064005 


0.0 



Table AAD. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag2892, Run 
164629840 


Tissue Name 


Rel. Exp.(%) 
Ag2892, Run 
164629840 


Daoy- Medulloblastoma 


5.0 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 


TE67 1 - Medulloblastoma 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma 


0.0 


T98G- Glioblastoma 


0.0 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- 

Neuroblastoma 

(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin's B-cell 
lymphoma 


0.0 


Cerebellum 


2.2 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


3.2 


Jurkat- T cell leukemia 


0.0 ! 


NCI-H292- 


0.0 


TF-1- Erythroleukemia 


0.0 
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Mucoepidermoid lung 
carcinoma 








DMS-114- Small cell 
lung cancer 


22.8 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small cell lung 
cancer 


0.0 


U937- Histiocytic lymphoma 


0.0 


NCI-H146- Small cell 
lung cancer 


0.0 


KU-812- Myelogenous 
leukemia 


14.5 


NCI-H526- Small cell 
lung cancer 


0.0 


769-P- Clear cell renal 
carcinoma 


0.0 


NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


2.0 


NCI-H82- Small cell 
lung cancer 


0.0 


SW 839- Clear cell renal 
carcinoma 


1.7 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


20.4 


G401- Wilms 1 tumor 


0.5 


NCI-H1155-Large cell 
lung cancer 


0.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


1.0 


NCI-H1299- Large cell 
lung cancer 


0.0 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


2.0 


NCI-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


0.5 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.0 


LX-1- Small cell lung 
cancer 


100.0 


HPAC- Pancreatic 
adenocarcinoma 


0.0 


Colo-205- Colon cancer 


0.0 


MIA PaCa-2- Pancreatic 
carcinoma 


2.0 


KM 12- Colon cancer 


0.0 


CFPAC-1- Pancreatic ductal 

• • 
adenocarcinoma 


0.7 


KM20L2- Colon cancer 


0.6 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


6.5 


NCI-H716- Colon cancer 


0.4 


T24- Bladder carcinma 
(transitional cell) 


0.0 


SW-48- Colon 
adenocarcinoma 


6.6 


5637- Bladder carcinoma 


0.0 


SW1116- Colon 
adenocarcinoma 


0.0 


HT-1 197- Bladder carcinoma 


0.0 


LS 174T- Colon 
adenocarcinoma 


0.0 


UM-UC-3- Bladder carcinma 
(transitional cell) 


4.4 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


2.8 


HT-1 080- Fibrosarcoma 


0.0 
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NCI-SNU-5- Gastric 
carcinoma 


63.7 


MG-63- Osteosarcoma 


4.3 


KATO III- Gastric 
carcinoma 


23.7 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


0.0 


NCI-SNU-I6- Gastric 
carcinoma 


1.3 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0.0 


NCI-SNIM- Gastric 
carcinoma 


0.0 


A431- Epidermoid carcinoma 


6.9 


RF-1- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


13.1 


RF-48- Gastric 
adenocarcinoma 


0.0 


DU 145- Prostate carcinoma 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


0.0 


MDA-MB-468- Breast 
adenocarcinoma 


5.0 


NCI-N87- Gastric 

Oal U1UUJ Ha 


2.1 


SCC-4- Squamous cell 
carcinoma oi tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC- 15- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


8.8 


CAL 27- Squamous cell 
carcinoma of tongue 


0.0 



Table AAE. Panel 4D 



Tissue Name 


Rel.Exp.(%) 
Ag2892, Run 
164033139 


Tissue Name 


Rel. Exp.(%) 
Ag2892, Run 
164033139 


Secondary Thl act 


0.0 


HUVECEL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVECTNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


21.5 
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Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-1 beta 


65.1 


CD45KA LJJ4 
lymphocyte act 


6.7 


Coronery artery SMC rest 


0.0 


lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


LUo lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


32.5 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
r ivi/v/ 1 on omy c l n 


31.6 


2ryThimi2/Trl anti- 
CD95 CHI 1 


0.0 

— - — - ■ 


CCDl 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


y^y^u 1 1 uo ^j\.craiinocyxes ) 
TNFalpha + IL-lbeta 


3.8 


LAK cells IL-2 


0 0 


t-iver cinriosis 


1 AA A 


LAK cells IL-2+IL-12 




iwupus Kianey 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


11.0 


LAK cells IL-2+IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Wav MLR ^ dav 




iNv^j-rizyz iriN gamma 


A A 

u.o 


Two Wav MLR 5 dav 

* yv\j »t ix j itxxjxx mj uay 




rirf\£jK^ none 


A A 
U.O 


Two Way MLR 7 day 


0.0 


RPAPP TMP alnhn J- TT 1 

rir/vij^ i rsr aipna t 
beta 


0.0 


PBMC rest 


0 0 


XwUng iiDruDiaSL none 


A A 


PBMCPWM 


0.0 


Lung fibroblast TNF alpha 

+ IT - 1 hpfa 


8.1 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


10.4 


xxamos ^x> cenj none 


0.0 


T XT 1 1_ 1 a. TT f\ 

Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
lonomycm 


0.0 


Lung fibroblast IL-13 


4.3 


B lymphocytes PWM 


4.5 


Lung fibroblast IFN 
gamma 


5.8 


B lymphocytes CD40L 
and IL-4 


0.0 


UCilUal lIUrUDJdSl 

CCD1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


10.4 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 


13.0 
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gamma 




Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


21.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


12.2 


Macrophages rest 


0.0 


Lung 


63.7 


Macrophages LPS 


0.0 


Thymus 


43.8 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







Panel 1.3D Summary: Ag2892 Highest expression of the NOV29a gene is seen in a 
lung cancer cell line (CT=32.7). Significant expression is also seen in a colon cancer cell line 
and the liver. Thus, expression of this gene could be used to differentiate between these 
samples and other samples on this panel and as a diagnostic marker for the presence of colon 



5 and lung cancer. Furthermore, therapeutic modulation of the expression or function of this 
gene may be effective in the treatment of colon and lung cancers. Results from a second run 
with the same probe and primer set are not included because of a potential problem in one of 
the sample wells. 

Panel 2D Summary: Ag2892 Expression of the NOV29a gene is restricted to liver 
10 derived tissue, with highest expression in normal liver tissue (CT=32.4). Significant 

expression is also seen in liver cancer samples. Thus, expression of this gene could be used to 
differentiate liver derived samples from other samples on this panel and from other tissue 
samples. 

Panel 3D Summary: Ag2892 Highest expression of the NOV29a gene is seen in a 
15 lung cancer cell line (CT=31.4). Significant expression is also seen in a gastric cancer cell line. 
Thus, expression of this gene could be used to differentiate between these samples and other 
samples on this panel and as a diagnostic marker for the presence of gastric and lung cancer. 
Furthermore, therapeutic modulation of the expression or function of this gene may be 
effective in the treatment of gastric and lung cancers. 
20 Panel 4D Summary: Ag2892 Expression of the NOV29a gene is restricted to liver 

cirrhosis (CT=34.8). This liver specific expression is in agreement with the expression in 
Panels 1 .3D and 2D. Thus, expression of this gene could be used to differentiate between this 
sample and other samples on this panel, and as a marker of liver tissue. 

NOV29c 
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Expression of gene NOV29c was assessed using the primer-probe set Ag2893, 
described in Table ABA. Results of the RTQ-PCR runs are shown in Tables ABB, ABC, 
ABD, ABE and ABF. 



Table ABA Probe Name Ag2893 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gcccaatcctgatgactacttc-3 1 


22 


39 


1093 


Probe 


TET-5 * -ctccaagctcggagctttgacctg-3 1 - 
TAMRA 


24 


73 


1094 


Reverse 


5 * -ctcagcatgtcctctgatttct-3 * 


22 


98 


1095 



Table ABB. CNS_neurodegeneration_vl.O 



Tissue Name 


Run 224116295 


Tissue Name 


Kei. JLXp.(vo) AgZoW, 
Run 224116295 


AD 1 Hippo 


18.6 


v^unirOJ {rain) j 

Temporal Ctx 


15.8 


AD 2 Hippo 


59.0 


Temporal Ctx 


43.5 


AD 3 Hippo 


25 3 




41.6 


AD 4 Hippo 


56.3 


AD 2 Occipital Ctx 


0.0 


AD 5 Hippo 


95.9 


AD 3 Occipital Ctx 


27.5 


AD 6 Hippo 


86.5 


AD 4 Occinital Ptv 




Control 2 Hippo 


26.1 


AD 5 Occipital Ctx 


47.0 


Control 4 Hippo 


59.5 


AD 6 Occipital Ctx 


31.2 


Control (Path) 3 
Hippo 


17.3 


Control 1 Occipital 
Ctx 


20.9 


AD 1 Temporal Ctx 


48.6 


Control 2 Occipital 
Ctx 


63.7 


AD 2 Temporal Ctx 


61.1 


Control 3 Occipital 
Ctx 


62.0 


AD 3 Temporal Ctx 


25.2 


Control 4 Occipital 
Ctx 


28.1 


AD 4 Temporal Ctx 


70.7 


Control (Path) 1 
Occipital Ctx 


82.9 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


24.5 


AD 5 Sup Temporal 
Ctx 


55.9 


Control (Path) 3 
Occipital Ctx 


15.0 


AD 6 Inf Temporal 
Ctx 


86.5 


Control (Path) 4 
Occipital Ctx 


28.3 


AD 6 Sup Temporal 
Ctx 


92.7 


Control 1 Parietal 
Ctx 


20.4 
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Control 1 Temporal 
Ctx 


24.7 


Control 2 Parietal 
Ctx 


55.1 


Control 2 Temporal 
Ctx 


53.2 


Control 3 Parietal 
Ctx 


30.4 


Control 3 Temporal 


26.8 


Control (Path) 1 
r arietai l-tx 


74.2 


Control 3 Temporal 
Ctx 


40.6 


Control (Path) 2 
Parietal Ctx 


35.8 


Control (Path) 1 
Temporal Ctx 


63.7 


Control (Path) 3 
Parietal Ctx 


6.9 


Control (Path) 2 
Temporal Ctx 


44.1 


Control (Path) 4 
Parietal Ctx 


38.4 



Table ABC. Panel 1 3D 



Tissue Name 


Rel. Exp.(%) 
Ag2893, Run 
160944329 


Rel. Exp.(%) 
Ag2893,Run 
165701489 


Tissue Name 


Rel. Exp.(%) 
Ag2893, Run 
160944329 


Rel. Exp.(%) 
Ag2893, Run 
165701489 


Liver 

adenocarcinoma 


0.0 


0.0 


Kidney (fetal) 


19.5 


4.3 


Pancreas 


2.5 


4.0 


Renal ca. 786- 
0 


73.7 


64.2 


Pancreatic ca. 
CAP AN 2 


0.0 


0.0 


Renal ca. 
A498 


0.5 


0.6 


Adrenal gland 


0.8 


1.2 


Renal ca. RXF 
393 


100.0 


100.0 


Thyroid 


0.5 


0:3 


Renal ca. 
ACHN 


28.1 


18.8 


Salivary gland 


0.0 


0.0 


Renal ca. UO- 
31 


23.5 


24.3 


Pituitary gland 


2.0 


1.8 


Renal ca. TK- 
10 


21.2 


8.9 


Brain (fetal) 


0.9 


2.0 


Liver 


0.0 


1.3 


Brain (whole) 


3.6 


6.3 


Liver (fetal) 


1.2 


1.6 


Brain (amygdala) 


3.6 


6.1 


Liver ca. 

(hepatoblast) 

HepG2 


10.9 


8.1 


Brain (cerebellum) 


6.3 


6.2 


Lung 


8.9 


[ 4.8 


Brain 

(hippocampus) 


9.3 


10.4 


Lung (fetal) 


6.6 


3.6 


Brain (substantia 
nigra) 


2.6 


5.4 


Lung ca. 
(small cell) 
LX-1 


0.0 


0.0 


Brain (thalamus) 


7.2 


13.8 


Lung ca. 
(small cell) 
NCI-H69 


0.0 


0.0 
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Cerebral Cortex 


25.5 


3.3 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


0.0 


Spinal cord 


17.0 


10.7 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


glio/astroU87-MG 


0.0 


0.0 


T / 

Lung ca. (non- 
sm. cell) A549 


0.0 


o.p 


glio/astroU-118- 
MG 


A A 

0.0 


A A 

0.0 


Lung ca. (non- 
s.cell) NCI- 
H23 


2.3 


1.1 


astrocytoma 


0.6 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


3.6 


2.5 


neuro*; met SK-N- 
AS 


0.0 


0.0 


Lung ca. (non- 

s.cl)NCI- 

H522 


0.0 


0.0 


astrocytoma SF- 
539 


A A 

0.0 


A A 

0.0 


Lung ca, 
(squam.) SW 
900 


0.0 


2.4 


astrocytoma SNB- 
75 


0.0 


0.0 


Lung ca. 
(squam.) NCI- 
H596 


0.8 


0.0 


glioma SNB-19 


0.0 


0.0 


Mammary 
gland 


0.5 


0.0 


glioma U251 


0.0 


0.0 


Breast ca.* 
(pl.ef) MCF-7 


0.0 


0.0 


glioma SF-295 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 

IVJLD-Zj 1 


0.4 


1.7 


Heart (fetal) 


14 


0.0 


(pl.ef)T47D 


4.8 


0.9 


Heart 


0.0 


0.0 


±jI Cab I wa. Dl" 

549 


0.0 


0.0 


Skeletal muscle 
(fetal) 


3.3 


0.0 


Dl Caol Wa. 

MDA-N 


0.0 


0.0 


Skeletal muscle 


04 


0 0 


KJvaiy 


0 0 




Bone marrow 


0.0 


0.3 


Ovarian ca. 


3.2 


2.2 


Thymus 


1.4 


0.2 


Ovarian ca. 

OVrAP-4 


0.0 


0.0 


Spleen 


1.7 


0.6 


Ovarian ca. 
OVCAR-5 


0.0 


0.0 


Lymph node 


0.0 


1.1 


Ovarian ca. 
OVCAR-8 


1.0 


1.2 . 


Colorectal 


0.0 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach 


0.0 


2.0 


Ovarian ca.* 
(ascites) SK- 


0.0 


0.0 
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OV-3 






Small intestine 


| 0.0 


2.3 


Uterus 


1.9 


5.4 


Colon ca. SW480 


2.3 


0.5 


Placenta 


0.5 


0.9 


Colon ca.* 

SW620(SW480 

met) 


0.0 


0.6 


Prostate 


1.0 


1.0 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met)PC- 
3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Testis 


25.7 


9.3 


Colon ca. CaCo-2 


0.9 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


1.5 


0.0 . 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.0 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 

_~f\ XTr>T VJC7 

met) 1NL.I-IN0/ 


0.0 


0.6 


Melanoma 
MJ4 


0.0 


0.0 


Bladder 


11.3 


5.0 


Melanoma 
LOXIMVI 


0.0 


0.0 


Trachea 


2.5 


0.0 


Melanoma* 
(met) SK- 
MEJL-5 


0.0 


0.0 


Kidney 


26.2 


5.3 


Adipose 


1.0 


0.0 



Table ABD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2893, Run 
160966072 


Tissue Name 


Rel.Exp.(%) 
Ag2893,Run 
160966072 


Normal Colon 


2.5 


Kidney Margin 
8120608 


16.8 


CCWelltoModDiff 
(OD03866) 


0.0 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.5 


Kidney Margin 
8120614 


18.0 


CC Gt2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer 
9010320 


74.7 


CC Margin (OD03868) 


0.0 


Kidney Margin 
9010321 


18.3 


CC Mod Diff(ODO3920) 


0.5 


Normal Uterus 


0.7 


CC Margin (ODO3920) 


0.4 


Uterus Cancer 064011 


10.8 


CC Gr.2 ascend colon 
(ODQ3921) 


0.5 


Normal Thyroid 


0.0 
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CC Margin (OD03921) 


0.0 


Thyroid Cancer 
064010 


1.7 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.7 


Thyroid Cancer 
A302152 


3.4 


Liver Margin (ODO4309) 


0.9 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 

[KJlJyjWj I -U 1 ) 


5.0 


Normal Breast 


2.0 


Lung Margin (OD0445 1 - 
02) 


18.0 


Breast Cancer 
(OD04566) 


0.2 


Normal Prostate 6546-1 


0.7 


Breast Cancer 
(OD04590-01) 


0.2 


Prostate Cancer 
(OD04410) 


2.0 


Breast Cancer Mets 
(OD04590-03) 


0.8 


Prostate Margin 
(OD04410) 


8.7 


Breast Cancer 

Metastasis 

(OD04655-05) 


0.6 


Prostate Cancer 

[KJUvh /ZU-U 1 ) 


3.8 


Breast Cancer 064006 


0.3 


Prostate Margin 


3.5 


Breast Cancer 1024 


0.5 


Normal Lung 061010 


12.9 


Breast Cancer 
9100266 


0.5 


JLung Met to Muscle 
(OD04286) 


0.0 


Breast Margin 


1.1 


Muscle Margin 
(OD04286) 


0.3 


-Drcobi cancer 
A209073 


0.2 


Lung Malignant Cancer 
(OD03126) 


8.7 


Breast Margin 
A2090734 j 


0.4 


Lung Margin (OD03126) 


21.0 


formal Liver 


0 0 


Lung Cancer (OD04404) 


8.0 


Liver Cancer 064003 


0.8 


Lung Margin (OD04404) 


28 7 


j^ivcr dancer iuzj 




Lung.Cancer (OD04565) 


0.7 


Liver Cancer 1026 


8.1 


lino- A/farcrin ( fYT>fi/l 
Juuug lvlaigul yVJlJvJHJO J J 


33.0 


Liver Cancer 6004-T 


1.2 


01) 


1.3 


Liver Tissue 6004-N 


0.0 


lino- \/f arrrin *7 

i^ung margin jujuu^f/j /- 
02) 


11.3 


Liver Cancer 6005-T 


5.4 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.3 


Liver Margin (ODO4310) 


0.4 


Normal Bladder 


17.3 


Melanoma Mets to Lung 
(OD04321) 


1.0 


Bladder Cancer 1023 


0.2 


Lung Margin (OD04321) 


32.1 


Bladder Cancer 
A302173 


1.3 
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Normal Kidney 


42.0 


Bladder Cancer 
(OD04718-01) 


7.9 


Kidney Ca, Nuclear grade 
2 (OD04338) 


55.5 


Bladder Normal 
Adjacent (OD04718- 


0.4 


Kidney Margin 


40.9 


Normal Ovary 


2.0 • 


Kidney Ca Nuclear grade 


61.1 


Ovarian Cancer 

U04UUo 


10.7 


Kidney Margin 


22.8 


Ovarian Cancer 

[kJUvQ / Oo-U / ) 


0.3 


Kidney Ca, Clear cell 
type ^wi^u t t j^uj 


100.0 


Ovary Margin 

[KJL/v) £ * fOo-K)o) 


42 


Kidney Margin 


31.9 


Normal Stomach 


3.1 


Kidney Ca, Nuclear grade 


3.3 


Gastric Cancer 


0.4 


Kidney Margin 


27.0 


Stomach Margin 


1.1 


Kidney Cancer 

(UJL/WOZZ-U I ) 


30.4 


Gastric Cancer 

yv)0\)3yD 


02 


Kidney Margin 
COD04622-03} 


17.4 


Stomach Margin 
90601 94 


3.0 


Kidney Cancer 
(OD04450-01) 


21.5 


Gastric Cancer 
9060397 


0.2 


Kidney Margin 
(OD04450-03) 


21.3. 


Stomach Margin 
9060396 


2.6 


Kidney Cancer 8120607 


70.7 


Gastric Cancer 
064005 


0.8 



Table ABE. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag2893, Run 
165924139 


Tissue Name 


Rel. Exp.(%) 
Ag2893,Run 
165924139 


Daoy- Medulloblastoma 


2.0 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 


TE671- Medulloblastoma 


0.0 


ES-2- Qvarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498-CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 
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SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.9 


Daudi- Burkitt s lymphoma 


0.0 


T98G- Glioblastoma 


0.0 


U266- B-cell plasmacytoma 


0.0 


OTy XT OTT 

SK-N-SH- 

Neuroblastoma 

(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin f s B-cell 
lymphoma 


0.0 


Cerebellum 


6.4 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


9.9 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


2.0 


TF-1- Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0.0 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small cell lung 
cancer 


1.4 


U937- Histiocytic lymphoma 


0.0 


NCI-H146- Small cell 
lung cancer 


0.0 


KU-812- Myelogenous 
leukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


4.5 


769-P- Clear cell renal 
carcinoma 


7.7 


NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


100.0 


NCI-H82- Small cell 
lung cancer 


0.0 


SW 839- Clear cell renal 
carcinoma 


73.2 


NCI-HI 57- Squamous 
cell lung cancer 
(metastasis) 


0.0 


G401- Wilms' tumor 


0.0 


NCI-H1155- Large cell 
lung cancer 


0.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Urge cell 
lung cancer 


0.0 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


5.3 


NCI-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


1.3 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.0 


LX-1- Small cell lung 
cancer 


0.0 


HPAC- Pancreatic 
adenocarcinoma 


0.0 


Colo-205- Colon cancer 


0.0 


ivu/v ra^a-j.- rancreatic 
carcinoma 


0.0 


KM 12- Colon cancer 


0.0 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


1.6 


KM20L2- Colon cancer 


0.0 


PANC-1- Pancreatic 
epithelioid ductal 'carcinoma 


0.0 


NCI-H716- Colon cancer 


0.0 


T24- Bladder carcinma 


0.0 
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(transitional cell) 




SW-48- Colon 
adenocarcinoma 


0.0 


5637- Bladder carcinoma 


0.0 


SW111 6- Colon 
adenocarcinoma 


0.0 


HT-1 197- Bladder carcinoma 


0.0 


LS174T- Colon 
adenocarcinoma 


0.0 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhahdomvo^arcoma 


0 0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1 080- Fibrosarcoma 


0.0 


NCI-SNU-5- Gastric 
carcinoma 


9.7 


MG-63- Osteosarcoma 


0.0 


KATOIII- Gastric 
carcinoma 


0.0 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


0.0 


NCI-SNU-16- Gastric 
carcinoma 


0.0 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0 0 


NCI-SNU-1- Gastric 
carcinoma 


0.0 


A 43 1 - Kni dermoid carcinoma 


0 0 


RF-1- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


0 0 


RF-48- Gastric 
adenocarcinoma 


0.0 


DU 145- Prostate carcinoma 
(brain metastasis) 


0 0 


MKN-45- Gastric 
carcinoma 


0.0 


MDA-MB-468- Breast 
adenocarcinoma 


0 0 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-1 5- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma. 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


0.0 



Table ABF. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2893, Run 
159633002 


Tissue Name 


Rel. Exp.(%) 
Ag2893, Run 
159633002 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


b.o 


Secondary Th2 rest 


1.2 


HUVEC IL-11 


b.o 
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Secondary Trl rest 


q q jLung Microvascular EC 
(none 


0.0 


Primary Thl act 


0 0 jLung Microvascular EC 
}TNTalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.7 


Small airway epithelium 
none 


1.7 ' 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


3.0 


(JL>45KA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.3 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


4.4 


Secondary CD8 
lymphocyte rest 


0.2 


Astrocytes TNFalpha + 
IL-lbeta 


5.8 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


00 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CHI 1 


0.8 


CCD1 106 (Keratinocytes) 

nunc 


0.0 


LAK cells rest 


1.0 


1 1 uo ^Jveraiinocyxes ) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IT -2 


0.0 


Liver cirrhosis 


12.7 


LAK cells TI -2+TT -1? 


0 4 




in / 

18.6 


LAK cells TL-2+TFN 
gamma 


0.0 | 


NCI-H292 none 


6.5 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 TI -4 


5.6 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


6.8 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


3.6 | 


Two Wav MLR 3 dav 


0.0 


NCI-H292 IFN gamma 


2.9 


Two Way MLR 5 day 


0.0 


HPAEC none 


0 0 


Two Way MLR 7 day 


0.0 


HPAECTNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.3 


Lung fibroblast none 


0.0 


PBMC PWM 


0.4 


Lung fibroblast TNF alpha 
+ IL-lbeta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 
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Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.8 

-- — — - 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 

J TT A 

and IL-4 


0.6 


Dermal fibroblast 
CCD 1070 rest 


0.7 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


0.0 


EOL-1 dbcAMP 

PA/fA/innnmvrin 


0.0 


Dermal fibroblast 

rfT)1070 TT -1 hpta 


0.0 


Dendritic cells none 


0.0 


LJCimal HDrODlaM XT IN 

gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


1.2 • 


Monocytes LPS 


0.0 


Colon 


0.8 


Macrophages rest 


0.0 


Lung 


342 


Macrophages LPS 


0.0 


Thymus 


100.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_yl.O Summary: Ag2893 This panel does not show 
differential expression of the NOV29c gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for a discussion 
5 of utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag2893 Two experiments with the same probe and primer set 
produce results that are in excellent agreement, with highest expression of the NOV29c gene 
in a renal cancer cell line (CTs=28-30). Significant expression is also seen in a cluster of renal 
cancer cell lines. Thus, expression of this gene could be used to differentiate between this 
10 sample and other samples on this panel and as a marker to detect the presence of renal cancer. 
Furthermore, therapeutic modulation of the expression or function of this gene may be 
effective in the treatment of renal cancer. 

This gene also is expressed at low, but significant levels in the brain. Expression of the 
NOV29C gene in the cerebral cortex suggests a role in CNS-specific processes. Homology to 
15 the tocopherol-associated protein (TAP) transcription factor suggests a role for NOV29C in 
tocopherol mediated gene transcription. Tocopherol is an essential vitamin involved in many 
CNS processes that may be mediated by both its antioxidant properties and ability to regulate 
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gene transcription via NOV29c. Genetic disruption of tocopherol processing results in 
tocopherol deficiency and CNS disorders such as ataxia and neurodegeneration. Agents that 
modulate NOV29c may thus have utility in the treatment of ataxia and neurodegenerative 
diseases. 
5 References: 

Yamauchi J, Iwamoto T, Kida S, Masushige S, Yamada K, Esashi T. Tocopherol- 
associated protein is a ligand-dependent transcriptional activator. Biochem Biophys Res 
Commun 2001 Jul 13;285(2):295-9 

Vitamin E is a term that encompasses a group of potent, lipid-soluble, chain-breaking 

10 antioxidants. Structural analysis reveals that molecules having vitamin E activity include four 
isomers (alpha, beta, gamma, and delta) of both tocopherols and tocotrienols. Alpha- 
tocopherol has been shown to have the highest biological vitamin E activity in mammalian 
tissues based on fetal resorption assays, and it reverses vitamin E deficiency symptoms. 
Although the molecular functions fulfilled specifically by alpha-tocopherol have yet to be 

15 fully described, it is unlikely that they are limited to general antioxidant functions. Here we 
show the functional characterization of alpha-tocopherol associated protein, TAP, which 
displays significant sequence similarity to the alpha-tocopherol transfer protein. Ligand 
competition analysis showed that recombinant TAP binds to alpha-tocopherol but not to other 
isomers of tocopherols. Using GFP fusion protein expression system, we observed that TAP 

20 translocates from cytosol to nuclei in alpha-tocopherol-dependent fashion. Transient 

transfection experiment showed that TAP activates transcription of the reporter gene in alpha- 
tocopherol-dependent manner. These results suggest that the biological function of alpha- 
tocopherol is not only as an antioxidant but also as a transcriptional regulator of gene 
expression via association with a transcription factor TAP. 

25 Yokota T, Igarashi K, Uchihara T, Jishage K, Tomita H, Inaba A, Li Y, Arita M, 

Suzuki H, Mizusawa H, Arai H. Delayed-onset ataxia in mice lacking alpha -tocopherol 
transfer protein: model for neuronal degeneration caused by chronic oxidative stress. Proc Natl 
Acad Sci U S A 2001 Dec 18;98(26): 15 185-90 

alpha-Tocopherol transfer protein (alpha-TTP) maintains the concentration of serum 

30 alpha-tocopherol (vitamin E), one of the most potent fat-soluble antioxidants, by facilitating 
alpha-tocopherol export from the liver. Mutations of the alpha-TTP gene are linked to ataxia 
with isolated vitamin E deficiency (AVED). We produced a model mouse of AVED by 
deleting the alpha-TTP gene, which showed ataxia and retinal degeneration after 1 year of age. 
Because the brain alpha-TTP functions in maintaining alpha-tocopherol levels in the brain, 
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alpha-tocopherol was completely depleted in the alpha-TTP(-/-) mouse brain, and the 
neurological phenotype of alpha-TTP(-/-) mice is much more severe than that of wild-type 
mice when maintained on an alpha-tocopherol-deficient diet. Lipid peroxidation in alpha- 
TTP(-/-) mice brains showed a significant increase, especially in degenerating neurons, alpha- 
5 Tocopherol supplementation suppressed lipid peroxidation and almost completely prevented 
the development of neurological symptoms. This therapy almost completely corrects the 
abnormalities in a mouse model of human neurodegenerative disease. Moreover, alpha-TTP(- 
/-) mice may prove to be excellent animal models of delayed onset, slowly progressive 
neuronal degeneration caused by chronic oxidative stress. 

10 Panel 2D Summary: Ag2893 Highest expression of the NOV29c gene is seen in a 

sample derived from a kidney cancer cell line (CT=29.5). In addition, this sample is more 
highly expressed in kidney cancer than in adjacent normal tissue. Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 
marker to detect the presence of kidney cancer. Furthermore, therapeutic modulation of the 

1 5 expression or function of this gene may be effective in the treatment of kidney cancer. 

Panel 3D Summary: Ag2893 Expression of the NOV29c gene is detected primarily in 
samples derived from kidney cancer cell lines (CTs=30). Thus, expression of this gene could 
be used to differentiate between these samples and other samples on this panel and as a marker 
to detect the presence of kidney cancer. Furthermore, therapeutic modulation of the expression 

20 or function of this gene may be effective in the treatment of kidney cancer. 

Panel 4D Summary: Ag2893 The NOV29c transcript is expressed at low but 
significant levels in the lung and thymus and in lupus kidney and cirrhotic liver. Thus, the 
transcript or the protein it encodes could be used for detection of these tissues. The expression 
of this gene suggests that the protein encoded by this transcript may play an important role in 

25 the normal homeostasis of the thymus and lung tissues. Therefore, therapeutics designed with 
the protein encoded by this transcript could be important for modulating T cell development in 
the thymus and for maintaining or restoring normal function to these lung during inflammation 
due to diseases such as asthma and emphysema. Additionally, induction of this transcript in 
other tissues such as the kidney and liver may be detrimental and antagonistic therapies 

30 designed with the protein encoded for by this transcript could be important in the treatment of 
diseases of these tissues. 

NOV24b 
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Expression of gene NOV24b was assessed using the primer-probe set Agl688, 
described in Table ACA. Results of the RTQ-PCR runs are shown in Tables ACB, ACC and 
ACD. 



Table ACA. Probe Name Agl688 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -tcagaagggaatcatgatatcg-3 ' 


22 


577 


1096 


Probe 


TET-5 ' -ccttgataaaactccaggctcctttga- 
3 1 -TAMRA 


27 


550 


1097 


Reverse 


5 • -tttggaaggtaggcatattgg-3 ' 


21 


509 


1098 



Table ACB. Panel 1 3D 



Tissue Name 


Rel. Exp.(%)Agl688, 
Run 147249266 


Tissue Name 


ReLExp.(%) Agl688, 
Run 147249266 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


9.2 


Pancreas 


6.7 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.2 


Renal ca. A498 


1.7 


Adrenal gland 


1.8 


Renal ca. RXF 393 


0.0 


Thyroid 


3.8 


Renal ca. ACHN 


0.0 


Salivary gland 


1.5 


Renal ca.UO-31 


0.0 


Pituitarv eland 


6 1 


Renal ca TK-10 


n n 

U.v 


Brain (fetal) 


0.5 


Liver 


100.0 


Brain (whole) 


3.6 


Liver (fetal) 


99.3 


Brain (amygdala) 


3.3 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.4 


Lung 


1.3 


Brain (hippocampus) 


6.2 


Lung (fetal) 


1.8 


Brain (substantia nigra) 


1.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


2.1 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


6.3 


Lung ca. (s.cell var.) 
SHP-77 


0.8 


Spinal cord 


3.1 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell)A549 


02 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


02 


Lung ca. (non-s.cl) 


0.0 
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< 




NC1-H522 




astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.2 


astrocytoma SNB-75 


0.1 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma aNB-19 


0.2 


Mammary gland 


2.9 


glioma U251 


1.2 


Breast ca.* (pl.ei) 
MCF-7 


0.0 


glioma SF-295 


0.0, 


RrPAot pa * (n\ pfi 

MDA-MB-231 


0.0 


Heart (fetal) 


0.2 


Rrpjict ph * fnl pf\ 

T47D 


0.0 


Heart 


1 6 




0 0 


Skeletal muscle (fetal) 


0.7 


Breast ca. MDA-N 


0.0 


skeletal muscle 


1.2 


Ovary 


0.0 


Bone marrow 


0.5 


Ovarian ca, OVCAR- 
3 


0.2 


Thymus 


32 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


1.0 


Ovarian ca. OVCAR- 
5 


0.3 


Lymph node 


2.9 


Ovarian ca. OVCAR- 

C 
o 


0.0 


Colorectal 


0.8 


Ovarian ca. IGROV- 

i 
i 


0.0 


Stomach 


3.3 


Ovarian ca.* (ascites) 

Jl\"v V -) 


1.0 


Small intestine 


6 2 


yj id uo 


1 4 
i .*t 


Colon ca. SW480 


0.0 


Placenta 


0.4 


Colon ca.* 

SW620(SW480met) 


0.0 


Prostate 


1.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
metiPC-3 


0.0 


Colon ca.HCT-1 16 


0.0 


Testis 


6.1 


Colon ca. CaCo-2 


0.2 


Melanoma 
Hs688(A).T 


0.4 


uoion ca. 
tissue(0DO3866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.9 


Colon ca HCC-99QR 




Ttifolcmrtmo T 1 A C^C* £S) 

ivieianoma u/\i^l^-o^ 


ft ft 


Gastric ca.* (liver met) 
NCI-N87 


4.4 


Melanoma M 14 


0.0 


Bladder 


3.1 


Melanoma LOX 
IMVI 


0.0 


Trachea 


3.0 


Melanoma* (met) 
SK-MEL-5 


0.0 
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Kidney 



6.8 



Adipose 



0.5 



Table ACC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Agl688, Run 
162646059 


Tissue Name 


Rel. Exp.(%) 
Agl688, Run 
162646059 


Normal Colon 


1.7 


Kidney Margin 
8120608 


0.7 


CC Well to Mod Diff 
(OD03866) 


0.0 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.2 


Kidney Margin 
8120614 


0.5 


CC Gr.2 rectosigmoid 
(OD03868) 


0.2 


Kidney Cancer 
9010320 


0.2 


CC Margin (OD03868) 


0.1 


Kidney Margin 
9010321 


1.0 


CCMod Diff(ODO3920) 


0.1 


Normal Uterus 


0.2 


CC Margin (ODO3920) 


0.9 


Uterus Cancer 064011 


0.8 


CC Gr.2 ascend colon 
(OD03921) 


0.1 


Normal Thyroid 


0.9 


CC Margin (OD03921) 


0.1 


Thyroid Cancer 
064010 


02 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


4.7 


Thyroid Cancer 

AjUZ 1 


0.5 


Liver Margin (ODO4309) 


100.0 


Thyroid Margin 
A302153 


1.0 


Colon mets to lung 
(OD04451-01) 


0.1 


Normal Breast 


0.3 


Lung Margin (OD04451- 
02) 


0.1 


Breast Cancer 
(OD04566) 


0.1 


Normal Prostate 6546-1 


2.1 


Breast Cancer 
(OD04590-01) 


0.1 


Prostate Cancer 
(OD04410) 


0.6 


Breast Cancer Mets 
(OD04590-03) 


04 


Prostate Margin 
(OD04410) 


0.5 


Breast Cancer 

Metastasis 

(OD04655-05) 


0.9 


Prostate Cancer 
(OD04720-01) 


1.1 


Breast Cancer 064006 


0.6 


Prostate Margin 
(OD04720-02) 


1.6 


Breast Cancer 1024 


\2 


Normal Lung 061010 


2.0 


Breast Cancer 
9100266 


0.1 


Lung Met to Muscle 


0.0 


Breast Margin 


0.1 
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IVlUbCiC i Vial gill 

(OD04286) 


0.2 


A209073 


0.3 


Lung Malignant Cancer 
(OD03126) 


0.1 


Breast Margin 
A2090734 


0.3 


Lung Margin (OD03126) 


6.5 


Normal Liver 


69.7 


Lung Cancer (OD04404) 


0.1 


Liver Cancer 064003 


13.7 


T imp Manrin ffYn04404Y 

idling iviaigui ^vyi/utti/t / 


0 2 

VJ .__ 


T iver Panrpr 


18 0 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


12 


Lung Margin (OD04565) 


0.1 


T /"^ f A A A T* 

Liver Cancer 6004-T 


22.2 


Lung Cancer (UDU42J7- 
01) 


0.1 


Liver Tissue 6004-N 


1.0 


Lung Margin (OD04237- 
02) 


0.4 


Liver Cancer 6005-T 


1-9 


Ocular Mel Met to Liver 
(ODO4310) 


0.1 


Liver Tissue 6005-N 


42 


Liver Margin (ODO4310) 


77.4 


Normal Bladder 


2.7 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.1 


Bladder Cancer 
A302173 


0.2 


Normal Kidney 


12.9 


Bladder Cancer 
(OD04718-01) 


0.1 


Kidney Ca, Nuclear grade 
2 (OD04338) 


3.8 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.5 


Kidney Margin 
(OD04338) 


1.6 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


2.8 


Ovarian Cancer 
064008 


0.1 


Kidney Margin 


9.3 


Ovarian Cancer 

//~\T\f\ A *7/TO at\ 

(OD04768-07) 


0.2 


Kidney Ca, Clear cell 
type 


_. . 1.4 


Ovary Margin 
(OD04768-08) 


0.1 


Kidney Margin 

(ULJU4_J4U} 


4.1 


Normal Stomach 


0.3 


Kidney Ca, Nuclear grade 
J (UUU4J4S) 


0.1 


Gastric Cancer 
9060358 


0.1 


Ic'iflnpv \AnTOtn 

(OD04348) 


3.8 


oiomacn iviargin 
9060359 


0.0 


Kidney Cancer 
(OD04622-01) 


0.2 


Gastric Cancer 
9060395 


0.2 


Kidney Margin 
(OD04622-03) 


0.7 


Stomach Margin 
9060394 


0.3 


Kidney Cancer 


02 


Gastric Cancer 


0.3 
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(OD04450-01) 




9060397 




Kidney Margin 
(OD04450-03) 


2.6 


Stomach Margin 
9060396 


0.0 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


1.1 



Table ACD. Panel 5 Islet 



Tissue Name 


Rel. Exp.(%) 
Agl688, Run 
226587524 


Tissue Name 


Rel. Exp.(%) 
Agl688,Run 
226587524 


97457JPatient- 
02go_adipose 


41.2 


94709_Donor 2 AM - A_adlpose 


0.0 


97476_Patient- 
07sk skeletal muscle 


9.9 


94710JDonor 2 AM - B_adipose 


0.0 


97477_Patient- 
07ut_uterus 


8.1 


9471 l_Donor 2 AM - C_adipose 


0.0 


97478J>atient- 
07pl_placenta 


0.0 


94712JDonor 2 AD - A_adipose 


11.4 


99 167_Bayer Patient 1 


84.7 


94713_Donor 2 AD - B_adipose 


0.0 


97482_Patient- 
08ut_uterus 


2.4 


947 1 4_Donor 2 AD - C_adipose 


29.1 


97483_Patient- 
08plj?lacenta 


0.0 


94742_Donor 3 U - 
AJMesenchymal Stem Cells 


19.2 


97486JPatient- 
09sk skeletal muscle 


8.0 


94743_Donor 3 U - 
B_Mesenchymal Stem Cells 


0.0 


97487J>atient- 
09ut_uterus 


9.6 


94730_Donor 3 AM - A_adipose 


15.0 


97488_Patient- 
09pl_placenta 


0.0 


9473 1 JDonor 3 AM - B_adipose 


37.9 


97492_Patient- 
10ut_uterus 


u.u 


y4 /32_JJonor i AM - c_aaipose 


A A 

U.U 


97493_Patient- 
10pl_placenta 


0.0 


94733_Donor 3 AD - A_adipose 


39.2 


97495 JPatient- 
llgo_adipose 


0.0 


94734_Donor 3 AD - B_adipose 


11.4 


97496JPatient- 
llsk skeletal muscle 


52.9 


94735_Donor 3 AD - C_adipose 


34.4 


97497JPatient- 
llut_uterus 


35.8 


771 38JLiver_HepG2untreated 


8.4 


97498_Patient- 
llpljlacenta 


10.5 


73556JHeart_Cardiac stromal 
cells (primary) 


0.0 


97500 JPatient- 
12go_adipose 


0.0 


81735_Small Intestine 


100.0 


97501_Patient- 
12sk_skeletal muscle 


35.4 


72409_Kidney_Proximal 
Convoluted Tubule 


9.9 
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97502_Patient- 
12ut_uterus 


20.7 


82685_Small intestine_Duodenum 


70.2 


97503 JPatient- 
12pl_placenta 


0.0 


90650_Adrenal_Adrenocortical 
adenoma 


25 5 


94721_Donor2U- 
AJVIesenchymal Stem 
Cells 


0.0 


72410_Kidney_HRCE 


10.4 


94722_Donor2U- 
B Mesenchymal Stem 
Cells 


0.0 


72411_Kidney_HRE 


7.2 


94723_Donor2U- 
C Mesenchymal Stem 
Cells 


0.0 


73139_Uterus_Uterine smooth 
muscle cells 


0.0 



Panel 13D Summary: Agl688 Expression of the NOV24b gene, a plasma kallikrein 
homolog, is significantly higher in liver (CTs=28) than in any other sample on this panel. 
Thus, expression of this gene could be used as a marker of liver tissue. Plasma kallikrein is a 
5 serine protease that, among other roles, plays a part in blood coagulation, fibrinolysis, and 
complement activation and has been implicated in adipose differentiation by remodelling of 
the fibronectin-rich ECM of preadipocytes. Therefore, an antagonist to this gene product may 
be beneficial in the treatment of obesity. 

References: 

10 Hoover-Plow J, Yuen L. Plasminogen binding is increased with adipocyte 

differentiation. Biochem.Biophys.Res.Commun. (2001) 284, 389-394 

The purpose of this study was to examine the role of the plasminogen system in the 
development of adipose tissue. Plasminogen binding capacity was determined in differentiated 
and undifferentiated cells from adipose tissue of plasminogen deficient mice and 3T3 cells, a 

1 5 well-characterized tissue culture model. In 3T3 cells, plasminogen binding was fivefold higher 
in differentiated cells compared to the undifferentiated cells. Inhibition of binding by 
carboxyl-terminal lysine analogs was similar for the differentiated and undifferentiated cells 
with tranexamic acid > EACA > lysine. The binding of plasminogen was concentration- 
dependent and approaches saturation in the both cell types. The number of plasminogen 

20 binding sites was tenfold higher in the differentiated compared to the undifferentiated cells. In 
isolated mature fat cells and stromal cell cultures from mouse adipose tissue, plasminogen . 
binding was also higher in the differentiated mature fat cells and differentiated stromal cells 
compared to undifferentiated stromal cells. Plasminogen binding was elevated in the 
differentiated cells from the Pig-/- mice compared to cells from the WT mice. These results 
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suggest that the plasminogen system plays an important role in adipose tissue development. 
Copyright 2001 Academic Press. 
PMID: 11394891 

Selvarajan S, Lund LR, Takeuchi T, Craik CS, Werb Z.A plasma kallikrein-dependent 
5 plasminogen cascade required for adipocyte differentiation. Nature Cell Biol. (2001) 3, 267- 
275. 

Here we show that plasma kallikrein (PKal) mediates a plasminogen (Pig) cascade in 
adipocyte differentiation. Ecotin, an inhibitor of serine proteases, inhibits cell-shape change, 
adipocyte-specific gene expression, and lipid accumulation during adipogenesis in culture. 

10 Deficiency of Pig, but not of urokinase or tissue-type plasminogen activator, suppresses 
adipogenesis during differentiation of 3T3-L1 cells and mammary-gland involution. PKal, 
which is inhibited by ecotin, is required for adipose conversion, Pig activation and 3T3-L1 
differentiation. Human plasma lacking PKal does not support differentiation of 3T3-L1 cells. 
PKal is therefore a physiological regulator that acts in the Pig cascade during adipogenesis. 

1 5 We propose that the Pig cascade fosters adipocyte differentiation by degradation of the 
fibronectin-rich preadipocyte stromal matrix. 
PMID: 11231576 

Panel 2D Summary: Agl688 The expression of the NOV24b gene appears to be 
highest in a sample derived from a sample of normal liver tissue adjacent to a metastatic colon 

20 cancer CT=26.2). In addition, there is substantial expression in other samples of normal liver, 
and to a much lesser degree, malignant liver tissue. This liver specific expression is consistent 
with the expression seen in Panel 1.3D. Thus, the expression of this gene could be used to 
distinguish liver derived tissue from the other samples in the panel, and more specifically the 
expression of this gene could be used to distinguish normal liVer from malignant liver tissue. 

25 Moreover, therapeutic modulation of this gene, through the use of small molecule drugs, 
protein therapeutics or antibodies might be of benefit in the treatment of liver cancer. 

Panel 5 Islet Summary: Agl688 Expression of the NOV24b gene is limited to 
pancreatic islets and small intestines. Please see Panel 1 .3 for discussion of utility of this gene 
in metabolic disease. 

30 NOV30 

Expression of gene NOV30 was assessed using the primer-probe set Ag2894, 
described in Table ADA. Results of the RTQ-PCR runs are shown in Tables ADB, ADC and 
ADD. 
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Table ADA. Probe Name Ag2894 



Primers 


Sequences 


Length 


Start 
Position 


SBQ ID HO: 


Forward 


5' -cctgagtcaatccaagaaactg-3 1 


22 


94 


1099 


Probe 


TET-5 1 -aggtcatcaacccaggaccgcctag-3 ' - 
TAMRA 


25 


116 


1100 


Reverse 


5 1 - 1 ccagtagggat ctggagaagt - 3 • 


22 


149 


1101 



Table ADB. Panel 13D 



Tissue Name 


Rel Exof%^Ae2894 
Run 160968507 


Tissue Name 


Rel Exd (%\ Ap2894 
Run 160968507 


Liver adenocarcinoma 


0.0 


Kidnev Cfetall 


0.0 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


r ancreaiic ca. wvr ajn 
2 


0.0 


Renal ca. A498 


0.0 


/vurenai giana 


ft ft 
U.U 


Kenai ca. KAr 3V3 


ft ft 
U.U 


Thyroid 


0.0 


Renal ca. ACHN 


50.3 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


8.1 


Brain (whole) 


0.0 ... 


Liver (fetal) 


6.4 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


0.0 


Lung (fetal) 


6.7 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 




U.U 


Lung ca. (small cell) 
NCI-H69 


U.U 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


9.7 


gIio/astroU-118-MG . 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


19.8 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


3.4 


Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 


0.0 
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NCI-H596 




glioma SNB-19 


0.0 


Mammary gland 


0.0 


glioma U251 


0.0 


Breast ca.* (pl.ef) 

\Ar"T! 7 

ivi^r-/ 


0.0 


glioma SF-295 


0.0 j 


oreasi ca. {pi.eij 
MDA-MB-231 


3.3 


Heart (fetal) 


3.5 


ureast ca. \p\.ei) 
T47D 


0.0 


Heart 


U.U 


oreasi ca. d i 


U.U 


Skeletal muscle (fetal) 


12 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


6.5 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Thymus 


0.0 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.0 


Lymph node 


0.0 


Ovarian ca. OVCAR- 

o 
0 


0.0 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 


0.0 


Small intestine 


U.U 


Uterus 


n n 
U.U 


Colon ca. SW480 


0.0 


Placenta 


o.o 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
metjrC-J 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


100.0 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


5.7 


Melanoma M 14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


14.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.0 


Adipose 


.4.2 
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Table ADC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2894, Run 
160966709 


Tissue Name 


Rel. Exp.(%) 
Ag2894, Run 
160966709 


Normal Colon 


0.0 


Kidney Margin 
8120608 


0.0 


CC WelltoModDiff 
(OD03866) 


0.0 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer . 
9010320 


0.0 


CC Margin (OD03868) 


0.0 


Kidney Margin 
9010321 


0.0 


CCModDiff(ODO3920) 


0.0 


Normal Uterus 


o.o 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 06401 1 


0.0 


CC Gr.2 ascend colon 
(OD03921) 


OA 


Normal inyroid 


U.U 


i^c Margin (\JU\jjyZi ) 


u.u 


Thyroid Cancer 
064010 


A A 
U.U 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


3.0 


Thyroid Cancer 
A302152 


0.0 


Liver Margin (ODO4309) 


40.3 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


0.0 


Lung Margin (OD04451- 
02) 


a n 


Breast Cancer 
(OD04566) 


A A 

u.u 


"Wnrmjil Prnctatp* fc^Afc-l 


u.u 


Breast Cancer 
(OD04590-01) 


U.U 


Prostate Cancer 
(OD044.10) 


u.u 


Breast Cancer Mets 
(OD04590-03) 


A A 

U.U 


(OD04410) 


0.0 


Breast Cancer 

Metastasis 

(OD04655-05) 


0.0 


Prostate Cancer 
(OD04720-01) 


0.0 


Breast Cancer 064006 


0.0 


Prostate Margin 
(OD04720-02) 


0.0 


Breast Cancer 1024 


0.0 


Normal Lung 061010 


9.7 


Breast Cancer 
9100266 


0.0 


Lung Met to Muscle 
(OD04286) 


3.9 

• 


Breast Margin 
9100265 


0.0 


Muscle Margin 
(OD04286) 


0.0 


Breast Cancer 
A209073 


0.0 
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Lung Malignant Cancer 
(OD03126) 


4.4 


Breast Margin 
A2090734 


0.0 


Lung Margin (OD03126) 


25.0 


Normal Liver 


100.0 


Lung Cancer (OD04404) 


12.5 


Liver Cancer 064003 


5.3 


idling lvialglll ^v-/i-/v/ < + fc rV7*T ) 




T ivpr Pnnrpr 109^ 


20 2 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


12.7 


Lung Margin (OD04565) 


8.1 


Liver Cancer 6004-T 


13.0 


Lung Cancer (OD04237- 
01) 


2.0 


Liver Tissue 6004-N 


0.0 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (ODO4310) 


19.3 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


16.2 


Bladder Cancer 
A302173 


15.7 


Normal Kidney 


0.0, 


Bladder Cancer 
(OD04718-01) 


0.0 


Kidney Ca, Nuclear grade 
2 (OD04338) 


2.9 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.0 


Kidney Margin 
(OD04338) 


0.0 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


3.7 


Ovarian Cancer 
064008 


0.0 


Kidney Margin 
(OD04339) 


0.0 


Ovarian Cancer 
(OD04768-07) 


0.0 


Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


0.0 


Normal Stomach 


4.1 


Kidney Ca, Nuclear grade 
J (UD04348) 


0.0 


Gastric Cancer 
9060358 


0.0 


Kidney Margin 


0.0 


Stomach Margin 
9060359 


0.0 


Kidney Cancer 


0.0 


Gastric Cancer 


0.0 


Kidney Margin 
(OD04622-03) 


0.0 


Stomach Margin 
9060394 


0.0 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


0.0 


Kidney Margin 
(OD04450-03) 


0.0 


Stomach Margin 
9060396 


0.0 
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Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


0.0 



Table ADD. Panel 4D 



1 issue IName 


Rel. Exp.(%) 
AgZ894, Run 
164033148 


Tissue Name 


Rel. Exp.(%) 

Ag2oV4, Kun 
164033148 


Secondary Thl act 


u.u 


riUVJbU iL-loeta 


U.U 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 

• 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


17.3 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.6 


oeconaary L/i/o 
lymphocyte rest 


0.0 


Astrocytes lJNralpna + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 
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JLAK CeJJS li^-Z"rlrjN 

gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


0.0 


1 AT/ nt*\\c 

PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


i wo way jyllk j oay 


ft ft 

u.u 


jNci-rLdyz irvH gamma 


ft ft 
U.U 


l wo way iviLK j aay 


ft ft 

u.u 


rirAbC none 


ft ft 
U.U 


Two Way MLR 7 day 


0.0 


niVvbC, 1 INr alpna t jX,-1 

beta 


0.0 


r JDiviv^ re si 


ft ft 
U.U 


JLung tiDroDiast none 


ft ft 
U.U 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
t il-i oeta 


0.0 


PBMC PHA-L 


13.8 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD1 070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
riVLAyionomycin 


0.0 


Dermal fibroblast 
CCU1U/U 1L-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast irN 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


7.6 


Monocytes rest 


b.o 


ffiD Crohn's • 


0.0 


Monocytes LPS 


0.0 


Colon 


23.0 


Macrophages rest 


0.0 


Lung 


13.8 


Macrophages LPS 


0.0 


Thymus 


12.8 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







Panel 13D Summary: Ag2894 Expression of the NOV30 gene is restricted to the 
testis and a renal cancer cell line(CTs=33-35). Thus, expression of this gene could be used to 
differentiate these samples from other samples on this panel and as a marker for testis tissue 
5 and renal cancer. 

Panel 2D Summary: Ag2894 Expression of the NOV30 gene is restricted to normal 
liver tissue (CTs=33-35). This gene enodes a ryudocan homolog. Ryudocan is a cell-surface 
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heparan sulfate proteoglycan, which is involved in regulation of blood coagulation, among 
other biological functions. Thus, based on its expression profile, expression of this gene could 
be used to identify liver tissue and to differentiate between normal and malignant liver. 
Furthermore, this gene product may be involved in normal homeostasis of the liver. Thus, 
5 therapeutic modulation of the expression or function of this gene product may be effective in 
the treatment of liver disease and liver cancer. 
References: 

Kojima T, Inazawa J, Takamatsu J, Rosenberg RD, Saito ELHuman ryudocan core 
protein: molecular cloning and characterization of the cDNA, and chromosomal localization of 

10 the gene. Biochem Biophys Res Commun 1993 Feb 15;190(3):814-22 

We have isolated a series of overlapping cDNA clones encoding a 2,628 bp transcript, 
which potentially codes for a 198 amino acid protein with predicted molecular mass of 21,641 
daltons, for the human ryudocan core protein. The deduced core proteins of the human and the 
rat ryudocan have high structural conservation, particularly in the NH2 and COOH terminus 

1 5 regions of the putative mature core protein, including the combined 

transmembrane/cytoplasmic domains with conserved positions of all 4 tyrosine groups and 3 
conserved glycosaminoglycan chain attachment regions, which might serve important roles for 
biological function of ryudocan. A major 2.7 kb transcript was detected in all tissues tested, 
with relatively high levels of expression observed in mRNA from lung, liver, skeletal muscle 

20 and kidney. A minor 1 .9 kb transcript was also observed in some of tissues, which would be 
caused by alternative polyadenylation. Human ryudocan gene has been localized on the 
chromosome 20ql2 by fluorescence in situ hybridization. 
PMID: 7916598 

Kojima T. Molecular biology of ryudocan, an endothelial heparan sulfate proteoglycan. 

25 Semin Thromb Hemost 2000;26(l):67-73 

Ryudocan is a type I integral membrane heparan sulfate proteoglycan, which was 
originally cloned from rat microvascular endothelial cells. We have cloned the cDNA of rat 
ryudocan. The deduced amino acids of ryudocan has homologous transmembrane and 
intracellular domains with syndecan but very distinct extracellular regions. We also cloned the 

30 human ryudocan cDNA, of which the gene localizes on the chromosome 20ql2. To better 
understand the regulation of ryudocan expression, we have determined the structural 
organization of the human ryudocan gene. The human ryudocan gene extends approximately 
24 kb and is divided into five exons that appear conserved in syndecan family members. The 
5 -flanking sequences of the human.ryudocan gene contain a variety of potential binding sites 
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for transcription factors and are capable of functioning as a promoter. We purified human 
ryudocan and evaluated its interactions with several extracellular ligands. It was found that 
basic fibroblast growth factor (bFGF), midkine, and tissue factor pathway inhibitor exhibited 
significant ryudocan bindings. Heparitinase, but not chondroitin ABC lyase treatment, 
5 destroyed those ryudocan bindings; thus, the heparan sulfate chains of ryudocan appear to be 
responsible for those bindings. Immunohistochemical analysis revealed that ryudocan is 
expressed in peripheral nerve tissues, fibrous connective tissues, and placental trophoblasts. 
These findings suggest that ryudocan may possess multiple biologic functions, such as bFGF 
modulation, neurite growth promotion, and anticoagulation, via heparan sulfate-binding 
1 0 effectors in the cellular microenvironment. 
PMD: 10805285 

Panel 4D Summary: Ag2894 Expression of the NOV30 gene is restricted to a sample 
derived from liver cirrhosis. This gene is also expressed in normal liver in panel 2D. This 
expression suggests that this protein product, a ryudocan homolog, is essential to liver 
15 function. Thus, expression of this gene could be used as a marker for liver derived tissue. 

Furthermore, therapeutic modulation of the expression or function of this gene product may be 
effective in the treatment of diseases that affect the liver, including liver cirrhosis. 

NOV31 

Expression of gene NOV31 was assessed using the primer-probe set Ag2922, 
20 described in Table AEA. Results of the RTQ-PCR runs are shown in Table AEB. 

Table AEA. Probe Name Ag2922 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -cttcaggctgtgggtcatc-3 1 


19 


407 


1102 


Probe 


TET-5 1 -caagccctactgctcccagtccag-3 ' - 
TAMRA 


24 


447 


1103 


Reverse 


5' -cagcaacagggcttacaaca-3 1 


20 


472 


1104 



Table AEB. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag2922, Run 
171619741 


Tissue Name 


Rel. Exp.(%) 
Ag2922, Run 
171619741 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


1.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


1.3 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 
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Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.6 


Lung Microvascular EC 
none 


3.2 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


1.9 


Primary Th2 act 


0.4 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 

nnvrp i i < tt il o_ 

TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


1.1 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


1.3 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.7 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.6 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


1.8 


CD45RO CD4 
lymphocyte act 


12 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


/TT"X O 1 1_ . . 

CD8 lymphocyte act 


1.6 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


2.1 


Astrocytes TNFalpha + 
IL-lbeta 


0.6 


Secondary CD8 
lymphocyte act 


1.2 


KU-812 (Basophil) rest 


3.4 . 


CD4 lymphocyte none 


0.4 


KU-812 (Basophil) 

A ft />n f\ty% xts* i r» 

x ivi/\/ionoinycin 


1.8 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


2.3 


CCD 1 106 (Keratinocytes) 

nunc 


0.0 


LAK cells rest 


0.9 


v^v^jlv i luo ^xvcraLin ocyics ) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0 8 




fi 7 

U./ 


LAK cells IL-2+IL-12 


2 0 


NrT-H?Q2 none 




LAK cells IL-2+IFN 
gamma 


1.9 


NCI-H292 DL-4 


4.3 


LAK cells IL-2+ IL-18 


0 0 




i n 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292IL-13 


1.8 


NK Cells IL-2 rest 


2.0 


NCI-H292 IFN gamma 


1.3 


Two Way MLR 3 day 


3.4 


HPAEC none 


0.4 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL- 1 
beta 


0.6 


Two Way MLR 7 day 


0.6 


Lung fibroblast none 


0.6 


PBMC rest 


0.5 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 



1038 



WO 02/068649 



PCT/US02/02785 



PBMC PWM 


0 0 


T liner fiKrnnlQct TT _/l 
lAUlg IlUJ UDIaSl 1L»"^+ 


i i 
i . i 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


ft A 

0.0 


T IT\ 11 a TT * O 

Lung fibroblast IL-13 


3.5 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN 
gamma 


2.8 


B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCD 1070 rest 


2.2 


B lymphocytes CD40L 
and TL-4 


2.4 


Dermal fibroblast 
ka^yjv kj/v iiMr aipna 


7.3 


EOL-1 dbcAMP 


5.3 


Dermal fibroblast 
pmifiTnTT i Vi^to 

LA^JLJ1U/U JLJL/-1 Deia 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


1.2 


nprmal fiKrrkKlact TT7K7 

gamma 


0.0 


Dendritic cells none 


2.5 


Dermal fibroblast IL-4 


1.3 


Dendritic cells LPS 


0.9 


Dermal Fibroblasts rest 


0.5 


Dendritic cells anti- 
CD40 


2.3 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


i.2 


Monocytes LPS 


0.0 


Colon 


3.1 


Macrophages rest 


1.8 


Lung 


6.7 


Macrophages LPS 


0.9 


Thymus 


32.5 


HUVEC none 


0.8 


Kidney 


100.0 


HUVEC starved 


0.5 







CNS_neurodegeneration_vl.O Summary: Ag2922 Expression of the NOV31 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 13D Summary: Ag2922 Expression of the NOV31 gene is low/undetectable in 
5 all samples on this panel (CTs>35). (Data not shown.) 

Panel 2D Summary: Ag2922 Results from one experiment with the NOV31 gene are 
not included. The amp plot indicates that there were experimental difficulties with this run. 

Panel 4.1D Summary: Ag2922 Expression of the NOV3 1 gene is restricted to normal 
thymus and kidney (CTs=32-33). Thus, expression of this gene could be used as a marker for 
1 0 kidney and thymic tissue. 

Panel 4D Summary: Ag2922 Results from one experiment with the NOV31 gene are 
not included. The amp plot indicates that there were experimental difficulties with this run. 

NOV36a and NOV36b 
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Expression of gene NOV36a and variant NOV36b was assessed using the 
primer-probe sets Agl 136 and Ag2999, described in Tables AFA and AFB. Results of 
the RTQ-PCR runs are shown in Tables AFC, AFD, AFE and AFF. 



Table AFA. Probe Name Agl 136 



Primersj Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forwardjs 1 -tcatcaaagtgcaagacatcaa-3 ' 


22 


455 


1105 


|tet-5 ■ -ttttccccttgggccctaccatg-3 » - 
r ° S jTAMRA 


23 


492 


1106 


Reversejs' -atgtaccgacattggacatctc-3 1 


22 


526 


1107 



Table AFB. Probe Name Ag2999 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 1 -gccctacgatacttttgtgtgt-3 1 


22 


1488 


1108 


Probe 


TET-5 1 -ctcctggccagctgattcaggtcat-3 ■ - 
TAMRA 


25 


1520 


1109 


Reverse 


5 » -gctactgttgccaacttcatct-3 * 


22 


1560 


1110 



Table AFC CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Agll36, 
Run 206992276 


Tissue Name 


Rel. Exp.(%) Agll36, 
Run 206992276 


AD 1 Hippo 


14.4 


Control (Path) 3 
Temporal Ctx 


6.2 


AD 2 Hippo 


44.4 


Control (Path) 4 
Temporal Ctx 


25.7 


AD 3 Hippo 


1.6 


AD 1 Occipital Ctx 


12.2 


AD 4 Hippo 


6.8 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


91.4 


AD 3 Occipital Ctx 


9.0. 


AD 6 Hippo 


48.3 


AD 4 Occipital Ctx 


17.7 


Control 2 Hippo 


35.6 


AD 5 Occipital Ctx 


12.5 


Control 4 Hippo 


14.3 


AD 6 Occipital Ctx 


35.4 


Control (Path) 3 
Hippo 


6.4 


Control 1 Occipital 
Ctx 


13.6 


AD 1 Temporal Ctx 


16.7 


Control 2 Occipital 
Ctx 


61.1 


AD 2 Temporal Ctx 


31.6 


Control 3 Occipital 
Ctx ; 


25.5 


AD 3 Temporal Ctx 


15.5 


Control 4 Occipital 
Ctx 


8.4 


AD 4 Temporal Ctx 


21.6 


Control (Path) 1 
Occipital Ctx 


73.7 
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AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


9.2 


AD 5 SupTemporal 
Ctx 


41.8 


Control (Path) 3 
Occipital Ctx 


4.6 


AD 6 Inf Temporal 
Ctx 


57.0 


Control (Path) 4 
Occipital Ctx 


21.3 


AD 6 Sup Temporal 
Ctx 


37.4 


Control 1 Parietal 
Ctx 


10.6 


Control 1 Temporal 
Ctx 


16.5 


Control 2 Parietal 
Ctx 


46.0 


Control 2 Temporal 
Ctx 


45.7 


Control 3 Parietal 
Ctx 


18.7 


Control 3 Temporal 
Ctx 


20.4 


Control (Path) 1 
ranetai ctx 


62.9 


Control 4 Temporal 
Ctx 


12.6 


Control (Path) 2 
Parietal Ctx 


17.9 


Control (Path) 1 
Temporal Ctx 


51.1 


Control (Path) 3 
Parietal Ctx 


10.1 


Control (Path) 2 
Temporal Ctx 


31.6 


Control (Path) 4 
Parietal Ctx 


39.5 



Table AFD. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Agll36, 
Run 165528214 


Tissue Name 


Rel.Exp.(%)Agll36, 
Run 165528214 


Liver adenocarcinoma 


36.6 


Kidney (fetal) 


19.1 


Pancreas 


4.9 


Renal ca. 786-0 


30.1 


Pancreatic ca. CAPAN 
2 


12.0 


Renal ca. A498 


25.9 


Adrenal gland 


4.7 


Renal ca. RXF 393 


18.6 


Thyroid 


4.5 


Renal ca. ACHN 


22.2 


Salivary gland 


0.6 


Renal ca.UO-31 


22.4 


Pituitary gland 


10.0 


Renal ca. TK-10 


! 31.4 


Brain (fetal) 


53.2 


Liver 


0.0 


Brain (whole) 


76.3 


Liver (fetal) 


0.9 


Brain (amygdala) 


43.5 


Liver ca. 

(hepatoblast)HepG2 


4.3 


Brain (cerebellum) 


48.3 


Lung 


9.7 


Brain (hippocampus) 


42.6 


Lung (fetal) 


9.7 


Brain (substantia nigra) 


13.9 


Lung ca. (small cell) 
LX-1 


6.7 


Brain (thalamus) 


37.6 


Lung ca. (small cell) 
NCI-H69 


23.3 


Cerebral Cortex 


28.1 


Lung ca. (s.cell var.) 
SHP-77 


49.7 
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Spinal cord 


7.6 


Lung ca. (large 
ceIl)NCI-H460 


13.9 


glio/astro U87-MG 


12.7 


Lung ca. (non-sm. 
cell) A549 


8.4 


glio/astroU-118-MG 


85.9 


Lung ca. (non-s.cell) 
NCI-H23 


19.2 


astrocytoma SW1783 


33.9 


Lung ca. (non-s.cell) 
HOP-62 


30.4 


neuro*; met SK-N-AS 


72.2 


Lung ca. (non-s.cl) 
NCI-H522 


32.5 


astrocytoma SF-539 


51.1 


Lung ca. (squam.) 
SW 900 


47.6 


astrocytoma SNB-75 


39.5 


Lung ca. (squam.) 
NCI-H596 


80.7 


glioma SNB-19 


17.8 


Mammary gland 


17.2 ! 


glioma U251 


100.0 


Breast ca.* (pl.ei) 


38.4 


glioma SF-295 


13.5 


JDICasl U<1. ^pi.Ci ) 

MDA-MB-231 


43.5 


Heart (fetal) 


12.3 


Rrpoct r»o ♦ {r\\ o"P\ 

DiCaM Ca. \JJl.cI/ 

T47D 


13.8 


Heart 




Rr*»act pa TVT«.^4Q 
JDlCabt iva. £> lv*r7 


£1 1 

Ol.I 


Skeletal muscle (fetal) 


8.0 


Breast ca. MDA-N 


3.2 


Skeletal muscle ' 


0.0 


Ovary 


5.2 


Bone marrow 


1.6 


Ovarian ca. OVCAR- 
3 


192 


Thymus 


26.4 


Ovarian ca. OVCAR- 
4 


35.8 


Spleen 


15.1 


Ovarian ca. OVCAR- 
5 


46.0 


Lymph node 


23.3 


Ovarian ca. OVCAR- 
g 


54.7 


Colorectal 


6.3 


Ovarian ca. IGROV- 
1 


10.6 


Stomach 


19.8 ! 


Ovarian ca.* (ascites) 
SK-OV-3 


24.8 


Small intestine 


40 1 


1 Jtt?ri!Q 

\J Iwl UJ 


SO 0 


Colon ca. SW480 


16.0 


Placenta 


18.0 


CVilon en * 

SW620(SW480 met) 


4.7 


Prostate 


5.0 


Colon ca. HT29 


3.0 


Prostate ca.* (bone 
met)PC-3 


15.8 


Colon ca.HCT-1 16 


46.0 


Testis 


13.3 


Colon ca. CaCo-2 


10.1 


Melanoma 
Hs688(A).T 


3.6 
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Colon ca. 
tissue(0DO3866) 


25.7 


Melanoma* (met) 
Hs688(B).T 


6.2 


Colon ca. HCC-2998 


52.1 


Melanoma UACC-oZ 


Q Q 
O.O 


uasinc td. ^iiver meij 
NCI-N87 


26.2 


Melanoma M 14 


6.9 


Bladder 


9.0 


Melanoma LOX 
IMVI 


5.6 


Trachea 


20.6 


Melanoma* (met) 
SK-MEL-5 


0.8 


Kidney 


8.5 


Adipose 


9.9 



Table AFE. Panel 2D 



Tissue Name 


Rel.Exp.(%) 
Agll36,Run 
162599391 


Tissue Name 


Rel. Exp.(%) 
Agll36, Run 
162599391 


jNormai t^oion 




Kidney Margin 
8120608 


1A O 


CC Well to Mod DifF 
(OD03866) 


00 7 


Kidney Cancer 
8120613 


O 7 


CC \Anra\n {ClT)TMt(\£\ 

\^\^ ividrgin [KJXJKjjoooj 


19 9 


Kidney Margin 
8120614 




CC Gr.2 rectosigmoid 
(0DO3868) 




Kidney Cancer 
9010320 




CC Ma ram fOr)fY3R65Tl 


7 ft 


Kidney Margin 
9010321 


90 1 


CC Mod Diff (ODO3920) 


24.7 


Normal Uterus 


13.4 


CC Margin (ODO3920) 


10.2 


Uterus Cancer 0640 11 


42.9 


CC Gr.2 ascend colon 
(0DO3921) 


31.9 


Normal Thyroid 


12.8 


CC Margin (OD03921) 


10.4 


Thyroid Cancer 
064010 


20.4 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


9.1 


Thyroid Cancer 
A302152 


27.0 


Liver Margin (ODO4309) 


2.3 


Thyroid Margin 
A302153 


23.2 


Colon mets to lung 
(OD04451-01) 


22.1 


Normal Breast 


31.9 


Lung Margin (OD04451- 
02) 


13.2 


Breast Cancer 
(OD04566) 


37.9 


Normal Prostate 6546-1 


85.9 


Breast Cancer 
(OD04590-01) 


31.6 


Prostate Cancer 
(OD04410) 


34.9 


Breast Cancer Mets 
(OD04590-03) 


44.1 


Prostate Margin 


25.9 


Breast Cancer 


26.6 
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(OD04410) 




Metastasis 
(OD04655-05) 




Prostate Cancer 
(OD04720-01) 


24.1 


Breast Cancer 064006 


21.6 


Prostate Margin 
(OD04 720-02) 


26.4 


Breast Cancer 1 024 


21.6 


Normal Lung 061010 


40.3 


Breast Cancer 
9100266 


38.7 


Lung Met to Muscle 
(OD04286) 


100.0 


Breast Margin 

7 J UU^UJ 


21.9 


Muscle Margin 
(OD04286) 


8.2 


A209073 


41.8 


Lung Malignant Cancer 
(OD03126) 


28.3 


Breast Margin 
A2090734 


14.4 


Lung Margin (OD03 126) 


28.1 


Normal Liver 


2.2 


Lung Cancer (OD04404) 


51.4 


Liver Cancer 064003 


42 


Lung Marein (OD044041 

UUilg XTJlKiU gill 1 'X-' \/ 1 W \J 1 1 


5 6 


T ivpr f" , anr*i»r 1H9^ 


j. i 


Lung Cancer (OD04565) 


27.7 


Liver Cancer 1026 


8.1 


Lung Margin (UD04365) 


62 


Liver Cancer 6004-T 


1.3 


Lung Lancer (UDU4237- 
01) 


84.1 


Liver Tissue 6004-N 


3.6 


Lung Margin (UUU42i7- 
02) 


10.6 


Liver Cancer 6005-T 


20.6 


Ocular Mel Met to Liver 
(ODO4310) 


4.5 


Liver Tissue 6005-N 


1.2 


T inn* A/i\«/*<m /AAA/111 A\ 

Liver Margin (ULHJ43JU) 


3.5 


Normal Bladder 


30.1 


Melanoma Mets to Lung 
(OD04321) 


19.8 


Bladder Cancer 1023 


16.3 


Lung Margin (OD04321) 


21.9 


Bladder Cancer 
A302173 


22.2 


Normal Kidney 


29.7 


Bladder Cancer 
(OD04718-01) 


37.1 


Kidney Ca, Nuclear grade 
2 (OD04338) 


30.4 


Bladder Normal 
Adjacent (OD04718- 

AON 

03) 


12.1 


Kidney Margin 


16.2 


Normal Ovary 


25.2 


Kidney Ca Nuclear grade 
1/2 (OD04339) ' 


17.9 


Ovarian Cancer 


33.2 


Kidney Margin 
(OD04339) 


18.2 


Ovarian Cancer 
(OD04768-07) 


77.9 


Kidney Ca, Clear cell 
type (OD04340) 


21.6 


Ovary Margin 
(OD04768-08) 


11.9 


Kidney Margin 
(OD04340) 


18.8 


Normal Stomach 


16.5 
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Kidney Ca, Nuclear grade 
3 (OD04348) 


10.4 


Gastric Cancer 


8.4 


Kidney Margin 
(OD04348) 


22.1 


Stomach Margin 


5.8 


Kidney Cancer 
(OD04622-01) 


8.4 


Gastric Cancer 

y\)\>\)jyj 


25.3 


Kidney Margin 


5.6 


Stomach Margin 
9060394 


14.7 


Kidney Cancer 
(OD04450-01) 


42.0 


Gastric Cancer 
9060397 


42.3 


Kidney Margin 
(OD04450-03) 


13.6 


Stomach Margin 
9060396 


9.7 


Kidney Cancer 8120607 


20.2 


Gastric Cancer 
064005 


21.9 



Table AFF. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
AgllJb, 

Run 
164037277 


Rel. 
Exp.(%) 

Run 

165296355 


Tissue Name 


Rel. 
Exp.(%) 
Aell36, 

Run 
164037277 


Rel. 
Exp.(%) 
Ag2999, 

Run 
165296355 


Secondary Thl act 


4.9 


3.8 


HUVEC IL-lbeta 


15.8 


9.5 


Secondary Th2 act 


_ 10.7 


8.0 


HUVEC IFN 
gamma 


47.0 


61.1 


Secondary Trl act 


12.2 


10.2 


HUVEC TNF 
alpha + IFN 
gamma 


10.5 


8.9 


Secondary Thl rest 


4.9 


5.8 


HUVEC TNF 
alpha + IL4 


24.1 


15.1 


Secondary Th2 rest 


7.5 


3.0 


HUVEC IL-11 


37.9 


29.5 


Secondary Trl rest 


7.7 


. 4.2 


Lung 

Microvascular EC 
none 


85.9 


. 72.2 


Primary Thl act 


5.7 


7.3 


Lung 

Microvascular EC 
TNFalpha + IL- 
lbeta 


37.1 


33.9 


Primary Th2 act 


9.1 


6.7 


Microvascular 
Dermal EC none 


85.9 


24.7 


Primary Trl act 


12.2 


9.2 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
lbeta 


26.6 


14.8 


Primary Thl rest 


21.0 


20.7 


Bronchial 
epithelium 
TNFalpha + 


21.8 


15.8 
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ILlbeta 






Prim on/ TT-iO rpct 

rviniary i nz rest 


1 J.O 


O.J 


Small airway 
epithelium none 


% i 




Primflrv TV1 rpct 


10 0 




Small airway 
epithelium 
TNFalpha* IL- 
lbeta 




1 J.U 


CD45RACD4 
lymphocyte act 


10.7 


8.2 


Coronery artery 
SMC rest 


15.3 


5.7 


CD4 

lymphocyte act 


5.2 


2.6 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


3.1 


4.6 


CD8 lymphocyte 
act 


4.5 


3.2 


Astrocytes rest 


33.2 


14.8 


lymphocyte rest 


2.8 


3.0 


Astrocytes 
TNFalpha + IL- 
lbeta 


18.7 


15.4 


Secondary CD8 
lymphocyte act 


0. 1 




KU-812 
(Basophil) rest 


in o 


11 A 1 

I l.U 


CD4 lymphocyte 
none 


2.4 


1.6 


KU-812 

(Basophil) 

PMA/ionomycin 


41.8 


22.2 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


7.5 


10.0 


CCDII06 

(Keratinocytes) 

none 


48.3 


52.1 


LAK cells rest 


3.1 


1.8 


CCD1106 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


47.3 


46.7 


LAK cells IL-2 


4.7 


3.0 


Liver cirrhosis 


5.7 


1.8 


LAK cells IL-2+IL- 
12 


2.5 


0.8 


Lupus kidney 


5.0 


2.8 


LAK cells IL- 
2+IFN gamma 


4.1 


2.3 . 


NCI-H292 none 


22.8 


11.4 


LAK cells IL-2+ 
IL-18 


3.1 


1.0 


NCI-H292 IL-4 


40.1 


26.1 


T A V 11^ 

LAK cells 
PMA/ionomycin 


6.3 


4.3 


NCI-H292 IL-9 


32.1 


23.2 . 


VTT7 _ 1 1 _ TT ^ * 

NK Cells IL-2 rest 


11.2 


7.7 


NCI-H292 IL-13 


20.9 


24.0 


Two Way MLR 3 
day 


2.9 


3.0 


NCI-H292IFN 
gamma 


20.4 


16.6 


Two Way MLR 5 
day 


2.7 


2.4 


HPAEC none 


67.8 


66.4 


Two Way MLR 7 
day 


2.9 


0.8 


HPAEC TNF 
alpha + IL-1 beta 


16.5 


17.1 


PBMC rest 


0.7 


2.8 | 


Lung fibroblast 
none 


19.8 


15.8. 
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PBMC PWM 


12.7 


6.6 


Lung fibroblast 
TNF alpha + IL-1 
beta 


5.0 


6.7 


PBMC PHA-L 

x jliivxv^ x x in xj 


6 7 


3 2 


Lung fibroblast 
IL-4 


42 9 


30 1 


Ramos (B cell) 
none 


0 8 

v/.O 


0 0 

V/.v/ 


Lung fibroblast 
IL-9 


40 3 




Ramos (B cell) 
ionomycin 


1.0 


2.2 


Lung fibroblast 
IL-13 


24.7 


21.0 


B lymphocytes 
PWM 


9.7 


4.3 


Lung fibroblast 
IFN gamma 


40.3 


33.9 


B lymphocytes 
CD40L and IL-4 


5.2 


5.0 


Dermal fibroblast 
CCD1070 rest 


75.3 


52.5 


EOL-1 dbcAMP 


11.9 


3.7 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


55.5 


22.2 • 


EOL-1 dbcAMP 
PMA/ionomvcin 

j. ixxr\j iv/iiv/iiijr will 


17.8 


7.7 


Dermal fibroblast 
CCD1070 IL-1 
beta 


18.6 


18.9 


Dendritic cells 
none 


0.0 


0.0 


Dermal fibroblast 
IFN gamma 


12.3 


16.0 


Dendritic cells LPS 


0.4 

— — — - — — 


0.0 


Dermal fibroblast 

TT -A 


16.6 


13.2 


lycnunuc ecus ami- 
CD40 


0.0 


0.5 


IBD Colitis 2 


0.8 


2.2 


Monocytes rest 


0.0 


0.0 


IBD Crohrfs 


1.2 


1.0 


Monocytes LPS 


0.6 


0.4 


Colon 


12.6 


6.9 


Macrophages rest 


0.8 


0.8 


Lung 


•7.7 


4.0 


Macrophages LPS 


0.0 


0.4 


Thymus 


13.8 


9.5 


HUVEC none 


45.4 


54.0 


Kidney 


73.7 


39.0 


HUVEC starved 


100.0 


100.0 









CNS_neurodegeneration_vl.O Summary: Agl 136 This panel does not show 
differential expression of the NOV36a gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 1.3D Summary: Agl 136 Highest expression of the NOV36a gene, a cadherin 
1 1 homolog, is seen in a glioma cell line U251 (CT=30.1). There is also low but significant 
significant expression in clusters of ovarian, lung, breast, kidney and colon cancer cell lines. 
Thus, expression of this gene could potentially be used as a diagnostic marker for these 
1 0 cancers. Furthermore, therapeutic inhibition of this gene product may be useful in the 
treatment of these cancers. 
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This gene also exhibits brain-preferential expression, indicating a role in CNS-specific 
processes. Recent research has shown that genetic deletion of cadherin-1 1 function acts to 
increase long term potentiation, a process thought to underlie learning and memory. Thus, 
drugs that target this gene prodcut may have utility as memory enhancing drugs. Such drugs 
5 would have utility in treatment of CNS disorders involving memory dysfunction, including 
Alzheimer's disease and normal aging. In behavioral tests, Cadherin 1 1 deletion mutant mice 
show reduced fear- or anxiety-related responses. Thus, inhibitory agents targeting this gene 
product may also have utility as sedatives or anxiolytic agents for the treatment of CNS 
disorders including anxiety. 

10 Among tissues with metabolic function, this gene has low levels of expression in 

pancreas, adrenal, thyroid, pituitary, adult and fetal heart, and adipose. Thus, this cadherin-like 
gene product may be important in the pathogenesis, diagnosis, and/or treatment of metabolic 
and endocrine disease, including obesity and Types 1 and 2 diabetes. Decreased levels of 
cadherin have been associated with decreased insulin secretion, suggesting that increasing 

1 5 cadherin levels may be a potent therapeutic for Type 2 diabetes. In addition, this gene is 
expressed at higher levels in fetal (CT=34) vs adult skeletal muscle (CT=40) and may be 
useful for differentiation between the two sources of tissue. Ag2999 Results from one 
experiment with the CG56003-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

20 References: 

Manabe T, Togashi H, Uchida N, Suzuki SC, Hayakawa Y, Yamamoto M, Yoda H, 
Miyakawa T, Takeichi M, Chisaka O. Loss of cadherin-1 1 adhesion receptor enhances plastic 
changes in hippocampal synapses and modifies behavioral responses. Mol Cell Neurosci 2000 
Jun;15(6):534-46 

25 Cadherins organize symmetrical junctions between the pre- and postsynaptic 

membranes in central synapses. One of them, cadherin-1 1 (cadi 1), is expressed in the limbic 
system of the brain, most strongly in the hippocampus. Immunohistochemical studies of the 
hippocampus showed that cad 11 proteins were densely distributed in its synaptic neuropil 
zones; in cultured hippocampal neurons, their distribution often overlapped with that of 

30 synaptophysin, and also occasionally with that of GluRl at spines. To assess the role of cadi 1 
in synaptic formation and/or function, we analyzed brains of cadi 1 -deficient mice. In these 
mice, long-term potentiation (LTP) in the CA1 region of the hippocampus was, unexpectedly, 
enhanced; and the level of LTP saturation was increased. In behavioral tests, the mutant mice 
showed reduced fear- or anxiety-related responses. These results suggest that the cadi 1- 
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mediated junctions may modulate synaptic efficacy, confining its dynamic changes to a 
limited range, or these junctions are required for normal development of synaptic organization 
in the hippocampus. 

PMID: 10860580 

5 Yamagata K, Nammo T, Moriwaki M, Ihara A, Iizuka K, Yang Q, Satoh T, Li M, 

Uenaka R, Okita K, Iwahashi H, Zhu Q, Cao Y, Imagawa A, Tochino Y, Hanafusa T, 
Miyagawa Ji J, Matsuzawa Y. Overexpression of Dominant-Negative Mutant Hepatocyte 
Nuclear Factor-lalpha in Pancreatic beta-Cells Causes Abnormal Islet Architecture With 
Decreased Expression of E-Cadherin, Reduced beta-cell Proliferation, and Diabetes. Diabetes. 

10 2002 Jan;51(l):l 14-23. 

One subtype of maturity-onset diabetes of the young (MODY)-3 results from 
mutations in the gene encoding hepatocyte nuclear factor (HNF)-lalpha. We generated 
transgenic mice expressing a naturally occurring dominant-negative form of human HNF- 
1 alpha (P291fsinsC) in pancreatic beta-cells. A progressive hyperglycemia with age was seen 

15 in these transgenic mice, and the mice developed diabetes with impaired glucose-stimulated 
insulin secretion. The pancreatic islets exhibited abnormal architecture with reduced 
expression of glucose transporter (GLUT2) and E-cadherin. Blockade of E-cadherin-mediated 
cell adhesion in pancreatic islets abolished the glucose-stimulated increases in intracellular 
Ca(2+) levels and insulin secretion, suggesting that loss of E-cadherin in beta-cells is 

20 associated with impaired insulin secretion. There was also a reduction in beta-cell number 

(50%), proliferation rate (15%), and pancreatic insulin content (45%) in 2-day-old transgenic 
mice and a further reduction in 4-week-old animals. Our findings suggest various roles for 
HNF-1 alpha in normal glucose metabolism, including the regulation of glucose transport, 
beta-cell growth, and beta-cell-to-beta-cell communication. 

25 PMID: 11756330 

Panel 2D Summary: Agl 136 The NOV36a gene is a good target for ovarian, gastric, 
breast, lung, colon, uterine and kidney cancers because it is expressed at a higher level in these 
cancers than the adjacent normal tissue. Therefore, expression of this gene could potentially be 
used as a diagnostic marker for these cancers and therapeutic inhibition may be useful in 

30 treatment of these cancers. 

Panel 4D Summary: Agl 136/Ag2999 Two experiments produce results that are in 
excellent agreement. The NOV36A transcript is expressed at low levels in hematopoietic cells 
and at higher levels in endothelial cells. EL-1 beta and TNFalpha treatment reduce transcript 
levels consistently in endothelium samples including HPAEC, HUVEC and lung 
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microvascular EC. Fibroblasts also express this transcript and dermal fibroblasts down 
regulate expression in response to IL-1 beta, gamma interferon and IL-4. This transcript 
encodes a putative cadherin 1 1 like molecule. Cadherins are adhesion molecules that regulate 
normal homeostasis. Loss of cadherin 1 1 expression can reduce the expression of factors such 

5 as VEGF-D in fibroblasts (see reference 1). Therapies designed with the protein encoded by 
this transcript could be important in the regulating endothelium function including leukocyte 
extravasation, a major component of inflammation during asthma, IBD, and psoriasis. 
Therapeutics designed with the protein encoded by this transcript could also be important in 
the treatment of osteoarthritis and osteoporosis since this protein may be important in 

1 0 maintaining bone density (see reference 2). 
References: 

Orlandini M, Oliviero S In fibroblasts Vegf-D expression is induced by cell-cell 
contact mediated by cadherin- 1 1 . J Biol Chem 2001 Mar 2;276(9):6576-81 

Vascular endothelial growth factors (VEGFs) are a highly conserved family of growth 

15 factors all angiogenic in vivo with mitogenic and chemotactic activity on endothelial cells. 

VEGFs are expressed in fibroblasts either in hypoxia or in response to growth factors. Here we 
report that, differently from the other members of the family, Vegf-D is induced by cell-cell 
contact. By in situ hybridization we demonstrated that noninteracting fibroblasts express low 
levels of Vegf-D mRNA, whereas contacting cells express high levels of Vegf-D transcripts. 

20 By immunostaining we observed that the surface protein cadherin-1 1 is localized at the 
opposite sites of interacting cell surfaces. Ca(2+) deprivation from the culture medium 
determined the loss of cadherin-1 1 from the cell surfaces and down-regulation of Vegf-D 
mRNA. Moreover, a cadherin-1 1 antisense RNA construct inhibited Vegf-D expression in 
confluent BALB/c fibroblasts, whereas in NIH 3T3 cells, which express low levels of 

25 cadherin- 1 1 , Vegf-D induction could be obtained by overexpression of cadherin- 1 1 . This 
suggests that cell interaction mediated by cadherin-1 1 induces the expression of the 
angiogenic factor Vegf-D in fibroblasts. 
PMID: 11108717 

Kawaguchi J, Azuma Y, Hoshi K, Kii I, Takeshita S, Ohta T, Ozawa H, Takeichi M, 
30 Chisaka O, Kudo A. Targeted disruption of cadherin-1 1 leads to a reduction in bone density in 
calvaria and long bone metaphyses. J Bone Miner Res 2001 Jul;16(7):1265-71 

The migration and adhesion of osteoblasts requires several classical cadherins. 
Cadherin-1 1, one of the classical cadherins, was expressed in mouse osteoblasts in skull bone 
and femur, revealed by immunohistochemistry. To elucidate the function of cadherin-1 1 in 
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osteohistogenesis,, cadherin-1 1 null mutant mice were investigated. Although apparently 
normal at birth, Alizarin red staining of null mutant mice showed a reduced calcified area at 
the frontal suture that caused a round-shaped calvaria with increasing animal age to 3 months. 
Consequently, there was a reduction in bone density at the femoral metaphyses and the diploe 
5 of calvaria in null mutant mice. In the in vitro culture of newborn calvarial cells, the calcified 
area of mutant cells was smaller than those derived from wild-type littermates. These results 
show that absence of cadherin-1 1 leads to reduced bone density in some parts of skeletons 
including calvaria and long bone metaphyses, and thus suggest that cadherin-1 1 plays roles in 
the regulation of osteoblast differentiation and in the mineralization of the osteoid matrix. 
10 PMID: 11450702 

NOV37 

Expression of gene NOV37 was assessed using the primer-probe sets Ag047, Ag2679, 
Ag2728, Ag332, Ag47b, Ag712, Ag2732 and Ag2975, described in Tables AGA, AGB, AGC, 
AGD, AGE, AGF, AGG and AGH. Results of the RTQ-PCR runs are shown in Tables AGI, 
1 5 AGJ, AGK, AGL, AGM, AGN, AGO, AGP, AGQ and AGR. 

Table AGA. Probe Name Ag047 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 • -ccaatgacctggccacca-3 1 


18 


X064 


1111 


Probe 


TET-5 1 -ccagagtccgttcagcttcaggacagc- 
3 1 -TAMRA 


27 


1084 


1112 


Reverse 


5 1 -gtggcacgttgctgtttagc-3 ' 


20 


1116 


1113 



Table AGB. Probe Name Ag2679 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ttgacctcaggaacggtttac-3 1 


21 


1213 


1114 


Probe 


TET-5 1 -ctgctgcccaggaatactttctccag-3 1 - 
TAMRA 


26 


1249 


1115 


Reverse 


5 ' -agtatttggagggcttcttcag-3 ' 


22 


1288 


1116 



20 Table AGC. Probe Name Ag2728 



Primers 


Sequences 


Length 


Start 
Position 


SEQ XD NO: 


Forward 


5 ' -ggagcttggtctcatgaccta-3 ' 


21 


5178 


1117 


Probe 


TET-5 1 -actgggctcctggccaccaagag-3 ' - 
TAMRA 


23 


5209 


1118 


Reverse 


5\-agtcgtccatcctgtttcatc-3 1 


21 


5233 


1119 
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Table AGD. Probe Name Ag332 



Primers 


Sequences 


Length 


Start 
Position 


SBQ ID NO: 


Forward 


5 1 -gctgccctgacttgtgcaa-3 1 


19 


2594 


1120 


Probe 


TET-5 ' -tctgacccagtgtgcatctcccgtt-3 1 - 
TAMRA 


25 


2617 


1121 


Reverse 


5 ' -ccggtctggcagacacact-3 1 


19 


2651 


1122 



Table AGE. Probe Name Ag47b 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gaacgccggagcatacaga-3 1 


19 


1774 


1123 


Probe 


TET-5 ' -ccaggtactgcacaaacacggcttcat- 
3 ' -TAMRA 


27 


1805 


1124 


Reverse 


5 1 -gatgccacaggcccaca-3 1 


17 


1833 


1125 



5 Table AGF. Probe Name Ag712 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -tgcaagggtacgagggatat-3 » 


20 


8369 


1126 


Probe 


TET-5 ' -cttcccgtggagcaatacccagag-3 * - 
TAMRA 


24 


8395 


1127 


Reverse 


5 1 -tggatgttgctgctactgtct-3 1 


21 


8424 


1128 



Table AGG. Probe Name Ag2732 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ggagcttggtctcatgaccta-3 ' 


21 


5178 


1129 


Probe 


TET-5 » -actgggctcctggccaccaagag-3 ' - 
TAMRA 


23 


5209 


1130 


Reverse 


5 1 -agtcgtccatcctgtttcatc-3 1 


21 


5233 


1131 



Table AGH. Probe Name Ag2975 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ggagcttggtctcatgaccta-3 ' 


21 


5178 


1132 


Probe 


TET-5 1 -actgggctcctggccaccaagag-3 1 - 
TAMRA 


23 


5209 


1133 


Reverse 


5 1 -agtcgtccatcctgtttcatc-3 ' 


21 


5233 


1134 



10 



Table AGI. CNS_neurodegeneration_vl.O 





Rel. 


Rel. 


Rel. 




Rel. 


Rel. 


Rel. 


Tissue 


Exp.(%) 


Exp.(%) 


Exp.(%) 


Tissue 


Exp.(%) 


Exp.(%) 


Exp.(%) 


Name 


Ag047, 


Ag2679, 


Ag2728, 


Name 


Ag047, 


Ag2679, 


Ag2728, 




Run 


Run 


Run 




Run 


Run 


Run 
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206228021 


209751329 


206483376 




206228021 


209751329 


206483376 


AD 1 
Hippo 


10.3 


6.6 


8.4 


Control 
(Path) 3 
Temporal 
Ctx 


4.1 


3.2 


2.3 


AD 2 
Hippo 


18.0 


20.9 


17.3 


Control 
(Path) 4 
Temporal 

Ctx 


13.8 


33.9 


29.7 


AD 3 
Hippo 


6.6 


4.6 


6.4 


AD 1 

Occipital 

Ctx 


18.8 


16.8 


17.1 


AD4 
Hippo 


5.6 


4.4 


7.7 


AD 2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


0.0 


AD 5 
Hippo 


100.0 


100.0 


95.9 


AD 3 

Occipital 

Ctx 


5.7 


3.4 


3.1 


AD 6 
Hippo 


37.6 


33.0 


31.4 


AD4 

Occipital 

Ctx 


12.9 


15.1 


19.5 


Control 2 
Hippo 


22.1 


17.3 


43.2 


AD 5 

Occipital 

Ctx 


25.2 


33.4 


51.4 


Control 4 
Hippo 


7.7 


6.4 


5.3 


AD 6, 

Occipital 

Ctx 


3.2 


18.9 


19.8 


Control 
(Path) 3 
Hippo 


1.5 


3.4 


3.7 


Control 1 
Occipital 
Ctx 


3.4 • 


3.5 


4.4 


AD 1 

Temporal 

Ctx 


8.1 


6.7 


8.6 


Control 2 
Occipital 
Ctx 


83.5 


82.9 


100.0 


AD 2 

Temporal 

Ctx 


21.2 


40.3 


21.2 


Control 3 
Occipital 
Ctx 


25.3 


202 


17.3 


AD 3 

Temporal 

Ctx 


3.3 


4.2 


4.2 


Control 4 
Occipital 
Ctx 


5.4 


4.2 


4.4 


AT\ A 

AD 4 

Temporal 

Ctx 


18.7 


13.1 


20.0 


Control 
(Path) 1 
Occipital 
Ctx 


66.4 


74.7 


75.8 


AD5Inf 
Temporal 
Ctx 


27.0 


57.4 


42.9 


Control 
(Path) 2 
Occipital 
Ctx 


15.2 


11.3 


13.0 
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AD 5 Sup 
Temporal 
Ctx 


14.6 


27.0 


20.7 


Control 
(Path) 3 
Occipital 
Ctx 


3.0 


2.4 


1.5 


AD 6 Inf 
Temporal 

Ctx 


25.2 


16.8 


19.1 


Control 
(Path) 4 
Occipital 
Ctx 


38.7 


29.3 


19.6 


AD 6 Sup 
Temporal 
Ctx 


26.1 


18.9 


17.2 


Control 1 

Parietal 

Ctx 


4.4 


3.7 


4.1 


Control 1 
Temporal 
Ctx 


3.9 


3.3 


3.8 


Control 2 

Parietal 

Ctx 


31.4 


19.8 


18.2 


Control 2 
Temporal 
Ctx 


21.3 


'27.4 


33.4 


Control 3 

Parietal 

Ctx 


7.9 


10.7 


9.6 


Control 3 
Temporal 
Ctx 


13.7 


10.0 


10.4 


Control 
(Path) 1 
Parietal 
Ctx 


46.3 


56.3 


58.6 


Control 3 
Temporal 
Ctx 


8.1 


7.9 


8.7 


Control 
(Path) 2 
Parietal 
Ctx 


13.2 


16.6 


18.4 


Control 
(Path) 1 
Temporal 
Ctx 


34.4 


. 42.3 


5.1.4 


Control 
(Path) 3 
Parietal 
Ctx 


2.5 


2.4 


1.2 


Control 
(Path) 2 
Temporal 
Ctx 


27.2 


29.9 


24.5 


Control 
(Path) 4 
Parietal 

Ctx 


35.1 


41.2 


35.6 



Table AGJ. Panel 1 



Tissue Name 


Rel. Exp.(%) 
Ag047, Run 
87354354 


Rel. Exp.(%) 
Ag047,Run 
87354779 


Rel. Exp.(%) 
Ag332, Run 
97803603 


Rel. Exp.(%) 
Ag332, Run 
98747043 


Rel. Exp.(%) 
Ag47b, Run 
88164379 


Endothelial cells 


0.0 


0.0 


0.0 


0.0 


0.0 


Endothelial cells 
(treated) 


0.0 


0.0 


0.0 


0.0 


0.0 


Pancreas 


0.0 


0.3 


0.0 


' 0.6 


0.0 


Pancreatic ca. 
CAP AN 2 


0.0 


0.0 


0.0 


0.0 


0.0 


Adrenal gland 


0.0 


1.3 


0.2 


0.0 


0.0 


Thyroid 


0.0 


0.4 


100.0 


0.1 


0.0 
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Salivary gland 


0.0 


0.2 


0.0 


0.0 \ 0.0 


Pituitary gland 


0.0 


0.1 


2.1 


0.0 j 0.0 


Brain CfetaH 


0 0 


i — ' .V/ 


J .*T 


11.0 


j 21.8 


Brain (whole) 


95.9 


67.8 


3.2 


4.5 


j 32.5 


Brain (amygdala) 


A A 

0.0 


o o 

8.8 


3.8 


8.7 


12.2 


Brain 

(cerebellum) 


0.0 


22.2 


1.4 


0.1 


14.3 


Brain 

( h lDDOcatn mi O 


0.0 


24.7 


3.4 


8.7 


15.6 


Brain (substantia 


3.4 


3.6 


1.3 


2.3 


3.8 


Brain (thalamus) 


3.5 


4.7 


3.3 


5.0 


9.1 


Brain 

(hypothalamus) 


0.0 


0.7 


0.2 


0.0 


0.0 


Spinal cord 


A T 

0.7 


1.5 


0.9 


1.4 


1.1 


glio/astro U87- 
MCj 


0.6 


2.6 


1.6 


i i 
l./ 


3.0 


glio/astro U-l 18- 
MG 


0.0 


0.6 


0.2 


l.l 


A 1 
0.1 


astrocytoma 

on j 1 too 

SW1783 


33.7 


29.5 


9.0 


42.9 


36 1 


neuro*; met SK- 
N-AS 


0.0 


0.0 


0.0 


0.0 


0.0 


(Mil Uvjr IVJlltCi Oi 

539 


31.0 


36.9 


10.3 


68.3 


48.3 


astrocytoma SNB- 
75 


33.7 


32.8 


7.1 


— 

23.0 


50.3 


glioma SNB- 19 


100.0 


100.0 


30.1 


100.0 


100.0 


glioma U251 


49.0 


44.1 


16.5 


57.8 


41.5 


glioma SF-295 


6.0 


8.1 


2.1 


19.8 


8.7 


Heart 


61.1 


26.8 


36.6 


70.7 


39.8 


Skeletal muscle 


0.0 


0.1 


0.2 


0.0 


0.0 


Bone marrow 


0.0 


0.1 


0.0 


0.0 


0.0 


Thymus 


18.7 


18.4 


1.1 


2.7 


17.2 


Spleen 


0.0 


0.0 


0.0 


A A 
U.U 


A A 

0.0 


Lymph node 


0.0 


0.2 


0.0 


0.0 


0 0 


Colon (ascending^ ! 


0 5 


0 7 

V/. / 


u. / 


0.9 


4.5 


Stomach 


0.1 


1.1 


0.2 


0.2 


02 


Small intestine 


0.0 


0.1 


0.0 


0.0 


6.0 


Colon ca. SW480 


0.6 


1.0 


1.6 


7.0 


1.4 


Colon ca.* 

SW620(SW480 

met) 


0.0 


0.0 


0.0 


0.0 


0.0 


Colon ca. HT29 


0.0 | 0.0 


0.0 


0.0 


0.0 
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Colon ca. HCT- 
1 16 


0.0 


0.0 


0.0 


0.0 


0.0 




0 0 


ft l 


0 0 


0 ft 


0 0 




0 0 


ft 1 

v. 1 


0 2 


0 0 


0 0 


Pnlnn ca HCC- 
2998 


0.0 


0.0 


0.0 


0.0 


0.0 


Gastric ca. * (liver 
met)NCI-N87 


0.0 


0.4 


0.2 


0.0 


0.0 


Bladder 


0.3 


1.0 


0.2 


0.0 


0.1 


Trachea 


0.0 


0.4 


0.2 


0.0 


0.0 


Kidney 


0.2 


0.9 


0.7 


1.2 


1.0 


Kidney (fetal) 


1.3 


3.9 


0.8 


3.2 


2.7 


Renal ca. 786-0 


10.6 


11.7 


3.4 


7.9 


12.2 


Renal ca. A498 


d.o 


0.1 


0.0 


0.0 


0.0 


Renal ca. RXF 
393 


17.9 


14.0 


4.1 


16.6 


15.2 


Renal ca. ACHN 


0.0 


0.0 


0.0 


0.0 


0.0 


Renal ca TJO-31 


0 0 


ft ? 


ft 1 


ft ft 


ft 0 






ft ft 


ft ft 


ft ft 


ft ft 


T ivpr 


0 0 

v.vr 




ft 8 

v.O 


ft Q 


1 0 


Liver (fetal) 


0.0 


0.1 


0.0 


0.0 


0.0 


Liver ca. 

(hepatoblast) 

HenG2 


0.0 


0.0 


0.0 


0.0 


0.0 


Lung 


0.0 


0.2 


0.0 


0.0 


0.0 


Lung (ietai) 


0.0 


0.5 


0.0 


0.0 


0.0 


Lung ca. (small 
cell) LX-l 


0.0 


0.0 


0.0 


0.0 


0.0 


Lung ca. (small 
cell) NCI-Hoy 


2.0 


3.3 


1.4 


5.4 


2.9 


Lung ca. (s.cell 
var.) orlr - / / 


0.0 


0.0 


0.0 


0.0 

...... 


0.0 


Lung ca. (large 

Ceil )INd-rl40U 


0.0 


0.0 


1.3 


0.7 


0.3 


Lung ca. (non-sm. 
ceil) AD4y 


0.0 


0.0 


0.1 


0.0 


0.0 


Lung ca. (non- 

o XT/^T UT2 

S.cell) iNd-rizi 


0.0 


0.0 


0.0 


0.0 


0.0 


T i in rr r» »a ( ti nn. 
.LrUllg td. ^IlOll" 

s.cell) HOP-62 


5.2 


4.7 


3.9 


11.3 


6.7 


Lung ca. (non- 
s.cl)NCI-H522 


1.7 


3.1 


1.5 


2.8 


4.8 


Lung ca. (squam.) 
SW 900 


0.0 


0.0 


0.0 


0.0 


0.0 


Lung ca. (squam.) 


1.3 


2.5 


2.0 


4.2 


3.0 
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NC1-H596 












Mammary gland 


11.9 


9.1 


4.6 


7.0 


10.6 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


0.0 


0.0 


0.0 


0.0 


Breast ca.* (pl.ef) 

lV/THA 1V/TR 0^1 
IVlJJA-lvio-Zj I 


0.0 


0.1 


0.0 


0.0 


0.0 


Breast ca.* (pi. ef) 


11.8 


7.5 


7.9 


31.6 


31.4 j 


Breast ca. BT-549 


0.0 


0.0 


4.5 


13.8 


28.3 


Breast ca. MDA- 
N 


0.0 


0.1 


0.0 


0.0 


0.0 


Ovary 


0.1 


0.6 


0.4 


0.1 


0.2 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


0.0 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


0.1 


1.3 


4.4 


0.0 


Ovarian ca. 
OVCAR-5 


73.2 


382 


14.4 


57.0 


36.1 


Ovarian ca. 

UVtAK-o 


0.0 


0.7 


1.0 


3.4 


0.2 


Ovarian ca. 
IGROV-l 


0.0 


0.0 


0.8 


2.6 


0.0 


uvanan ca. 
(ascites) SK-OV-3 


0.0 


0.0 


0.0 


0.0 


0.0 


Uterus 


0.4 


1.2 


0.6 


1.7 


2.9 


riacenia 


u.u 


A 1 
U.i 


A A 
U.U 


A A 
U.U 


A A 
- U.U 


Prostate 


0.9 


1.9 


0.7 


1.9 


1.7 


Prostate ca.* 
(bone met) PC-3 


0.0 


0.0 


0.1 


0.0 


0.0 


Testis 


25.7 


22.2 


0.7 


0.7 


26.4 


Melanoma 
Hs688(A).T 


23.8 


21.6 


5.0 


20.0 


28.9 


Melanoma* (met) 
Hs688(B).T 


4.6 


6.5 


3.4 


9.6 


9.2 


Melanoma 
UACC-62 


0.0 


0.0 


0.0 


0.0 


0.0 


Melanoma Ml 4 


A A 


A 1 

0.1 


0.0 


0.0 


0.0 


Melanoma LOX 
IMVI 


3.7 


3.4 


0.6 


0.2 


2.7 


Melanoma* (met) 
SK-MEL-5 


0.0 


0.0 


0.0 


0.0 


0.0 


Melanoma SK- 
MEL-28 


0.0 


0.0 


3.6 


17.4 


0.0 
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Table AGK. Panel 1.1 



Tissue Name 


Rel. Exp.(%) Ag047, 
Run 109663520 


Tissue Name 


Rel. Exp.(%) Ag047, 
Run 109663520 


Adrenal gland 


0.5 


Renal ca. UO-31 


0.1 


Bladder 


1.2 


Renal ca. RXF 393 


8.7 


Brain (amygdala) 


2.8 


Liver 


1.3 


Brain (cerebellum) 


4.1 


Liver (fetal) 


0.0 


Rrciin • ntTirxnpcimT^iic^ 
XjIO.111 ylllL/LHJl/alllL/UDj 


0 fx 


Liver ca. 

(hepatoblast) HepG2 


ft ft 


Brain (substantia 
nigra) 


1 ^ ft 


j^ung 


ft 1 


Brain (thalamus) 


4.7 


Lung (fetal) 


0.4 


Cerebral Cortex 


48.3 


Lung ca. (non-s.cell) 
HOP-62 


26.6 


Brain (fetal) 


21.2 


Lung ca. (large 
cell)NCI-H460 


1.2 


Brain (whole) 


9.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


gIio/astroU-118-MG 


0.7 


Lung ca. (non-s.cl) 
NCI-H522 


5.8 


astrocytoma SF-539 


38.4 


Lung ca. (non-sm. 
cell) A549 


0.0 


astrocytoma SNB-75 


16.4 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


a«!trftrvtnmfl ^W17R3 
ajuuu^ luiiia O W 1 / OD 


10 1 
17. J 


Lung ca. (small cell) 
LX-1 


ft ft 


glioma U251 


55.5 


Lung ca. (small cell) 
NCI-H69 


5.3 


glioma SF-295 


8.7 


Lung ca. (squam.) 
SW 900 


0.0 


glioma SNB-19 


100.0 


Lung ca. (squam.) 
NCI-H596 


5.0 


gho/astro U87-MG 


3.3 


Lymph node 


0.3 


neuro*; met SK-N-AS 


0.0 


Spleen 


0.0 


Mammary gland 


2.0 


Thymus 


0.6 


Breast ca. BT-549 


6.4 


Ovary 


0.6 


Breast ca. MDA-N 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Breast ca.* (pl.ef) 
T47D 


5.7 


Ovarian ca. OVCAR- 

3 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


Ovarian ca. OVCAR- 
4 


0.1 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


Ovarian ca. OVCAR- 
5 


49.0 


Small intestine 


0.0 


Ovarian ca. OVCAR- 


0.5 
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0 

o 




Colorectal 


0.1 


ovarian ca. (ascitesj 
SK-OV-3 


0.1 


Colon ca. HT29 


0.0 


Pancreas 


0.6 


Colon ca. CaCo-2 


0.0 


rancreaiic ca. 
CAPAN 2 


0.0 


Colon ca HCT-15 


0 1 


riLuiiary giano 


A C 

U.3 


Colon ca HCT-116 


0 0 


.riacenia 


A A 
0.0 


Colon ca. HCC-2998 


0.0 


Prostate 


0.8 


Colon ca. SW480 


1.2 


Prostate ca.* (bone 
meij rL-i 


0.1 


Colon ca * SW620 
(SW480 met) 


0.0 


Salivary gland 


0.2 


Stomach 


0.2 


Trachea 


U.J 


Gastric ca. (liver met) 
NCI-N87 


0.1 


Spinal cord 


1.4 


Heart 


80.7 


Testis 




Skeletal muscle (Fetal) 


1.6 


Thyroid 


0.3 


Skeletal mu^elp 


n a 


Uterus 


0.0 


Endothelial cells 


0.0 


Melanoma M 14 


o.o 


Heart (Fetal) 


14.2 


Melanoma LOX 
IMVI 


0.7 


Kidney 


2.4 


Melanoma UACC-62 


0.0 


Kidney (fetal) 


1.5 


Melanoma SK-MEL- 
28 


0.0 


Renal ca. 786-0 


3.6 


ivieianoma (met) 
SK-MEL-5 


0.0 


Renal ca. A498 


0.0 


Melanoma 
Hs688(A).T 


16.5 


Renal ca. ACHN 


0.0 


Melanoma* (met) 
Hs688(B).T 


5.6 


Renal ca. TK-10 


0.0 







Table AGL. Panel 1.2 



Tissue Name 


Rel. Exp.(%) 
Ag712,Ron 
114986148 


Rel. Exp.(%) 
Ag712, Run 
119452123 


Tissue Name 


Rel. Exp.(%) 
Ag712, Run 
114986148 


Rel. Exp.(%) 
Ag712, Run 
119452123 


Endothelial cells 


0.0 


0.0 


Renal ca. 786- 
0 


4.0 


3.0 


Heart (Fetal) 


4.0 


2.1 


Renal ca. A498 


0.1 


0.1 


Pancreas 


2.1 


1.1 


Renal ca. RXF 

393 


4.9 


3.5 


Pancreatic ca. 
CAPAN 2 


0.0 


0.0 


Renal ca. 
ACHN 


0.6 


0.3 
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Adrenal Gland 


2.8 


„ _ jRenal ca. UO- 
J31 


0.3 


0.1 


Thyroid 


2.1 


. <- Renal ca. TK- 
}10 


0.2 


0.1 


Salivary gland 


1.9 


1.1 pLiver 


4.6 


2.9 


trllUliaTy glallU 




3.7 


Liver (fetal) 


ft A 


ft i 
U.3 


Rrnin ffatnW 

Didin ^ieuii ) 


00 0 


29.3 


Liver ca. 

(hepatoblast) 

HepG2 


ft a 
U.U 


ft ft 
U.U 


orain ^wntHoj 




26.1 


Lung 


ft Q 
U.O 


ft c 

U.O 


Brain 

^amygaaiaj 


18.6 


12.9 


Lung (fetal) 


1.3 


1.2 


Brain 

^creoenurn ) 


2.3 


1.8 


Lung ca. (small 
cell)LX-l 


0.0 


0.0 


Brain 

^Hippocampus ^ 


27.5 


18 0 


Lung ca. (small 
cell)NCI-H69 


4.4 


3.7 


Brain (thalamus) 


10.4 


77 


Lung ca. (s.cell 
var.) SHP-77 


0.1 


0.0 


Cerebral Cortex 


25.3 


22.2 


Lung ca. (large 
cell)NCI-H460 


1.0 


0.6 


Spinal cord 


3.8 


3.2 


Lung ca. (non- 
sm. cell) A549 


0.3 


0.1 


glio/astro U87- 
MG 


2.5 


2.1 


Lung ca. (non- 

sxell)NCI- 

H23 


0.1 


0.1 


glio/astro U- 118- 
MG 


0.8 


0.4 


Lung ca. (non- 
s.cell)HOP-62 


7.2 


5.8 


astrocytoma 
SW1783 


19.9 


9.2 


Lung ca. (non- 
s.cl)NCI-H522 


5.9 


4.0 


neuro*; met SK- 
N-AS 


0.1 


0.0 


Lung ca. 
(squam.) SW 
900 


0.0 


0.0 


astrocytoma SF- 
539 


36.1 


19.8 


Lung ca. 
(squam.) NCI- 
H596 


4.6 


3.7 


astrocytoma 
SNB-75 


10.7 


3.6 


Mammary 
gland 


7.6 


4.5 


glioma SNB-l 9 


71.2 


51.4 


Breast ca.* 
(pl.ef)MCF-7 


0.0 


0.0 


glioma U25l 


38.7 


19.9 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.0 


0.0 


glioma SF-295 


4.6 


4.1 


Breast ca.* (pi. 
ef)T47D 


10.2 


7.8 


Heart 


100.0 


100.0 


Breast ca.BT- 
549 


8.4 


5.5 
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Skeletal Muscle 


1.4 


1.1 


Breast ca. 
MDA-N • 


0.5 


0.1 


Bone marrow 


0.3 


0.0 


Ovary 


0.5 


0.3 


Thymus 


2.1 


1.0 


Ovarian ca. 
OVCAR-3 


0.4 


0.1 


Spleen 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


1.8 


1.4 


Lymph node 


0.8 


0.3 


Ovarian ca. 
OVCAR-5 


39.0 


30.1 


Colorectal 
Tissue 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


0.6 


0.3 


Stomach 


1.2 


0.7 


Ovarian ca. 
IGROV-1 


8.8 


8.3 


Small intestine 


0.8 


0.4 


Ovarian ca. 
(ascites) SK- 
OV-3 


0.1 


0.0 


Colon ca. 
SW480 


2.5 


1.7 


Uterus 


0.3 


0.2 


Colon ca.* 
SW620 (SW480 
met) 


0.0 


0.0 


Placenta 


0.3 


0.0 


Colon ca. HT29 


0.0 


0.0 


Prostate 


5.1 


3.2 


Colon ca. HCT- 
116 


0.0 


0.0 


Prostate ca.* 
(bone met) PC- 
3 


0.1 


0.0 


Colon ca. CaCo- 
2 


03 


0.2 


Testis 


9.7 


6.8 


Colon ca. Tissue 
(OD03866) 


O 1 


n ft 


Melanoma 
Hs688(A).T 




i n a 


2998 


0.0 


0.0 


Melanoma* 
(met) 

Hs688(B).T 


5.0 


4.3 


Gastric ca.* 
(liver met) NCI- 
N87 


0.5 


0.2 


Melanoma 
UACC-62 


0.0 


0.0 


Bladder 


0.9 


0.6 


Melanoma 
M14 


0.0 


0.0 


Trachea 


1.2 


0.9 


Melanoma 
LOX IMVI 


0.7 


0.2 


Kidney 


4.0 


2.7 


Melanoma* 
(met) SK- 
MEL-5 


1.0 


0.5 


Kidney (fetal) 


6.0 


0.0 
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Table AGM. Panel 1. 3D 



Tissue 
Name 


Rel. 

JLXp.^ /O ) 

Ag2679, 
Run 
158633802 


Rel. 

JHiXp.( /o) 

Ag2728, 

Run 
158560797 


Rel. 

JiiXp.^ /o ) 

Ag2732, 

Run 
162400878 


Tissue 
Name 


Rel. 

CjXp*^ /O ) 

Ag2679, 

Run 
158633802 


Rel. 

ITvn 

iLAp.^ /O ) 

Ag2728, 

Run 
158560797 


Rel. 

Ag2732, 
Run 
162400878 


Liver 

adenocarci 

noma 


3.8 


2.0 


2.9 


Kidney 
(fetal) 


0.2 


1.6 


1.2 


Pancreas 


0.0 


0.0 


0.0 


Renal ca. 
786-0 


9.8 


11.5 


3.3 


Pancreatic 
ca. 

CAPAN2 


0.0 


0.0 


0.0 


Renal ca. 
A498 


54.7 


40.6 


15.1 


Adrenal 
gland 


0.2 


0.5 


0.2 


Renal ca. 
RXF393 


11.9 


7.3 


8.2 


Thyroid 


0.6 


1.3 


0.2 


Renal ca. 
ACHN 


0.0 


0.0 


0.0 


Salivary 
gland 


0.1 


0.3 


0.1 


Renal ca. 
UO-31 


0.8 


0.3 


0.0 


Pituitary 
gland 


0.4 


0.3 


0.0 


Renal ca 
TK-10 


0.0 


0.0 


0.0 


Brain 
(fetal) 


12.4 


15.2 


2.5 


Liver 


0.7 


0.4 


0.4 


Brain 
(whole) 


7.2 


15.5 


6.1 


Liver 
(fetal) 


0.2 


0.4 


0.0 


Brain 
(amygdala) 


11.3 


17.4 


7.0 


juiver ca. 
(hepatoblas 
t) HepG2 


0.0 


0.0 


0.0 


D J all J 

(cerebellu 
m) 


1.3 


1.0 


0.4 


Lung 


0.1 


0.8 


0.1 


Ul dill 

(hippocam 
pus) 


60.7 


80.1 


16.8 


Lung 
(fetal) 


0.4 


0.6 


0.4 


ui am 

(substantia 
nigra) 


1.7 


1.7 


0.5 


T lino f*5i 

JUU11 I'd. 

(small cell) 
LX-1 


0.0 


0.0 


0.0 


Brain 
(thalamus) 


3.7 


10.4 


4.5 


Lung ca. 
f small cell) 

l UillUlJ vwll J 

NCI-H69 


3 9 


14.0 


3 7 


Cerebral 
Cortex 


100.0 


100.0 


93.3 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


0.2 


0.0 


Spinal cord 


1.8 


1.8 


1.3 


Lung ca. 
(large 
cell)NCI- 
H460 


0.2 


0.1 


0.5 
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glio/astro 
U87-MG 


3.0 


4.7 


3.0 


Lung ca. 
(non-sm. 
cell) A549 


0.0 


0.2 


0.0 


glio/astro 
U-118-MG 


5.3 


5.4 


0.8 


Lung ca. 

(non-s.cell) 

NCI-H23 


0.0 


0.0 


0.0 


astrocytom 
a SW1783 


99.3 


87.7 


100.0 


Lung ca. 

(non-s.cell) 

HOP-62 


6.4 


4.3 


3.6 


neuro*; 
met SK-N- 
AS 


0.1 


0.0 


0.0 


Lung ca. 

(non-s.cl) 

NCI-H522 


1.3 


1.5 


0.9 


astro cvtom 
a SF-539 


80.7 


75.3 


42.0 


Lung ca. 
(squam.) 
SW 900 


0.0 


0.0 


0.0 


a^trocvtom 
a SNB-75 


77.4 


64.6 


18.2 


Lung ca. 
(squam.) 
NCI-H596 


1.0 


2.5 


2.6 


glioma 
SNB-19 


94.0 


75.3 


63.7 


Mammary 
gland 


1.9 


4.3 


0.6 


glioma 
U251 


68.8 


62.4 


29.1 


Breast ca.* 

(pl-ef) 

MCF-7 


0.0 


0.0 


0.0 


glioma SF- 
295 


8 1 






Breast ca.* 
(pl.ef) 
MDA-MB- 
231 


U.o 




A 1 

0.1 


Heart 
(fetal) 


34.6 


10.6 


11.9 


Breast ca.* 

(Pl-ef) 
T47D 


15.9 


7.6 


4.9 


Heart 


12.9 


12.1 


20.2 


Breast ca. 
BT-549 


58.2 


36.6 


4.0 


Skeletal 

muscle 

(fetal) 


11.0 


7.0 


4.1 


Breast ca. 
MDA-N 


0.2 


0.0 


0.0 


Skeletal 
muscle 


0.0 


0.5 


0.0 


Ovary 


1.8 


1.5 


1 4 


Bone 
marrow 


0.4 


0.2 


0.1 


Ovarian ca. 
OVCAR-3 


0.0 


00 


0 0 


Thymus 


0.9 


1.2 


3.8 


Ovarian ca. 
OVCAR-4 


0.1 


0 5 




Spleen 


0.0 


0.0 


0.0 


Ovarian ca. 
OVCAR-5 


41.8 


33.9 


14.3 


Lymph 
node 


0.1 


0.6 


0.0 


Ovarian ca. 
OVCAR-8 


1.0 


0.6 


0.5 


Colorectal 


1.3 


0.5 


0.9 


Ovarian ca. 
IGROV-1 


0.0 


4.5 


1.8 


Stomach 


0.4 


0.5 


0.1 


Ovarian 


0.1 


0.1 


0.0 
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ca.* 

(ascites) 
SK-OV-3 








Small 
intestine 


0.0 


0.6 


0.2 


Uterus 


0.0 


0.0 


0.0 


Colon ca. 
SW480 


1 1 2 




i i 
i.i 


JrialCHla 


0 1 


ft ft 

U.U 


ft ft 
u.u 


Colon ca.* 
SW620(S 
W480 met) 


0.0 


0.0 


0.0 


Prostate 


1.2 


0.5 


0.6 


\s\J1SlJIL \*cl» 

HT29 


0.0 


0.0 


0.0 


Prostate 
ca.* (bone 
met)PC-3 


0.5 


0.0 


0.0 


Colon ca. 
HCT-116 


0.0 


0.0 


0.0 


Testis 


1.8 


1.4 


0.9 


Colon ca. 
CaCo-2 


0.4 


0.1 


0.1 


Melanoma 
Hs688(A). 
T 


42.3 


22.4 


17.1 


Colon ca. 
tissue(OD 


0.4 


0.4 


0.3 


Melanoma 
* (met) 
Hs688(B). 
T 


1.7 


1.6 


1.0 


Colon ca. 
HCC-2998 




ft ft 
u.u 


u.u 


Melanoma 
UACC-62 


n a 
U.U . 


A A 

u.u 


A A 
U.U 


Gastric 
ca.* (liver 
met) NCI- 

KTQ*7 
INo/ 


ft 0 


ft A 


A 1 


Melanoma 
M14 


a a 
u.u 


A A 
U.U 


A A 
U.U 


Bladder 


0.5 


0.3 


0.3 


Melanoma 
LOXIMVI 


2.4 


1.1 


0.4 


Trachea 


1.3 


1.5 


1.0 


Melanoma 
* (met) 
SK-MEL-5 


0.0 


0.0 


0.0 


Kidney 


0.7 


0.3 


0.2 


Adipose 


0.2 


0.7 


0.0 



Table AGN. Panel 2.2 



Tissue Name 


Rel. Exp.(%) 
Ag2975, Run 
173763053 


Tissue Name 


Rel. Exp.(%) 
Ag2975, Run 
173763053 


Normal Colon 


2.5 


Kidney Margin 
(OD04348) '. 


29.7 


Colon cancer 
(OD06064) 


4.7 


Kidney malignant 
cancer (OD06204B) 


0.0 


Colon Margin 
(OD06064) 


4.3 


Kidney normal adjacent 
tissue (OD06204E) 


0.0 


Colon cancer 


2.2 


Kidney Cancer 


0.0 
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(OD06159) 




UJLHW4jU-U1 ) 




Colon Margin 
(OD06159) 


7.3 


Kidney Margin 


13.6 


Colon cancer 

//\t\ r\ f r\ g\ a \ 

(OD06297-04) 


0.0 


Kidney Cancer 8120613 


2.3 


Colon Margin 
(OD06297-015) 


0.0 


Kidney Margin 
ol20ol4 


11.5 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Kidney Cancer 9010320 


23.8 


CC Margin (OD03921) 


• 0.0 


Kidney Margin 
9010321 


20.0 


Colon cancer metastasis 
(OD06104) 


0.0 


Kidney Cancer 8120607 


2.0 


Lung Margin 
(OD06104) 


0.0 


Kidney Margin 
8120608 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Uterus 


4.4 


Lung Margin 
(OD04451-02) 


2.0 


Uterine Cancer 06401 1 


0.0 


Normal Prostate 


7.5 


Normal Thyroid 


A A 

4.4 


Prostate Cancer 
(OD04410) 


2.3 


Thyroid Cancer 064010 


0.0 


Prostate Margin 
(OD04410) 


4.6 


Thyroid Cancer 
A302152 


2.2 


Normal Ovary 


2.2 


Thyroid Margin 
A302153 


3.6 


Ovarian cancer 
(OD06283-03) 


0.0 


Normal Breast 


55.1 


Ovarian Margin 
(OD06283-07) 


0.0 


Breast Cancer 
(OD04566) 


3.9 


Ovarian Cancer 064008 


13.4 


Breast Cancer 1024 


68.3 


Ovarian cancer 
(OD06145) 


18.S 


Breast Cancer 
(OD04590-01) 


5.3 


Ovarian Margin 
(OD06145) 


8.1 


Breast Cancer Mets 
(UlJU45yU-03) 


0.0 


Ovarian cancer 
(OD06455-03) 


D.U 


Breast Cancer 
Metastasis (UJJU4to3- 
05) 


13 1 

13.1 


Ovarian Margin 
(0006455-07) 


2.3 


Breast Cancer 064006 


12.5 


Normal Lung 


2.3 


Breast Cancer 9100266 


34.6 


Invasive poor diff. lung 
adeno(ODO4945-01 


0.0 


Breast Margin 9100265 


17.0 


Lung Margin 
(ODO4945-03) 


0.0 


Breast Cancer A209073 


16.7 


Lung Malignant Cancer 


1.4 


Breast Margin : ! 71.7 
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(OD03126) 




A2090734 




Lung Margin 
(OD03126) 


0.0 


Breast cancer 
(OD06083) 


20.7 


Lung Cancer 
(OD05014A) 


0.0 


Breast cancer node 
metastasis (OD06083) 


6.3 


Lung Margin 
(OD05014B) 


2.1 


Normal Liver 


0.0 


Lung cancer (OD06081) 


100.0 


Liver Cancer 1026 


8.1 


Lung Margin 
(OD06081) 


4.1 


Liver Cancer 1025 


29.5 


Lung Cancer 
(OD04237-01) 


0.0 


Liver Cancer 6004-T 


8.4 


Lung Margin 
(OD04237-02) 


0.0 


Liver Tissue 6004-N 


4.7 


Ocular Melanoma 

NA<*tfl Qtfl QIC 


6.7 


Liver Cancer 6005-T 


23.0 


Ocular Melanoma 
iviargin \i>ivcrj 


2.8 


Liver Tissue 6005-N 


32.8 


Melanoma Metastasis 


2.2 


Liver Cancer 064003 


12.8 


Melanoma Margin 
(Lung) 


3.7 


Normal Bladder 


0.0 


Normal Kidney 


9.0 


Bladder Cancer 1023 


6.3 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


19.5 


Bladder Cancer 
A302173 


16.7 


Kidney Margin 
(OD04338) 


3.1 


Normal Stomach 


15.3 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


0.0 


Gastric Cancer 9060397 


0.0 


Kidney Margin 


9.3 


Stomach Margin 
yuouoyo 


3.8 


Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Gastric Cancer 9060395 


6.5 


Kidney Margin 
(OD04340) 


4.6 


Stomach Margin 
9060394 


2.3 


Kidney Ca, Nuclear 
grade3(OD04348) 


77.4 


Gastric Cancer 064005 


0.0 



Table AGO. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag047, Run 
144771648 


Rel. Exp.(%) 
Ag047, Run 
152940364 


Rel. Exp.(%) 
Ag2679, Run 
158633803 


Rel.Exp.(%) 
Ag2728, Run 
158561830 


Normal Colon 


5.5 


8.9 


7.4 


10.4 


CC Well to Mod 
Diff(OD03866) 


1.4 i 


0.2 


0.2 


1.3 
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CC Margin 
(OD03866) 


0.4 


0.5 


0.4 


0.1 


CC Gr.2 

rectosigmoid 

(OD03868) 


0.4 


0.1 


1.7 


0.2 


CC Margin 
(OD03868) 


2.5 


0.9 


0.7 


0.7 


CCModDiff 
(ODO3920) 


0.0 


0.0 


0.0 


0.2 


CC Margin 
(ODO3920) 


0.7 


0.9 


1.1 


0.3 


CC Gr.2 ascend 
colon (OD03921) 


0.0 


0.7 


0.6 


0.2 


CC Margin 
(OD03921) 


0.2 


0.2 


0.6 


0.8 


CC from Partial 
Hepatectomy 
(ODO4309) Mets 


0.6 


0.4 


1.1 


1.0 


Liver Margin 
(ODO4309) 


11.1 


13.6 


30.4 


212 


Colon mets to lung 
(OD04451-01) 


0.0 


0.4 


0.1 


0.2 


Lung Margin 
(OD04451-02) 


1.3 


1.0 


0.5 


0.7 


Normal Prostate 
6546-1 


19.1 


0.7 


1.0 


2.3 


Prostate Cancer 
(OD04410) 


5.8 


2.9 


4.7 


3.4 


Prostate Margin 
(OD04410) 


4.7 


5.7 


7.2 


4.9 


Prostate Cancer 
(OD04720-01) 


2.9 


3.3 


3.4 


3.3 


Prostate Margin 
(OD04720-02) 


12.5 


10.7 


8.3 


152 


Normal Lung 
061010 


1.5 


2.7 


6.8 


5.4 


Lung Met to Muscle 
(ODG4286) 


0.3 


0.1 


0.0 


0.5 


Muscle Margin 
(OD04286) 


0.3 


0.2 


0.3 


0.6 


Lung Malignant 
Cancer (OD03 126) 


0.4 


0.2 


0.8 


1.5 


Lung Margin 
(OD03126) 


0.4 


1.0 


1.2 


0.5 


Lung Cancer 
(OD04404) 


86.5 


100.0 


100.0 


100.0 
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Lung Margin 
(OD04404) 


18.3 


3.3 


2.2 


3.0 


Lung Cancer 
(OD04565) 


100.0 


52.1 


62.0 


77.9 


Lung Margin 
(OD04565) 


0.2 


0.1 


0.0 


0.6 


Lung Cancer 
(OD04237-01) 


6.3 


1.5 


3.2 


3.3 


Lung Margin 
(OD04237-02) 


1.4 


0.5 


0.5 


0.6 


Ocular Mel Met to 
Liver (ODO4310) 


0.4 


0.3 


0.5 


0.7 


Liver Margin 
(ODO4310) 


2.3 


1.8 


3.5 


2.8 


Melanoma Mets to 
Lung (OD04321) 


0.0 


0.3 


0.4 


1.6 


T X A • 

Lung Margin 
(OD04321) 


2.1 


2.6 


2.2 


3.0 


Normal Kidney 


6.7 


4.9 


8.4 


7.0 


Kidney Ca, Nuclear 
grade2(OD04338) 


0.0 


0.0 


0.3 


1.4 


Kidney Margin 
(OD04338) 


3.5 


1.5 


4.1 


3.5 


Kidney Ca Nuclear 
grade 1/2 
(OD04339) — 


0.0 


0.1 


0.3 


0.7 


Kidney Margin 
(UD04339) 


18.4 


10.3 


8.4 


15.5 


Kidney Ca, Clear 
cell type (UJJU4340) 


0.0 


0.8 


0.7 


1.1 


Kidney Margin 
(UJJU4J4U) 


6.5 


4.4 


4.5 


6.5 


Kidney Ca, Nuclear 
graae j (LJL/U434o ) 


90.8 


36.1 


54.3 ; 


50.3 


Kidney Margin 

(UJJU434o ) 


4.6 


3.2 j 


3.8 


4.0 


Kidney Cancer 

srxr\f\A O A 1 \ 
(UL)U4ozz-Ul) 


2.7 


1.8 


4.1 


3.5 


Kidney Margin 


2.0 


0.2 


0.3 


1.1 


Kidney Cancer 
(OD04450-01) 


0.0 


0.0 


0.0 


0.0 


Kidney Margin 
(OD04450-03) 


3.1 


1.4 


6.9 


5.4 


Kidney Cancer 
8120607 


1.5 


0.3 


0.5 


1.8 
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Kidney Margin 
8120608 


3.4 


0.8 


1.3 


2.2 


Kidney Cancer 
8120613 


0.4 


0.8 


4.3 


2.0 


Kidney Margin 
8120614 


2.8 


1.2 


4.9 


4.0 


Kidney Cancer 
9010320 


76.8 


39.0 


36.3 


52.9 


Kidney Margin 

AA 1 ATO 1 

9010321 


10.4 


5.1 


4.9 


4.3 


Normal Uterus 


0.0 


0.2 


0.3 


0.0 


Uterus Cancer 
064011 


0.9 


0.0 


0.3 


1.9 


Normal Thyroid 


0.3 


0.0 


0.9 


2.2 


Thyroid Cancer 
064010 


0.0 


0.0 


0.0 


0.0 


Thyroid Cancer 
A302152 


1.4 


0.1 


0.4 


0.0 


Thyroid Margin 
A302153 


1.7 


0.9 


6.4 


4.1 


Normal Breast 


20.3 


5.7 


13.0 


20.9 


Breast Cancer 
(OD04566) 


0.3 


0.1 


0.5 


03 


Breast Cancer 
(OD04590-01) 


2.4 


2.2 


1.1 


1.9 


Breast Cancer Mets 
(OD04590-03) 


0.7 


0.1 


0.3 


0.0 


Breast Cancer 

Metastasis 

(OD04655-05) 


6.9 


3.3 


7.7 


9.3 


Breast Cancer 
064006 


5.6 


8.3 


4.2 


4.9 


Breast Cancer 1024 


47.0 


19.3 


30.1 


23.5 


Breast Cancer 
9100266 


14.9 


9.3 


15.7 


21.8 


Breast Margin 
9100255 


4.6 


1.5 


4.4 


8.4 


Breast Cancer 
A209073 


48.3 


12.9 


28.3 


40.9 


Breast Margin 
A2090734 


38.7 


16.6 


27.5 


29.7 


Normal Liver 


0.2 


0.1 


0.0 


0.2 


Liver Cancer 064003 


11.8 


5.6 


4.6 


4.5 


Liver Cancer 1025 


4.5 


1.6 


4.2 


2.8 


Liver Cancer 1026 


6.2 


6.7 


9.0 


6.4 
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Liver Cancer 6004-T 


15.6 


3.3 


3.5 


2.8 


Liver Tissue 6004-N 


0.1 


0.2 


0.2 


0.4 


Liver Cancer 6005-T 


14.6 


8.0 


8.7 


7.9 


T ivpr Ti^^iif* ^flOS-"W 


6 4 


7 0 

/ .V/ 


5 9 


3 2 


Normal Bladder 


1.3 


0.9 


1.6 


0.9 


Bladder Cancer 1023 


0.4 


0.2 


0.3 


0.3 


Bladder Cancer 

A "> A^ 1 T> 

A3 02 1 73 


7.5 


5.3 


12.1 


19.8 


Bladder Cancer 
(OD047 18-01) 


27.7 


23.0 


35.6 


41.5 


Bladder Normal 
Adjacent (OD0471 8- 
03) 


0.4 


A "5 

0.3 


A ZT 

0.6 


A *7 

0.7 


Normal Ovary 


1 A 

1.4 


A A 

0.4 


i.i 


A ^ 

0.3 


Ovarian Cancer 
064008 


1.3 


0.5 


1.3 


1.3 


Ovarian Cancer 
(OD04768-07) 


0.4 


0.2 


0.0 


0.0 


Ovary Margin 
(OD04768-08) 


1.0 


0.8 


0.6 


1.6 


Normal Stomach 


4.3 


2.0 


0.3 


3.1 


Gastric Cancer 

C\f\ f f\ r ^ c o 

9060358 


2.4 - 


0.7 


0.7 


1.2 


Stomach Margin 
9060359 


0.0 


0.0 


0.7 


■4.3 


Gastric Cancer 
9060395 


0.9 


2.0 


1.9 


2.4 


Stomach Margin 


0.2 


0.3 


1.1 


0.2 


Gastric Cancer 
9060397 


0.4 


0.5 


1.6 


0.7 


Stomach Margin 
9060396 


0.0 


0.1 


0.2 


0.2 


Gastric Cancer 
064005 


0.7 


0.7 


0.9 


1.5 



Table AGP. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag047,Run 
158634002 


Tissue Name 


Rel. Exp.(%) 
Ag047,Run 
158634002 


Daoy- Medulloblastoma 


0.9 


Ca Ski- Cervical epidermoid 1 
carcinoma (metastasis) 


0.3 


TE671- Medulloblastoma 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


6.7 
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D283 Med- 
Medulloblastonia 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


rrbK-1- Primitive 
N e 1 1 vn p^tnH prm a I 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 


SNB-78- Glioma 


100.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


79.0 


Daudi- Burkitt's lymphoma 


0.0 


1 98G- Glioblastoma 


1.4 


U266- B-cell plasmacytoma 


0.0 


Of XT CT T 

Neuroblastoma 
(metastasis) 


2.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.1 


RL- non-Hodgkin's B-cell 
lymphoma 


0.0 


Cerebellum 


4.5 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


2.9 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


3.8 


TF-1- Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0.0 


HUT 78- T-cell lvmnhnma 

AXW A / V x WWII Ij HILJIIKJUICL 


0 ft 


DMS-79- Small cell lung 
cancer 


0.1 


U937- Histiocvtic Ivmnhnma 


0 ft 


NCI-H146- Small cell 
lung cancer 


0.0 


KU-812- Myelogenous 
leukemia 


0 ft 


NCI-H526- Small cell 
lung cancer 


1.0 


769-P- Clear cell renal 
carcinoma 


0.1 


NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


0.1 


NCI-H82- Small cell lung 
cancer 


7.6 


SW 839- Clear cell renal 
carcinoma 


0.0 . 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


0.8 


G401- Wilms' tumor 


6.0 


NCI-HI 155- Large cell 
lung cancer 


o.o. 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Large cell 
lung cancer 


0.2 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


NCI-H727-Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


0.1 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.6 


LX-1- Small cell lung 
cancer 


q q JHPAC- Pancreatic . 
[adenocarcinoma 


0.0 
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Colo-205- Colon cancer 


0.0 


MIA PaCa-2- Pancreatic 
carcinoma 


0.0 


KM 12- Colon cancer 


0.0 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


0.1 


KM20L2- Colon cancer 


0.0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


0.0 


NCI-H716- Colon cancer 


0.0 


T24- B ladder carcinma 
(transitional cell) 


0.0 


SW-48- Colon 
adenocarcinoma 


0.0 


5637- Bladder carcinoma 


3.8 


SW111 6- Colon 
adenocarcinoma 


o.o - 


HT-1 197- Bladder carcinoma 


0.0 


LS 174T- Colon 
adenocarcinoma 


0.0 


UM-UC-3- Bladder carcinma 
(transitional cell) 


10.3 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1080- Fibrosarcoma 


59.9 


NCI-SNU-5- Gastric 
carcinoma 


0.7 


MG-63- Osteosarcoma 


0.5 


KATO in- Gastric 
carcinoma 


0.0 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


4.2 


NCI-SNU-16- Gastric 
carcinoma 


0.2 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0.3 


NCI-SNU-1- Gastric 
carcinoma 


0.0 


A431- Epidermoid carcinoma 


0.6 


RF-1- Gastric 
adenocarcinoma 


0.3 


WM266-4- Melanoma 


0.0 


RF-48- Gastric 
adenocarcinoma 


0.1 


DU 145- Prostate carcinoma 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


0.0 


MDA-MB-468- Breast 
adenocarcinoma 


0.0 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma oi tongue 


0.2 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


1.1 


RL95-2- Uterine 
carcinoma 


4.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.3 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


7.0 



Table AGQ. Panel 4.1D 





Rel. Exp.(%) 




Rel. £xp.(%) 


Tissue Name 


Ag2975, Run 


Tissue Name 


Ag2975,Run 




171818669 




171818669 



1072 



WO 02/068649 



PCT/US02/02785 



Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 




u.u 


nU VbC IrlN gamma 


A A 

0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


1.1 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


37.4 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


47 6 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


100.0 


CD45RA CD4 
lymphocyte act 


0.8 


Coronery artery SMC rest 


0.8 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.6 


CD8 lymphocyte act 


6.6 


Astrocytes rest 


51.8 


Secondary CD8 
lymphocyte rest 


0.1 


Astrocytes TNFalpha + 
IL-lbeta 




Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil, rest 


0 0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl_ anti- 
CD95 CH1 1 


0.0 


CCD1 106 (Keratinocytes) 
none 


45.1 


LAK cells rest 


0.0 


cciJl lUo (Keratinocytes) 
TNFalpha + IL-lbeta 


33.7 


LAK cells IL-2 




Liver cirrhosis 


0.2 


-AK cells IL-2+TI -12 


fl ft 
u.u 


JNUi-nzyz none 


0.5 


^AK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 IL-4 


12.6 


LAKcellsIL-2+IL-18 


0.1 


NCI-H292 IL-9 


0.6 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292IL-13 


10.9 


NK Cells IL-2 rest 


0.0 


NCI-H292IFN gamma 


0.9 


Two Way MLR 3 day j 


0.0 


HPAEC none 


1.2 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL-1 1 0.3 
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beta 


Two Way MLR 7 day 


0.1 


Lung fibroblast none 


1.5 


PBMC rest 


0.0 


Lung fibroblast TNF alpha 

1 TT 1 1 A _ 

+ IL-1 beta 


0.0 


rDML r WiVl 


c\ n 
U.U 


Lung fibroblast IL-4 


0.1 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.2 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-1 3 


1.3 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN 
gamma 


0.8 


B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCD1070 rest 


4.1 


B lymphocytes CD40L 

on rl TT -A 
anu 1J_*-H 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


2.2 


EOL-l dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 IL-1 beta 


0.8 


CAT 1 A\^r> A \AT> 

PMA/ionomycin 


0.0 


Dermal fibroblast IFN 

cam ma 


2.4 


Dendritic cells none 


0.0 


UC i III ail IlurUDlaSl JLLj-H 




Dendritic cells LPS 


0.0 


UCllllal JrlOrUDIaSlS icSL 




Dendritic cells anti- 
CD40 


0.1 


Neutrophils TNFa+LPS 


0.6 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


0.0 


Colon 


0.5 


Macrophages rest 


0.0 


Lung 


1.6 


Macrophages LPS 


0.0 


Thymus 


9.3 


HUVEC none 


0.0 


Kidney 


11.3 


HUVEC starved 


0.4 







Table AGR. Panel 4D 



Tissue 
Name 


Rel. 
Exp.(%) 
Ag047, 

Run 
146087309 


Rel. 
Exp.(%) 
Ag047, 
Run 
151918119 


ReL 
Exp.(%) 
Ag047, 
Run 
152893522 


Rel. 

Exp.(%) 
Ag2679, 

Run 
158535667 


Rel. 
Exp.(%) 
Ag2728, 

Run 
158562582 


Rel. 
Exp.(%) 
Ag2975, 

Run 
164314612 


Rel. 
Exp.(%) 
Ag47b, 
Run 
158664022 


Secondary 
Thl act 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.1 


Secondary 
Th2act 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Secondary 
Trl act 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Secondary 
Thl rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Secondary 
Th2rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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Secondary 
Trl rest 


0.0 


0.0 


0.0 


0.0 


0 0 


0 0 
u.u 


0 0 
u.u 


Primary 
Thl act 


0.0 


0.0 


0.0 


0.0 


0 0 


0 ft 


u.u 


Primary 
Th2 act 


0.0 


0.0 


0.0 


0.0 


0 0 
u.u 


ft 0 
u.u 


u.u 


Primary 
Trl act 


0.0 


0.0 


0.0 


0.0 


0 0 

U»U 


u.u 


ft ft 
u.u 


Primary 
Thl rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Primary 
Th2 rest 


0 0 

u.u 


n n 
u.u 


ft ft 

u.u 


u.u 


0.0 


0.0 


0.0 


Primary 
Trl rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


CD45RA 
CD4 

lymphocyt 
e act 


0.1 


0.2 


0.1 


0.3 


0.3 


0.0 


0.3 


CD45RO 
CD4 

lymphocyt 
e act 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.1 


CD8 

lymphocyt 
eact 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Secondary 
CD8 

lymphocyt 
e rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Secondary 
CD8 

lymphocyt 
e act 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


CD4 

lymphocyt 
e none 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


2ry 

Thimi2/T 
rl anti- 
CD95 
CH11 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


LAK cells 
rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


LAK cells 
IL-2 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


LAK cells 
IL-2+IL- 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 ; 
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12 












LAK cells 
IL-2+IFN 
gamma 


0.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.1 


LAK cells 

IL-2+IL- 

18 


0.0 


0.0 


0.0 


0.0 


0.1 


0.0 


0.0 


LAK cells 
PMA/iono 
mycin 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


NK Cells 
IL-2 rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Two Way 
MLR 3 * 
day 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Two Way 
MLR 5 " 
day 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Two Way 
MLK / 

day 


0.0 


0.0 


A A 

u.u 


u.u 


0.0 


0.0 


A A 

u.u 


rDivit resi 


0.0 


0.0 


A A 

u.u 


A h 

u.u 


0.0 


0.0 


A A 

u.u 


PBMC 
PWM 


bo 


0.0 


0.0 


0.0 


0 1 


0 0 


0.0 


PBMC 
PHA-L 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Ramos (B 
cell) none 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Ramos (B 
cell) 

ionomycin 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


B 

lymphocyt 
es PWM 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


B 

lymphocyt 
es CD40L 

J TT A 

and IL-4 


n n 
u.u 


n n 
u.u 


0.0 


0.0 


A 1 

U.l 


A A 

u.u 


0.0 


EOL-1 
dbcAMP 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.1 


hXJL-l 

dbcAMP 

PMA/iono 

mycin 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Dendritic 
cells none 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Dendritic 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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cells LPS 
















Dendritic 
cells anti- 
CD40 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Monocytes 
rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Monocytes 
LPS 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Macropha 
ges rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Macropha 
ges LPS 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


HUVEC 
none 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


HUVEC 
starved 


0.0 


0.0. 


0.0 


0.0 


0.0 


0.0 


0.0 


HUVEC 
IL-lbeta 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


HUVEC 

IFN 

gamma 


0.0 


0.0 


0.0 


0.1 


0.0 


0.0 


0.0 


HUVEC 
TNF alpha 
4- IFN 
gamma 


ft ft 


u.u 


ft ft 

u.u 


n n 
u.u 


A A 

u.u 


A A 

u.u. 


A A 

O.U 


HUVEC 
TNF alpha 
+ IL4 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


HUVEC 
IL-11 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Lung 

Microvasc 
ular EC 
none 


A A 

u.u 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Lung 

Microvasc 
ular EC 
TNFalpha 
+ IL-lbeta 


0.0 


7.3 


0.0 


0.0 


0.0 


0.0 


0.0 


Microvasc 
ular 

Dermal EC 
none 


0.0 


0.0 


0.0 


f 

0.0 


0.0 


0.0 


0.0 


Microsvas 
ular 

Dermal EC 
TNFalpha 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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+ IL-lbeta 
















Bronchial 
epithelium 
TNFalpha 
+ ILlbeta 


8.6 


2.0 


1.4 


10.0 


4.6 


11.9 


13.3 


Small 
airway 
epithelium 
none 


11 .0 




17.8 


17.7 


17.8 


12.2 


17.2 


Small 
airway 
epithelium 
TNFalpha 
+ IL-lbeta 


100.0 


100.0 


100,0 


100.0 


100.0 


100.0 


100.0 


Coronery 
artery 
SMC rest 


0.2 


0.2 


0.3 


0.4 


0.4 


0.2 


0.2 


Coronery 
artery 
SMC 
TNFalpha 
+ IL-lbeta 


0.2 


0.3 


0.2 


0.2 


0.2 


0.1 


0.2 


Astrocytes 
rest 


23 3 


19 3 


?n 4 


in ft 


ZU. / 


10.0 


lie 

11. a 


Astrocytes 
TNFalpha 
+ IL-lbeta 


27.0 


25.0 


25.3 


27.9 


22.7 


19.3 


28.5 


KU-812 

(Basophil) 

rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


KU-812 
(Basophil) 
PMA/iono 
mycin 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


CCD 11 06 
(Keratinoc 
ytes) none 


18.4 


12.7 


14.3 


16.7 


12.5 


10.2 


13.5 


CCD1106 
(Keratinoc 
ytes) 

TNFalpha 
+ IL-lbeta 


4.2 


1.4 


2.0 


7.1 


3.5 


5.1 


8.4 


Liver 
cirrhosis 


0.3 


0.2 


0.2 


0.3 


0.3 


0.2 


0.4 


Lupus 
kidney 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


NCI-H292 
none 


0.4 


0.2 


0.4 


0.3 


0.6 


0.1 


0.2 
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NCI-H292 
IL-4 


9.3 


4.8 


7.5 


8.9 


8.2 


5.3 


8.7 


NCI-H292 
IL-9 


0.6 


0.3 


0.6 


0.7 


0.7 


0.5 


0.4 


NCI-H292 
IL-13 


4.0 


5.1 


3.9 


4.5 


4.2 


2.9 


3.3 


NCI-H292 

IFN 

gamma 


0.2 


0.2 


0.1 


0.3 


0.2 


0.2 


0.2 


HPAEC 
none 




ft ft 
u.u 


v.u 


fi n 
u.u 


ft ft 

U.U 


n ft 
u.u 


ft 1 

U.l 


HPAEC 
TNF alpha 
+ IL-1 beta 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Lung 

fibroblast 

none 


0.1 


0.3 


0.3 


0.3 


0.3 


0.1 


0.2 


Lung 
fibroblast 
TNF alpha 
+ IL-1 beta 


0.0 


0.0 


0.1 


0.0 


0.0 


0.1 


0.1 


Lung 

fibroblast 

IL-4 


0.3 


0.1 


0.2 


0.3 


0.1 


0.2 


0.3 


Lung 

fibroblast 

IL-9 


0.0 


02 t 


0.1 


0.1 


0.1 


0.1 


0.1 


Lung 

fibroblast 

IL-13 


0.1 


0.1 


0.1 


' 02 


0.2 


0.0 


0.3 


Lung 
fibroblast 
IFN 
gamma 


0.1 


0.0 


0.0 


0.1 


0.1 


0.2 


0.1 


Dermal 
fibroblast 
CCD1070 
rest 


2.2 


1.3 


2.4 


3.1 


2.5 


1.5 


2.4 


Dermal 
fibroblast 
CCD1070 
TNF alpha 


1.0 


0.8 


1.1 


2.1 


1.8 


0.9 


1.2 


Dermal 
fibroblast 
CCD1070 
IL-1 beta 


0:3 


0.3 


0.6 


0.7 


0.6 


0.4 


0.6 


Dermal 
fibroblast 


0.6 


0.6 


1.1 


1.5 


1.1 


0.6 


0.7 
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IFN 
gamma 
















Dermal 
fibroblast 

TT -A 


1.8 


1.2 


2.0 


2.5 


1.5 


1.2 


1.4 


Colitis 2 


0.1 


0.1 


0.0 


0.0 


0.0 


0.0 


0.0 


IBD 
Crohn's 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Colon 


0.0 


0.0 


0.1 


0.2 


0.2 


0.1 


0.2. 


Lung 


0.3 


0.3 


0.2 


0.6 


0.3 


0.2 


0.2 


Thymus 


0.8 


1.5 


0.9 


1.4 


1.2 


1.0 


1.4 


Kidney 


3.0 


2.8 


3.6 


4.5 


3.3 


2.0 


2.6 ! 



CNS_neurodegeneration_vl.O Summary: Ag047/Ag2679/Ag2728 This panel does 
not show differential expression of the NOV37 gene in Alzheimer's disease. However, this 
expression profile confirms the presence of this gene in the brain. Please see Panel 1 for 
5 discussion of utility of this gene in the central nervous system. 

Panel 1 Summary: Ag047/Ag332/Ag47b Multiple experiments with three different 
probe and primer sets produce results that are in excellent agreement, with highest expression 
oftheNOV37 gene in a brain cancer cell line (CT=23-25). There is also significant expression 
in clusters of samples from melanoma, ovarian cancer, breast, lung, renal colon and brain 
1 0 cancer lines. Thus, expression of this gene may be associated with these forms of cancer and 
could potentially be used as a diagnostic marker for the presence of these cancers. 
Furthermore, therapeutic modulation of the expression or function of these genes may be 
useful in the treatment of ovarian, breast, lung, renal, and brain cancer and melanoma. 

In addition to significant expression in brain cancer cell lines, this gene is 
1 5 preferentially expressed in the brain. This expression profile suggests that this gene prodcut 
may play a role in CNS processes. This gene encodes a homolog of a member of the neurestin 
family, Ten M2, and may play a role in neuronal regeneration. Thus, agents that induce the 
expression or activity of NOV37 may have utility as neuronal regeneration drugs. Such agents 
would have utility in neurodegenerative diseases, stroke, and neuronal trauma. 
20 Among tissues with metabolic function, this gene shows consistent expression in 

thyroid, adult and fetal heart, liver and skeletal muscle. Thus, this gene product may be an 
antibody target for the treatment of metabolic and endocrine disease, including obesity and 
Types 1 and 2 diabetes. In addition, this gene is expressed at higher levels in adult liver than in 
fetal liver and may be useful for differentiating between the two sources of liver tissue. 
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References: 

Otaki JM, Firestein S. Neurestin: putative transmembrane molecule implicated in 
neuronal development. Dev Biol 1999 Aug 1;212(1):165-81 

We have cloned a novel cDNA encoding a putative transmembrane protein, neurestin, 
5 from the rat olfactory bulb. Neurestin was identified based on a sequence similar to that of the 
second extracellular loops of odorant receptors in the cysteine-rich CC box located 
immediately after EGF-like motifs. Neurestin shows homology to a neuregulin gene product, 
human gamma-heregulin, a Drosophila receptor-type pair-rule gene product, Odd Oz (Odz) / 
Ten(m), and Ten(a), suggesting a possible function in synapse formation and morphogenesis. 
10 Recently, a mouse neurestin homolog has independently been cloned as DOC4 from the NIH- 
3T3 cell line. Northern blot analysis showed that neurestin is highly expressed in the brain and 
also in other tissues at much lower levels. In situ hybridization studies showed that neurestin is 
expressed in many types of neurons, including pyramidal cells in the cerebral cortex and tufted 
cells in the olfactory bulb during development. In adults, neurestin is mainly expressed in 
15 olfactory and hippocampal granule cells, which are known to be generated throughout 

adulthood. Nonetheless, in adults the expression of neurestin was experimentally induced in 
external tufted cells during regeneration of olfactory sensory neurons. These results suggest a 
role for neurestin in neuronal development and regeneration in the central nervous system. 
Panels 1.1/1.2/1.3D Summary: Ag047 TheNOV37 gene is expressed in melanoma, 
20 ovarian cancer, breast, lung, renal and brain cancer lines with good concordance for 3 

independent runs. Expression of this gene might be associated with these forms of cancer and 
therapeutic modulation of this gene might be of use in the treatment of these cancers. Please 
note that results from one experiment on Panel 1.3D with the probe/primer set Ag2975 are not 
included. The amp plot suggests that there were experimental difficulties with this run. 
25 Panel 2.2 Summary: Ag2975 This gene appears to be expressed at a very low level in 

the samples used in this panel. Significant expression is only seen in lung, kidney and breast 
cancer samples. Expression of this gene might be associated with these forms of cancer and 
therapeutic modulation of this gene might be of use in the treatment of these cancers. 

Panel 2D Summary: Ag047 The expression of the NOV37 gene was assessed in 
30 multiple runs on this panel, with excellent concordance between the runs. This gene appears to 
be expressed at a higher level in gastric, bladder, and 2 samples each of lung and kidney 
cancer when compared to the normal adjacent tissue. Thus, expression of this gene might be 
associated with these forms of cancer and therapeutic modulation of this gene might be of use 
in the treatment of these cancers. 
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Panel 3D Summary: Ag047 The NOV37 gene is expressed in squamous cell 
carcinoma, glioma, small cell lung cancer cell lines. Thus, expression of this gene might be 
associated with these cancers and therapeutic modulation of this gene might be of use in the 

treatment of these cancers. 

5 Panels 4D/4.1D Summary: Ag047/Ag2679/Ag2728/Ag2975/Ag047b Multiple runs 

with different set of primers give very consistent expression data. Highest expression of the 
NOV37 transcript is found in small airway epithelium upon treatment with TNF-a and II- 1 
(CT=25). This expression is significantly up-regulated when compared to.untreated tissue 
(CT=28). Moderate expression of this transcript is also found in keratinocytes and astrocytes. 

10 NOV37 encodes for a neurestin like molecule whose role in neuronal regeneration has been 
demonstrated. Therefore, the putative protein encoded by NOV37 may play an important role 
in the regeneration or repair mechanism of these tissues in inflammation. Thus, therapeutic 
modulation of the expression of this gene product may be beneficial for the treatment of 
inflammatory lung diseases such as bronchitis, chronic obstructive pulmonary disease, 

1 5 emphysema. Therapeutics designed against this putative protein may also be useful for in the 
CNS for reducing inflammation, including inflammation that results from multiple sclerosis or 
stroke. 

NOV38, NOV39a, and NOV39b 

Expression of gene NOV38 and variants NOV39a and NOV39b was assessed 
20 using the primer-probe set Ag3753, described in Table AHA. Results of the RTQ-PCR 
runs are shown in Tables AHB and AHC. 



Table AHA. Probe Name Ag3753 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 1 -cactgcagtaattcagctggta-3 1 


22 


835 


1135 


Probe 


TET-5 1 -agtatccagtcccgccatcccagtt-3 1 - 
TAMRA 


25 


797 


1136 


Reverse 


5' -aggcgagaccattacgtagact-3 • 


22 


769 


1137 



Table AHB. General_screeningjpanel_vl.4 



Tissue Name 


Rel.Exp.(%)Ag3753, 
Run 216707728 


Tissue Name 


Rel.Exp.(%)Ag3753, 
Run 216707728 


Adipose 


0.0 


Renal ca.TK-10 


1.0 


Melanoma* 
Hs688(A).T 


0.3 


Bladder 


0.0 
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Melanoma* 
Hs688(B).T 


0.3 


Gastric ca. (liver met.) 
NCI-N87 


0.1 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* biv- 
MEL-5 


0.0 


Colon ca. SW480 


0.2 


Sauamou^ cell 
carcinoma SCC-4 


02 


uojon ca. ^oW4oU 
met) SW620 


2.6 


Testis Pool 


0.0 


Colon ca. HT29 


0.0 


Prostate ca * fhrmp 

met) PC-3 


0.1 


Colon ca.HCT-1 16 


0.0 


Prostate Pool 


o n 


coion ca. caco-z 


0.0 


Placenta 


0.0 


Colon cancer tissue 


o.b 


uterus rool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 

r\\/r* ad "5 
UVLAR-j 


0.1 


Colon ca. Colo-205 


00 


Ovarian ca. SK-OV- 


0.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
UVCAR-4 


0.1 


Colon Pool 


0 0 


Ovarian ca. 
OVCAR-5 


100.0 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV- 
] 


0.1 


Stomach Pool 


0.0 


Ovarian ca 

^— ' » *** IUJJ wu • 

OVCAR-8 


2.1 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 




Breast ca. MCF-7 


0.1 


Heart Pool 


0 Q 
\j.y 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


n s 


oKeieiai Muscle rool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


b.o 


oreast rooi 


0.0 


Thymus Pool 


0.0 


Trachea 


0.0 


CNS cancer (glio/astro) 
U87-MG 


0.2 


Lung 


0.0 


CNS cancer (glio/astro) 

r T.I 1 ft AACl 


02 


Fetal Lung 


0.0 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.2 


CNS cancer (astro) SF- 
539 


1.1 


Lung ca. LX-1 


1.9 


CNS cancer (astro) 
SNB-75 


0.4 
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Lung ca. NCI-H146 


0.1 


\^rNo Lander ^gllUy 

SNB-19 


0.2 


Lungca. SHP-77 


0.9 


PT\J^ runner (o\\ri\ QP- 
\^.tno Cancer ^gnuj or- 

295 


3.1 


Lung ca. A549 


0.1 


Brain (Amygdala) Pool 


0.0 


T nncr ra NCT-HS26 


ft ft 


jjidin ^cercDeiiumj 


ft i 




0 2 


Ft rain {fatnW 


ft ft 


Lung ca. NCI-H460 


• 0.1 


jLJiain ^riippocainpus^ 
Pool 


0.0 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


0.0 


Lung ca. NCI-H522 


0.2 


Brain (Substantia nigra) 
Pool 


0.0 


Liver 


1.2 


Brain (Thalamus) Pool 


0.0 


Fetal Liver 


0.9 


Brain (whole) 


0.1 


Liver ca. HepG2 


2.0 


Spinal Cord Pool 


0.0 


Kidney Pool 


0.0 


Adrenal Gland 


0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.1 


Pancreatic ca. CAPAN2 


0.0 


Renal ca.UO-31 


0.0 


Pancreas Pool 


0.0 



Table AHC. Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 
Ag3753, Run 
172209242 


Tissue Name 


Rel. Exp.(%) 
Ag3753, Run 
172209242 


Secondary Thl act 


0.5 


HUVEC IL-lbeta 


0.5 


Secondary Th2 act 


2.1 


HUVEC IFN gamma 


1.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


1.2 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.2 


HUVEC IL-11 


0.0 


Secondary Trl rest 


1.0 


Lung Microvascular EC 
none 


0.9 


Primary Thl act 


0.5 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.2 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.5 


Bronchial epithelium 
TNFalpha + IL1 beta 


1.2 


Primary Th2 rest 


0.0 


Small airway epithelium 


0.0 
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none 




Primary Trl rest 


0.5 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


lymphocyte act 


1.7 


Coronery artery SMC rest 


2.6 


lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


1.3 


l^uo lympnocyte act 


0.0 


Astrocytes rest 


1.2 


Secondary CD8 
lympnocyte rest 


0.2 


Astrocytes TNFalpha + 
IL-lbeta 


n a 

U.4 


Secondary CD8 
jympnocyte act 


3.6 


KU-812 fBasonhin re<:t 


U.U 1 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
rMA/ionomycm 


0.0 


2ryThl/Th2yTrl anti- 
CD95 CHI 1 


0.5 


CCD1106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


2.5 


CCDl 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


U.U 


Liver cirrhosis 


39.5 


-AK cells IL-2+TT -19 


l.j 


NCI-H292 none 


0.3 


^AK cells IL-2+IFN 
gamma 


1.0 


NCI-H292 IL-4 


1.1 


-rAK cells IT -2+ TT -18 




NCI-H292 IL-9 


0.0 


LAK cells 
PMA/ionomycin 


3.8 


NCI-H292 IL-13 


0.8 


NK Cells IL-2 rest 


U.J 


NCI-H292 IFN gamma 


0.3 j 


Two Wav MLR 3 dav 




HFAEC none 


0.0 


Two Way MLR 5 day 


2.6 


HPAEC TNF alpha + IL-1 
beta 


0.0 


Two Way MLR 7 day 


1.0 ] 


Lung fibroblast none 


15.1 


PBMCrest 


0.0 ] 


Lung fibroblast TNF alpha 
f IL-lbeta 


1.2 


PBMC PWM 


0.0 ] 


Lung fibroblast IL-4 


0.8 


PBMC PHA-L 


0.5 ] 


Lung fibroblast IL-9 


0.0 


jEvamos (d ecu) none 


12.8 jLung fibroblast IL- 1 3 


0.4 


Ramos (B cell) 
luiiumycin 


100.0 j Lung flbrobIa st IFN 
Jgamma 


U.o 


B lymphocytes PWM 


0 5 jDermal fibroblast 
jCCDl 070 rest 


3.7 


B lymphocytes CD40L 
and IL-4 


q 9 jDermal fibroblast 

CCD1070TNF alpha 


2.3 


EOL-1 dbcAMP 


0 o Dermal fibroblast 
|CCD1070IL-lbeta 


2.2 


EOL-1 dbcAMP 
PMA/ionomycin 


0 0 jDermal fibroblast IFN | 
Jgamma 


0.0 
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Dendritic cells none 


0.9 


Dermal fibroblast IL-4 


43 


Dendritic cells LPS 


6.2 


Dermal Fibroblasts rest 


0.9 


Dendritic cells anti- 
CD40 


3.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


0.5 


Colon 


0.8 


Macrophages rest 


3.7 


Lung 


0.0 


Macrophages LPS 


1.6 


Thymus 


1.1 


HUVEC none 


0.0 


Kidney 


15.3 


HUVEC starved 


0.0 







General screening pane! vl.4 Summary: Ag3753 Results from one experiment 
with the NOV38 gene are not included. The amp plot indicates that there were experimental 
difficulties with this run. 

5 Panel 4.1D Summary: Ag3753 Highest expression of the NOV38 transcript is found 

in Ramos B cell line activated with PMA and ionomycin (CT=29). However, expression is not 
seen in primary activated B cells. Therefore, epxression of this gene could potentially be used 
as a marker for activated B lymphoma. This gene is also expressed at lower levels in liver 
cirrhosis, lung fibroblasts and kidney. This transcript encodes for a molecule that belongs to 
10 the activin family, a member of the TGF beta superfamily. These factors influence growth and 
differentiation and are present in many cells and tissues. Therefore, therapeutics using the 
protein encoded by NOV38 could be important for the normal homeostasis of these tissues. 

NOV40 

Expression of gene NOV40 was assessed using the primer-probe set Ag2907, 
1 5 described in Table AIA. Results of the RTQ-PCR runs are shown in Tables AIB, AIC, AID 
and AIE. 

Table AIA. Probe Name Ag2907 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ggctcattcgaaactactggta-3 1 


22 


773 


1138 


Probe 


TET-5 ' -tggaatttcctcgcccactcttacct-3 1 - 
TAMRA 


26 


797 


1139 


Reverse 


5 1 -ggttgacaggtttgcagtagag-3 1 


22 


844 


1140 



Table AIB. Panel 1 3D 



Tissue Name 


Rel. Exp.(%) 
Ag2907,Run 


Rel. Exp.(%) 
Ag2907, Run 


Tissue Name 


Rel. Exp.(%) 
Ag2907, Run 


Rel. Exp.(%) 
Ag2907, Run 
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157Zo34Z3 


165701505 j 


157283423 


165701505 


Liver 

adenocarcinoma 


0.0 


0.0 jlCidney (fetal) 


0.0 


0.0 


Pancreas 


0.0 


0.0 


Renal ca. 786- 
0 


0.0 


0.0 


Pancreatic ca. 


0.0 


0.0 


Renal ca. 
A498 


0.0 


0.0 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 
393 


0.0 


0.0 


Thyroid 


0.0 


0.0 


Renal ca. 
ACHN 


0.0 


— 

0.0 


Salivary gland 


0.0 


0.0 


Renal ca. UO- 
31 


0.0 


0.0 


Pituitary gland 


100.0 


100.0 


Renal ca. TK- 
10 


2.0 


28.3 


Brain (fetal) 


0.0 


0.0 


Liver 


0.0 


0.6 


oram jwnoiej 


u.u 


14.3 


Liver (fetal) 


0.0 


0.0 


Brain (amygdala) 


0.0 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


0.0 


Brain (cerebellum) 


0.0 


11.9 


Lung 


0.0 


0.0 


Brain 

(hippocampus) 


1.9 


0.0 


Lung (fetal) 




u.u 


Brain (substantia 
nigra) 


0.0 


0.0 


Lung ca. 
(small cell) 
LX-1 


0.0 


0.0 


Brain (thalamus) 


0.0 


0.0 


Lung ca. 
(small cell) 
NCI-H69 


1.1 


0.0 


Cerebral Cortex 


2.0 


0.0 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


0.0 


Spinal cord 


1.3 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


glio/astro U87-MG 


0.0 


12.2 


Lung ca. (non- 
sm. cell) A549 


0.0 


0.0 


glio/astroU-118- 
MG 


0.0 


0.0 


Lung ca. (non- 
sxell) NCI- 
H23 


0.0 


0.0 


aairouycoma 
SW1783 


0.0 


0.0 


Lung ca. (non- 
s.cell)HOP-62 


o.o i 


0.0 


neuro*; met SK-N- 
AS 


1.0 


0.0 


Lung ca. (non- 

s.cl)NCI- 

H522 


0.0 ' 


0.0 


astrocytoma SF- 
539 


1.4 


0.0 


Lung ca. 
(squam.) SW 


0.9 


0.0 



1087 



WO 02/068649 



PCT/US02/02785 









900 






astrocytoma SNB- 
75 


1.0 


0.0 


Lung ca. 
(squam.) NCI- 
H596 


0.0 


0.0 


glioma SNB-19 


1.8 


0.0 


Mammary 
gland 


0.0 


0.0 


glioma U251 


0.8 


0.0 


Breast ca.* 
(pl.ef)MCF-7 


1.4 


2.5 


glioma SF-295 


1.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


2.6 


0.0 


Heart (fetal) 


0.0 


0.0 


Breast ca.* 
(pl.ef) T47D 


0.0 


0.0 


Heart 


0.0 


, 0.0 


Breast ca. BT- 
549 


3.3 


0.0 


Skeletal muscle 
(fetal) 


1.0 


0.0 


Breast ca. 
MDA-N 


0.0 


0.0 


skeletal muscle 


0.0 


A A 

0.0 


Ovary 


0.0 


0.0 


Bone marrow 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


2.9 


0.0 


Thymus 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


0.0 


Spleen 


0.9 


0.0 


Ovarian ca. 
OVCAR-5 


3.0 


10.9 


Lymph node 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 


Colorectal 


0.0 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


otomacn 


U.U 


U.U 


Ovarian ca.* 
(ascites) SK- 
OV-3 


A A < 

0.0 


A A 

0.0 


Small intestine 


0.0 


0.0 


Uterus 


0.0 


0.0 


Colon ca. SW480 


0.0 


0.0 


Placenta 


0.9 


0.0 


Colon ca.* 

SW620(SW480 

met) 


0.0 


0.0 


Prostate 


0.0 


0.0 


Colon ca. HT29 


1.0 


0.0 


Prostate ca.* 
(bone met)PC- 

3 


1.3 


0.0 


v^oion ca. ric l - 
116 


o.o ! 


0.0 


Testis 


16.6 


9.5 


Colon ca. CaCo-2 


1.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


11.0 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


0.0 
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Colon ca. HCC- 
2998 


2.6 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met)NCI-N87 


1.2 


13.3 


Melanoma 
M14 


0.0 


0.0 


Bladder 


0.0 


6.7 


Melanoma 
LOX IMVI 


0.0 


0.0 


Trachea 


0.0 


0.0 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney 


0.0 


0.0 


Adipose 


1.5 


0.0 



Table AIC. Panel 2D 



Tissue Name 


Rel.Exp.(%) 
Ag2907,Run 
157284121 


Tissue Name 


Rel.Exp.(%) 
Ag2907, Run 
157284121 


Normal Colon 


A C 

4.5 


Kidney Margin 
8120608 


0.0 


CC WelltoModDiff 
(OD03866) 


2.1 


Kidney Cancer 
8120613 


0.0 


LC JVlargin (0003866) 


2.3 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


1.7 


Kidney Cancer 
9010320 


A A 1 

0.0 


CC Margin (OD03868) 


0.0 


Kidney Margin 
9010321 


4.8 


CCModDiff(ODO3920) 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 06401 1 


1.5 1 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


6.8 


Thyroid Cancer 
064010 


0.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


0.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 
A302153 


15.2 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


1.7 


Lung Margin (OD0445 1 - 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.0 


Normal Prostate 6546-1 


0.0 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
(OD04590-03) 


2.2 


Prostate Margin 


0.0 


Breast Cancer 


2.8 
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(OD04410) 




Metastasis 

ii~\T\f\A /ICC f\C\ 

(OD04655-05) 




Prostate Cancer 

(UJL/U4 1 ) 


100.0 


Breast Cancer 064006 


0.0 


Prostate Margin 
(UL/U4//U-Uz) 


0.0 


Breast Cancer 1024 


0.0 


Normal Lung 061010 

— — — — 


7.2 


Breast Cancer 
9100266 


2.4 


Lung Met to Muscle 
(0004286) 


0.6 


Breast Margin 
9100265 


0.0 


Muscle Marein 

I'AUt/Vlv iiUU gill 

(OD04286) 


0.0 


A209073 


0.0 


Lung Malignant Cancer 
(OD03126) 


3.4 


Breast Margin 
A2090734 


0.0 | 


Lung Margin (OD03 126) 


0.0 


Normal Liver 


0.0 


Limg Cancer (OD04404) 


0.0 


Liver Cancer 064003 


3.5 i 


Lunff Margin (00044041 


4 1 


T iver Cancer 1 fl?S 


0 0 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0.0 


.Lung Margui (UD04565) 


A A 

0.0 


Liver Cancer 6004-T 


0.0 


bung Cancer (UJJU4237- 
01) 


0.0 


Liver Tissue 6004-N 


0.5 


bung Margin (ULH/4237- 
02) 


0.0 


Liver Cancer 6005-T 


0.0 


ucuiar Mel Met to Liver 
(ODO4310) 


0.0 • 


Liver Tissue 6005-N 


0.0 


L»iver Margin \\ju\jhd 1 


u.o 


Nonnal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


22.1 


Normal Kidney 


0.0 


Bladder Cancer 
(OD04718-01) 


2.4 


Kidney Ca, Nuclear grade 
2(OD04338) 


2.0 


Bladder Nonnal 
Adjacent (OD0471 8- 


0.0 


Kidney Margin 

\\JlJ\J t tjjo) 


0.0 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (0004339) 


2.0 


Ovarian Cancer 

UUtUvO 


0.0 


Kidney Margin 
(OD04339) 


0.0 


Ovarian Cancer 
(OD04768-07) 


. 14.4 


Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


2.4 • 


Nonnal Stomach 


0.0 
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Kidney Ca, Nuclear grade 
3 (OD04348) 


0.0 


Gastric Cancer 
9060358 


0.0 


Kidney Margin 
(OD04348) 


0.0 


Stomach Margin 
9060359 


0.0 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


0.0 


Kidney Margin 
(OD04622-03) 


0.0 


Stomach Margin 
9060394 


0.0 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397. 


0.0 


Kidney Margin 
(OD04450-03) 


0.0 


Stomach Margin 
9060396 


0.0 


Kidney Cancer 8120607 


2.1 


Gastric Cancer 
064005 


2.9 



Table AID. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag2907, Run 
164633936 


Tissue Name 


Rel. Exp.(%) 
Ag2907, Run 
164633936 


Daoy- Medulloblastoma 


0.0 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


49.7 


TE67 1 -. Medulloblastoma 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


15.7 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498-CNS 


b.o 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma 


0.0 


T98G- Glioblastoma 


0.0 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- 

Neuroblastoma 

(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin's B-cell 
lymphoma 


49.7 


Cerebellum 


0.0 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


0.0 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


94.0 


TF-1- Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0.0 


HUT 78- T-cell lymphoma 


0.0 
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DMS-79- Small cell lung 
cancer 


0.0 I 


J937- Histiocytic lymphoma 


0.0 


NCI-H146- Small cell 
lung cancer 


0.0 


CU-812- Myelogenous 
eukemia 

VUIVVII11U 


0.0 


NC1-H526- Small cell 
lung cancer 


0.0 


769-P- Clear cell renal 
carcinoma 


27.2 


NCI-N417-Small cell 
lung cancer 


.0.0 ; 


^aki-2- Clear ceil renai 
carcinoma 


0.0 


NCI-H82- Small cell 
lung cancer 


0.0 


:>W o jy- dear ceil renai 
:arcinoma 


0.0 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


0.0 


J4U1- Wilms tumor 


0.0 


NCI-HI 155- Large cell 
lung cancer 


0.0 


rls/oo 1 - r ancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Large cell 
lung cancer 


16.4 


^ArAJN-i- rancreaiic 
adenocarcinoma (liver 

m eta s ta si s^ 


0.0 


NCI-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 

r>fiTr>inomf) Hiver metastasis^ 


0.0 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 

sirlprificarcinnma 


0.0 


LX-1- Small cell lung 
cancer 


0.0 


HP AC- Pancreatic 
adenocarcinoma 


0.0 


Colo-205- Colon cancer 


0.0 


MIA PaCa-2- Pancreatic 
carcinoma 


0.0 


KM12- Colon cancer 


0.0 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


0.0 


KM20L2- Colon cancer 


0.0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


0.0 


NCI-H716- Colon cancer 


0.0 


T24- Bladder carcinma 
^transitional cell) 


0.0 


SW-48- Colon 


0.0 


5637- Bladder carcinoma 


0.0 


SWl 116- Colon 
adenocarcinoma 


0.0 


HT-1 197- Bladder carcinoma 


17.6 


LS 174T- Colon 
adenocarcinoma 


0.0 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1080- Fibrosarcoma 


0.0 


NCI-SNU-5- Gastric 
carcinoma 


0.0 


MG-63- Osteosarcoma 


0.0 


KATOIII- Gastric 
carcinoma 


0.0 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


1 0.0 
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NCl-SNU-l 6- Gastric 
carcinoma 


0.0 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0.0 


NCI-SNU-l- Gastric 
carcinoma 


0.0 


A431- Epidermoid carcinoma 


0.0 


RF-l- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


0.0 


RF-48- Gastric 
adcnocarcin oma 


0.0 


DU 145- Prostate carcinoma 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


0.0 


MDA-MB-468- Breast 
adenocarcinoma 


37.6 1 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 

r^niYMfirMTii r\f* tnnmi^ 
L/OlUlIlUIHu Ul lUllgUC 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


100.0 



Table AIE. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2907, Run 
157284733 


Tissue Name 


Rel. Exp.(%) 
Ag2907, Run 
157284733 


Secondary Thl act 


0.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC I FN gamma 


0.0 


Secondary Trl act 


0.0 


HUVECTNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL- 11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


12.3 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


10.4 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


11.2 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


11.7 


Bronchial epithelium 
TNFalpha + ILlbeta 


12.9 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


24.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


100.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 
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CD45RO CD4 
lymphocyte act 


6.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


12.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


4.5 


KU-812 (Basophil) 
rivLnj lunomy win 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CHI 1 


0.0 


CCDl 106 (Keratinocytes) 

tiAnp 
llV/llw 


0.0 


LAK cells rest 


0.0 


v^v^i-/! iuu ^xVCfaunUwjr ICS J 

TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


n 4 


XwlVwX vIllllUolo 




LAK cells IT -2+TT -1? 

L/niv w wl lO i ■ ~f* > JL J / X Z# 


0 0 


xwupiid iviuncy 


u.u 


LAK cells TL-2+TFN 
gamma 


0.0 


NCI-H292 none 


33.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


36.6 


LAK cells 

L/iiXV WWllO 

PMA/ionomycin 


0.0 


NCI-H292 IL-9 


23.3 


NK Cells IL-2 rest 


0.0 


NCI-H292IL-13 


21.2 


Two Wav MT R 3 rtav 

x wv »t ajf ivxxvxv j \1cLj 


ft ft 


i\\^x-rxzyz. ir in gamma 


1 1 £ 

1 l.D 


Two Wav MT "R S Hav 


ft ft 

U.U 


XXX /vljV^ nunc 




Two Way MLR 7 day 


0.0 


RPAPP TMP nlnVia -4- TT 1 
jnii /\xjv^ liNr aipna^iJU"! 

beta 


0.0 


PBMC rest 


l/.v 


T f 1TKT Ti nrAnl q cf nnno 

i^ung liuroDiaSi none 


fi n I 
u.u 


PBMC PWM 


11.7 


Lung fibroblast TNF alpha 

4- TT -1 Kpta 
' 1L/ 1 DC la 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
lonomycm 


0.0 


Lung fibroblast IL- 1 3 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 

on/1 TT A 

ana 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-l dbcAMP 


0.0 


Dermal fibroblast 
CCDl 070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


J-SCMlial ilDiODiaSi 

CCD1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


13.1 
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Monocytes rest 


0.0 


IBD Crohn's 


6.9 


Monocytes LPS 


26.1 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


16.0 


HUVEC none 


0.0 


Kidney 


13.1 


HUVEC starved 


0.0 







Panel 13D Summary: Ag2907 Results from two experiments with the same 
probe/primer set are in good agreement. Expression of the NOV40 gene is highest in a sample 
derived from pituitary tissue with little to no expression detected in any other tissue. Thus, 
5 expression of this gene could be used to distinguish pituitary gland from the other samples on 
this panel. 

The protein encoded for by this gene is most homologous to a glucuronosyltransferase, 
normally found in liver. UDP glycosyltransferases (UGT) are a superfamily of enzymes that 
catalyze the addition of the glycosyl group from a UTP-sugar to a small hydrophobic 

10 molecule. Glucuronosyltransferases are membrane-bound microsomal enzymes that catalyze 
the transfer of glucuronic acid to a wide variety of exogenous and endogenous lipophilic 
substrates. These enzymes are of major importance in the detoxification and subsequent 
elimination of xesnobiotics such as drugs and carcinogens. The pituitary plays a major role in 
the physiology of many different systems in the body. Therefore, this gene may play an 

1 5 essential role in maintaining proper function of the pituitary gland and many of its secreted 

peptides. Furthermore, therapeutic modulation of the activity of this gene or its protein product 
using small molecule drugs may be useful for the treatment of diabetes and diabetes as well as 
growth, reproductive, and endocrine disorders. 

Panel 2D Summary: Ag2907 Expression of the NOV40 gene is highest and almost 

20 exclusive to a sample derived from a prostate cancer (CT = 3 1 .7). Thus, the expression of this 
gene could be used to distinguish prostate cancer from the other samples in the panel. 
Moreover, therapeutic modulation of the activity of this gene or its protein product, through 
the use of small molecule drugs, protein therapeutics or antibodies, might be of benefit in the 
treatment of prostate cancer. 

25 Panel 3D Summary: Ag2907 Expression of the NOV40 gene is highest in a sample 

derived from a squamous cell carcinoma cell line (CT = 33.8). Thus, the expression of this 
gene could be used to distinguish this sample from the other samples in the panel. 
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Panel 4D Summary: Ag2907 Expression of the NOV40 gene is detected at a very low 
level in small airway epithelium treated with the inflammatory cytokines TNF-a and IL-lb 
(CT = 34.2). Thus, expression of this gene may be a marker of inflammation in the lung. 

NOV41a and NOV41b 

5 Expression of gene NOV41a and variant NOV41b was assessed using the primer-probe 

sets Agl361 and Ag2953, described in Tables AJA and AJB. Results of the RTQ-PCR runs 
are shown in Tables AJC, AJD and AJE. 



Table AJA. Probe Name Agl361 



Primers 


Sequences 


Length 


Staxt 
Position 


SEQ ID NO: 


Forward 


5 1 -ctggtcaggtacctggatgtta-3 1 


22 


1438 


1141 


Probe 


TET-5 » -tccatcaatgaagagcttcatattcg-3 ' - 
TAMRA 


26 


1480 


1142 


Reverse 


5 1 -cagcctttaagtgatccatcag-3 ' 


22 


1507 


1143 



1 0 Table AJB. Probe Name Ag2953 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ■ -ctggtcaggtacctggatgtta-3 1 


22 


1438 


1144 


Probe 


TET-5 1 -tccatcaatgaagagcttcatattcg-3 1 - 
TAMRA 


26 


1480 


1145 


Reverse 


5 1 -cagcctttaagtgatccatcag-3 ' 


22 


1507 


1146 



Table AJC. Panel 13D 



Tissue Name 


Rel. Exp.(%) Agl361, 
Run 152953143 


Tissue Name 


Rel.Exp.(%)Agl361, 
Run 152953143 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


2.1 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A49S 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


1.6 


Renal ca.UO-31 


0.0 


Pituitary gland 


0.1 


Renal ca.TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.2 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.7 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


1.0 


Lung (fetal) 


0.0 
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Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


0.2 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


glio/astro U-l 18-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.1 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-1 9 


0.0 


Mammary gland 


0.0 


glioma U251 


0.0 


Breast ca.*(pl.ef) 
MCr-/ 


02 


glioma SF-295 


0.0 


Breast ca.* (pl.et) 
MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


Breast ca.* (pl.et) 
T47D 


0.0 


Heart 


u.u 


Breast ca. Jd 1 -54y 


U.U 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0.5 


Thymus 


0.4 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.7 


Lymph node 


0.0 


Ovarian ca. OVCAR- 

Q 
O 


0.0 


Colorectal 


0.0 


w Vallall La. IVjavv^ V - 

l 


0.0 


Stomach 


100.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0 


Uterus 


0.0 


Colon ca.SW480 


0.0 


Placenta 


0.0 


Colon ca.* 


0.0 


Prostate 


0.0 
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SW620(SW480 met) 








Colon ca. HT29 


0.0 


Prostate ca.* (bone 


0.0 


Colon ca.HCT-1 16 


0.0 


Testis 


2.1 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.4 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


vjasinc ca. ^iiver met ) 
NCI-N87 


0.0 


Melanoma Ml 4 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.4 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


12.8 


Adipose 


0.0 



Table AJD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Agl361,Run 
152953321 


Tissue Name 


Rel. Exp.(%) 
Agl361, Run 
152953321 


Normal Colon 


0.2 


Kidney Margin 
8120608 


2.1 


CC WelltoModDiff 
(OD03866) 


0.0 


Kidney Cancer 
8120613 


0.5 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


2.6 


CC Gr.2 rectosigmoid 
(OD03868) 


1.3 


Kidney Cancer 
9010320 


0.0 


CC Margin (OD03868) 


0.0 


Kidney Margin 
9010321 


2.9 


CC ModDiff(ODO3920) 


0.6 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 064011 


0.2 


CC Gr.2 ascend colon 
(OD03921) 


0.2 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


0.0 


Thyroid Cancer 


0.0 
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064010 




CC from Partial 
Hepatectomy (ODO4309.) 
ivieis 


0.0 


Thyroid Cancer 
A302152 


0.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 


0.0 


Colon mets to lung 

[\JLJ\J i T e rJ 1 -U I J 


0.0 


Normal Breast 


0.0 


Lung Margin (OD04451- 


0.0 


Breast Cancer 


0.0 


Normal Prostate 6546-1 


0.0 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
(OD04590-03) 


0.0 


Prostate Margin 
(OD04410) 


0.0 


Breast Cancer 
Metastasis 

(UJL)U40-O-U}) 


0.0 


Prostate Cancer 

[\JLJ\J L t I ZU-U I ) 


0.0 


Breast Cancer 064006 


0.0 


Prostate Margin 

\\JLs\jH 1 4A)-\JJ. ) 


0.1 


Breast Cancer 1024 


0.3 


Normal Lung 061010 


0.3 


Breast Cancer 

Q1 MY) 6 A 


0.0 


Lung Met to Muscle 


0.0 


Breast Margin 


0.0 


(OD04286) 


0.0 


r>reasi cancer 
A209073 


0.1 


Lung Malignant Cancer 
(OD03126) 


0.0 


Breast Margin 
A2090734 


0.1 


Lung Margin (OD03126) 


0.0 


Normal Liver 


0.0 


Lung Cancer (OD04404) 


0.1 


Liver Cancer 064003 


0.0 
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Lung Margin (OD04404) 


0.1 


Liver Cancer 1025 


0.0 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0.2 


Lune Marein COD04565^ 

uUUg XT1U1 Kill ^V/l^VTJ VJ 1 


0 0 

v/.v 






Lung Cancer (OD04237- 

on 


0.0 


Liver Tissue 6004-N 


0.0 


Lune Marein COD04237- 

l— lUllfe iviul gill y VyL/ U II.J / 

02) 


0.0 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Marein (OD043 1 0") 


0 0 


INUillldl XJldUUwl 


\}.H 


Melanoma Mets to Lung 
(OD0432H 


0.0 


Bladder Cancer 1023 


0.1 


Lung Margin (OD04321) 


* 

0.0 


Bladder Cancer 
A302173 


0.1 


Normal Kidney 


23.3 


Bladder Cancer 
(OD04718-01) 


0.0 


Kidney Ca, Nuclear grade 
2 (OD04338) 


0.7 


Bladder Normal 
Adjacent (OD047 18- 

VO) 


0.0 


Kidney Margin 
(OD04338) 


92 


Normal Ovary 


o.o 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


0.0 


Ovarian Cancer 
064008 


0.2 


Kidney Margin 
(OD04339) 


28.3 


Ovarian Cancer 
(OD04768-07^ 


0.0 


Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


45.4 


Normal Stomach 


100.0 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0.0 


Gastric Cancer 
9060358 


5.3 
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Kidney Margin 

//~\T\r\ A~t A Q\ 

(UDU4J4o) 


18.6 


Stomach Margin 


78.5 


Kidney Cancer 

(UDU4622-U1) 


0.0 


Gastric Cancer 


0.3 


Kidney Margin 
(OD04622-03) 


3.4 


Stomach Margin 

yUoUiV4 


31.6 


Kidney Cancer 


0.0 


Gastric Cancer 


02 


Kidney Margin 
(OD04450-03) 


25.7 


Stomach Margin 
9060396 


29.5 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


4.5 



Table AJE. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
Agl361, 

Run 
152953376 


Rel. 
Exp.(%) 
Ag2953, 

Run 
164306345 


Tissue Name 


Rel. 
Exp.(%) 
Agl361, 

Run 
152953376 


ReL 
Exp.(%) 
Ag2953, 

Run 
164306345 


Secondary Thl act 


0.0 


0.0 


HUVEC IL-lbeta 


0.0 


0.0 


Secondary Th2 act 


0.0 


0.0 


HUVECIFN 
gamma 


0.0 


0.0 


Secondary Trl act 


0.0 


0.0 


HUVEC TNF 
alpha + IFN 
gamma 


0.0 


0.0 


Secondary Thl rest 


0.0 


0.0 


HUVEC TNF 
alpha + IL4 


0.0 


0.0 


Secondary Th2 rest 


0.0 


0.0 


HUVEC IL-11 


0.0 


0.0 


Secondary Trl rest 


0.0 


0.0 


Lung 

Microvascular EC 
none 


0.0 


0.0 


Primary Thl act 


0.0 


0.0 


Lung 

Microvascular EC 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Primary Th2 act 


0.0 


0.0 


Microvascular 
Dermal EC none 


0.0 


0.0 


Primary Trl act 


0.0 


0.0 


Microsvasular 
Dermal EC 
TNFalpha + IL-. 


0.0 


0.0 
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lbeta 






Primary Thl rest 


0.0 


0.0 


Bronchial 
epithelium 
TNFalpha + 
IL lbeta 


0.0 


0.0 


Primary Th2 rest 


0.0 


0.0 


Small airway 
epithelium none 


0.0 


0.0 


Primarv Trl rest 

M. 1 11J1U1 T J. 1 X « Vijl 


0.0 


0.0 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


00 


0.0 


CD45RACD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC rest 


0.0 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC TNFalpha + 
IL- lbeta 


0.0 


0.0 


CD8 lymphocyte 
act 


0.0 


0.0 


Astrocytes rest 


0.0 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


0.0 


Astrocytes 
TNFalpha + IL- 
lbeta 


0.1 


0.2 


Secondary CD8 
lymphocyte act 


0 0 


0 0 


KU-812 
(Basophil) rest 


0 0 


0 0 


CD4 lymphocyte 
none 


0.0 


0.0 


KU-812 

(Basophil) 

PMA/ionomycin 


0.0 


0.0 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


0.0 


CCD1106 

(Keratinocytes) 

none 


0.0 


0.0 


LAK cells rest 


0.0 


0.0 


CCD1106 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


LAK Ceils LL-z 


A A 

u.o 


u.u 


Liver cirrhosis 


0.0 


0.0 


LAK cells IL-2+IL- 


0.0 


0.0 


Lupus kidney 


1.2 


0.8 


T AT/ TT 

luAJs. ceiis LL- 
2+IFN gamma 


0.0 


0.0 


NCI-H292none 


0.2 


0.0 


LAK cells 
IL-18 


0.0 


0.0 


NCI-H292 IL-4 


0.0 


0.0 


PMA/ionomycin 


0.0 


0.0 


NCI-H292 IL-9 


0.2 


0.0 


NK Cells IL-2 rest 


0.0 


0.0 


NCI-H292IL-13 


0.0 


0.0 


Two Way MLR 3 
day 


0.0 


0.0 


NCI-H292 IFN 
gamma 


0.0 


0.0 


Two Way MLR 5 
day 


0.0 


0.0 


HPAEC none 


0.0 


0.0 
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Two Way MLR 7 
day 


0.0 


0.0 


HPAEC TNF 
alpha + IL-1 beta 


0.0 


0.0 


PBMC rest 


0.0 


0.0 


Lung fibroblast 
none 


0.0 


0.0 


PBMC PWM 


0.0 


0.2 


Lung fibroblast 
TNF alpha + IL-1 
beta 


0.0 


0.0 


PBMC PHA-L 


0.0 


0.0 


Lung fibroblast 
IL-4 


0.0 


0.0 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast 
IL-9 


0.0 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


0.0 


Lung fibroblast 
EL-13 


0.0 


0.0 


B lymphocytes 
PWM 


0.0 


0.0 


Lung fibroblast 
IFN gamma 


OX) 


0.0 


B lymphocytes 
CD40LandIL~4 


A A 

u.u 


a a 
u.u 


Dermal fibroblast 
CCD 1070 rest 


A A 
U.U 


A A 
U.U 


EOL-1 dbcAMP 


0.0 


0.0 


Dermal fibroblast 
CCD1070 TNF 
alpha 


0.0 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


0.0 


Dermal fibroblast 
CCD1070 IL-1 
beta 


0.0 


0.0 


Dendritic cells 
none 


0.0 


0.0 


Dermal fibroblast 

TT7KT 

IFN gamma 


0.0 


0.0 


Dendritic cells LPS 


0.0 


0.0 


Dermal fibroblast 
II -4 


0.0 


0.0 


Dendritic cells anti- 
CD40 


0.0 


0.0 


IBD Colitis 2 


0.0 


0.0 


Monocytes rest 


0.0 


0.0 


IBD Crohn's 


0.0 


0.0 


Monocytes LPS 


0.0 


0.0 


Colon 


0.0 


0.0 


Macrophages rest 


0.0 


0.0 


Lung 


0.0 


0.0 


Macrophages LPS 


0.0 


0.0 


Thymus 


100.0 


100.0 


HUVEC none 


0.0 


0.0 


Kidney 


0.0 


0.2 


HUVEC starved 


0.0 


0.0 









Panel 1.3D Summary: Agl361 Expression of the NOV41a gene is restricted to 
stomach (CT value = 29.9) and kidney (CT value « 32.9) tissue. This observation is consistent 
with the identification of this gene as a sodium/hydrogen ion exchanger because the function 
5 of both of these tissues requires sodium/hydrogen ion exchange activity. The inhibition of the 
NOV41 A protein activity, through the use of antibodies or small molecule drugs, might be of 
use in the treatment of kidney or gastric diseases related to the function of a sodium/hydrogen 
ion exchanger. For example, the activity of this gene may be related to over-production of 
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stomach acid leading to acid reflux disease or peptic ulcer. Results from a second experiment 
with the probe and primer set Ag2953 are not included. The amp plot indicates that there is a 
potential problem in one of the sample wells. 

Panel 2D Summary: Agl361 Consistent with what was observed in Panei 1.3D, 

. 5 expression of the NOV41a gene in panel 2D is restricted to both normal kidney and stomach 
adjacent to tumor tissue. Interestingly, expression of the gene is absent in 4/4 gastric tumors 
and 10/10 kidney cancers when compared to the normal adjacent tissue controls. Thus, the 
expression of this gene appears to be a consistent trait of the non-neoplastic kidney and 
stomach. Therefore the absence of expression of this gene could be used as a diagnostic 

10 marker for kidney or gastric cancer. In addition, the replacement of this gene, potentially 
through the direct application of the protein or using gene replacement therapy, could be of 
use in the treatment of kidney or gastric cancer. Na+/H+ exchangers have previously been 
implicated in modulation of cellular adhesion and tumor invasion (Refs. 1 and 2). 
References: 

15 1 . Denker S.P., Huang D.C., Orlowski J., Furthmayr H., Barber D.L. (2000) Direct 

binding of the Na~H exchanger NHE1 to ERM proteins regulates the cortical cytoskeleton 
and cell shape independently of H(+) translocation. Mol. Cell 6: 1425-1436. 

The association of actin filaments with the plasma membrane maintains cell shape and 
adhesion. Here, we show that the plasma membrane ion exchanger NHE1 acts as an anchor for 

20 actin filaments to control the integrity of the cortical cytoskeleton. This occurs through a 
previously unrecognized structural link between NHE1 and the actin binding proteins ezrin, 
radixin, and moesin (ERM). NHE1 and ERM proteins associate directly and colocalize in 
lamellipodia. Fibroblasts expressing NHE1 with mutations that disrupt ERM binding, but not 
ion translocation, have impaired organization of focal adhesions and actin stress fibers, and an 

25 irregular cell shape. We propose a structural role for NHE1 in regulating the cortical 
cytoskeleton that is independent of its function as an ion exchanger. 
PMID: 11163215 

2. Reshkin S.J., Bellizzi A., Albarani V., Guerra L., Tommasino M., Paradiso A., 
Casavola V. (2000) Phosphoinositide 3-kinase is involved in the tumor-specific activation of 
30 human breast cancer cell Na(+)/H(+) exchange, motility, and invasion induced by serum 
deprivation. J. Biol. Chem. 275: 5361-5369. 

Whereas the tumor acidic extracellular pH plays a crucial role in the invasive process, 
the mechanism(s) behind this acidification, especially in low nutrient conditions, are unclear. 
The regulation of the Na(+)/H(+) exchanger (NHE) and invasion by serum deprivation were 
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studied in a series of breast epithelial cell lines representing progression from non-tumor to 
highly metastatic cells. Whereas serum deprivation reduced lactate production in all three cells 
lines, it inhibited NHE activity in the non-tumor cells and stimulated it in the tumor cells with 
a larger stimulation in the metastatic cells. The stimulation of NHE in the tumor cell lines was 
5 the result of an increased affinity of the internal H(+) regulatory site of the NHE without 

changes in sodium kinetics or expression. Serum deprivation conferred increased cell motility 
and invasive ability that were abrogated by specific inhibition of the NHE. Inhibition of 
phosphoinositide 3-kinase by overexpression of a dominant-negative mutant or wortmannin 
incubation inhibited NHE activity and invasion in serum replete conditions while potentiating 
10 the serum deprivation-dependent activation of the NHE and invasion. These results indicate 
that the up-regulation of the NHE by a phosphoinositide 3-kinase-dependent mechanism plays 
an essential role in increased tumor cell invasion induced by serum deprivation. 
PMID: 10681510 

Panel 4D Summary: Agl361/Ag2953 Two experiments with the same probe and 
1 5 primer sets produce results that are in excellent agreement. The NOV4 1 a transcript is 
expressed in the thymus in Panel 4D (CT = 28.6), but not in Panel 1.3D (CT = 38). The 
NOV41 A gene encodes a putative ion exchange molecule and may therefore be important in 
signal transduction in the thymus. Antibodies against the protein encoded for by the NOV41 A 
gene may be used to identify thymic tissue. Additionally, small molecule or antibody 
20 therapeutics designed against this putative ion exchanger could disrupt T cell development in 
the thymus and serve an immunosuppresive function that could be important for tissue 
transplant. 

NOV42a and NOV42b 

Expression of gene NOV42a and variant NO V42b was assessed using the primer-probe 
25 set Ag3002, described in Table AKA. Results of the RTQ-PCR runs are shown in Tables AKB 
and AKC. 



Table AKA. Probe Name Ag3002 



Primers 


Sequences 


Length 


Start 
Position 


SEQ XD NO: 


Forward 


5 ■ -agaccctccatgtggtcatt-3 1 


20 


1188 


1147 


Probe 


TET-5 1 -tcacaggaacagctacaaagaaccca-3 * - 
TAMRA 


26 


1211 


1148 


Reverse 


5 ' -caggaccatctggagaagct-3 1 


20 


1245 


1149 
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Table AKB. Panel 1. 3D 



Tissue Name 


Rel. Exp.(%) Ag3002, 


Tissue Name 


Rel. Exp.(%) Ag3002 9 

Run 1£7Qft5704 


JUlVCr aUCUUCdi wUlUilld. 


0 0 
u.u 


JVlUUCjr {ICLalj 


0 0 

U.U 


r ailUrCclo 


ft ft 
u.u 


fvCIlal La. / Ov v 


0 ft 
u.u 


r all^ICallL- La. v^Ar Ai\ 

2 


0.0 


Renal ca. A498 


0.0 


AHrf»na1 crlnnH 
/\UICIlai gldilU 


4 R 


Renal ra RYF 


ft ft 
u.u 


Tfivrniri 


ft ft 


JVC 1 la I ta. /Vv^ril n 


ft ft 
u.u 


Oaiivaljr giaiiu 


3 ft 


ixClldl v^a. lj w-j i 


ft ft 
u.u 


Pituitary gland 


0.0 


Renal ca.TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


4.3 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


14.1 


Lung 


0.0 


Brain (hippocampus) 


6.7 


Lung (fetal) 


4.0 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell)A549 


0.0 


giio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


astrocytoma SW1783 


4.0 


Lung ca. (non-s.cell) 
HOP-62 


0 0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


3.2 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


00 


glioma SNB-19 


0.0 


Mammary gland 


100.0 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 
T47D 


29.9 


Heart 


15.7 


Breast ca. BT-549 


0.0 
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Skeletal muscle (fetal) 


6.5 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


20.6 


Ovary 


12.5 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Thymus 


13.7 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


71.7 


Ovarian ca. OVCAR- 
5 


21.6 


Lymph node 


24.8 


Ovarian ca. OVCAR- 

e 
o 


15.2 


Colorectal 


3.8 


Ovarian ca. IGROV- 

i 
i 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 

V "J 


0.0 


Olllctll IIILCMIIIC 


JJ.Z 


uterus 


z/.u 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


4.2 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 

mP t\pp i 
meijri^o 


0.0 


Colon ca.HCT-1 16 


2.4 


Testis 


7.7 


Colon ca. CaCo-2 


4.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


vjastric ca. ^iiver met; 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.0 


Adipose 


13.8 



Table AKC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3002,Run 
164043126 


Tissue Name 


Rel. Exp.(%) 
Ag3002, Run 
164043126 


Secondary Thl act 


10.2 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


o.o 
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Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0-0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 

« I ii ill 1 1 ■ TT 11 j 

TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


14.2 


Bronchial epithelium 

f f tx TT"' 1 1_ ■ YT 1 L a. 

TNFalpha + IL 1 beta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


6.2 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


/ll \ 0 1 _ 1 _A A 

CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest . 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 

A IVXrV/ lUliUIIIjr Is III 


0.0 


2ryThl/Th2/Trl_anti- 
CD95 CHI 1 


0.0 


CCD1 106 (Keratinocytes) 

nnnp 
ilUllv 


0.0 


LAK cells rest 


0.0 


fpni prat i nor* vrfptA 

TNFalpha + IL-lbeta 


0.0 


LAK celk TL-2 


0 0 


T ivpi* /*irrrif\cic 

L-/1VCI l/ll I Ill/Ma 


0 ft 


LAK cells IL-2+IL- 1 2 


0 0 


T umiQ IciHnpv 


in ft 

1 V.O 


LAK cells IL-2+IFN 
gamma 


23.3 


NCI-H292 none 


10.8 


LAK cells IL-2+IL-18 


0.0 


NCI-H292 IL-4 


oi 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Wav MLR 3 dav 


0 0 




o ft 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAECTNF alpha + IL-1 
beta 


0.0 


PBMC rest 


30.6 


Lung fibroblast none 


0.0 


PBMC PWM 


24.8 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


13.5 


Lung fibroblast IL-4 


0.0 
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Kamos (d ceiij none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
• • 
lonomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


100.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 


0.0 


Dermal fibroblast 
CCD1 070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070TNF alpha 


0.0 


PMA/ionomvcin 


99.3 


Dermal fibroblast 
pmirnfiTT i koto 


0.0 


Dendritic cells none 


0.0 


lyermai HDroDiast LriN 
gamma 


0.0 


Dendritic cells LPS 


22.4 


Dermal fibroblast IL-4 


0 0 


Dendritic cells anti- 
CD40 


36.3 


IBD Colitis 2 


0.0 


Monocytes rest 


76.8 


IBD Crohn's 


22.2 


Monocytes LPS 


0.0 


Colon 


67.8 


Macrophages rest 


0.0 


Lung 


22.7 


Macrophages LPS 


0.0 


Thymus 


. 12.2 


HUVEC none 


0.0 {Kidney 


0.0 


HUVEC starved 


0.0 





Panel 13D Summary: Ag3002 Significant expression of the NOV42a gene is limited 
to the mammary gland and the spleen (CTs=33-34). Thus, expression of this gene could be 
used to differentiate these samples from other samples on this panel and as a marker for these 
tissues. 

Panel 4D Summary: Ag3002 Significant expression of the NOV42a gene is limited to 
activated B cells, an eosinophil cell line, and monocytes (CTs=33-35). Thus, this transcript 
could be. used as a marker for phagocytic cell types. 

NOV43 

Expression of gene NOV43 was assessed using the primer-probe set Ag2987, 
described in Table ALA. Results of the RTQ-PCR runs are shown in Tables ALB, ALC, ALD 
and ALE. 



Table ALA. Probe Name Ag2987 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 1 -ctacctcaccatctgctttctg-3 ' 


22 jBSO 


1150 


Probe 


TET-5 * -tttctcaggactgccagctcttgatg-3 ' - 


26 |883 


1151 



1109 



WO 02/068649 



PCT/US02/02785 



(tamra j 




Reversej5 ■ - tccatatcttgtaggccacact-3 ' 


(22 


J91S 


1152 



Table ALB. CNS_neurodegeneration_yl.O 



Tissue Name 


Rel Exn (%\ Ap2987 
Run 211008970 


Tissue Name 


Run 211008970 


AD 1 Hippo 


31.2 


Control (Path) 3 
Temporal Ctx 


14.7 


AD 2 Hippo 


29.9 


Control (Path) 4 
Temporal Ctx 


43.8 


AD 3 Hippo 


37.4 


AD 1 Occipital Ctx 


44.1 


AD 4 Hippo 


10.8 


(Missing) 


0.0 


AD 5 hinno 


87 7 




77 7 


AD 6 Hippo 


85.3 


AD 4 Occipital Ctx 


37.4 


Control z Hippo 


14.5 


AD 5 Occipital Ctx 


17.0 


Control 4 Hippo 


13.4 


AD 6 Occipital Ctx 


12.5 


Control (Path) 3 
Hippo 


20.7 


Control 1 Occipital 
Ctx 


9.8 


AD 1 Temporal Ctx 


49.0 


Control 2 Occipital 
Ctx 


14.6 


AD 2 Temporal Ctx 


35.6 


Control 3 Occipital 
Ctx 


29.7 


AD 3 Temporal Ctx 


29.9 . 


Control 4 Occipital 
Ctx 


9.4 


AD 4 Temporal Ctx 


31.0 


Control (Path) 1 
Occipital Ctx 


44.4 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


13.2 


AD 5 SupTemporal 
Ctx 


64.2 


Control (Path) 3 
Occipital Ctx 


5.8 


AD 6 Inf Temporal 
Ctx 


69.3 


Control (Path) 4 
Occipital Ctx 


29.9 


AHA Cnn Tflm«Ai«l 

ajj 0 oup 1 emporai 

Ctx 


96.6 


Control 1 Parietal 
Ctx 


19.3 


Control 1 Temporal 
Ctx 


17.0 


Control 2 Parietal 
Ctx 


67.8 


Control 2 Temporal 
Ctx 


10.6 


Control 3 Parietal 
Ctx 


23.7 


Control 3 Temporal 
Ctx 


26.4 


Control (Path) 1 
Parietal Ctx 


23.0 


Control 4 Temporal 
Ctx 


11.0 


Control (Path) 2 
Parietal Ctx 


36.1 


Control (Path) 1 
Temporal Ctx 


35.6 


Control (Path) 3 
Parietal Ctx 


16.2 
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Control (Path) 2 j 
Temporal Ctx j 


35.4 


Control (Path) 4 
Parietal Ctx 


45.4 


Table ALC. Panel L3D 


Tissue Name 


ReI.Exp.(%)Ag2987,j „ 

Run 166232814 Tissue Name 


Uol 17vn tO/ \ An1A07 

Kei. JLXp.(To) AgZyo7, 
Run 166232814 


Liver adenocarcinoma 


5.9 {Kidney (fetal) 




Pancreas 


6.4 jRenal ca. 786-0 




Pancreatic ca. CAP AN 
2 


0.0 Renal ca. A498 


13.6 


Adrenal gland 


18.7 [Renal ca. RXF 393 


la. I 


Thyroid 


5.6 |Renai ca. ACHN 


0.0 


oaiivary giana 


40.9 (Renal ca.UO-31 


0.0 


Pituitary gland 


23.5 |Renalca.TK-10 


5.4 


Brain (fetal) 


67.4 


Liver 


0.0 


Brain (whole) 


54. / 


Liver (fetal) 


4.8 


Brain (amygdala) 


39.5 


Liver ca. 

(nepatoblast) HepG2 


7.0 


Brain (cerebellum) 


i An n 


Lung 


0.0 


Brain (hippocampus) 


21.8 


Lung (fetal) 


2.7 


Brain (substantia nigra) 


16.6 


Lung ca. (small cell) 

T Y 1 


0.0 


Brain (thalamus) 


96.6 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


39.8 


Lung ca. (s.cell var.) 
SHP-77 


10.3 


Spinal cord 


41.2 


Lung ca. (large 
cell)NCI-H460 


13.0 


elio/astro U87-MG 


5.6 


Lung ca. (non-sm. 
cell) A549 


0.0 


elio/astro U-l 1 8-MG 


51.8 


Lung ca. (non-s.cell) 
NCI-H23 


6.4 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


10.4 


neuro*; met SK-N-AS 


8.4 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


4.3 


Lung ca. (squam.) 
SW900 


34.2 


astrocytoma SNB-75 


10.4 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


28.5 


Mammary gland 


0.0 


glioma U251 


10.2 


Breast ca.* (pl.ef) 
MCF-7 


13.0 j 


glioma SF-295 


76.8 


Breast ca.* (pl.ef) 


0.0 
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MDA-MB-231 




Heart (fetal) 


5.1 


Rreast ea * (n\ ef\ 

T47D 


0.0 


Heart 


10 5 


Brea«!t ca BT-549 


15.8 


Skeletal muscle (fetal) 


10.8 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


15.7 


Ovary 


7.3 


Bone marrow 


5.0 


Ovarian ca. OVCAR- 
3 


0.0 


Thymus 


0.0 


Ovarian ca. OVCAR- 
4 


5.8 


Spleen 


11.3 


Ovarian ca. OVCAR- 
5 


38.2 


Lymph node 


42.3 


Ovarian ca. OVCAR- 

o 


31.4 


Colorectal 


8.4 


Ovarian ca. IGROV- 
i 


10.2 


Stomach 


10.8 


Ovarian ca.* (ascites) 
SK-OV-3 


54.0 


Small intf*Qtinp 




T Tt/*n i c 
yj ici ua 


7 


Colon ca. SW480 


0.0 


Placenta 


11.7 


Colon ca.* 
SW620(SW480met) 


0.0 


Prostate 


18.9 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 

1 11CI )T\-/ D 


0.0 


Colon ca.HCT-1 16 


0.0 


Testis 


9.1 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


4.8 


Melanoma* (met) 
Hs688(B).T 


0.0 


uoion ca. ricc-zyyo 


A A 
U.U 


Melanoma UACC-oz 


1 1.9 


VJdoll La. \1IVC1 illdj 

NCI-N87 


31.2 


Melanoma Nil 4 


0.0 


Bladder 


25.2 


Melanoma LOX 
IMVI 


5.2 


Trachea 


10.6 


Melanoma* (met) 
SK-MEL-5 


7.0 


Kidney 


0.0 


Adipose 


27.4 



Table ALD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2987,Run 
164314632 


Tissue Name 


Rel. Exp.(%) 
Ag2987,Run 
164314632 


Secondary Thl act 


8.1 


HUVEC IL-lbeta 


1.9 


Secondary Th2 act 


10.3 


HUVEC EFN gamma 


17.4 
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Secondary Trl act 


2.2 


HUVEC TNF alpha + IFN 


9.6 


Secondary Thl rest 


8.8 


j HUVEC TNF alpha + IL4 


4.1 


SecnnHarv Xh9 rpct 
uww^/ituoi y xiijL i Col 


inn 


HUVEC IL-ll 


9.3 


Secondary Trl rest 


25.0 


Lung Microvascular EC 
none 


31.9 


Primary Thl act 


4.0 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


22.8 


Primary Th2 act 


1.9 


Microvascular Dermal EC 
none 


25.9 


Primary Trl act 


4.5 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


7.3 


Primary Thl rest 


40.1 


Bronchial epithelium 
TrJFalpha + ILlbeta 


33.4 


Primary Th2 rest 


39.5 


Small airway epithelium 
none 


10.6 . 


Primary Trl rest 


74.7 


Small airway epithelium 
TNFalpha + IL-1 beta 


77.4 


JM\J\ K^LJH 

lymphocyte act 


0.0 


Coronery artery SMC rest 


15.0 


y^LJ^r JIxw \^LJ L r 

lymphocyte act 


15.4 


Coronery artery SMC 
TNFalpha -ML-lbeta 


3.0 


lympnocyie acx 


D.l 


Astrocytes rest 


4.6 


Secondary CD8 


16.3 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 

IvrnrNn onvrt'f* apt 
i j lJipilUL'jr IC aUL 


1.8 

— — — — . 


KU-812 (Basophil) rest 


1.7 


CD4 lymphocyte none 


10.0 


KU-812 (Basophil) 
r xvirv/ lununiycin 


20.9 


2ryThl/Th2/Trl anti- 
CD95CH11 


13.7 


CCD 11 06 (Keratinocytes) 


8.4 


LAK cells rest 


33.7 


1 1 uo ^rvcraiinocyies ) 
TNFalpha + IL-lbeta 


7.9 


LAK cells IL-2 


14.8 


TJvcr rttrhrvcic 


1 O £ 
lo.O 


LAK cells IL-2+IL-12 


14.9 




15.5 


LAK cells EL-2+IFN 
gamma 


28.9 


NCI-H292 none 


15.0 


LAK cells IL-2+IL-18 


14.4 j 


NCI-H292 IL-4 


4.6 


LAK cells 
PMA/ionomycin 




VT/^T T TO no tt n 

NC1-H292 IL-9 


132 


NK Cells IL-2 rest 


11.3 


NCI-H292 IL-13 


4.7 | 


Two Way MLR 3 day 


32.3 


NCI-H292IFN gamma 


2.8 


Two Way MLR 5 day 


2.4 ] 


HPAEC none 


8.0 


Two Way MLR 7 day 


0.0 j 


HPAEC TNF alpha + EL-1 
seta 


10.2 
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i uiviu^ rcbi 




Lung iiDrouidbi none 


3^ fx 


PBMC PWM 


18.8 


Lung fibroblast TNF alpha 

* 1.L- 1 DC Id 


16.3 


PBMC PHA-L 


20.2 


Lung fibroblast IL-4 


65.1 


Ramos (B cell) none 


1.4 


Lung fibroblast IL-9 


55.5 


Ramos (B cell) 
ionomycin 


5.6 


Lung fibroblast IL-13 


57.4 


B lymphocytes PWM 


19.2 


Lung fibroblast IFN 
gamma 


100.0 


B lymphocytes CD40L 

1 TT A 

and IL-4 


11.7 


Dermal fibroblast 
CCD 1070 rest 


9.7 


EOL-1 dbcAMP 


59.9 


Dermal fibroblast 
CCD1070 TNF alpha 


29.7 


EOL-1 dbcAMP 

P A /i n n nm vp i n 


11.9 


Dermal fibroblast 


3.2 


Dendritic cells none 


48.6 


nprmal -fitvrrkhlocr TT7KT 
UCnilal IlDrODlaSL IJrlN 

gamma 


11.4 


Dendritic cells LPS 


17.7 


Dermal fibroblast IL-4 


24.3 


Dendritic cells anti- 
CD40 


29.5 


IBD Colitis 2 


6.7 


Monocytes rest 


26.4 


IBD Crohn's 


4.5 


Monocytes LPS 


19.5 


Colon 


20.4 


Macrophages rest 


28.1 


Lung 


14.8 


Macrophages LPS 


8.9 


Thymus i 


52.5 


HUVEC none 


9.9 


Kidney 


74.7 


HUVEC starved 


21.2 







Table ALE. Panel CNS 1 



Tissue Name 


Rel. Exp.(%) Ag2987, 
Run 171670053 


Tissue Name 


Rel. Exp.(%) Ag2987, 
Run 171670053 


BA4 Control 


21.3 


BA17PSP 


16.8 


BA4 Control2 


15.2 


BA17PSP2 


4.4 


BA4 

Alzheimer's2 


27.5 


Sub Nigra Control 


39.8 


BA4 Parkinson's 


100.0 


Sub Nigra Control2 


13.9 


BA4 

Parkinson's2 


66.9 


Sub Nigra 
Alzheimer*s2 


34.6 


BA4 

Huntington's 


33.2 


Sub Nigra 
Parkinson's2 


51.4 


BA4 

Huntington's2 


31.4 


Sub Nigra 
Huntington's 


69.3 


BA4PSP 


3.9 


Sub Nigra 
Huntington's2 


22.7 


BA4 PSP2 


5.1 


Sub Nigra PSP2 


17.2 
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BA4 Depression 


31.6 


Sub Nigra 
Depression 


23.0 


BA4 

ivcpressionz 


23.3 


Sub Nigra 
Depression2 


17.1 


BA7 Control 


39.5 


Glob Palladus 
Control 


23.7 


BA7 Control2 


6.0 


Glob Palladus 
Control2 


6.3 


BA7 

/vizneimer sz 


43.2 


Glob Palladus 

A 1 1 * t 

Alzheimer's. 


17.6 


BA7 Parkinson's 


33.2 


Glob Palladus 

All* 1 f\ 

Alzheimer^2 


20.2 


U A7 

Parkinson f s2 


61.1 


Glob Palladus 
Parkinson's 


76.8 


RA7 
Dr\ / 

Huntington's 


51.4 


Glob Palladus 
x oTKinson sz 


31.6 


BA7 

Huntington's2 


69.3 


Glob Palladus PSP 


4.4 


BA7 PSP 




oiod rauaaus rorz 


7.8 


BA7PSP2 


39.8 


Glob Palladus 
Depression 


19.6 


BA7 Depression 


15.7 


Temp Pole Control 


25.7 


dav control 


27.4 


Temp Pole Control2 


15.5 


BA9 Control2 


28.5 


Temp Pole 
Alzheimer's 


11.7 


BA9 Alzheimer's 


8.8 


Temp Pole 
Alzheimer's2 


16.4 


BA9 

/\izneimer sz 


42.3 


Temp Pole 
Parkinson's 


34.9 ; 


BA9 Parkinson's 


86.5 


Temp Pole 
Parkinson's2 


38.7 


BA9 

rarKinson sz 


49.7 


Temp Pole 
Huntington's 


65.5 


RAO 

Huntington's 


45.1 


Temp Pole PSP 


3.1 


BA9 

Huntington's2 


40.1 


Temp Pole PSP2 


12.2 


BA9PSP 


20.4 


Temp Pole 
Depression2 


19.1 


BA9 PSP2 


0.0 


Cing Gyr Control 


52.1 


BA9 Depression 


14.9 


Cing Gyr Control2 


14.1 


BA9 

Depression2 


17.7 


Cing Gyr 
Alzheimer's 


13.1 


BA 17 Control 


64.2 ' 


Cing Gyr | 
AJzheimer's2 


60.3 
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BA17Control2 


22.5 


Cing Gyr Parkinson's 


66.4 


BA17 

Alzheimer^ 


52.1 


Cing Gyr 
Parkinson's2 


25.5 


BA17 
Parkinson's 


85.3 


Cing Gyr 
Huntington's 


85.3 


BA17 

Parkinson's2 


94.0 


Cing Gyr 
Huntington's2 


432 


BA17 

TT a? l. f 

Huntingtons 


70.7 


Cing Gyr PSP 


262 


BA17 

Huntington's2 


34.4 


Cing Gyr PSP2 


62 


BA17 
Depression 


46.7 


Cing Gyr Depression 


12.9 


BA17 

Depression2 


82.9 


Cing Gyr 
Depression2 


23.8 



CNS_neurodegeneration_vl.O Summary: Ag2987 The NOV43 gene exhibits 
significantly higher expression in the brains of Alzheimer's disease patients than normal 
controls. This is consistant with reports of increased purinoceptor expression in AD. Please see 
5 Panel 1 .3D for discussion of utility of this gene in the central nervous system. 

References: 

Moore D, Iritani S, Chambers J, Emson P. Immunohistochemical localization of the 
P2Y1 purinergic receptor in Alzheimer's disease. Neuroreport 2000 Nov 27;1 1(17):3799-803 

The biological actions of extracellular nucleotides are mediated by two distinct classes 
10 of P2 receptor, P2X and P2Y. The G protein-coupled P2Y receptors comprise five mammalian 
subtypes, P2Y(1-1 1). The P2Y1 subtype is expressed abundantly throughout the human brain 
and is specifically localized to neuronal structures. In the present study, the distribution of the 
P2Y1 receptor was investigated in Alzheimer's disease (AD) brains. In contrast to control 
human brain, the P2Y1 receptor was localized to a number of characteristic AD structures 
15 such as neurofibrillary tangles, neuritic plaques and neuropil threads. Immunoblot analysis 
showed that this specific immunostaining observed over tangles was not a result of cross- 
reactivity between the anti-P2Yl antiserum and abnormal tau protein, the major constituent of 
tangles. The significance of this altered P2Y1 cellular distribution in AD brains is at present 
unclear. 

20 Panel 1.3D Summary: Ag2987 The NOV43 gene, a purinoceptor homolog, exhibits 

highly brain preferential expression in this panel. Purinoceptors found in GDNF sensitive 
sensory neurons mediate nociceptor function. Therefore, agents that block the action of this 
receptor may have utility in treating pain, either as analgesics or in inhibiting the establishment 
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of chronic pain. In addition, adenosine plays a significant neuromodulator role in brain 
regions such as the hippocampus, cortex, basal ganglia, and thalamus. Thus, this purinoceptor 
is localized in a position to participate with the action of adenosine in these brain regions. The 
NOV43 gene product may also influence Ca2+ mobilization, a function performed by other 
purinoceptors. Ca2+ mobilization is an important component of the molecular process leading 
to neurotransmitter release, such as dopamine and glutamate. P2Y receptors have been shown 
to affect the release of dopamine, a critical neurotransmitter deficient in Parkinson's disease. 
P2 receptor agonists are known to induce secretion. Therefore, agents that modulate NOV43 
may be effective treatments for Parkinson's disease via effecting enhanced dopamine release. 
Furthermore, glutamate is the-main excitatory amino acid neurotransmitter. Glutamate exerts 
excitotoxic neuronal damage and death in a number of pathological conditions, including 
stroke. Therefore, agents that inhibit this gene prodcut are likely to affect glutamate release in 
the brain and the subsequent cytotoxic action of glutamate in these regions. The 
overexpression of this gene in the brains of Alzheimer's disease patients in the 
CNS_neurodegeneration_vl.O panel indicates that antagonists of this receptor may also have 
utility in countering the processes associated with this disease. 
References: 

Liu DM, Katnik C, Stafford M, Adams DJ.P2Y purinoceptor activation mobilizes 
intracellular Ca2+ and induces a membrane current in rat intracardiac neurones. J Physiol 
2000 Jul 15;526Pt 2:287-98 

1. The mobilization of Ca2+ by purinoceptor activation and the relative contributions 
of intra- and extracellular sources of Ca2+ were investigated using microfluorimetric 
measurements of fura-2 loaded in cultured neurones from rat intracardiac ganglia. 2. Reverse 
transcriptase-polymerase chain reaction (RT-PCR) revealed expression ofmRNA for the G 
protein-coupled P2Y2 and P2Y4 receptors. 3. Brief application of either 300 microM ATP or 
300 microM UTP caused transient increases in [Ca2+]i of 277 +/- 22 nM and 267 +/- 39 nM, 
respectively. Removal of external Ca2+ did not significantly reduce these [Ca2+]i responses. 
4. The order of purinoceptor agonist potency for [Ca2+]i increases was ATP = UTP > 2- 
MeSATP > ADP » adenosine, consistent with the profile for P2Y2 purinoceptors. ATP- and 
UTP-induced rises in [Ca2+]i were completely and reversibly blocked by 10 microM PPADS 
(a P2 purinoceptor antagonist) and partially inhibited by 100 microM suramin (a relatively 
non-specific purinoceptor antagonist). 5. In the presence of the endoplasmic reticulum Ca2+- 
ATPase inhibitor cyclopiazonic acid (10 microM) in Ca2+-free media, the [Ca2+]i responses 
evoked by ATP were progressively decreased and abolished. 6. ATP- and UTP-induced 
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[Ca2+]i rises were insensitive to pertussis toxin, caffeine (5 mM) and ryanodine (10 microM) 
but were significantly reduced by U-73122, a phospholipase C (PLC) inhibitor. 7. In fura-2- 
loaded cells, perforated patch whole-cell recordings show that ATP and UTP evoked slow 
outward currents at -60 mV, concomitant with the rise in [Ca2+]i, in approximately 30 % of 
5 rat intracardiac neurones. 8. In conclusion, these results suggest that in r intracardiac neurones, 
ATP binds to P2Y2 purinoceptors to transiently raise [Ca2+]i and activate an outward current. 
The signalling pathway appears to involve a PTX-insensitive G protein coupled to PLC 
generation of IP3 which triggers the release of Ca2+ from a ryanodine-insensitive Ca2+ 
store(s). 

10 Driessen B, Bultmann R, Jurna I, Baldauf ^Depression of C fiber-evoked activity by 

intrathecal^ administered reactive red 2 in rat thalamic neurons. Brain Res 1998 Jun 
15;796(l-2):284-90 

To investigate the possible role of spinal purinoceptors in nociception, the potent P2- 
purinoceptor antagonist reactive red 2 was studied in rats under urethane anesthesia in which 

1 5 nociceptive activity was elicited by electrical stimulation of afferent C fibers in the sural nerve 
and recorded from single neurons in the ventrobasal complex of the thalamus. Intrathecal (i.t.) 
application of reactive red 2 (6-200 micrograms) caused a dose-dependent reduction of the 
evoked activity in thalamic neurons. The estimated ED50 was 30 micrograms, and the 
maximum depression of nociceptive activity amounted to about 70% of the control activity at 

20 a dose of 100 micrograms. Morphine, administered i.t. at a maximally effective dose (80 
micrograms), inhibited the evoked nociceptive activity by only up to 55% of the control 
activity. An i.t. co-injection of reactive red 2 (100 micrograms) and morphine (80 micrograms) 
caused a maximum reduction of the evoked thalamic activity by up to 85% of the control 
activity, thus, exceeding significantly the effect elicited by either drug alone. Similarly, i.t co- 

25 injection of almost equipotent dosages of reactive red 2 (30 micrograms) and morphine (30 
micrograms) caused a maximum reduction of the evoked activity by up to 72% of the control 
activity, which again exceeded significantly the effect of either drug alone. The results suggest 
that in rats reactive red 2 exerts antinociception by blockade of P2-purinoceptors in the spiiial 
cord and, hence, support the idea that ATP may play an important role in spinal transmission 

30 of nociceptive signals. An activation of the spinal opioid system does not seem to contribute to 
the effect of reactive red 2 but might act additive or even synergistically with its 
antinociceptive action. 
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' Krugel U, Kittner H, Franke H, Illes P. Stimulation of P2 receptors in the ventral 
tegmental area enhances dopaminergic mechanisms in vivo. Neuropharmacology 2001 
Jun;40(8):1084-93 

It has been shown that endogenous adenosine 5 -triphosphate (ATP) as well as its 
exogenously applied structural analogue, 2-methylthio ATP (2-MeSATP), facilitate the release 
of dopamine from axon terminals in the rat nucleus accumbens (NAc) by activating ATP- 
sensitive P2 receptors. In the present study, reversed microdialysis of 2-MeSATP (10 microM, 
100 microM and 1 mM), or its microinjection (0.5, 5.0 and 50 pmol) into the ventral tegmental 
area (VTA), dose-dependently increased the local extracellular level of dopamine and the 
locomotion in the open field, respectively. These effects were abolished by the P2-receptor 
antagonist pyridoxalphosphate-6-azophenyl^4^iisulfonic acid (PPADS). When applied 
alone, the antagonist decreased the basal dopamine concentration, indicating that endogenous 
ATP controls the somatodendritic release of dopamine. Repeated microinjections of 2- 
MeSATP (5 pmol) once daily for 4 days led to a reproducible locomotor stimulation in the 
open field. Conditioned locomotion was induced by re-exposure to the novel environment on 
the seventh day. A challenge with amphetamine (1 mg/kg intraperitoneally) on the eighth day 
enhanced the locomotor activity in the 2-MeSATP-treated group in the sense of a cross- 
sensitisation, but failed to do so in the control group. Neurons in the VTA were heavily stained 
with antibodies developed against the P2Y(1) subtype of P2 receptors. Taken together, our 
data suggest that P2 receptors (probably of the P2Y(1) subtype) are involved in the initiation 
of somatodendritic dopamine release in the VTA and thereby may have a profound influence 
on sensitisation and reward-motivated behaviour. 

Fernandez-Alvarez J, Hillaire-Buys D, Loubatieres-Mariani MM, Gomis R, Petit P. P2 
receptor agonists stimulate insulin release from human pancreatic islets. Pancreas 2001 
Jan;22(l):69-71 

Although P2 receptors for adenosine S'-triphosphate (ATP) and/or adenosine 5'- 
diphosphate (ADP) have been characterized in mammalian pancreatic beta cells, no evidence 
for an insulin-secreting effect of P2 receptor agonists has been reported as yet in humans. The 
present study aimed at investigating whether P2 receptor agonists could stimulate insulin 
release in human pancreatic islets obtained from brain-dead organ donors. Experiments were 
performed using different glucose concentrations and insulin was measured by 
radioimmunoassay. When the glucose concentration (8.3 mmol/L) was slightly stimulating for 
insulin release, alpha,beta-methylene ATP (200 micromol/L) and ADPbetaS (50 micromol/L) 
similarly amplified insulin secretion: both compounds induced a threefold increase in insulin 
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response. In the presence of a nonstimulating glucose concentration (3.0 mmol/L), only 
alpha,beta-methylene ATP could induce a significant .1.4-fold increase in insulin release, 
ADPbetaS being completely ineffective. These results give evidence that P2 receptor agonists 
are effective in stimulating insulin release in humans, the effect of the P2Y agonist being 
5 essentially glucose dependent 

Panel 4D Summary: Ag 2987 The NOV43 transcript is expressed in lung fibroblasts 
after treatment with IFNg, IL-4, IL-9 other cytokines (CTs=32). This gene is also expressed in 
small airway epithelium treated with the inflammatory cytokines TNF-a and IL-1. This 
expression profile suggests a role for this transcript in lung inflammation. Low but detectable 

1 0 expression of this transcript is found also in dermal fibroblasts, primary CD4 T cells, EOL and 
antigen presenting cells. 

This transcript encodes for a PY2receptor like molecule. Expression of this receptor 
has been reported in several cell types including eosinophils (Ref.l) and lung epithelium 
where it has been shown to mediate Cl(-) secretion via an increase in intracellular calcium 

1 5 concentration (ref. 2). Thus, the NOV43 gene product may influence Ca2+ mobilization, a 
funtion performed by other purinoceptors, and therefore lead to activation or secretion 
processes. As suggested by ref.3, the release of nucleotides by damaged cells in inflammation 
can lead to the activation of purinoreceptors and of other cells present in the inflammed 
tissues, including lung epithelium in asthma, COPD, emphysema and the skin in psoriasis or 

20 other skin inflammatory diseases. This release can also result in the activation of antigen 

presenting cells and T cells, which contribute to the perpetuation of the inflammatory process. 
Therefore, modulation of the expression or activity of the protein encoded by the NOV43 gene 
may prevent or reduce the inflammation process in all of these diseases and other autoimmune 
diseases, including as inflammatory bowel diseases. 

25 References: 

Idzko M, Dichmann S, Panther E, Ferrari D, Herouy Y, Virchow C Jr, Luttmann W, Di 
Virgilio F, Norgauer J. 

Functional characterization of P2Y and P2X receptors in human eosinophils. J Cell 
Physiol 2001 Sep;188(3):329-36 

30 Activation of purinoceptor by ATP induces in eosinophils various cell responses 

including calcium transients, actin polymerization, production of reactive oxygen metabolites, 
CD1 lb-expression, and chemotaxis. Here, the effect of ion channel-gated P2X and/or G 
protein-coupled P2Y receptor agonists ATP, ATPgammaS, alpha,beta-meATP, 2-MeSATP, 
BzATP, ADP, CTP, and UTP on the intracellular Ca(2+)-mobilization, actin polymerization, 
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production of reactive oxygen metabolites, CD1 lb expression and chemotaxis of human 
eosinophils were measured and the biological activity was analyzed. Although all tested 
nucleotides were able to induce all these cell responses, the biological activity of the analyzed 
nucleotides were distinct. Agonists of the G protein-coupled P2Y receptors such as 2- 
MeSATP, UTP, and ADP have a higher biological activity for production of reactive oxygen 
metabolites, actin polymerization and chemotaxis in comparison to the ion channel-gated P2X 
agonists alphabeta-meATP, BzATP, and CTP. In contrast, P2Y and P2X agonist showed 
similar potencies in respect to intracellular calcium transient and CD1 lb up-regulation. This 
conclusion was further supported by experiments with receptor iso-type antagonist KN62, 
EGTA or with the G(i) protein-inactivating pertussis toxin. These findings indicate 
participation of different purinorecptors in the regulation of cell responses in eosinophils. 

Laubinger W, Streubel G, Reiser G. Physiological evidence for a P2Y receptor 
responsive to diadenosine polyphosphates in human lung via Ca(2+) release studies in 
bronchial epithelial cells. Biochem Pharmacol 2001 Mar l;61(5):623-9 

P2Y(2) receptors that are activated by the extracellular nucleotides ATP or UTP 
mediate Cl(-) secretion via an increase in [Ca(2+)](i) (intracellular calcium concentration). 
Therefore, in the lung of patients suffering from cystic fibrosis, inhalation of aerosolized UTP 
offers a way to circumvent the defect in Cl(-) secretion by the cystic fibrosis transmembrane 
conductance regulator. A possible alternative for the relatively unstable UTP in inhalation 
therapy is the more resistant diadenosine tetraphosphate (Ap(4)A). In human and rat lung 
membranes, Ap(4)A binds to P2 receptor sites coupled to G proteins. Here, we showed that 
Ap(4)A caused an increase in [Ca(2+)](i) with an EC(50) of 17 microM in human bronchial 
epithelial cells (HBE1). The [Ca(2+)](i) rise evoked by ATP and UTP was completely, but 
that induced by Ap(4)A only partially, caused by release of Ca(2+) from internal stores. 
Moreover, the potency of Ap(4)A to mobilize Ca(2+) was lower than that of ATP and UTP 
(EC(50) 1.5 and 1.8 microM, respectively), and the maximal increase in [Ca(2+)](i) was 
considerably smaller than that after ATP or UTP. In accordance with our previous results 
providing evidence for a common binding site for various diadenosine polyphosphates in lung 
membranes, all Ap(n)A analogues tested (n = 3 to 6) caused a comparable [Ca(2+)](i) 
increase. Homologous or heterologous prestimulation largely diminished the increase in 
[Ca(2+)](i) found after a second pulse of either UTP or Ap(4)A. Although specific binding 
characteristics and functional responses of Ap(4)A on lung cells are in favor of a distinct 
receptor for Ap(4)A, the cross-talk between UTP and Ap(4)A in HBE1 cells and the only 
slight differences in Ca(2+) mobilization by ATP or UTP and Ap(4)A render it impossible at 
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this point to state unequivocally whether there exists a distinct P2Y receptor specific for 
diadenosine polyphosphates in lung epithelia or whether Ap(4)A activates one of the 
nucleotide receptors already described. 

Di Virgilio F, Chiozzi P, Ferrari D, Falzoni S, Sanz JM, Morelli A, Torboli M, 
5 Bolognesi G, Baricordi OR. Nucleotide receptors: an emerging family of regulatory molecules 
in blood cells. Blood 2001 Feb 1 ;97(3):587-600 

Nucleotides are emerging as an ubiquitous family of extracellular signaling molecules. 
It has been known for many years that adenosine diphosphate is a potent platelet aggregating 
factor, but it is now clear that virtually every circulating cell is responsive to nucleotides. 

1 0 Effects as different as proliferation or differentiation, chemotaxis, release of cytokines or 

lysosomal constituents, and generation of reactive oxygen or nitrogen species are elicited upon 
stimulation of blood cells with extracellular adenosine triphosphate (ATP). These effects are 
mediated through a specific class of plasma membrane receptors called purinergic P2 
receptors that, according to the molecular structure, are further subdivided into 2 subfamilies: 

15 P2Y and P2X. ATP and possibly other nucleotides are released from damaged cells or secreted 
via nonlytic mechanisms. Thus, during inflammation or vascular damage, nucleotides may 
provide an important mechanism involved in the activation of leukocytes and platelets. 
However, the cell physiology of these receptors is still at its dawn, and the precise function of 
the multiple P2X and P2Y receptor subtypes remains to be understood. 

20 Panel CNSJ Summary: Ag2987 The expression in this panel confirms expression of 

the NOV43 gene in the brain. Please see Panel 1 3D for discussion of utility of this gene in the 
central nervous system. 

NOV44 

Expression of gene NOV44 was assessed using the primer-probe sets Ag2988 and 
25 Ag2989, described in Tables AMA and AMB. Results of the RTQ-PCR runs are shown in 
Table AMC. 

Table AMA. Probe Name Ag2988 



Primers 


Sequences 


Length 


Staxt 
Position 


SEQ ID NO: 


Forward 


5 1 -ctggccaacctatcctttattg-3 ■ 


22 


195 


1153 


Probe 


TET-5 ' -tggctcctaaactcattgctgactca-3 ' - 
TAMRA 


26 


238 


1154 


Reverse 


5 ■ -agatggttctcccctcatacaa-3 ' 


22 


264 


1155 
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Table AMB. Probe Name Ag2989 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ctggccaacctatcctttattg-3 ' 


22 


195 


1156 


Probe 


TET-5 1 -tggctcctaaactcattgctgactca-3 » - 
TAMRA 


26 


238 


1157 


Reverse 


5 1 -agatggttctcccctcatacaa-3 1 


22 


264 


1158 



Table AMC. Panel 4D 



Tissue Name 


ReL Exp.(%) 
Ag2988,Run 
164523397 


Tissue Name 


Rel. Exp.(%) 
Ag2988,Run 
164523397 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC EFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 




Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lun£ Microvascular FC 
TNFalpha + IL-lbeta 


0.0 


Primary 1 nz act 


0.0 


Microvascular Dermal EC 
none 


0.0 


rnmary lrl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial eDithelium 

± m.^*m. 11U1 VL/lUlvll till A 

TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


ft ft 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0, 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CDS lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CDS 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


0.0 
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LAK cells rest 


0.0 


TNFalpha + IL-lbeta 


0.0 


T AK celk TT -2 






inn n 


LAK cells IT -2+II -12 


ft 0 






LAK cells TL-2+TFN 

jL^i&JTV WHO XXj £r 1 XJl 1^1 

gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Wav MT R 3 Hav 


0 0 

V/.V 


"NIPT-H909 TTTM an mm a 




Two Wav MT R 5 dav 

1 ™U » " *Xjf X.VXXjX\. •/ UttY 


0 0 


RPAFP nnnp 


i n n 


Two Way MLR 7 day 


0.0 


HPAFC TNF nlnha 4- TT -1 

beta 


0.0 


PBMC re<rt 


0 0 

\J»\J 


T l in cr TiHrohlacf" nrtnp 
JL/Uilg HUlUUiaaL I1UI1C 


ft ft 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 

4- TT -1 hf»ta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


5.3 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and 1L-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


0.0 


JbUL-1 dbcAMP 
PMA/ionomvcin 


0.0 


Dermal fibroblast 


0.0 


Dendritic cells none 


0.0 


Formal fihrnKlflct FPTsJ 

gamma 


— — — 

0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


19.9 


Monocytes rest ' 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


10.2 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


24.5 







CNS_neurodegeneration_vl.O Summary: Ag2988/Ag2989 Expression of the 
NOV44 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 
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Panel 1.3D Summary: Ag2988/Ag2989 Expression of the NOV44 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2988 Significant expression of this gene is detected in a liver 
cirrhosis sample (CT = 32.7). This gene encodes a putative GPCR; therefore, antibodies or 
5 small molecule therapeutics could reduce or inhibit fibrosis that occurs in liver cirrhosis. In 
addition, antibodies to this putative GPCR could also be used for the diagnosis of liver 
cirrhosis. 

NOV45 

Expression of gene NOV45 was assessed using the primer-probe sets Ag2979, 
10 Ag2982, Ag2981 and Ag2984, described in Tables ANA, ANB, ANC and AND. Results of 
the RTQ-PCR runs are shown in Table ANE. 



Table ANA. Probe Name Ag2979 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -tacttcttcgtgggctcctt-3 ' 


20 


827 


1159 


Probe 


TET-5' -aaggcagaacctgaagctggttctcc-3 ' - 
TAMRA 


26 


862 


1160 


Reverse 


5 1 -cattcacctcagtcgtgtcc-3 1 


20 


901 ; 


1161 



Table ANB. Probe Name Ag2982 



15 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -tacttcttcgtgggctcctt-3 • 


20 


827 


1162 


Probe 


TET-5 ■ -aaggcagaacctgaagctggttctcc-3 » - 
TAMRA 


26 


862 


1163 


Reverse 


5 1 -cattcacctcagtcgtgtcc-3 1 


20 


901 


1164 




Table ANC. Probe* 


fame Ag2981 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 




5 1 -tacttcttcgtgggctcctt-3 * 


20 


827 


1165 


Probe 


TET-5 1 -aaggcagaacctgaagctggttctcc-3 » - 
TAMRA 


26 


662 


1166 


Reverse 


5 1 -cattcacctcagtcgtgtcc-3 1 


20 


901 


1167 




Table AND. Probe Name Ag2 


984 




Primers 


Sequences 


Length j 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -atcctccatcccatcttcaa-3 ' 


i° 1 


284 


1168 
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Probe 


TET-5 ' -cctcagccctgtgatgatgtttt.cct.-3 ' - j 

TAMRA j 26 ] 


307 


1169 


Reverse 


5 1 -cgcttagaaagctcaggctt-3 ' J20 


340 


1170 


Table ANE. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2982 9 Run 
158603041 


Tissue Name 


Rel. Exp.(%) 
Ag2982,Run 
158603041 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.3 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


0\*\AJll\lal y l Hi. JTCdI 




t_tt TVPP TT11 


A A 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Hil act 


0.0 


Lung Microvascular EC 

i INTaipna 1 11^-1 DC Id 


19.8 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


lVlllTOSVaSUlar L/Cllllal Hv^ 

TNFalpha + IL-lbeta 


5.5 


Primary Thl rest 


0.0 


jDiuucuiai cpunciiuni 

TNFalpha + ILlbeta 


58.2 


Primary Th2 rest 


0.0 


QtTi5i11 q ir\A/nv pnitVi folium 
oiiiaii airway cpiuiciiuiii 

none 


3.0 


Primary Trl rest 


0.0 


.Small airwflv pnithplium 

TNFalpha + IL-lbeta 


100.0 


CD45RA CD4 
lymphocyte act 


3.6 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronerv arferv SMi" 1 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.8 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95CH11 


2.4 


CCD1106 (Keratinocytes) 
none 


0.3 


LAK cells rest 


0.0 


CCD1 1 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


5.8 


LAK cells IL-2 


0.0 


Liver cirrhosis 


23.3 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


2.7 
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T A TV 1 1„ TT ^ i TT7X T 

LAK cells IL-2+IFN 


0.0 


NCI-H292 none 


0.0 


LAKcellsIL-2+IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 

P1VT A /intinm vrin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292IL-13 


0.0 


1 wo Way MLR 3 day 


4.8 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


5.7 


HPAEC TNF alpha + IL-l 
beta 


0.7 


TVT5 TV Af~* —4- 

rBML rest 


0.0 


Lung fibroblast none 


0.0 


PBMCPWM 


0.0 


Lung fibroblast TNF alpha 

I TT 1 I- 

+ IL-l beta 


1.8 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0 0 


T liner fiVirnrilact TT -1 ^ 


o n 

u.u 


B lvmnhocvtes PWM 




Lung fibroblast IFN 
gamma 




B lymphocytes CD40L 
andIL-4 


0 0 


Dermal fibroblast 
CCD1070rest 




EOL-1 dbcAMP 


0.7 


Dermal fibroblast 
CCD1070 TNF alpha 


4.1 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD1 070 IL-l beta 


0.4 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cpIU T PS 


ft ft 


J-/ciiTlaI IlDrODlaSl 


u.u 


Dendritic cells anti- 

lllw wWllO CHILI 

CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


BBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag2979/Ag2982 Expression of the 

NOV45 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 13D Summary: Ag2981/Ag2984 Expression of the NOV45 gene is 

5 low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2982 Expression of the NOV45 gene is restricted to a few 

samples, with highest expression in small airway epithelium treated with TNF-alpha and IL-l 
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beta (CT=31.2). Significant expression is treated in bronchial epithelium and lung 
microvascular endothelial cells. Thus, expression of this gene could be used as a marker for 
activated epithelium. The expression in lung derived samples suggests that this protein may be 
involved in lung inflammatory disorders, including asthma and chronic obstructive pulmonary 
5 disorder. Results from a second experiment with the probe/primer set Ag2979 are not included 
because the amp plot indicates that there is a potential problem in one of the sample wells. 

NOV46a and NOV46b 

Expression of gene NOV46s and variant NOV46b was assessed using the primer-probe 
sets Ag2990 and Ag2991, described in Tables AOA and AOB. Results of the RTQ-PCR runs 
1 0 are shown in Tables AOC. Please note that variant NOV46B does not match the probe and 
primer set Ag2990. 



Table AOA. Probe Name Ag2990 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 ' -cggaactgaggagactctttg-3 1 


21 


59 


1171 


Probe 


TET-5* -tacaagcagaccttgagcctcacggt-3 ■ - 
TAMRA 


26 


81 


1172 


Reverse 


5 1 -gagcacaactgcgtttcct-3 1 


19 


140 


1173 



Table AOB. Probe Name Ag2991 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 - tggacagggaagtcttattttg-3 ' 


22 


742 


1174 


Probe 


TET-5 ' -tttcctgtccgctcttaacagcagtg-3 ' - 
TAMRA 


26 


785 


1175 


Reverse 


5 ' -agcccacgaagaagtaaatgat-3 1 


22 


819 


1176 



15 



Table AOC. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
Ag2990, 

Run 
164524407 


Rel. 
Exp.(%) 
Ag2991, 

Run 
164315033 


Tissue Name 


Rel. 
Exp.(%) 
Ag2990, 

Run 
164524407 


Rel. 
Exp.(%) 
Ag2991, 

Run 
164315033 


Secondary Thl act 


0.0 


0.0 


HUVEC IL-lbeta 


0.0 


0.0 


Secondary Th2 act 


0.0 


0.0 


HUVEC EFN 
gamma 


0.0 


0.0 


Secondary Trl act 


0.0 


0.0 


HUVEC TNF 
alpha + IFN 
gamma 


0.0 


0.0 
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Secondary Thl rest 


0.0 


0.0 


HUVECTNF 
alpha + IL4 


0.0 


0.0 


Secondary Th2 rest 


0.0 


0.0 


HUVEC IL-11 


0.0 


0.0 


Secondary Trl rest 


0.0 


0.0 


Lung 

Microvascular EC 
none 


0.0 


0.0 


Primary Thl act 


0.0 


0.0 


Lung 

Microvascular EC 
TNFalpha + IL- 
Ibeta 


3.1 


232 


Primary Th2 act 


0.0 


0.0 


Microvascular 
Dermal EC none 


0.0 


0.0 


Primary Trl act 


0.0 


0.0 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
lbeta 


0.0 


11.6 


Primary Thl rest 


0.0 


0.0 


Bronchial 
epithelium 
TNFalpha + 
ILlbeta 


12.3 


100.0 


Primary Th2 rest 


0.0 


0.0 


Small airway 
epithelium none 


u.u 


12.1 


Primary Trl rest 


0.0 


0.0 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


117 
11./ 


no c. 


CD45RA CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC rest 


0.0 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


0.0 


0.0 


CD8 lymphocyte 
act 


0.0 


0.0 


Astrocytes rest 


0.0 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


0.0 


Astrocytes 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


0.0 


KU-812 
(Basophil) rest 


Vf.U 


U.U 


CD4 lymphocyte 
none 


0.0 


0.0 


KU-812 

(Basophil) 

PMA/ionomycin 


23.8 


5.0 

c 


2ry 

Thl/Th2/Trl anti- 
CD95CH11 


0.0 


0.0 

1 


CCD1106 

[Keratinocytes) 

none 


0.0 


0.0 


LAK cells rest 


0.0 


( 

0.0 


CCD1106 
[Keratinocytes) 
FNFalpha + IL- 


0.0 


10.7 
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lbeta 




LAK cells IL-2 


0.0 


0.0 


Liver cirrhosis 


100.0 


18.0 


LAK cells IL-2+IL- 
12 


0.0 


0.0 


Lupus kidney 


0.0 


0.0 


T AT/" rt «1 1 « TT 

LAK. cells IL- 
2+IFN gamma 


0.0 


0.0 


NCI-H292 none 


0.0 


0.0 


TAT/ - TT 1 1 

LAK cells IL-2+ 
IL-18 


0.0 


0.0 


NCI-H292 IL-4 


0.0 


3.8 


lak cells 
PMA/ionomycin 


0.0 


0.0 


NCI-H292 IL-9 


0.0 


0.0 


NK Cells IL-2 rest 


27.5 


A A 

0.0 


NCI-H292 IL-13 


0.0 


0.0 ■ 


Two Way MLR 3 
day 


0.0 


0.0 


NCI-H292 IFN 
gamma 


0.0 


0.0 


Two Way MLR 5 
day 


2.1 


0.0 




0 0 

v.v/ 


0.0 


Two Way MLR 7 
day 


0.0 


0.0 


HPAEC TNF 
alpha + IL-1 beta 


l/.U 


0.0 


PBMC rest 


0.0 


0.0 


Lung fibroblast 
none 


0 0 


0.0 


PBMC PWM 


0.0 


0.0 


Lung fibroblast 
TNF alpha + IL-1 
beta 


0.0 


2.0 


PBMC PHA-L 


0.0 


0.0 


Lung fibroblast 
IL-4 


0.0 


0.0 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast 
IL-9 


0.0 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


0.0 


Lung fibroblast 
IL-13 


0 0 


0.0 


B lymphocytes 
PWM 


0.0 


0.0 


Lung fibroblast 
IFN gamma 


0.0 


0.0 


B lymphocytes 
CD40L and IL-4 


0.0 


0.0 


Dermal fibroblast 
CCD1070rest 


0.0 


0.0 


EOL-1 dbcAMP 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


0.0 


4.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


0.0 


Dermal fibroblast 
CCD1 070 IL-1 
beta 


0.0 


0.0 . 


uenuriuc ceils 
none 


0.0 


0.0 


uermai iiDrooiast 
IFN gamma 


0.0 


0.0 


Dendritic cells LPS 


0.0 


0.0 


Dermal fibroblast 
IL-4 


7.0 


0.0 


Dendritic cells anti- 
CD^ 


0.0 


0.0 


IBD Colitis 2 


0.0 


6.2 


Monocytes rest 


0.0 


0.0 


IBD Crohn's 


0.0 


0.0 


Monocytes LPS 


0.0 


0.0 


Colon 


0.0 


0.0 
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Macrophages rest 


0.0 


0.0 


Lung 


0.0 


0.0 


Macrophages LPS 


0.0 


4.8 


Thymus 


0.0 


0.0 


HUVEC none 


0.0 


0.0 


Kidney 


0.0 


0.0 


HUVEC starved 


0.0 


0.0 




1 ! 



CNS_neurodegeneration_vl.O Summary: Ag2990/Ag2991 Expression of the 
NOV46a gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 
Panel 1.3D Summary: Ag2990/Ag2991 Expression of the NOV46a gene is 
5 low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2990/Ag2991 Expression of the NOV46a gene is restricted to 
liver cirrhosis and TNFalpha + ILlbeta treated bronchial and small airway epithelium. This 
expression profile suggests that antibodies or small molecule therapeutics designed with the 
putative protein encoded by this gene could reduce or inhibit fibrosis that occurs in liver 
1 0 cirrhosis. In addition, antibodies to this putative protein product could also be used for the 

diagnosis of liver cirrhosis. In addition, the expression of this gene in tissues derived from the 
lung suggests that this gene product may be involved in pathological and inflammatory lung 
disorders that include chronic obstructive pulmonary disease, asthma, allergy and emphysema. 
A second experiment with Ag290 shows low/undetectable in all samples on this panel 
15 (CTs>35). (Data not shown.) 

NOV46d 

Expression of gene NOV46d was assessed using the primer-probe sets Ag2992 and 
Ag513, described in Tables APA and APB. Results of the RTQ-PCR runs are shown in Tables 
APC and APD. 



Table APA. Probe Name Ag2992 



Primers 


1 Sequences 


Length 


Start 
Position 


SEQ XD NO: 


Forward 


I 5 1 -agggctggtcttcctcttct-3 ' 


20 


703 


1177 


Probe 


lTET-5 1 -cccctcagcattcagggattcctatt-3 1 - 

Itamra 


26 


731 


1178 


Reverse 


15 1 -agtcatccaaatccttctcgat-3 ' 


22 


764 


1179 




Table APB. Probe Name Agi 


513 




Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -tatctgtggttctctgtgggttca-3 ' 


24 


600 


1180 


Probe 


TET-5 1 - 

atccacatatgatcctgacaagcaggaccag-3 1 - 


31 


626 


1181 
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|tamra _ | 




Reverse]5 1 - tggtcagcggcatcttctg-3 1 ji9 


659 1X82 



Table APC. Panel 1.1 



Tissue Name 


Rel. Exp.(%) Ag513, 
Run 124882567 


Tissue Name 


Rel. Exp.(%) Ag513, 
Run 124882567 


Adrenal gland 


0.0 


Renal ca.UO-31 


0.0 


Bladder 


0 0 

V.Vr 


Renal ca RVF 


o n 


Brain (amygdala) 


0.0 


Liver 


0.0 


Brain (cerebellum) 


0.0 


Liver (fetal) 


0.0 


Brain (hippocampus) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (substantia 
nigra) 


0.0 


Lung 


0.0 


Brain (thalamus) 


0.0 


Lung (fetal) 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


Brain (fetal) 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


Brain (whole) 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (sxell var.) 
SHP-77 


0.0 


astrocytoma SW1783 


0.0 


Lunff ca Ismail eel Y\ 

JLslllIg Vu. ^31 Hull vwll J 

LX-1 


0.0 


glioma U251 


0.0 


Lunirca Ismail celH 
NCI-H69 


81.8 


glioma SF-295 


0.0 


T liner ^Qfinam ^ 

SW 900 


0.0. 


glioma SNB-1 9 


0.0 


Lung ca. (squam.) 
NCI-H596 


1.4 


glio/astro U87-MG j 


0.0 


Lymph node 


0.0 


neuro*; met SK-N-AS 


0.0 


Spleen 


0.0 


Mammary gland 


0.0 


Thymus 


0.0 


Breast ca. BT-549 


0.0 


Ovary 


0.0 


Breast ca. MDA-N 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Breast ca.* (pl.ef) 
T47D 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Breast ca.* (pl.ef) 


0.0 


Ovarian ca. OVCAR- 


0.0 



1132 



WO 02/068649 



PCT/US02/02785 



MLr-/ 


r-r-.Tr- P -„ . -„ „,, , , 


4 




Breast ca. (pl.ei) 
MDA-MB-231 


0.0 


Ovarian ca. OVCAR- 
5 


100.0 


Small intestine 


0.0 


Ovarian ca. OVCAR- 

8 
o 


0.0 


Colorectal 


0.0 


\jvanan ca. (ascites ) 
SK-OV-3 


— — — " > - 

0.0 


Colon ca. HT29 


0.0 


Pancreas 


0.0 


Colon ca. CaCo-2 


0.0 


Pancreatic ca. 
CAPAN 2 


0.0 


Colon ca HCT-15 


1 < 

1 .J 


rituiiary gland 


0.0 


Colon ca HCT-116 

v^v/iv/ll wfc4* XJIV X 1 1 v 


u.u 


riacenia 


0.0 


Colon ca. HCC-2998 


0.0 


Prostate 


0.0 


Colon ca. SW480 


0.0 


Prostate ca.* (bone 
mei; rt-j 


0.0 


Colon ca * SW620 
(SW480 met) 


0.0 


Salivary gland 


0.0 


Stomach 


0.0 


Trachea 


n o 


Gastric ca. (liver met) 
NCI-N87 


0.0 


Spinal cord 


0.0 


Heart 


0.0 


Testis 


0 n 


Skeletal muscle (Fetal) 


0.0 


Thyroid 


0.0 


Skeletal muscle 


u.u 


Uterus 


0.0 


Endothelial cells 


0.0 


Melanoma Ml 4 


0.0 


Heart (Fetal) 


0.0 


Melanoma LOX 
IMVI 


0.0 


Kidney 


0.0 


Melanoma UACC-62 


0.0 


Kidney (fetal) 


0.0 


Melanoma SK-MEL- 
28 


0 0 


Renal ca. 786-0 


0.0 


ivieianoma {met; 
SK-MEL-5- 


0.0 


Renal ca. A498 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Renal ca. ACHN 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Renal ca. TK-10 


0.0 







Table APD. Panel 1.2 



Tissue Name 


Rel. Exp.(%) Ag513, 
Run 129119406 


Tissue Name 


Rel. Exp.(%) Ag513, 
Run 129119406 


Endothelial cells 


0.0 


Renal ca. 786-0 


0.0 


Heart (Fetal) 


0.0 


Renal ca. A498 


0.0 


Pancreas 


0.0 


Renal ca.RXF 393 


0.0 
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Pancreatic ca. CAP AN 
2 


0.0 


Renal ca. ACHN 


0.0 


Adrenal Gland 


0.0 


Renal ca. UO-31 


0.0 


Thyroid 


0.0 


Renal ca. TK-10 


0.0 


Salivary gland 


0.0 


Liver 


0.0 


Jriiuiiary giana 


0 n 

u.u 




u.u 


Brain (fetal) 


0.0 


Liver ca. 

^ncpaiouiaoi/ ncpvjZr 


0.0 


Brain (whole) 


0.0 


Lung 


0.0 


Brain (amygdala) 


0.0 


Lung (fetal) 


0.0 


Brain (cerebellum) 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (hippocampus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


100.0 


Brain (thalamus) 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


Spinal cord 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (squam.) 
SW900 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
NCI-H596 


40.6 


astrocytoma SNB-75 


0.0 


Mammary gland 


0.0 


glioma SNB-19 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma U251 


0.0 


MDA-MB-231 


0.0 


glioma SF-295 


0.0 


T47D 


0.0 


Heart 


0 0 




ft n 

u.u 


Skeletal Muscle 


0.0 


Breast ca MDA-N 


0.0 


Bone marrow 


0.0 


Ovary 


0.0 


Thymus 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 
4 


0.0 


Lymph node 


0.0 


Ovarian ca. OVCAR- 


91.4 
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Colorectal Tissue 


0.0 


Ovarian ca. OVCAR- 

o 


0.0 


Stomach 


0.0 


Ovarian ra IHROV-I 




Small intestine 


0.0 


Ovarian ca. (ascites) 

OXV~W V J 


0.0 


Colon ca. SW480 


0 0 


1 ]ff»t*iic 


u.u . 


Colon ca.* SW620 
(SW480 met) 


0.0 


Placenta 


0.0 


Colon ca. HT29 


0.0 


Prostate 


0.0 | 


Colon ca.HCT-1 16 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


v^oion ca. \^aL>o-z 


A A 

0.0 


Testis 


0.0 


^uiun ua. i issue 
(OD03866) 


59.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma* (met) 
Hs688(B).T 


2.0 


uastric ca. (liver met) 
NCI-N87 


0.0 


Melanoma UACC-62 


0.0 


Bladder 


0.0 


Melanoma M14 


0.0 


Trachea 


0.0 


Melanoma LOX 
IMVI 


0.0 


Kidney 


0.0 . 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney (fetal) 


0.0 







CNS_neurodegeneration_vl.0 Summary: Ag2992 Expression of the NOV46d gene 
is low/undetectable in alfsamples on this panel (CTs>35). (Data not shown.) 

Panel 1.1 Summary: Ag513 Expression of the NOV46d gene is limited to two 

5 samples derived from lung cancer and ovarian cancer cell lines (CTs=3 1 -32). Thus, expression 
of this gene could be used to differentiate between these sample and other samples on this 
panel and as a marker to detect the presence of lung and ovarian cancers. Furthermore, 
therapeutic modulation of the expression or function of this gene may be effective in the 
treatment of lung and ovarian cancers. 

0 Panel 1.2 Summary: Ag513 Expression of the NOV46d gene is restricted to samples 

derived from lung cancer, ovarian cancer, and colon cancer cell lines (CTs=3 1 -32). This 
expression profile is in agreement with the expression seen in Panel 1.1. Thus, expression of 
this gene could be used to differentiate between these sample and other samples on this panel 
and as a marker to detect the presence of these cancers. Furthermore, therapeutic modulation 
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of the expression or function of this gene may be effective in the treatment of lung, ovarian, 
and colon cancers. 

Panel 1.3D Summary: Ag2992 Expression of the NOV46d gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 
5 Panel 4D Summary: Ag2992 Expression of the NOV46d gene is low/undetectable in 

all samples on this panel (CTs>35). (Data not shown.) 

NOV46c 

Expression of gene NOV46c was assessed using the primer-probe set Ag2985, 
described in Table AQA. 

1 0 Table AQA. Probe Name Ag2985 



Primers | Sequences 


Length 


Start 
Position 


SEQ XD NO: 


Forwardjs 1 - tctgtcatcccatctccaaa-3 1 


20 


285 


1183 


jTET-5 1 -atcctcattcctgtgatgacctttct-3 ■ - 
JTAMRA 


26 


305 


1184 


Reversejs 1 - tcatggcactcagaaagctc-3 1 


20 


346 


• 1185 



CNS_neurodegeneration_vl.O Summary: Ag2985 Expression of the NOV46c gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2985 Expression of the NOV46c gene is low/undetectable in 
15 all samples on this panel (CTs>35). (Data not shown.) 



NOV47 

Expression of gene NOV47 was assessed using the primer-probe set Ag2993, 
described in Table ARA. Results of the RTQ-PCR runs are shown in Tables ARB, ARC and 
ARD. 

20 Table ARA. Probe Name Ag2993 



Primers 


Sequences 


Length 


Start 
Position 


SEQ XD NO: 


Forward 


5 1 -gagggttactgctttcacagaa-3 1 


22 


161 


1186 


Probe 


TET-5 ' -tgacttcacatgccataatggcactg-3 1 - 
TAMRA 


26 


211 


1187 


Reverse 


5 ' -ctccctgtagatggacttgct-3 • 


21 


239 


1188 



Table ARB. CNS_neurodegeneration_vl.O 



Tissue Name 1 Rel. Exp.(%) Ag2993, \ Tissue Name } Rel. Exp.(%) Ag2993, 
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Run 211009463 




Run 211009463 


AD 1 Hippo 


11.9 


Control (Path) 3 
x cmpurdi i^ix 


10.8 


AD 2 Hippo 


43.2 


V^OnLiOl ^JTain ^ *t 

Temporal Ctx 


55.5 


AD 3 Hippo 


5.3 


AD 1 Occipital Ctx 


38.4 


AD 4 Hippo 


15.2 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


7.1 


AD 6 Hinno 


H 7 


/vu 4 uccipitai cix 


11 O 

33.2 


Control 2 Hippo 


25.0 


AD 5 Occipital Ctx 


21.6 


Control 4 rlippo 


18.4 


AD 6 Occipital Ctx 


16.0 


Control (Path) 3 
Hippo 


5.3 


Control 1 Occipital 
Ctx 


2.2 


AD 1 Temporal Ctx 


2.9 


Control 2 Occipital 
Ctx 


39.2 


AD 2 Temporal Ctx 


42.3 


Control 3 Occipital 
Ctx 


16.3 


AD 3 Temporal Ctx 


11.9 


Control 4 Occipital 
Ctx 


6.4 


AD 4 Temporal Ctx 


45.4 


Control (Path) 1 
Occipital Ctx 


64.6 


AD 5 Inf Temporal 
Ctx 


86.5 


Control (Path) 2 
Occipital Ctx 


15.8 


AD 5 Sup Temporal 
Ctx 


39.5 


Control (Path) 3 
Occipital Ctx 


3.1 


AD 6 Inf Temporal 

CtX 


24.8 


Control (Path) 4 
Occipital Ctx 


27.9 


AD 6 Sup Temporal 
Ctx 


30.1 


Control 1 Parietal 
Ctx 


15.1 


Control 1 Temporal 
ctx 


8.7 


Control 2 Parietal 
Ctx 


31.9 


Control 2 Temporal 
ctx 


26.6 


Control 3 Parietal 
Ctx 


26.6 


Control 3 Temporal 
Ctx 


13.5 


Control (Path) 1 

i diiCLal \^IX 


13.3 


Control 3 Temporal 
Ctx 


6.2 


Control (Path) 2 
Parietal Ctx 


45.7 


Control (Path) 1 
Temporal Ctx 


38.4 


Control (Path) 3 
Parietal Ctx 


6.7 


Control (Path) 2 
Temporal Ctx 


34.9 


Control (Path) 4 
Parietal Ctx 


36.3 
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Table ARC. Panel 13D 



Tissue Name 


Rel. Exp.(%) Ag2993, 
Kun looziUooo 


Tissue Name 


Rel. Exp.(%) Ag2993, 
Kun iooZ3U3oo 


-r— — • 

Liver adenocarcinoma 




V \ Arson r {-fn+nW 

ivioney (ietai ) 




Pancreas 


U.U 


Kenai ca. /oo-u 


jL\J. 1 


Pancreatic ca. CAP AN 
2 


34.2 


Renal ca. A498 


31.4 


Adrenal gianu 


U.U 


Kenai ca. kat 3^3 


io.3 


Thyroid 


U.U 


D Qnr .| A PT TXT 

Kenai ca. ACHJN 


14.3 


oaiivary gianu - 


HO. / 


Kenai ca. uuoi 


DD.Z 


Pituitary gland 


5.5 


Renal ca. TK-10 


19.5 


Brain (fetal) 


0.0 


Liver 


4.7 


Brain (whole) 


29.1 


Liver (fetal) 


4.0 


Brain (amygdala) 


13.4 


Liver ca. 

(hepatoblast) HepG2 


37.4 


Brain (cerebellum) 


3.3 


Lung 


0.0 


Brain (hippocampus) 


22.5 


Lung (fetal) 


9.9 


Brain (substantia nigra) 


18.7 


Lung ca. (small cell) 
LX-1 


26.4 


Brain (thalamus) 


100.0 


Lung ca. (small cell) 
NCI-H69 


312 


Cerebral Cortex 


19.6 


Lung ca. (s.cell var.) 
SHP-77 


53.6 


Spinal cord 


29.9 


Lung ca. (large 
cell)NCI-H460 


42 


glio/astroU87-MG 


37.1 


Lung ca. (non-sm. 
cell) A549 


11.6 


glio/astroU-118-MG 


24.3 


Lung ca. (non-s.cell) 
NCI-H23 


15.5 


a<?trocvtnmn 

a. oil ii/iiid O W I /OJ 


1 s ft 


Lung ca. (non-s.cell) 
HOP-62 




neurn*" met ^K-lsJ-A^ 


z> i ~J 


Lung ca. (non-s.cl) 
NCI-H522 




astrocvtoma ^1F-S10 




Lung ca. (squam.) 
SW 900 


1ft 7 

1 v. / 


astrocvtoma 




Lung ca. (squam.) 
NCI-H596 


3ft 1 

JU. 1 


glioma SNB- 19 


25.5 


Mammary gland 


12.8 


glioma U251 


7.9 


Breast ca.* (pl.ef) 
MCF-7 


17.4 


glioma SF-295 


3.4 


Breast ca.* (pl.ef) 
MDA-MB-231 


21.8 


Heart (fetal) 


3.3 


Breast ca.* (pl.ef) 
T47D 


15.4 


Heart 


0.0 


Breast ca. BT-549 


16.5 
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Skeletal muscle (fetal) 


3.0 


Breast ca. MDA-N 


13.4 


Skeletal muscle 


21.2 


Ovary 


3.8 


Bone marrow 


2.4 


Ovarian ca. OVCAR- 
3 


12.0 


Thymus 


22.4 


Ovarian ca. OVCAR- 
4 


39.5 


Spleen 


4.6 


Ovarian ca. OVCAR- 
5 


34.6 


Lymph node 


6.3 


Ovarian ca. OVCAR- 

o 
o 


2.8 


Colorectal 


45.1 


Ovarian ca. IGROV- 

i 
i 


12.4 


Stomach 


6.0 


Ovarian ca.* (ascites) 


3.7 


Small intestine 




Uterus 


ft A 

u.u 


Colon ca. SW480 


60.3 


Placenta 


23.8 


Colon ca. 

SW620(SW480met) 


13.2 


Prostate 


5.4 


Colon ca. HT29 


6.0 


Prostate ca.* (bone 
meijru-j 


24.0 


Colon ca.HCT-116 


13.5 


Testis 


11.8 


Colon ca. CaCo-2 


31.6 


Melanoma 
Hs688(A).T 


5.0 


Colon ca. 
tissue(OD03866) 


41.2 


Melanoma* (met) 
Hs688(B).T 


7.6 


Colon ca. HCC-2998 


32.5 


Melanoma UACC-62 


25.7 


Gastric ca * fliver mpf^ 
NCI-N87. 


11.3 


Melanoma M14 


12.8 


Bladder 


45.1 


Melanoma LOX 
IMVI 


6.7 


Trachea 


8.5 


Melanoma* (met) 
SK-MEL-5 


30.1 


Kidney 


3.7 


Adipose 


3.2 



Table ARD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2993, Run 
164315034 


Tissue Name 


Rel. Exp.(%) 
Ag2993, Run 
164315034 


Secondary Thl act 


8.4 


HUVECIL-lbeta 


3.8 


Secondary Th2 act 


11.8 


HUVEC IFN gamma 


4.5 


Secondary Trl act 


14.3 


HUVEC TNF alpha + IFN 
gamma 


4.2 


Secondary Thl rest 


3.3 


HUVEC TNF alpha + IL4 


5.0 


Secondary Th2 rest 


6.0 


HUVEC IL-11 


5.7 
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Secondary Trl rest 


3.8 


Lung Microvascular EC 
none 


3.8 


Primary Thl act 


22.7 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


3.3 


Primary Th2 act 


16.0 


Microvascular Dermal EC 
none 


10.2 


Primary Trl act 


30.4 


Microsvasular Dermal EC 

f 1 TV T"I — "i 1 1 | TT T t i 

TNFalpha + IL-lbeta 


5.5 ' 


Primary Thl rest 


39.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


18.0 


Primary Th2 rest 


31.9 


Small airway epithelium 
none 


5.2 


Primary Trl rest 


20.6 


Small airway epithelium 
TNFalpha + IL-lbeta 


25.9 


CD45RA CD4 
lymphocyte act 


10.0 


Coronery artery SMC rest 


4.5 


CD45RO CD4 
lymphocyte act 


24.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


1.5 


CD8 lymphocyte act 


17.4 


Astrocytes rest 


4.5 


Secondary CD8 
lymphocyte rest 


13.0 


Astrocytes TNFalpha + 
IL-lbeta 


5.0 


Secondary CD8 
lymphocyte act 


10.7 


KU-812 (Basophil) rest 


14.0 


CD4 lymphocyte none 


9.0 


KU-812 (Basophil) 
r iv i/v l onom y c in 


22.4 


2ryThl/Th2yTrl_anti- 
pno^ phi i 


12.5 


CCD1106 (Keratinocytes) 
none 


8.7 


LAK cells rest 


13.3 


PfFH 1 0£ fTf prat inn/ 1 * vt^c^ 

ksK^lj i iuo ^Jveraiinocytes»^ 
TNFalpha + IL-lbeta 


6.1 


T AK ppIIq TT .0 

Lf/\S\. LCI lb \Lj-Z. 


14. 4 


LilvCT CllTnOMo 






13 1 


T iit*\iic Hrlnp\/ 

i^upiio Muiicy 




gamma 


15.8 


NCI-H292 none 


25.3 


LAK cells IL-2+ IL-18 


13.8 


NCI-H292 BL-4 


28.3 


T AK ppIIq 
PMA/ionomycin 


3.9 


NCI-H292IL-9 


312 


NK Cells IL-2 rest 


10.7 


NCI-H292 IL-13 


14.9 


Two Wav N/TT P ^ Hav 
i wu w <xy ivuLvix j KicLy 


1 1 7 


in -nz?z iriN gamma 


1 1 0 


Two Wav MLR 5 dav 


6.8 


HPAEC none 


. 4 9 


Two Way MLR 7 day 


6.0 


HPAEC TNF alpha + IL-1 
beta 


4.7 


PBMC rest 


3.3 


Lung fibroblast none 


2.6 


PBMC PWM 


30.4 


Lung fibroblast TNF alpha 
+ IL-1 beta 


1.4 


PBMC PHA-L 


20.9 


Lung fibroblast IL-4 


7.1 
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Ramos (B cell) none 


22.1 


Lung fibroblast IL-9 


6.5 


Ramos (B cell) 
ionomycin 


100.0 


Lung fibroblast IL-13 


4.9 


B lymphocytes PWM 


51.4 


Lung fibroblast IFN 
gamma 


7.1 


B lymphocytes CD40L 

J TT A 

and 1L-4 


24.8 


Dermal fibroblast 
CCD1070 rest 


9.7 


EOL-1 dbcAMP 


5.6 


Dermal fibroblast 
CCD1070TNF alpha 


24.1 


bOL-1 doc AMP 


4.5 


Dermal fibroblast 


3.9 


PA/f A /i nnnm vr* t n 


rrriifwnTT 1 koto 
\^\^u i u /u iLf- 1 oeta 


Dendritic cells none 


7.0 


uciinai nDroDiasi iriN 
gamma 


4.0 


Dendritic cells LPS 


5.7 


Dermal fibroblast IL-4 


5.9 


Dendritic cells anti- 
CD40 


8.7 


IBD Colitis 2 


0.2 


Monocytes rest 


3.0 


IBD Crohn's 


0.0 


Monocytes LPS 


1.6 


Colon 


8.2 


Macrophages rest 


113 


Lung 


7.3 


Macrophages LPS 


5.8 


Thymus 


27.0 


HUVEC none 


6.7 


Kidney 


23.0 


HUVEC starved 


15.6 







CNS_neurodegeneration_vl.0 Summary: Ag2993 This panel does not show 
differential expression of the NOV47 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1 .3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 13D Summary: Ag2993 Significant expression of the NOV47 gene is 
restricted to the brain, with expression in the thalamus (CT=34.4). This gene also shows strong 
expression in the brain in the previous panel suggesting that this gene product may be involved 
in the normal functioning of the brain. Thus, the protein encoded by this gene may represent a 
1 0 small molecule target for the treatment of neurologic diseases. 

Panel 4D Summary: Ag2993 The NOV47 gene, a Peptidyl Prolyl Cis-Trans 
Isomerase A homolog, is a novel member of the family of receptors for the widely used 
immunosuppressants cyclosporin A and FK506 (see Wang et al., 2001). The NOV47 gene is 
expressed at moderate levels in many of the tissues in this panel and is expressed at a 
1 5 somewhat higher level (CT = 30.3) in ionomycin-stimulated Ramos B lymphocytes. 

Therefore, small molecule drugs that antagonzie the activity of the NOV47 gene product may 
be useful as immunosuppressants to reduce or eliminate the symptoms in patients with 
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autoimmune or inflammatory conditions, such as Crohn's disease, ulcerative colitis, multiple 
sclerosis, chronic obstructive pulmonary disease, asthma, emphysema, rheumatoid arthritis, 
lupus erythematosus, or psoriasis. 
References: 

5 Wang HC, Kim K, Bakhtiar R, Germanas JP. Structure-activity studies of ground- and 

transition-state analogue inhibitors of cyclophilin. J Med Chem 2001 Aug 2;44(16):2593-600 
Peptidyl-prolyl isomerases (PPIases) are ubiquitous cellular enzymes that play roles in 
cellular signaling and protein folding. In addition, these proteins are the receptors for the 
widely used immunosuppressants cyclosporin A and FK506. We report the first structure- 

10 activity studies of de novo designed inhibitors of cyclophilin, the cellular target of cyclosporin 
A. Our mechanism-based inhibitors were modeled on the ground- and transition-state 
structures of proline-containing peptides, the natural substrates of the enzyme. Both ground- 
state analogues 1 and transition-state analogues 2 were prepared as single enantiomers from L- 
proline following a "self-reproduction of chirality" procedure. The binding affinities of the 

1 5 analogues for the active site of cyclophilin were measured by a fluorescence perturbation 
assay. While the transition-state analogues 2 did not display significant avidity for the active 
site (K(d) = 77 microM for 2b), several ground-state analogues bound to the enzyme with low 
micromolar affinity (K(d) = 1.5 microM for le). These results proclaim that properly designed 
small molecular weight molecules can form strong complexes with cyclophilin and may find 

20 use as probes in cell biology and as therapeutic agents. 

NOV48a 

Expression of gene NOV48a was assessed using the primer-probe set Ag3006, 
described in Table ASA. Results of the RTQ-PCR runs are shown in Tables ASB, ASC, ASD, 
ASEandASF. 

25 Table ASA. Probe Name Ag3006 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gggcttggaatagagaaacct-3 1 


21 


2991 


1189 


Probe 


TET-5 ' -caacttcctcaaagcccaaagccaag-3 ' - 
TAMRA 


26 


3037 


1190 


Reverse 


5 1 -gaagccttgagccttgatttat-3 1 


22 


3069 


1191 



Table ASB. AI comprehensive panePvl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3006, Run 


Tissue Name 


Rel. Exp.(%) 
Ag3006, Run 
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110967COPD-F 


27.2 


1 12427 Match Control 

Pcnriacic-F 


72.7 


110980COPD-F 


18.8 


1 12418 Psoriasis-M 


20.2 


110968 COPD-M 


20.0 


1 12723 Match Control 
Psoriasis-M 


19.8 




"71 *7 

71.7 


1 12419 Psonasis-M 


32.8 


110989Emphysema- 
r 


46.0 


1 12424 Match Control 
Psoriasis-M 


22.7 


1 10992 Emphysema- 

p 
r 


18.3 


112420 Psoriasis-M 


74.2 


110993 Emphysema- 
r 


23.8 


112425 Match Control 
Psoriasis-M 


56.6 


1 10994 Emphysema- 
r 


20.7 


1 04689 (MF) OA 
Bone-Backus 


55.1 


110995 Emphysema- 

P 


19.9 


1 04690 (MF)Adj 
"Normal" Bone-Backus 


70.7 


11 0996 Emphysema- 

P 

r 


4.3 


104691 (MF) OA 
Synovium-Backus 


42.0 


110997Asthma-M 


6.4 


1 04692 (B A) OA 
Cartilage-Backus 


50.7 


111001 Asthma-F 


33.4 


104694 (B A) OA 
Bone-Backus 


32.5 


11 1002 Asthma-F 


33.0 


104695 (BA)Adj 
Normal" Bone-Backus 


52.1 


11 1003 Atopic 
Asthma-F 


31.6 


104696 (BA) OA 
Synovium-Backus 


25.7 


11 1004 Atopic 
/\sinnia-r 


28.5 


104700 (SS) OA Bone- 
Backus 


10.4 


111 UU3 Atopic 

Asthma-F 


11.1 


104701 (SS) Adj 
"Normal" Bone-Backus 


42.0 


in uuo /\iopic 
Asthma-F 


5.8 


104702 (SS) OA 
Synovium-Backus 


71.2 


111417Allergy-M 


13.7 


117093 OA Cartilage 
Rep7 


23.8 


1 1 / /viicrgy-jvi 


6.2 


ll 2672 OABoneS 


57.8 


1 1 9^40 Mnrmcil T una 

■4 


3.8 


1 12673 OA Synovium5 


22.7 


1 12357 Normal Lung- 

i 


39.8 


Fluid cells5 ! 


27.5 


1 12354 Normal Lung- 
M 


22.1 


117100 OA Cartilage 
Repl4 


12.3 


1 12374 Crohns-F 


36.9 


112756 OA Bone9 


412 


112389 Match 
Control Crohns-F 


15.6 


112757 OA Synovium9| 


100.0 
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112375 Crohns-F 


20.6 


112758 OA Synovial 
Fluid Cells9 


19.2 


112732 Match 
Control Crohns-F 


17.9 


117125 RA Cartilage 
Rep2 


25.0 


112725 Crohns-M 


6.1 


113492 Bone2 RA 


19.8 


112387 Match 
Control Crohns-M 


14.9 


113493 Synovium2RA 


7.7 


112378 Crohns-M 


4.8 


113494 Syn Fluid Cells 
RA 


14.5 


112390 Match 
Control Crohns-M 


48.0 


113499 Cartilage4RA 


24.1 


112726 Crohns-M 


33.0 


113500 Bone4 RA 


25.9 


112731 Match 
Control Crohns-M 


25.2 


113501 Synovium4RA 


15.4 


112380 Ulcer Col-F 


32.3 


113502 Syn Fluid 
Cells4RA 


7.5 


112734 Match 
Control Ulcer Col-F 


32.5 


113495 Cartilage3RA 


11.8 


112384 Ulcer Col-F 


35.6 


113496 Bone3 RA 


12.6 


112737 Match 
Control Ulcer Col-F 


6.5 


113497 Synovium3RA 


5.6 


112386 Ulcer Col-F 


17.3 


113498 Syn Fluid 
Cells3 RA 


13.9 


1 12738 Match 
Control Ulcer Col-F 


11.3 


117106 Normal 
Cartilage Rep20 


14.2 


112381 Ulcer Col-M 


5.1 


113663 Bone3 Normal 


6.8 


1 12735 Match 
Control Ulcer Col-M 


41.2 


113664 Synovium3 
Normal 


1.1 


112382 Ulcer Col-M 


25.2 


113665 Syn Fluid 
Cells3 Normal 


3.8 


i j/jy4 Matcn 
Control Ulcer Col-M 


11.2 


i m ah > t ,i 

1 i /1U/ JNormal 
Cartilage Rep22 


15.0 


112383 Ulcer Col-M 


23.0 


1 13667 Bone4 Normal 


23.8 


112736 Match 
Control Ulcer Col-M 


13.0 


113668 Synovium4 
Normal 


24.5 


112423 Psoriasis-F 


34.9 


113669 Syn Fluid 
Cells4 Normal 


37.9 



Table ASC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3006, 
Run 165517770 


Tissue Name 


Rel. Exp.(%) Ag3006, 
Run 165517770 


Liver adenocarcinoma 


4.7 


Kidney (fetal) 


1.2 


Pancreas 


0.7 


Renal ca. 786-0 


0.8 


Pancreatic ca. CAPAN 


1.1 


Renal ca. A498 


2.5 
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2 


| 1.0 






Adrenal gland 




Renal ca. RXF 393 


0.9 


Thyroid 


j 1.9 


Renal ca ACHN 




Salivary gland j 2.0 


Renal ca UO-31 


1 7 


Pituitary gland 


7.0 


Renal ca. TK-10 


1 7 


Brain (fetal) 


2.2 


Liver 


0.2 


Brain (whole) 


4.0 


.Liver (ietai ) 


0.7 


jDrdin ^amygaaiaj 


4.6 


Liver ca. 

(tie pato blast) HepUz 


1.8 


Brain (cerebellum) 


2.1 


Lung 


1.2 


Brain (hippocampus) 


3.6 


Lung (fetal) 


0.9 


Brain (substantia nigra) 


3.5 


Lung ca. (small cell) 
LX-1 


3 0 


Brain (thalamus) 


8.0 


Lung ca. (small cell) 
NC1-H69 




Cerebral Cortex 


2.3 


Lung ca. (s.cell var.) 
SHP-77 


2 4 


Spinal cord 


5.2 


Lung ca. (large 
cell)NCI-H460 


6 1 


glio/astro U87-MG 


1.2 


Lung ca. (non-sm. 
cell) A549 




glio/astroU-118-MG 


9.9 


Lung ca. (non-s.cell) 
NCI-H23 




astrocytoma SW1783 


1.2 


Lung ca. (non-s.cell) 
HOP-62 


1 1 

1.1 


neuro*; met SK-N-AS 


8.8 


Lung ca. (non-s.cl) 
NCI-H522 


2.1 


abu ucyioina orOj7 


2.6 


Lung ca. (squam.) 
SW900 


1.3 


astrocytoma SNB-75 


3 7 


Lung ca. (squam.) 
NCI-H596 


2.6 


glioma SNB-19 


2.7 


Mammary gland 


2.1 


glioma U251 


15.3 


Breast ca.* (pl.ef) 
iviwr - / 


0.8 


glioma SF-295 


2.2 


Breast ca. (pl.et) 
MDA-MB-231 


3.1 


Heart (fetal) 


0.2 


breast ca. (pl.ef) 
T47D 


0.7 


Heart 


1.8 


Breast ca. BT-549 


2 7 


Skeletal muscle (fetal) 


4.4 


Breast ca. MDA-N 


1.6 


Skeletal muscle 


100.0 


Ovary 


1.2 


Bone marrow 


1.8 


Ovarian ca. OVCAR- 
i 


1.7 


Thymus 


0.6 ] 


Ovarian ca. OVCAR- 
1 


2.2 
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Spleen 


2.5 


Ovarian ca. OVCAR- 
5 


1.7 


Lymph node 


4.7 


Ovarian ca. OVCAR- 

o 
o 


0.6 


Colorectal 


2.9 


Ovarian ca. IGROV- 
i 

X 


0.9 


Stomach 


3.2 


Ovarian ca.* (ascites) 

QTf f\\T 1 

Olv-vJ V -J 


2.0 


olllall lillCbUIIC 


0 0 


uterus 




Colon ca. SW480 


0.9 


Placenta 


0.8 


Colon ca.* 
SW620(SW480 met) 


1.3 


Prostate 


4.2 


Colon ca. HT29 


0.2 


Prostate ca.* (bone 


1.6 


Colon ca. HCT-116 


2.0 


Testis 


6.3 


Colon ca. CaCo-2 


1.2 


Melanoma 
Hs688(A).T 


0.9 


Colon ca. 
tissue(OD03866) 


0.9 


Melanoma* (met) 
Hs688(B).T 


0.3 


Colon ca. HCC-2998 


2.2 


Melanoma UACC-62 


3.1 


Gastric ca.* (liver met) 
NCI-N87 


3.9 


Melanoma M l 4 


5.0 


Bladder 


0.8 


Melanoma LOX 
IMVI 


0.7 


Trachea 


2.3 


Melanoma* (met) 
SK-MEL-5 


0.8 


Kidney 


4.0 


Adipose 


1.1 



Table ASD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3006, Ron 
163577592 


Tissue Name 


Rel. Exp.(%) 
Ag3006, Run 
163577592 


Normal Colon 


35.8 


Kidney Margin 
8120608 


9.0 | 


CC Well to Mod Diff 
(OD03866) 


5.6 


Kidney Cancer 
8120613 


13.2 


CC Margin (OD03866) 


6.2 


Kidney Margin 
8120614 


16.2 


CC Gr.2 rectosigmoid 
(OD03868) 


5.5 


Kidney Cancer 
9010320 


8.7 


CC Margin (OD03868) 


10.8 


Kidney Margin 
9010321 


20.9 


CCMod Diff(ODO3920) 


9.5 


Normal Uterus 


6.3 


CC Margin (ODO3920) 


19.6 


Uterus Cancer 064011 


9.4 
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CC Gr.2 ascend colon 
(OD03921) 


10.7 


Normal Thyroid 


21.0 


CC Margin (OD03921) 


8.5 


Thyroid Cancer 
064010 


2.5 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


6.1 


Thyroid Cancer 
A302152 


5.6 


Liver Margin (ODO4309) 


0.9 


Thyroid Margin 
A302153 


8.2 


Colon mets to lung 
(OD04451-01) 


6.8 


Normal Breast 


24.7 


Lung Margin (OD04451- 
02) 


2.8 


Breast Cancer 
(OD04566) 


65.1 


Normal Prostate 6546- 1 


66.9 


Breast Cancer 
(OD04590-01) 


70.2 


Prostate Cancer 
(OD04410) 


16.6 


Breast Cancer Mets 
(OD04590-03) 


68.3 


Prostate Margin 
(OD04410) 


18.0 


Breast Cancer 

Metastasis 

(OD04655-05) 


100.0 


Prostate Cancer 
(OD04720-01) 


13.7 


Breast Cancer 064006 


14.7 


Prostate Margin 
(OD04720-02) 


25.5 


Breast Cancer 1024 


33.7 


Normal Lung 061010 


15.1 


Breast Cancer 
9100266 


26.6 


Lung Met to Muscle 
(OD042R6\ 


22.4 


Breast Margin 


15.3 


MiiqpIp lMnrcrin 

(OD04286) 


52.5 


Breast Cancer 
A209073 


16.7 


Lung Malignant Cancer 
(OD03126) 


9.9 


Breast Marein 
A2090734 I 


16.3 


Lung Margin (OD03126) 


11.2 


Normal Liver 


6 8 


Lung Cancer (OD04404) 


3.9 


Liver Cancer 064003 


1.6 


Lung Margin /OD04404^ 

JJU,I 5 AwAMmM gill yV/lyV/^7V/7 J 


8 7 
o. / 


i-ri ver cancer i kjZj 


2.1 


Lung Cancer (OD04565) 


6.5 


Liver Cancer 1026 


2.0 


Lung Margin (OD04565) 


. 6.2 


Liver Cancer 6004-T 


2.3 


Lung Cancer (OD04237- 
01) 


13.3 


Liver Tissue 6004-N 


3.2 


Lune Margin fTYnft4/?'}7- 
02) 


11.4 


Liver Cancer 6005-T 


2.6 


Ocular Mel Met to Liver 
(ODO4310) 


5.2 


Liver Tissue 6005-N 


0.6 


Liver Margin (ODO4310) 


2.0 


Normal Bladder 


9.7 


Melanoma Mets to Lung 
(OD04321) 


6.3 j 


Bladder Cancer 1023 


2.2 
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Lung Margin (OD04321) 


11.5 


Bladder Cancer 
A302173 


4.4 


Normal Kidney 


47.6 


Bladder Cancer 
(OD04718-01) 


10.3 


Kidney Ca, Nuclear grade 
2 (OD04338) 


27.7 


Bladder Normal 
Adjacent (OD0471 8- 
03) 


62.0 


Kidney Margin 
(OD04338) 


43.8 


Normal Ovary 


5.0 


Kidney Ca Nuclear grade 
Ml (OD04339) 


12.4 


Ovarian Cancer 
064008 


17.9 


Kidney Margin 
(OD04339) 


46.7 


Ovarian Cancer 
(OD04768-07) 


20.3 


Kidney Ca, Clear cell 
type (OD04340) 


20.2 


Ovary Margin 
(OD04768-08) 


3.6 


Kidney Margin 
(OD04340) 


40.6 


Normal Stomach 


32.5 


Kidney Ca, Nuclear grade 

o / y"~x /\ j| <^ At r\ \ 

3 (OD04348) 


4.0 


Gastric Cancer 
9060358 


9.6 


Kidney Margin 
(OD04348) 


30.1 


Stomach Margin 
9060359 


8.6 


Kidney Cancer 
(OD04622-01) 


9.1 


Gastric Cancer 
9060395 


12.0 


Kidney Margin 


5.6 


Stomach Margin 


14.3 


Kidney Cancer 
(OD04450-01) 


9.1 


Gastric Cancer 
9060397 


17.6 


Kidney Margin 
(OD04450-03) 


24.0 


Stomach Margin 
9060396 


2.6 


Kidney Cancer 8120607 


18.3 


Gastric Cancer 
064005 


20.7 



Table ASE. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag3006, Ran 
170188143 


Tissue Name 


Rel. Exp.(%) 
Ag3006,Run 
170188143 


Daoy- Medulloblastoma 


8.5 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


15.7 


TE671- Medulloblastoma 


7.5 


ES-2- Ovarian clear cell 
carcinoma 


8.3 


D283 Med- 
Medulloblastoma 


31.6 


Ramos- Stimulated with 
PMA/ionomycin 6h 


6.3 


PFSK-1- Primitive 
Neuroectodermal 


21.8 


Ramos- Stimulated with 
PMA/ionomycin 14h 


12.6 


XF-498-CNS 


5.4 


MEG-01- Chronic 


3.5 
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myelogenous leukemia 
^megoKaryooiast ) 




SNB-78- Glioma 


2.5 


Raji- Burkitt's lymphoma 


5.8 


or -zoo- uliODlastoma 


13.4 


Daudi- Burkitt s lymphoma 


15.4 


I 98G- Glioblastoma 


24.5 


U266- B-cell plasmacytoma 


11.4 


CI/' \T CTT 

SK-N-SH- 

Neuroblastoma 

(metastasis) 


14.4 


CA46- Burkitt's lymphoma 


2.4 . 


SF-295- Glioblastoma 


8.0 


RL- non-Hodgkin f s B-cell 
lymphoma 


1.3 


Cerebellum 


5.3 


JM1- pre-B-cell lymphoma 


6.4 


Cerebellum 


3.2 


Jurkat- T cell leukemia 


3.8 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


13.5 


TF-1- Erythroleukemia 


8.0 


DMS-1 14- Small cell 
lung cancer 


9.1 


HUT 78- T-cell lymphoma 


7.8 


DMS-79- Small cell lung 
cancer 


100.0 


U937- Histiocytic lvmohoma 


8 3 


NCI-H146- Small cell 
lung cancer 


6.3 


KU-812- Myelogenous 
leukemia 


5 8 


NCI-H526- Small cell 
lung cancer 


20.6 


769-P- Clear cell renal 
carcinoma 


7.7 


NCI-N417- Small cell 
lung cancer 


21.5 


Caki-2- Clear cell renal 
carcinoma 


13.9 


NCI-H82- Small cell 
lung cancer 


15.2 


SW 839- Clear cell renal 
carcinoma 


0.9 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


18.2 


G401- Wilms' tumor 


3.7 


NCI-H1155- Large cell 
lung cancer 


18.6 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


10.5 


NCI-H1299- Large cell 
lung cancer 


19.2 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


6.3 


NCI-H727- Lung 
carcinoio 


7.2 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


10.3 


NCI-UMC-ll-Lung 
carcinoid 


12.5 


BxPC-3- Pancreatic 
adenocarcinoma 


3 7 


-»X-1- Small cell luntr 

a l^lliuil WWII 

cancer 


10.9 


HPAC- Pancreatic 
adenocarcinoma 


11.0 


Colo-205- Colon cancer 


7.1 


MIA PaCa-2- Pancreatic 
carcinoma 


1.4 


KM 12- Colon cancer 


13.1 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


23.2 


KM20L2- Colon cancer 


1.1 


PANC-1- Pancreatic j 4.7 . 
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(epithelioid ductal carcinoma 




NCI-H716- Colon cancer 


7.2 


T24- Bladder carcinma 
(transitional cell) 


3.5 


SW-48- Colon 
adenocarcinoma 


4.4 


5637- Bladder carcinoma 


4.7 


SW1 116- Colon 
adenocarcinoma 


5.8 


HT-1 197- Bladder carcinoma 


1.5 


LS 174T- Colon 
adenocarcinoma 


3.7 


UM-UC-3- Bladder carcinma 
(transitional cell) 


4.8 


SW-948- Colon 
adenocarcinoma 


0.5 


A204- Rhabdomyosarcoma 


35.4 


SW-480- Colon 
adenocarcinoma 


1.7 


HT-1 080- Fibrosarcoma 


9.0 


NCI-SNU-5- Gastric 
carcinoma 


3.1 


MG-63- Osteosarcoma 


4.3 


KATO III- Gastric 
carcinoma 


24.8 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


13.8 


NCI-SNU-1 6- Gastric 
carcinoma 


6.9 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


7.3 


NCI-SNU-1- Gastric 
carcinoma 


18.0 


A43 1 - Epidermoid carcinoma 


2.8 


RF-1- Gastric 
adenocarcinoma 


4.1 


WM266-4- Melanoma 


5.4 


RF-48- Gastric 
adenocarcinoma 


3.4 


DU 145- Prostate carcinoma 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


10.0 


MDA-MB-468- Breast 
adenocarcinoma 


14.5 


NCI-N87- Gastric 
carcinoma 


6.1 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


3.5 


SCC-15- Squamous cell 
carcinoma of tongue 


0.3 


HelaS3- Cervical 
adenocarcinoma 


6.7 


CAL 27- Squamous cell 
carcinoma of tongue 


5.2 



Table ASF. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3006,Run 
168033497 


Tissue Name 


Rel. Exp.(%) 
Ag3006, Run 
168033497 


Secondary Thl act 


32.3 


HUVEC IL-lbeta 


9.0 


Secondary Th2 act 


25.5 


HUVEC EFN gamma 


21.9 


Secondary Trl act 


39.0 


HUVEC TNF alpha + EFN 
gamma 


29.9 


Secondary Thl rest 


5.8 


HUVEC TNF alpha + IL4 


26.4 
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Secondary Th2 rest 


7.8 


HUVEC IL-11 


11.0 


Secondary Trl rest 


15.9 


Lung Microvascular EC 
none 


27.7 


Primary Thl act 


38.2 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


28.3 


Primary Th2 act 


33.7 


Microvascular Dermal EC 
none 


30.8 


Primary Trl act 


47.6 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


17.4 


Primary Thl rest 


35.8 


Bronchial epithelium 
TNFalpha + ILlbeta 


23.3 


Primary Th2 rest 


25.7 


Small airway epithelium 
none 


12 3 


Primary Trl rest 


23.7 


Small airway epithelium 
TNFalpha + IL-lbeta 


44.4 


CD45RA CD4 
lymphocyte act 


23.7 


Coronery artery SMC rest 


132 


CD45RO CD4 
lymphocyte act 


29.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


10.4 


CD8 lymphocyte act 


33.7 


Astrocytes rest 


11.5 


Secondary CDS 
lymphocyte rest 


37.6 


Astrocytes TNFalpha + 
IL-lbeta 


l/.V 


Secondary CD8 
lymphocyte act 


16.4 


KU-812 ffiasoDhil) rest 


14 S 


CD4 lymphocyte none 


4.8 


KU-812 (Basophil) 
PMA/ionomycin 


22.1 


2ryThl/Th2/Trl_anti- 


15.3 


CCD1 106 (Keratinocytes) 
none 


28.1 


LAK cells rest 


11.2 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


14.8 


i-fcTVlV l/Clla 1JLv~Z, 




Liver cirrhosis 


2.2 


T AK ceU<z TT -7+TT -19 


ZD. J 


Lupus kidney 


5.8 


T AK celk TT -7+TFN 

gamma 


44.1 


NCI-H292 none 


15.4 


LAK cells IL-2+ IL-18 


35.8 


NCI-H292 IL-4 


36.3 


LAK celk 
PMA/ionomycin 


8.7 


NCI-H292 IL-9 


22.8 


NK Cells IL-2 rest 


23.3 


NCI-H292IL-13 


12.7 


Two Way MLR 3 day 


33.0 


NCI-H292 IFN eamma 




Two Way MLR 5 day 


18.3 


HPAEC none 


22.8 


Two Way MLR 7 day 


17.0 


HPAECTNF alpha + IL-1 
beta 


15.9 


PBMCrest 


5.0 


Lung fibroblast none 


27.5 | 


PBMCPWM 


100.0 


Lung fibroblast TNF alpha 
+ IL-lbeta 


14.3 



1151 



WO 02/068649 



PCT/US02/02785 



PBMC PHA-L 


31.2 


Lung fibroblast IL-4 


39.0 


Ramos (B cell) none 


30.4 


Lung fibroblast IL-9 


26.2 


Ramos (B cell) 
ionomycin 


66.4 


Lung fibroblast IL-1 3 


27.9 


B lymphocytes PWM 


96.6 


Lung fibroblast IFN 
gamma 


35.8 


B lymphocytes CD40L 
and IL-4 


32.3 


Dermal fibroblast 
CCD 1070 rest 


31.2 


EOL-1 dbcAMP 


18.7 


Dermal fibroblast 
CCD1070TNF alpha 


49.3 


PAT 1 11 AX jTT* 

EOL-1 dbcAMP 

i ivi/v luiivjiii y will 


15.6 


Dermal fibroblast 
pf r»i met tt .1 Kf»tn 


9.7 


Dendritic cells none 


4.6 


normal ftkrrtKlact TPXf 
LJCTlllcLl lIDrODJoSl JLT IN 

gamma 


9.5 


Dendritic cells LPS 


3.0 


Dermal fibroblast IL-4 


15.7 


Dendritic cells anti- 
CD40 


4.0 


IBD Colitis 2 


6.0 


Monocytes rest 


7.5 


IBD Crohn's 


1.4 


Monocytes LPS 


10.8 


Colon 


25.0 


Macrophages rest 


6.8 


Lung 


7.9 


Macrophages LPS 


3.1 


Thymus 


55.1 


HUVEC none 


27.9 


Kidney 


18.4 


HUVEC starved. 


35.8 







AI_comprehensive panel_vl.0 Summary: Ag3006 The NOV48a gene is a novel 
member of the Phospholipase C family that is expressed at moderate to low levels in numerous 
cell types involved in the immune response in health and disease. In addition, the NOV48a 
5 gene is expressed at higher levels (CTs range 29-32) in samples obtained from patients with 
several autoimmune and inflammatory diseases, particularly a subset of samples from 
osteoarthritic synovium and psoriasis. Therefore, small molecule drugs that antagonzie the 
activity of this gene product may be useful as immunosuppressants to reduce or eliminate the 
symptoms in patients with conditions, such as osteoarthritis, psoriasis, Crohn ! s disease, 

10 ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary disease, asthma, 
emphysema, rheumatoid arthritis, or lupus erythematosus. 

Panel 1.3D Summary: Ag3006 Highest expression of the NOV48a gene, a 
phospholipase C homolog, is seen in skeletal muscle (CT=27.6). Phosphatidylinositol-specific 
phospholipase C (PLC) plays an important role in receptor-mediated signal transduction. In 

15 addition to skeletal muscle, this gene is expressed in heat, liver, adipose, adrenal, thyroid, and 

pancreas. This widespread expression in metabolic tissues suggests that this gene product may 

be involved in cellular regulation of metabolism through interactions with the insulin receptor. 
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A therapeutic modulator to this gene and/or gene-product may be useful in the treatment of 
metabolic diseases that affect skeletal muscle metabolism. This gene and/or gene-product may 
also prove useful in differentiating between fetal and adult forms of skeletal muscle tissue, 
since it is expressed at much higher levels in the adult (CT=27) when compared to expression 
5 in the fetal tissue (CT=32). 

This gene is also expressed at a low level in almost all cancer cell lines in this panel. 
Hence, it is probably required for cell survival and proliferation and therefore, inhibition of 
this gene in cancer can probably be used as therapy. 

In addition, increased expression of PLC delta has been observed in the brains of 
10 Alzheimer's disease patients, indicating a role for this class of enzyme in the disease process. 
Therefore, inhibitors of the NOV48A protein product, by countering this disease associated 
process, may have utility in treating Alzheimer's disease and other neurodegenerative 
disorders. 

References: 

1 5 Tanino H, Shimohama S, Sasaki Y, Sumida Y, Fujimoto S. Increase in phospholipase 

C-deltal protein levels in aluminum-treated rat brains. Biochem Biophys Res Commun 2000 
May 19;271(3):620-5 

The effect of administration of aluminum to rats on the level of three phospholipase C 
(PLC) isozymes (betal, gamma 1, and delta 1) was assessed in a variety of brain tissues. After 

20 exposure to aluminum, a statistically significant increase in malondialdehyde, an index of lipid 
peroxidation, was observed. In addition, there was a significant reduction in the catalytic 
activity of low molecular weight phosphotyrosine phosphatase, which loses its activity during 
oxidative stress. This suggests that oxidative stress is induced in brain tissues exposed to 
aluminum. The protein level of PLC-deltal, but not that of PLC-betal or -gammal, was 

25 significantly increased in brains where oxidative stress had been induced. The total PLC 
activity in aluminum-treated rat brains was significantly higher than that in control brains. 
These results suggest that PLC-deltal protein levels in brain tissues are increased by the 
induction of oxidative stress, giving an explanation for its up-regulation in Alzheimer's 
disease. 

30 Panel 2D Summary: Ag3006 The NOV48A gene is expressed at a low to moderate 

level in most of the tissues on this panel. There is increased expression in ovarian and breast 
cancer compared to normal adjacent tissue. Thus, expression of this gene could potentially be 
used as a diagnostic marker for the presence of cancer. Furthermore, inhibition of this gene in 
ovarian and breast cancer may be useful as a therapeutic treatment Additionally, there is 
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increased expression in normal kidney samples compared to adjacent tumors. Thus, decreased 
expression of this gene could be used as a diagnostic marker for kidney cancer and therapeutic 
modulation of expression of this gene in tumors may be used to treat these cancers. 

Panel 3D Summary: Ag3006 The NOV48a gene is expressed at a low level in almost 
5 all cancer cell lines in this panel with the highest expression in DMS-79 (CT=29.21). This 
ubiquitous pattern of expression suggests that this gene product may be required for cell 
survival and proliferation and inhibition of this gene in cancer may therefore be useful as a 
therapy. 

Panel 4D Summary: Ag3006 The NOV48a gene is ubiquitously expressed among the 
10 samples on this panel, suggesting a role for this protein product in inflammation. Please see 
Alcomprehensive panel_vl.O for further discussion of utility of this gene in inflammation. 
Results from a second experiment with the CG56003-01 gene are not included. The amp plot 
indicates that there were experimental difficulties with this run. 

NOV49 

1 5 Expression of gene NOV49 was assessed using the primer-probe set Ag3003, 

described in Table ATA. Results of the RTQ-PCR runs are shown in Table ATB. 

Table ATA. Probe Name Ag3003 



Primers 


Sequences 


Length 


Start 
Position 


SEQ XD NO: 


Forward 


5 1 -catctcgtccaccctacgtt-3 1 


20 


299 


1192 


Probe 


TET-5 1 -cttcagctgctgttgcactcaaggat-3 1 - 
TAMRA 


26 


339 


1193 


Reverse 


5 ■ -ttcaggaagccatagaaactca-3 ' 


22 


366 


1194 



Table ATB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3003, 
Run 166245477 


Tissue Name 


Rel. Exp.(%) Ag3003, 
Run 166245477 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


0.0 


Pancreas 


15.2 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


17.0 


Renal ca.UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca.TK-10 


14.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 



1154 



WO 02/068649 



PCT7US02/02785 



Brain (amygdala) 


0.0 


Liver ca. 

fhenatoblastl Henfi? 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


A A 
0.0 


Lung (ietal) 


0.0 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-l 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NC1-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


17.7 


Spinal cord 


29.9 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


15.5 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


rrliAmo C\TD 1 Q 

glioma oiNrj-iy 


0.0 


Mammary gland 


52.1 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
MPF-7 


100.0 


glioma SF-295 


0.0 


DICdbl Ul. ^pi.Ci J 

MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


T47D 


0.0 


Heart 


0.0 




n n [ 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


oKeicmi muscie 


A A 

u.o 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Thymus 


0.0 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian HVP AP- 

5 


51.8 


Lymph node 


0.0 


Ovarian ca. OVCAR- 
8 


0.0 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 


0.0 
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oIV-tJ V-j 




- ■ ■ . : 

Small intestine 


U.U 


uterus 


U.U 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


11.2 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 


0.0 


Colon ca.HCT-1 16 


0.0 


Testis 


0.0 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


31.2 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


13.3 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.0 


Adipose 


0.0 



Panel 1.3D Summary: Ag3003 Expression of the NOV49 gene is restricted to a 
sample derived from a breast cancer cell line (CT=34.7). Thus, expression of this gene could 
be used to differentiate between this sample and other samples on this panel and as a marker to 
detect the presence of breast cancer. Furthermore, therapeutic modulation of the expression or 



5 function of this gene may be effective in the treatment of breast cancer. 

Panel 4D Summary: Ag3003 Expression of the NOV49 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

NOVSOa 

Expression of gene NOV50a was assessed using the primer-probe set Ag3014, 
10 described in Table AUA. 

Table AUA. Probe Name Ag3014 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 1 -gactgagcgttgccttttct-3 1 


20 


1065 


1195 


Probe 


TET-5 1 -agctacctcccaaagcagcctgacct-3 ' - 
TAMRA 


26 


1088 


1196 


Reverse 


5 1 -acaatccctgcacaacgat-3 1 


19 


1138 


1197 



1156 



WO 02/068649 



PCTYUS02/02785 



CNS_neurodegeneration_vl.O Summary: Ag30 1 4 Expression of the NOV50a gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 

Panel 1.3D Summary: Ag3014 Expression of the NOV50a gene is low/undetectable 
5 in all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

Panel 4D Summary: Ag3014 Expression of the NOVSOa gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

10 NOV53 

Expression of gene NOV53 was assessed using the primer-probe set Ag3008, 
described in Table AVA. Results of the RTQ-PCR runs are shown in Tables AVE, AVC and 
AVD. 

Table AVA. Probe Name Ag3008 

15 



Primers 


Sequences 


j Length 


Start 
Position 


SEQ ZD HO: 


Forward 


5 1 -cttaagctgctgcctatgaatg-3 • 


J 22 


469 


1198 


Probe 


TET-5 1 -atacgggagctacagaccatcatccg-3 ' 
TAMRA 


"26 


496 


1199 


Reverse 


5 » -tcacctctactggctgtcttgt-3 1 


\22 


524 


1200 



Table AVB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3008, 
Run 211010256 


Tissue Name 


Rel. £xp.(%) Ag3008, 
Run 211010256 


AD 1 Hippo 


14.7 


Control (Path) 3 
Temporal Ctx 


6.2 


AD 2 Hippo 


30.6 


Control (Path) 4 
Temporal Ctx 


35.4 


AD 3 Hippo 


6.3 


AD 1 Occipital Ctx 


22.1 


AD 4 Hippo 


8.0 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


84.1 


AD 3 Occipital Ctx 


4.9 


AD 6 Hippo 


57.0 


AD 4 Occipital Ctx 


22.4 


Control 2 Hippo 


28.9 


AD 5 Occipital Ctx 


722 


Control 4 Hippo 


• 22.1 


AD 6 Occipital Ctx 


21.5 


Control (Path) 3 
Hippo 


4.7 


Control 1 Occipital 
Ctx 


4.7 
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AD 1 Temporal Ctx 


26.1 


Control 2 Occipital 
Ctx 


64.6 


AD 2 Temporal Ctx 


30.8 


Control 3 Occipital 
Ctx 


10.7 


AD 3 Temporal Ctx 


5.9 


Control 4 Occipital 
Ctx 


12.2 


AD 4 Temporal Ctx 


17.6 


Control (Path) 1 
Occipital Ctx 


73.7 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


10.5 


AD 5 Sup Temporal 
Ctx 


50.0 


Control (Path) 3 
Occipital Ctx 


3.7 


AD 6 Inf Temporal 
Ctx 


57.0 


Control (Path) 4 
Occipital Ctx 


9.7 


AD 6 Sup Temporal 
Ctx 


54.7 . 


Control 1 Parietal 
Ctx 


5.0 


Control 1 Temporal 
Ctx 


3.9 


Control 2 Parietal 
Ctx 


42.3 


Control 2 Temporal 
Ctx 


46.3 


Control 3 Parietal 
Ctx 


25.5 


Control 3 Temporal 

\^IX 


17.9 


Control (Path) 1 
ranetal Ctx 


95.9 


Control 3 Temporal 
Ctx 


8.0 


Control (Path) 2 
Parietal Ctx 


33.4 


Control (Path) 1 
Temporal Ctx 


58.2 


Control (Path) 3 
Parietal Ctx 


2.7 


Control (Path) 2 
Temporal Ctx 


33.2 


Control (Path) 4 
Parietal Ctx 


32.8 



Table AVC. Panel 1 3D 



Tissue Name 


Rel. Exp.(%) Ag3008, 
Run 167927168 


Tissue Name 


Rel.Exp.(%)Ag3008, 
Run 167927168 


Liver adenocarcinoma 


18.9 


Kidney (fetal) 


42.6 


Pancreas 


7.6 


Renal ca. 786-0 


18.4 


Pancreatic ca. CAPAN 
2 


9.4 


Renal ca. A498 


11.3 


Adrenal gland 


10.8 


Renal ca. RXF 393 


5.7 


Thyroid 


8.1 


Renal ca. ACHN 


5.2 


Salivary gland 


5.9 


Renal ca.UO-31 


9.1 


Pituitary gland 


11.6 


Renal ca.TK-10 


10.3 


Brain (fetal) 


31.9 


Liver 


6.2 


Brain (whole) 


30.1 


Liver (fetal) 


10.7 


Brain (amygdala) 


17.8 


Liver ca. 

(hepatoblast) HepG2 


17.3 
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Brain (cerebellum) 


25.3 


Lung | 5.5 


Brain (hippocampus) 


25.7 


Lung (fetal) | 10.2 


Brain (substantia nigra) 


24.7 


Lung ca. (small cell) 
LX-1 


zl.U 


Brain (thalamus) 


14.1 


Lung ca. (small cell) 
NC1-H69 


1 A 

1.4 


Cerebral Cortex 


26.2 


Lung ca. (s.cell var.) 
SHP-77 


V4.U 


Spinal cord 


10.5 


Lung ca. (large 
cell)NCI-H460 


a n 


glio/astro U87-MG 


26.2 


Lung ca. (non-sm. 
cell) A549 


36.6 


glio/astroU-118-MG 


22.4 


Lung ca. (non-s.cell) 
NCI-H23 


12.7 


astrocytoma SW1783 


82.4 


Lung ca. (non-s.cell) 
HOP-62 


19.1 


neuro*; met SK-N-AS 


20.7 


Lung ca. (non-s.cl) 
NCI-H522 


332 


astrocytoma SF-539 


30.6 


Lung ca. (squam.) 
SW 900 


16.0 


astrocytoma SNB-75 


33.0 


Lung ca. (squam.) 
NCI-H596 




glioma SNB-19 


22.7 


Mammary gland 


13.0 


glioma U251 


100.0 


Breast ca.* (pl.ef) 
MCF-7 


21.8 


glioma SF-295 


38.2 ' 


Breast ca.* (pl.ef) 
MDA-MB-231 


38.7 


Heart (fetal) 


12.5 


Breast ca.* (pl.ef) 
T47D 


21.3 


Heart 


14 7 


Breast ca. BT-549 


15.6 


Skeletal muscle (fetal) 


10.4 


Breast ca. MDA-N 


21.6 


oKeietai muscle 


35.4 


Ovary 


15.3 


Bone marrow 


7.4 


Ovarian ca. OVCAR- 
3 


1 1 < 

11.0 


Thymus 


21.6 


Ovarian ca. OVCAR- 
4 


10.2 


Spleen 


9.1 


Ovarian ca. OVCAR- 

5 


12.4 


Lymph node 


16.8 


Ovarian ca. OVCAR- 
8 


8.5 


Colorectal 


15.0 


Ovarian ca. IGROV- 
1 


6.8 . \ 


Stomach 


17.8 


Ovarian ca.* (ascites) 
SK-OV-3 


58.2 


Small intestine 


4.7 


Uterus 


8.1 
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Colon ca. SW480 


12.5 


Placenta 


1.3 


Colon ca.* 
SW620(SW480 met) 


85.9 


Prostate 


7.2 


Colon ca. HT29 


7.0 


Prostate ca.* (bone 


22.8 


Colon ca.HCT-116 


13.3 


Testis 


0.6 


Colon ca. CaCo-2 


32.8 


Melanoma 
Hs688(A).T 


10.1 


Colon ca. 
tissue(OD03866) 


9.3 


Melanoma* (met) 
Hs688(B).T 


8.1 


colon ca. ricc-zyyo 


inn 


Melanoma UACC-62 


22.4 


NCI-N87 


10.7 


Melanoma Ml 4 


6.3 


Bladder 


21.6 


Melanoma LOX 
IMVI 


18.2 


Trachea 


9.3 


Melanoma* (met) 
SK-MEL-5 


12.5 


Kidney 


31.2 


Adipose 


23.0 



Table AVD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3008, Run 
164043360 


Tissue Name 


Rel. Exp.(%) 
Ag3008, Run 
164043360 


Secondary Thl act 


21.0 


HUVEC IL-lbeta 


7.5 


Secondary Th2 act 


23.7 


HUVEC IFN gamma 


10.7 


Secondary Trl act 


23.5 


HUVEC TNF alpha + IFN 
gamma 


6.8 


Secondary Thl rest 


8.2 


HUVEC TNF alpha + IL4 


7.5 


Secondary Th2 rest 


8.8 


HUVEC IL-11 


7.7 


Secondary Trl rest 


9.1 


Lung Microvascular EC 
none 


12.0 


Primary Thl act 


24.8 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


8.2 


Primary Th2 act 


21.9 


Microvascular Dermal EC 
none 


17.9 


Primary Trl act 


29.1 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


11.0 


Primary Thl rest 


44.8 


Bronchial epithelium 
TNFalpha + ELlbeta 


16.4 


Primary Th2 rest 


21.3 


Small airway epithelium 
none 


6.0 


Primary Trl rest 


20.4 


Small airway epithelium 
TNFalpha + IL-lbeta 


48.3 


CD45RA CD4 


15.8 


Coronery artery SMC rest 


19.9 
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lymphocyte act 








CD45RO CD4 
lymphocyte act 


30.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


8.0 


CD8 lymphocyte act 


22.8 


Astrocytes rest 


13.4 


Secondary CD8 
lymphocyte rest 


32.5 


Astrocytes TNFalpha + 
EL-lbeta 


8.9 


Secondary CD8 
lymphocyte act 


15.7 


KU-812 (Basophil) rest 


9.5 


CD4 lymphocyte none 


8.8 


KU-812 (Basophil) 
PMA/ionomycin 


33.0 . 


2ry Thl/Th2/Trl_anti- 


17.0 


CCD1 106 (Keratinocytes) 
none 


12.3 


LAK cells rest 


21.3 


iajlh iuo (Reratinocytesj 
TNFalpha + IL-lbeta 


8.6 


JL//\XV UCllb ILt-Z, 


1 Q < 


Liver cirrhosis 


2.1 


L-»/\J\. CCUS 1L»-Z- i 1L»-1Z 


1 n £ 

1 /,0 


Lupus kidney 


1.4 


T AV- /-pile TT -94-TFM 
i-t/viv ecus iJL»"Z^irrN 

gamma 


28.9 


NCI-H292 none 


19.8 


LAK cells IL-2+ IL-18 


16.2 


NCI-H292 IL-4 


25.9 


j->/\js. ceus 
PMA/ionomyc in 


232 


NCI-H292 IL-9 


27.0 


NK Cells IL-2rest 


11.0 


NCI-H292IL-13 


10.7 


iwo way lYijLfix j aay 




JNC1-H2y2 lrN gamma 


1 1 o 

11.8 


i wo w ay iVLLfiN. z> aay 


10.3 


HrAbC none 


6.0 


Two Way MLR 7 day 


9.6 


JrlrAibC IJNr alpha + 1L-1 
beta 


9.0 


PRAAf" 1 r<=»et 
JrUlvll^ ICSL 


1U./ 


Lung fibroblast none 


7.5 


PBMCPWM 


65.1 


Lung fibroblast TNF alpha 
+ 1L-1 beta 


6.7 


PBMC PHA-L 


58.6 


Lung fibroblast IL-4 


22.1 


Ramos (B cell) none 


33.0 


Lung fibroblast IL-9 


18.2 


Ramos (B cell) 
ionomycin 


100.0 


Lung fibroblast IL-13 


14.7 


B lymphocytes PWM 


74.7 


Lung fibroblast IFN 
gamma 


232 


B lymphocytes CD40L 
and IL-4 


23.2 


Dermal fibroblast 
CCD1 070 rest 


18.8 


EOL-1 dbcAMP 


12.2 


Dermal fibroblast 
CCD 1070 TNF aloha 


46.7 


EOL-1 dbcAMP 
PMA/ionomycin 


13.0 


Dermal fibroblast 
CCD1070 IL-1 beta 


8.0 


Dendritic ceils none 


16.7 


Dermal fibroblast IFN 
gamma 


6.7 


Dendritic cells LPS 


13.0 


Dermal fibroblast IL-4 


122 


Dendritic cells anti- 


17.1 


IBD Colitis 2 


2.0 
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CD40 








Monocytes rest 


17.2 


IBD Crohn's 


2.9 


Monocytes LPS 


7.5 


Colon 


15.4 


Macrophages rest 


24.5 


Lung 


14.8 . 


Macrophages LPS 


11.8 


Thymus 


22.2 


HUVEC none 


9.4 


Kidney 


19.8 


HUVEC starved 


18.8 







CNS_neurodegeneration_vLO Summary: Ag3008 This panel does not show 
differential expression of the NOV53 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3008 Highest expression of the NOV53 gene is seen in a 
brain cancer cell line (CT=29). In addition, this gene has low to moderate expression in all the 
cancer cell lines used in this panel. Thus, expression of this gene might be used as a diagnostic 
marker in brain, colon, renal, lung, melanoma and ovarian cancers. 
10 This gene encodes a homo log of uracil phosphoribosyltransferase. This gene has low to 

moderate expression in several endocrine/metabolically-related tissues, including; adipose, 
adrenal, pancreas, liver and skeletal muscle. Therefore, a therapeutic modulator to this gene 
and/or gene-product may prove useful in the treatment of diseases which affect the endocrine 
system. ' 

15 In addition, this gene shows moderate to low levels in the CNS and may be a small 

molecule target for the treatment of neurologic diseases. 

Panel 4D Summary: Ag3008 The NOV53 gene, a uracil phosphoribosyl-transferase 
homolog is expressed at moderate to low levels in numerous cell types involved in the immune 
response. Higher levels of expression are seen in activated B lymphocytes, represented by 

20 ionomycin-activated Ramos (CT=27.6), and pokeweed mitogen-activated B lymphocytes 

(CT=28.02). Therefore, small molecules that antagonize the function of this gene product may 
be useful as therapeutic drugs to reduce or eliminate the symptoms in patients with 
autoimmune and inflammatory diseases in which B cells play a part in the initiation or 
progression of the disease process, such as lupus erythematosus, Crohn's disease, ulcerative 

25 colitis, multiple sclerosis, chronic obstructive pulmonaiy disease, asthma, emphysema, 
rheumatoid arthritis, or psoriasis. 

NOV54a 
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Expression of gene NOV54a was assessed using the primer-probe sets Ag3015 and 
Ag3070, described in Tables AWA and AWB. Results of the RTQ-PCR runs are shown in 
Tables AWC, A WD, AWE and AWF. 



Table AWA. Probe Name Ag3015 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gtgctctcactatccacctcaa-3 1 


22 


1515 


1201 


Probe 


TET-5 1 -cacacatccatctcaagaggaacatt-3 1 - 
TAMRA 


26 


1537. 


1202 


Reverse 


5 1 -ccatacacttccagctctgact-3 1 


22 


1573 


1203 



Table AWB. Probe Name Ag3070 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 ' -gtgctctcactatccacctcaa-3 ' 


22 


1515 


1204 


Probe 


TET-5 1 -cacacatccatctcaagaggaacatt-3 ' - 
TAMRA 


26 


1537 


1205 


Reverse 


5 1 -ccatacacttccagctctgact-3 1 


22 


1573 


1206 



Table AWC. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel.Exp.(%) 
Ag3015, Run 
211010356 


Rel. Exp.(%) 
Ag3070,Run 
208974108 


Tissue 
Name 


Rel. Exp.(%) 
Ag3015, Run 
211010356 


Rel.Exp.(%) 
Ag3070,Run 
208974108 


AD 1 Hippo 


37.1 


22.5 


Control 
(Path) 3 
Temporal 
Ctx 


11.2 


10.5 


AD 2 Hippo 


56.3 


50.0 


Control 
(Path) 4 
Temporal 
Ctx 


50.0 


29.9 


AD 3 Hippo 


24.8 


22.7 


AD 1 

Occipital 

Ctx 


15.5 


13.6 


AD 4 Hippo 


15.6 


8.5 


AD2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


AD 5 hippo 


84.7 


54.0 


AD3 

Occipital 

Ctx 


17.7 


9.4 


AD 6 Hippo 


100.0 


66.4 


AD4 

Occipital 

Ctx 


17.3 


11.3 


Control 2 


37.4 


32.5 


AD 5 


29.7 


39.2 
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Hippo 






Occipital 
Ctx 






Control 4 
Hippo 


33.0 


26.1 


AD 6 

Occipital 

Ctx 


45.4 


13.1 


Control (Path) 

3 HlDDO 


15.2 


8.8 


Control 1 
Occipital 
Ctx 


9.7 


8.2 


AD 1 Temporal 
Ctx 


31.2 


17.4 


Control 2 
Occipital 
Ctx 


42.0 


30.4 


AD 2 Temporal 
Ctx 


50.3 


38.2 


Control 3 
Occipital 
Ctx 


15.5 


14.5 


AD 3 Temporal 
Ctx 


17.6 


14.8 


Control 4 
Occipital 
Ctx 


13.8 


9.4 


AD 4 Temporal 
Ctx 


36.1 


23.3 


Control 
(Path) 1 
Occipital 
Ctx 


92.7 


67.8 


AD5Inf 
Temporal Ctx 


87.1 


69.7 


Control 
(Path) 2 
Occipital 
Ctx 


17.7 


11.3 


SupTemporal 
Ctx 


87.1 


53.2 \ 


Control 
(Path) 3 
Occipital 
Ctx 


7.9 


8.9 


AD6Inf 
Temporal Ctx 


60.7 


72.7 


Control 
(Path) 4 
Occipital 
Ctx 


21.8 


14.5 


AD 6 Sup 
Temporal Ctx 


55.9 


35.8 


Control 1 
Parietal Ctx 


16.5 


10.4 


Control 1 
Temporal Ctx 


15.9 


8.0 


Control 2 
Parietal Ctx 


77.9 


48.6 


Control 2 
Temporal Ctx 


56.6 


28.9 


Control 3 
Parietal Ctx 


20 3 


19 3 


Control 3 
Temporal Ctx 


28.3 


17.8 


Control 
(Path) 1 
Parietal Ctx 


76.3 


100.0 


Control 4 
Temporal Ctx 


21.5 


16.7 


Control 
(Path) 2 
Parietal Ctx 


36.1 


21.2 


Control (Path) 
1 Temporal Ctx 


82.4 


70.2 


Control 
(Path) 3 
Parietal Ctx 


7.1 


9.3 
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Control (Path) 
2 Temporal Ctx 


49.7 




Control 






38.7 


(Path) 4 
Parietal Ctx 


41.8 


30.6 



Table AWD. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Ag3015, Run 
167927212 


Rel. Exp.(%) 
Ag3070, Run 
167985243 


Tissue Name 


Rel. Exp.(%) 
Ag3015, Run 
167927212 


Rel. Exp.(%) 
Ag3070, Run 
167985243 


Liver 

adenocarcinoma 


23.7 


21.3 


Kidney (fetal) 


56.3 


64.6 


Pancreas 


7.4 


12 


Renal ca. 786- 

U 


2.3 


2.9 


Pancreatic ca. 
CAP AN 2 


4.5 


2.4 


Renal ca. 
A498 


13.9 


11.7 


Adrenal gland 


6.5 


4.9 


Renal ca. RXF 


14.1 


25.7 


Thyroid 


14.6 


14.6 


Renal ca. 
ACHN 


13.9 


19.8 


Salivary gland 


13.3 


1.4 


Renal ca. UO- 
31 


7.4 


17.8 


Pituitary gland 


6.0 


8.4 


Renal ca. TK- 
10 


12.6 


14.0 


Brain (fetal) 


25.3 


14.6 


Liver 


13.8 


2.4 


Brain (whole) 


21.6 


11.1 


Liver (fetal) 


5.7 


4.9 


Brain (amygdala) 


28.1 


32.3 


Liver ca. 

(hepatoblast) 

HepG2 


3.7 


4.3 


Brain (cerebellum) 


17.9 


17.2 


Lung 


23.7 


7.6 


Brain 

(hippocampus) 


16.8 i 


16.5 


Lung (fetal) 


20.6 


33.0 


Brain (substantia 
nigra) 


18.9 


30.6 


Lung ca. 
(small cell) 
LX-1 


8.8 


15.2 


Brain (thalamus) 


9.1 


2.8 


Lung ca. 
(small cell) 
NCI-H69 


3.5 


10.4 


Cerebral Cortex 


42.6 


36.9 


Lungca. 
(s.cell var.) 
SHP-77 


13.1 


17.3 


Spinal cord 


18.9 


18.4 


Lung ca. (large 
cell)NCI-H460 


4.7 


2.2 


glio/astro U87-MG 


19.8 


37.6 


Lung ca. (non- 
sm. cell) A549 


13.1 


10.2 


glio/astroU-118- 
MG 


11.9 | 


18.3 


Lung ca. (non- 
s.cell) NCI- 


5.5 


18.7 
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H23 






astrocytoma 
SW1783 


5 6 


10.9 


Lung ca. (non- 
s.cell) HOP-62 


7.5 


9.3 


neuro*; met SK-N- 
AS 


3.4 


7.1 


Lung ca. (non- 
s.cl) NCI- 
H522 


18.8 


34.6 


astrocytoma SF- 
539 


6.4 


10.4 


Lung ca. 
(squam.) SW 
900 


2.0 


4.8. 


astrocytoma SNB- 
75 


12.9 


10.1 


Lung ca. 
(squam.) NCI- 
H596 


7.6 


8.2 


glioma SNB- 19 


13.3 


16.0 


Mammary 
gland 


19.5 


3.8 


glioma U251 


' 24.5 


36.9 


Breast ca.* 
(pl.ef)MCF-7 


0.9 


2.6 


glioma SF-295 


13.4 


18.2 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


8.0 


9.7 


Heart (fetal) 


100.0 


100.0 


Breast ca.* 
(plef) T47D 


26.8 


40.6 


Heart 


12.4 


14.0 


Breast ca. BT- 
549 


5.1 


6.1 


Skeletal muscle 
(fetal) 


51.1 


67.8 


Breast ca. 
MDA-N 


11.0 


16.2 


Skeletal muscle 


11/4 

11.4 


1 A 1 

14.1 


Ovary 


50.0 


53.2 


Bone marrow 


28.9 


11.1 


Ovarian ca. 
OVCAR-3 


8.3 


3.3 


Thymus 


53.2 


56.3 


Ovarian ca. 

^VT rx~^ A T* A 

OVCAR-4 


11.0 


18.7 


Spleen 


24.8 


30.6 


Ovarian ca. 
OVCAR-5 


202 


38.4 


Lymph node 


60.7 


47.6 


Ovarian ca. 
OVCAR-8 


5.0 


8.1 


Colorectal 


17.1 


15.5 


Ovarian ca. 
IGROV-1 


4.7 


7.9 


btomacn 


8.9 


9.7 


Ovarian ca.* 
(ascites) SK- 
OV-3 i 


8.5 


1 A 1 

I4.l 


ouiaii intestine 


1 1 0 


1Z.0 


Uterus 






Colon ca. SW480 


17.9 


16.4 


Placenta 


3.4 


5.4 


Colon ca.* 

SW620(SW480 

met) 


21.9 


16.0 


Prostate 


14.3 


13.5 


Colon ca. HT29 


2.4 


2.3 


Prostate ca.* 
(bone met)PC- 


14.6 


28.3 
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3 






Colon ca. HCT- 
116 


3.4 


6.8 


Testis 


6.6 


10.3 


Colon ca. CaCo-2 


11.7 


16.0 


Melanoma 
Hs688(A).T 


12.9 


12.4 


Colon ca. 
tissue(0DO3866) 


9.9 


18.0 


Melanoma* 
(met) 

Hs688(B).T 


10.7 


16.6 


Colon ca. HCC- 
2998 


15.0 


17.8 


Melanoma 
UACC-62 


7.3 


10.4 


Gastric ca.* (liver 

ma i\ VT/^T \T07 

met) NC1-N87 


3.1 


4.9 


Melanoma 
M14 


3.4 


7.4 


Bladder 


10.5 


5.8 


Melanoma 
LOXIMVI 


14.1 


15.7 


Trachea 


13.2 


10.9 


Melanoma* 
(met) SK- 
MEL-5 


2.5 


2.8 


Kidney 


26.1 


15.6 


Adipose 


34.2 


33.0 



Table AWE. Panel 2.2 



Tissue Name 


Rel. Exp.(%) 
Ag3070, Run 
173800588 


Tissue Name 


Rel. Exp.(%) 
Ag3070, Run 
173800588 


Normal Colon 


25.0 


Kidney Margin 
(OD04348) 


75.8 


Colon cancer 
(OD06064) 


74.2 


Kidney malignant 
cancer (OD06204B) 


5.0 


Colon Margin 
(OD06064) 


36.3 


Kidney normal adjacent 
tissue (OD06204E) 


11.9 


Colon cancer 
(OD06159) 


4.9 


Kidney Cancer 
(OD04450-01) 


44.4 


Colon Margin 
(OD06159) 


17.4 


Kidney Margin 
(OD04450-03) 


15.8 


Colon cancer 
(OD06297-04) 


1.4 


Kidney Cancer 8120613 


4.0 


Colon Margin 
(OD06297-015) 


18.9 


Kidney Margin 
8120614 


37.4 


CC Gr.2 ascend colon 
(OD03921) 


6.5 


Kidney Cancer 9010320 


19.3 


CC Margin (OD03921) 


16.5 


Kidney Margin 
9010321 


16.6 


Colon cancer metastasis 
(OD06104) 


12.7 


Kidney Cancer 8120607 


33.2 


Lung Margin 
(OD06104) 


34.6 


Kidney Margin 
8120608 


18:4 
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Colon mets to Jung 
(OD0445I-01) 


8.1 


Normal Uterus 


76.3 


Lung Margin 
(OD04451-02) 


52.5 


Uterine Cancer 06401 1 


12.8 


\Trmviol DrA cf~Q t £i 

INormai rrostaie 


10 1 


iNormai inyroia 


11 Q 
1 l.O 


Prostate Cancer 


0.0 


Thyroid Cancer 064010 


7.2 


Prostate Margin 

{KJlJV^tH i\J) 


13.2 


Thyroid Cancer 

A 1AO 1 


37.4 


Normal Ovary 


60.7 


Thyroid Margin 
A302153 


9.7 


Ovarian cancer 
(OD06283-03) 


14.8 


Normal Breast 


100.0 


uvanan Margin 
(OD06283-07) 


24.1 


Breast Cancer 
(OD04566) 


13.4 


Ovarian Cancer 064008 


26.6 


Breast Cancer 1024 


34.6 


Ovarian cancer 
(OD06145) 


252 


Breast Cancer 
(OD04590-01) 


13.3 


Ovarian Margin 
tannzi a^\ 


24.1 


Breast Cancer Mets 
(UDU439U-U3) 


40.6 


Ovarian cancer 
(OD06455-03) 


A A 


Breast Cancer 
Metastasis (UDU4655- 
05) 


O A 

52.9 


Ovarian Margin 
(OD06455-07) 


22.4 


Breast Cancer 064006 


29.3 


Normal Lung 


m o 

jo.y 


JtJreast Cancer 9100266 


28.7 


Invasive poor diff. lung 
aaeno ^uiJU^y'O-ui 


19.1 


Breast Margin 9100265 


35.6 


Lung Margin 


44.1 


Breast Cancer A209073 


10.7 


Lung Malignant Cancer 


18.0 


Breast Margin 


19.5 


T HTlfT \/l O T*fTl n 

l^ung ivicugm 
(OD03126) 


18.3 


Breast cancer 
(OD06083) 


30.4 


T lino* f'onr*^?* 

(OD05014A) 


22.5 


r>reast cancer none 
metastasis (OD06083) 


59.5 


T i in ft AAn rrri n 

i^ung lviargin 
(OD05014B) 


79.6 


Normal Liver 


19.1 


jLrUng cancer ^l/jl-'vOuoI ) 


O < 


.Liver Cancer 1 U26 


17.3 


Lune Marffin 

UUllg 1T1UI gill 

(OD06081) 


17.7 


Liver Cancer 1025 


17.4 


Lung Cancer 
(OD04237-01) 


4.1 


Liver Cancer 6004-T 


27.0 


Lung Margin 
(OD04237-02) 


54.7 


Liver Tissue 6004-N 


8.7 


Ocular Melanoma 


14.3 


Liver Cancer 6005-T 


48.6 
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Ocular Melanoma 

Marcrin H iv£*r^ 

iV_l.CU gill wj y 


9.7 


Liver Tissue 6005-N 


49.3 


Melanoma Metastasis 


4.4 


Liver Cancer 064003 


23.2 


Melanoma Margin 
(Lung) 


60.3 

, „.n . .„..• 


Normal Bladder 

-- -- - - - 


18.7 


Normal Kidney 


in r 

18.6 


Bladder Cancer 1023 


15.3 


Kidney Ca, Nuclear 
grade 2 ((JDU4338) 


47.6 


Bladder Cancer 
A302173 


15.1 


Kidney Margin 
(UDU4J3o) 


26.6 


Normal Stomach 


58.2 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


49.7 


Gastric Cancer 9060397 


9.1 


Kidney Margin 


16.4 


Stomach Margin 


33.0 


Kidney Ca, Clear cell 
type (OD04340) 


11.9 


Gastric Cancer 9060395 


29.9 


Kidney Margin 
(OD04340) 


19.1 


Stomach Margin 
9060394 


65.5 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


7.3 


Gastric Cancer 064005 


19.5 



Table AWF. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
Ag3015, 

Ran 
164043871 


Rel. 
Exp.(%) 
Ag3070, 

Run 
164525657 


Tissue Name 


Rel. 
Exp.(%) 
Ag3015, 

Run 
164043871 


Rel. 
Exp.(%) 
Ag3070, 

Run 
164525657 


Secondary Thl act 


35.4 


35.4 


HUVEC IL-lbeta 


2.6 


1.2 


Secondary Th2 act 


27.9 


21.6 


HUVEC IFN 
gamma 


9.7 


10.5 


Secondary Trl act 


28.9 


33.7 


HUVEC TNF 
alpha + IFN 
gamma 


8.1 


8.1 


Secondary Thl rest 


21.3 


21.5 


HUVEC TNF 
alpha + IL4 


4.8 


7.4 


Secondary Th2 rest 


24.0 


21.2 


HUVEC IL-11 


4.9 


5.6 


Secondary Trl rest 


26.6 


26.6 


Lung 

Microvascular EC 
none 


6.9 


9.2 


Primary Thl act 


23.5 


19.6 


Lung 

Microvascular EC 
TNFalpha + IL- 
lbeta 


6.7 


7.5 


Primary Th2 act 


18.3 


20.7 


Microvascular 


12.4 


14.8 
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Dermal EC none 






Primary Trl act 


23.8 


27.5 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
lbeta 


5.5 


7.0 


Primary Thl rest 


48.6 


54.0 


Bronchial 
epithelium 
TNFalpha + . 
ILlbeta 


5.1 


4.8 


Primary Th2 rest 


33.0 


28.7 


Small airway 
epithelium none 


2.0 


2.1 


Primary Trl rest 


26.1 


28.1 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


3.4 


3.0 


CD45RA CD4 
lymphocyte act 


12.7 


14.8 


Coronery artery 
SMC rest 


8.4 


7.1 


CD45ROCD4 
lymphocyte act 


28.9 


32.3 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


4.1 


7.3 


CD8 lymphocyte 
act 


27.7 


42.0 


Astrocytes rest 


5.3 


7.6 


Secondary CD8 
* 

lymphocyte rest 


29.1 


35.8 


Astrocytes 
TNFalpha + IL- 
lbeta 


3.8 


32 


Secondary CD8 
lymphocyte act 


382 


41.8 


KU-812 
(Basophil) rest 


4.0 


4.5 


CD4 lymphocyte 
none 


0.0 


18.0 


KU-812 

(Basophil) 

PMA/ionomycin 


7.1 


10.4 


2ry 

Thi/Thz/Trl anti- 
CD95 CH11 


26.8 


29.7 


CCD1106 

(Keratinocytes) 

none 


8.3 


6.8 


LAK cells rest 


503 


71.2 


CCD1106 
(Keratinocytes) 
i jNraipna t il,- 
lbeta 


4.2 


2.1 


T AK celU IT -? 


JU.l 




L,iver cirrnosis 


5.1 


3.U 


LAK cells IL-2+IL- 

19 


16.6 


29.3 


Lupus kidney 


3.1 


3.3 


LAK cells IL- 
2+IFN gamma 


43.8 


44.8 


NCI-H292 none 


7.5 


5.7 


LAK cells IL-2+ 
IL-18 


26.8 j 


50.0 


NCI-H292 IL-4 


6.5 


8.3 


LAK cells 
PMA/ionomycin 


23.8 


24.8 


NCI-H292 IL-9 


6.0 


8.2 


NK Cells IL-2 rest 


31.4 


35.1 


NCI-H292 IL-13 


3.7 


4.1 



1170 



WO 02/068649 



PCT/US02/02785 



Two Way MLR 3 

UflY 


32.1 


50.3 


NC1-H292 IFN 
gamma 


3.9 


4.5 


Two Way MLR 5 
day 


20.7 


26.2 


HPAEC none 


9.8 


10.2 


Two Way MLR 7 
day 


17.1 


25.9 


HPAEC TNF 
alpha + IL-1 beta 


3.8 


4.5 


PBMC rest 


16.2 


19.1 


Lung fibroblast 
none 


20.0 


23.3 


PBMCPWM 


55.1 


58.6 


Lung fibroblast 
TNF alpha + EL- 1 

ucld 


7.6 


8.0 


PBMC PHA-L 


39.8 


43.5 


Lung fibroblast 

IT -4 


19.9 


24.7 


Ramos (B cell) 


4.8 


5.2 


Lung fibroblast 


15.6 


19.2 


Ramos (B cell) 
ionomycin 


18.9 


25.3 


Lung fibroblast 
IL-13 


15.1 


22.7 


B lymphocytes 
PWM 


73.7 


77.9 


Lung fibroblast 
IFN gamma 


22.8 


20.7 


B lymphocytes 
CD40L and IL-4 


25.2 


37.4 


Dermal fibroblast 
CCD1070 rest 


16.4 


17.6 


EOL-1 dbcAMP 


11.7 


10.5 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


56.6 


67.4 


EOL-1 dbcAMP 
PMA/ionomycin 


14.7 


24.3 


Dermal fibroblast 
CCD1070 IL-1 
beta 


7.4 


9.6 


none 


89.5 


100.0 


Dermal tiDrooiast 
IFN gamma 


13.1 


12.0 


Dendritic cells LPS 


38.2 


•55.9 


Dermal fibroblast 
IL-4 


24.1 


33.7 


Dendritic cells anti- 
CD40 


inn n 


OA f\ 


IdD Colitis 2 


0.9 


1.4 


Monocytes rest 


49.3 


72.2 


IBD Crohn's 


1.4 


1.1 


Monocytes LPS 


15.4 


14.7 


Colon 


10.0 


10.7 


Macrophages rest 


80.7 


87.1 


Lung 


16.5 


15.0 


Macrophages LPS 


24.0 


30.1 


Thymus 


10.3 


11.0 


HUVEC none 


6.9 


7.9 


Kidney 


34.6 


50.3 


HUVEC starved 


12.3 


15.3 









CNS_neurodegeneration_vLO Summary: Ag3015/Ag2070 This panel does not show 
differential expression of the NOV54a gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 
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Panel 1.3D Summary: Ag3015/Ag2070 Two experiments with the same probe and 
primer set produce results that are in excellent agreement, with highest expression in fetal 
heart (CTs= : 29-30). This expression is higher than the expression seen in adult heart (CTs=32- 
33). Thus, expression of this gene could be used to differentiate between the two sources of 
5 this tissue. This gene is expressed in other metabolic tissues including adipose, adrenal, liver, 
pancreas, skeletal muscle and thyroid. This gene encodes a homolog of protein phosphatase 
2C (PP2C), which has been linked to the regulation of hormone-sensitive lipase, the rate- 
limiting enzyme in adipose tissue lipolysis (Eur J Biochem 1987 Oct 15;168(2):399-405). 
PP2C may also play a role in controlling insulin signaling. Therefore, a therapeutic modulator 
1 0 of this gene and/or gene-product may prove useful in the treatment of diseases affecting the 
endocrine system. 

In addition, protein phosphatase 2C plays a role in dopamine and serotonin signaling. 
Specifically, PP2C counters the action of these neurotransmitters on DARPP-32. These 
neurotransmitter systems are the primary targets of drugs that treat schizophrenia and 
1 5 depression. Therefore, agents that inhibit this gene product may have utility in treating these 
disorders. 

References: 

Desdouits F, Siciliano JC, Nairn AC, Greengard P, Girault JA. Dephosphorylation of 
Ser-137 in DARPP-32 by protein phosphatases 2A and 2C: different roles in vitro and in 

20 striatonigral neurons. Biochem J 1998 Feb 15;330 ( Pt 1):21 1-6 

DARPP-32 (dopamine- and cAMP-regulated phosphoprotein, Mr=32000) is highly 
expressed in striatonigral neurons in which its phosphorylation is regulated by several 
neurotransmitters including dopamine and glutamate. DARPP-32 becomes a potent inhibitor 
of protein phosphatase 1 when it is phosphorylated on Thr-34 by cAMP- or cGMP-dependent 

25 protein kinases. DARPP-32 is also phosphorylated on Ser-137 by protein kinase CK1 (CK1), 
in vitro and in vivo. This phosphorylation has an important regulatory role since it inhibits the 
dephosphorylation of Thr-34 by calcineurin in vitro and in striatonigral neurons. Here, we 
show that DARPP-32 phosphorylated by CK1 is a substrate in vitro for protein phosphatases 
2A and 2C, but not protein phosphatase 1 or calcineurin. However, in substantia nigra slices, 

30 dephosphorylation of Ser-137 was markedly sensitive to decreased temperature, and not 
detectably affected by the presence of okadaic acid under conditions in which 
dephosphorylation of Thr-34 by protein phosphatase 2A was inhibited. These results suggest 
that, in neurons, phospho-Ser-137-DARPP-32 is dephosphorylated by protein phosphatase 2C, 
but not 2A. Thus, DARPP-32 appears to be a component of a regulatory cascade of 
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phosphatases in which dephosphorylation of Ser-136 by protein phosphatase 2C facilitates 
dephosphorylation of Thr-34 by calcineurin, removing the cyclic nucleotide-induced inhibition 
of protein phosphatase 1 . 

Overall, expression of this gene is appears to be associated with normal tissues over 
5 cancer cell lines. Thus, expression of this gene could be used to differentiate between normal 
and malignant tissues and potentially as a treatment for cancer. 

Panel 22 Summary: Ag3070 As seen in the previous panel, the NOV54a gene shows 
greater expression in normal tissues than in samples derived from malignant tissue. Thus, 
expression of this gene may be useful in distinguishing the two types of tissue. 

10 Panel 4D Summary: Ag3015/Ag2070 Two experiments with the same probe and 

primer set produce results that are in excellent agreement. The NOV54a gene, a protein 
phosphatase 2C homolog is expressed by T lymphocytes prepared under a number of 
conditions at moderate levels and is expressed at higher levels in treated and untreated 
dendritic cells, monocytes, and macrophages. Dendritic cells and macrophages are powerful 

15 antigen-presenting cells (APC) whose function is pivotal in the initiation and maintenance of 
normal immune responses. Autoimmunity and inflammation may also be reduced by 
suppression of this function. Therefore, small molecule drugs that antagonize the function of 
this gene product may reduce or eliminate the symptoms in patients with several types of 
autoimmune and inflammatory diseases, such as lupus erythematosus, Crohn's disease, 

20 ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary disease, asthma, 
emphysema, rheumatoid arthritis, or psoriasis. 

NOV55 

Expression of gene NOV55 was assessed using the primer-probe set Ag3024, 
described in Table AXA. Results of the RTQ-PCR runs are shown in Tables AXB, AXC, 
25 AXD, AXE, AXF, AXG and AXH. 

Table AXA. Probe Name Ag3024 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 • -ggtgggctgctataacttgact-3 ' 


22 


1033 


1207 


Probe 


TET-5 1 - tgaaagaaacaccatcctgttgcaga-3 1 - 
TAMRA 


26 


1069 


1208 


Reverse 


5 ' -tgttcttcaggttgttctttgc-3 ' 


22 


1097 


1209 
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Table AXB. AI_comprehensive panel_vl.0 



Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
248122026 


Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
248122026 


110967 COPD-F 


10.6 


112427 Match Control 
Psoriasis-F 


37.1 


1 10980 COPD-F 


9.5 


1 12418 Psoriasis-M 


6.1 


110968 COPD-M 


10.8 


112723 Match Control 
Psoriasis-M 


0 0 


110977 COPD-M 


16.7 


112419 Psoriasis-M 


16.6 


1 10989 Emphysema- 
F 


44.8 


112424 Match Control 
Psoriasis-M 


17.8 


1 10992 Emphysema- 
F 


3.8 


112420 Psoriasis-M 


100.0 


110993 Emphysema- 
F 


2.6 


112425 Match Control 
Psoriasis-M 


26.2 


1 10994 Emphysema- 
F 


1.0 


104689 (MF) OA 
Bone-Backus 


0.0 


1 10995 Emphysema- 
F 


9.0 


104690 (MF) Adj 
"Normal" Bone-Backus 


0.4 


1 10996 Emphysema- 
F 


1.3 


104691 (MF) OA 
Synovium-Backus 


3.0 


1 10997 Asthma-M 


1.0 


104692 (BA) OA 
Cartilage-Backus 


1.0 


111001 Asthma-F 


33.4 


104694 (BA) OA 
Bone-Backus 


0.0 


11 1002 Asthma-F 


37.9 


104695 (BA) Adj 
"Normal" Bone-Backus 


0.6 


111003 Atopic 
Asthma-F 


41.5 


104696 (BA) OA 
Synovium-Backus 


0.5 


11 1004 Atopic 
Asthma-F 


21.2 


104700 (SS) OA Bone- 
Backus 


0 2 


111005 Atopic 
Asthma-F 


39.0 


104701 (SS)Adj 
"Normal" Bone-Backus 


0 3 


111006 Atopic 
Asthma-F 


8.5 


104702 (SS)OA 
Synovium-Backus 




111417Allergy-M 


26.6 


11 7093 OA Cartilage 
Rep7 


42.0 


112347 Allergy-M 


8.6 i 


112672 OA Bone5 


18.4 


112349 Normal Lung- 
F 


9.2 


112673 OA Synovium5 


8.0 


112357 Normal Lung- 
F 


0.6 


1 12674 OA Synovial 
Fluid cells5 


2.4 


112354 Normal Lung- 
M 


1.3 


117100 OA Cartilage 
Repl4 


5.5 
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1 12374 Crohns-F 


4.0 


112756 OA Bone9 


3.5 


112389 Match 
Control Crohns-F 


1.3 


112757 OA Synovium9 


2.1 


112375 Crohns-F 


4.5 


1 12758 OA Synovial 
Fluid Cells9 


11.9 


112732 Match 
Control Crohns-F 


0.0 


117125 RA Cartilage 
Rep2 


2.5 


11 2725 Crohns-M 


9.9 


113492 Bone2 RA 


1.7 


-ft *« A A f\ mm ~* M' i 1 

112387 Match 
Control Crohns-M 


15.3 


113493 Synovium2RA 


0.0 


1 12378 Crohns-M 


9.7 


113494 Syn Fluid Cells 
RA 


1.6 


112390 Match 
Control Crohns-M 


23.7 


113499 Cartilage4RA 


0.4 


1 12726 Crohns-M 


2.9 


113500 Bone4 RA 


1.3 


1 1 Til 1 M x. 1 

112731 Match 
Control Crohns-M 


1.4 


113501 Synovium4RA 


0.0 


112380 Ulcer Col-F 


8.7 


113502 Syn Fluid 
Cells4 RA 


0.2 


112734 Match 
Control Ulcer Col-F 


0.7 


113495 Cartilage3RA 


0.2 


112384 Ulcer Col-F 


25.3 


113496 Bone3 RA 


0.6 


-« ft mm s% mm ^ M 1 

112737 Match 
Control Ulcer Col-F 


3.0 


1 13497 Synovium3RA 


0.9 


112386 Ulcer Col-F 


3.3 


113498 Syn Fluid 
Cells3 RA 


0.6 


112738 Match 
Control Ulcer Col-F 


0.6 


117106 Normal 
Cartilage Rep20 


14.1 


112381 Ulcer Col-M 


1.8 


113663 Bone3 Normal 


8.5 


112735 Match 
Control Ulcer Col-M 


32.8 


113664 Synovium3 
Normal . 


4.0 


112382 Ulcer Col-M 


1.5 


113665 Syn Fluid 
Cells3 Normal 


5.7 


1 l^j7*r lVlaLCn 

Control Ulcer Col-M 


1.8 


il / iu/ Normal 
Cartilage Rep22 


13.8 


112383 Ulcer Col-M 


6.3 


1 13667 Bone4 Normal : 


17.2 


112736 Match 
Control Ulcer Col-M 


1.1 


113668 Synovium4 
Normal 


14.7 


1 12423 Psoriasis-F 


12.7 


113669 Syn Fluid 
Cells4 Normal 


7.6 



Table AXC. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 


Rel. Exp.(%) 


Tissue 


Rel. Exp.(%) 


Rel. Exp.(%) 


Ag3024, Run 


Ag3024, Run 


Name 


Ag3024, Run 


Ag3024, Run 
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211011006 


233677302 




211011006 


233677302 


AD 1 Hippo 

< 


11.7 


15.4 


Control 
(Path) 3 
Temporal 
Ctx 


1.9 


1.9 


AD 2 Hippo 


23.3 


32.3 


Control 
(Path) 4 
Temporal 
Ctx 


25.2 


41.2 


AD 3 Hippo 


10.5 


13.6 


AD 1 

Occipital 

Ctx 


7.7 


11.7 


AD 4 Hippo 


6.2 


9.9 


AD2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


AD 5 hippo 


77.9 


80.7 


AD 3 

Occipital 

Ctx 


2.9 


4.5 


AD 6 Hippo 


46.0 


45.7 


AD 4 

Occipital 

Ctx 


16.6 


27.2 


Control 2 


28.7 


53.6 


AD 5 

Occipital 

Ctx 


10.9 


20.3 


Control 4 


2.8 


2.4 


AD6 

Occipital 

Ctx 


61.1 


51.4 


Control (Path) 

3 l-Tinnn 


4.5 


3.8 


Control 1 
Occipital 
Ctx 


1.3 


1.1 


AD 1 Temporal 
Ctx 


4.6 


7.0 


Control 2 
Occipital 
Ctx 


52.1 


72.7 


AD 2 Temporal 
Ctx 


16.8 


26.1 


Control 3 
Occipital 
Ctx 


9.9 


13.1 


AD 3 Temporal 
Ctx 


2.9 


3.8 


Control 4 
Occipital 
Ctx 


1.6 


1.4 


AD 4 Temporal 
Ctx 


15.8 


19.1 


Control 
(Path)l 
Occipital 
Ctx 


81.2 


87.1 


AD5Inf 
Temporal Ctx 


47.6 


94.6 


Control 
(Path) 2 
Occipital 
Ctx 


12.2 


16.5 
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AD 5 

SupTemporal 
Ctx 


42.9 

• 


46.0 

* 


Control 
(Path) 3 
Occipital 
Ctx 


0.6 


0.6 


AD 6 Inf 
Temporal Ctx 


22.8 


29.9 


Control 
(Path) 4 
Occipital 
Ctx 


11.0 


14.3 


AD 6 Sup 
Temporal Ctx 


24.0 


30.1 


Control 1 
Parietal Ctx 


3.5 


4.0 


Control 1 
Temporal Ctx 


23 


2.8 


Control 2 
Parietal Ctx 


16.3 


16.8 


Control 2 
Temporal Ctx 


JJ.8 




Control 3 
Parietal Ctx 


17 9 


7fi 7 


Control 3 
Temporal Ctx 


13.1 


14.5 


Control 
(Path)l 
Parietal Ctx 


100.0 


100.0 


Control 4 

i empoiai ^ix 


4.4 


4.2 


Control 
(Path) 2 
Parietal Ctx 


293 


38.7 


Control (Path) 
1 Temporal Ctx 


56.3 


68.8 


Control 
(Path) 3 
Parietal Ctx 


1.1 


2.4 


Control (Path) 
2 Temporal Ctx 


47.6 


54.0 


Control 
(Path) 4 
Parietal Ctx 


42.0 


56.6 



Table AXD. Panel 1 3D 



Tissue Name 


Rel. Exp.(%) Ag3024, 
Run 165517896 


Tissue Name 


Rel. Exp.(%) Ag3024, 
Run 165517896 


Liver adenocarcinoma 


0.4 


Kidney (fetal) 


0.2 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


2.4 


Adrenal gland 


2.6 


Renal ca. RXF 393 


0.0 


Thyroid 


0.7 


Renal ca. ACHN 


0.0 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


2.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


11.0 


Liver 


0.7 


Brain (whole) 


54.7 


Liver (fetal) 


0.0 


Brain (amygdala) 


27.2 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


12.9 


Lung 


0.0 


Brain (hippocampus) 


36.3 


Lung (fetal) 


0.0 


Brain (substantia nigra) 


5.7 


Lung ca. (small cell) 


0.0 
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LX-1 




Brain (thalamus) 


64.2 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


100.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


3.8 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell)A549 


0.2 


glio/astroU-118-MG 


14.7 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


astrocytoma SW1783 


1.7 


Lung ca. (non-s.cell) 
HOP-62 


0.6 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW900 


2.1 


astrocytoma SNB-75 


6.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-1 9 


0.6 


Mammary gland 


0.4 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


11.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.9 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 
T47D 


2.3 


riean 


U.O 


Breast ca. BT-549 


1.0 


Skeletal muscle (fetal) 


1.9 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.4 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Thymus 


0.0 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.7 


Ovarian ca. OVCAR- 
5 


0.2 


Lymph node 


0.0 


Ovarian ca. OVCAR- 

o 
o 


0.0 


Colorectal 


0.8 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


6.9 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


1.4 


Uterus 


2.2 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


0.4 
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Colon ca. HT29 


0.0 


r rosiate ca. ^uone 
met)PC-3 


0.0 


Colon ca.HCT-1 16 


0.0 


Testis 


1.7 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


1.0 


Melanoma jmex j 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


U.U 


Melanoma u/\v^-oz 




OaSLTlC Ca. ^llVcr ITICIJ 

NCI-N87 


0.4 


Melanoma Ml 4 


0.0 


Bladder 


0.7 


Melanoma LOX 
IMVI 


0.0 


Trachea 


2.4 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.6 


Adipose 


0.0 



Table AXE. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
163577593 


Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
163577593 


Normal Colon 


4.7 


Kidney Margin 
8120608 


1.3 


CC WelltoModDiff 
(OD03866) 


5.0 


Kidney Cancer 
8120613 


100.0 


CC Margin (OD03866) 


3.1 


Kidney Margin 
8120614 


5.3 


CC Gr.2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer 
9010320 


0.8 


CC Margin (OD03868) 


03 


Kidney Margin 
9010321 


6.3 


CC Mod Diff (ODO3920) 


0.0 


Normal Uterus 


0.7 


CC Margin (ODO3920) 


0.5 


Uterus Cancer 064011 


9.4 


CC Gr.2 ascend colon 
(OD03921) 


3.1 


Normal Thyroid 


5.7 


CC Margin (OD03921) 


2.1 


Thyroid Cancer 
064010 


0.3 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


1.8 


Thyroid Cancer 
A302152 


2.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 
A302153 


23.5 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


7.9 


Lung Margin (OD04451- 
02) 


0.0 


Breast Cancer 
(OD04566) 


7.4 
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Normal Prostate 6546-1 


3.7 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


5.2 


Breast Cancer Mets 
(OD04590-03) 


3.6 


Prostate Margin 
(OD04410) 


5.4 


Breast Cancer 

Metastasis 

(OD04655-05) 


3.8 


Prostate Cancer 
(OD04720-01) 


ns 


Breast Cancer 064006 


7.1 


Prostate Margin 
(OD04720-02) 


6.4 


Breast Cancer 1024 


6.2 


Normal Lung 061010 


0.0 


Breast Cancer 

f\ I AAA £■ 

9100266 


27.7 


Lung Met to Muscle 
COD04286^ 


0.6 


Breast Margin 
9100765 


4.3 


Muscle MarPin 

IViUovlW 1VXC11 gill 

(OD04286) 


1.0 


A209073 


30.8 


Lung Malignant Cancer 
(OD03126) 


0.0 


Breast Margin 
A2090734 


3.0 


Lung Margin (OD03126) 


1.6 


Normal Liver 


1.3 


Lung Cancer (OD04404) 


0.0 


Liver Cancer 064003 


0.4 


idling iviai gin ^\yLrv fc t 4 Tl/t y 


1 n 


j-»ivcr ^anwcr iuzj 


ft ft 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0.0 


Lung Margin (OD04565) 


0.0 


Liver Cancer 6004-T 


0.0 


T ., M *m /^« _„ — — «» /AHA/1 ^ O "7 

Lung Cancer (OD04237- 
01) 


8.4 


Liver Tissue 6004-N 


1.1 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005-T 


0.0 


Ucular Mel Met to Liver 
(ODO4310) 


2.5 


Liver Tissue 6005-N 


0.0 


.Liver Margin (UUU4 j j uj 


A A 
U.U 


Normal Bladder 


2.8 


Melanoma Mets to Lung 
(OD04321) 


0.4 


Bladder Cancer 1023 


0,0 


Lung Margin (OD0432 1) 


0.5 


Bladder Cancer 
A302173 


2.2 


Normal Kidney 


23.7 


Bladder Cancer 
(OD04718-01) 


0.7 


Kidney Ca, Nuclear grade 
2 (OD04338) 


12 


Bladder Normal 
Adjacent (OD0471 8- 

Vj) 


. 0.4 


Kidney Margin 
(OD04338) 


10.6 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


0.0 


Ovarian Cancer 
064008 


1.4 


Kidney Margin 
(OD04339) 


7.7 


Ovarian Cancer 
(OD04768-07) 


0.0 
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Kidney Ca, Clear cell 
type (OD04340) 


9.3 


Ovary Margin 
(OD04768-08) 


3.3 


Kidney Margin 
(OD04340) 


17.2 


Normal Stomach 


17.8 


Kidney Ca, Nuclear grade 
3 (OD04348) 


2.3 


Gastric Cancer 
9060358 


1.0 


Kidney Margin 
(OD04348) 


5.1 


Stomach Margin 
9060359 


13.6 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


7.5 


Kidney Margin 

(ULJU4022-UJ) 


2.8 


Stomach Margin 
9060394 


4.1 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


13.4 


Kidney Margin 
(OD04450-03) 


17.8 


Stomach Margin 
9060396 


6.7 


Kidney Cancer 8120607 


1.7 


Gastric Cancer 
064005 


\2 



Table AXF. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag3024,Run 
164886426 


Tissue Name 


Rel.Exp.(%) 
Ag3024, Run 
164886426 


Daoy- Medulloblastoma 


1.6 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 


TE671- Medulloblastoma 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


0.3 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.6 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.6 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.4 


Daudi- Burkitfs lymphoma 


0.0 


T98G- Glioblastoma 


14.1 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- 

Neuroblastoma 

(metastasis) 


0.2 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.4 


RL- non-Hodgkin's B-cell 
lymphoma 


0.0 


Cerebellum 


15.3 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


5.4 


Jurkat- T cell leukemia 


02 


NCI-H292- 


0.0 


TF-1- Erythroleukemia 


0.0 
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Mucoepidermoid lung 
carcinoma 








DMS-114- Small ceil 
lung cancer 


0.0 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small cell lung 
cancer 


3.3 


U937- Histiocytic lymphoma 


0.0 


NCI-H146- Small cell 
lung cancer 


0.0 


KU-812- Myelogenous 
leukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


57.4 


769-P- Clear cell renal 
carcinoma 


0.0 


NCI-N417- Small cell 
lung cancer 


14.4 


Caki-2- Clear cell renal 
carcinoma 


32.8 


NCI-H82- Small cell 
lung cancer 


0.0 


SW 839- Clear cell renal 
carcinoma 


0.2 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


0.0 


G401- Wilms* tumor 


0.0 


NCI-H1155- Large cell 
lung cancer 


2.9 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Urge cell 
lung cancer 


0.3 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


NCI-H727-Lung 
carcinoid 


15.9 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


1.5 


NCI-UMC-ll-Lung 
carcinoid 


1.2 


BxPC-3- Pancreatic 
adenocarcinoma 


0.0 


LX-1- Small cell lung 
cancer 


0.0 


HPAC- Pancreatic 
adenocarcinoma 


0.3 


Colo-205- Colon cancer 


0.0 


MIA PaCa-2- Pancreatic 
carcinoma 


0.0 


KM 1 2- Colon cancer 


0.0 


CFPAC-1- Pancreatic ductal 

* • 
adenocarcinoma 


0.0 


KM20L2- Colon cancer 


0.0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


0.8 


NCI-H716- Colon cancer 


100.0 


T24- Bladder carcinma 
(transitional cell) 


0.0 


SW-48- Colon 
adenocarcinoma 


0.0 


5637- Bladder carcinoma 


0.0 


SW1116- Colon 

{iH<*Tirtr»nrr»inomsi 
auciiuuai L/lllUil la 


0.0 


HT-1197- Bladder carcinoma 


0.0 


LS 174T- Colon 
adenocarcinoma 


0.0 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


1.1 


HT-1080- Fibrosarcoma 


0.3 
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NCI-SNU-5- Gastric 
carcinoma 


0.0 


MG-63- Osteosarcoma 


0.0 


KATO III- Gastric 
carcinoma 


0.0 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


4.6 


NCI-SNU-16- Gastric 
carcinoma 


0.4 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0.2 


NCI-SNU-1- Gastric 
carcinoma 


0.0 


A431- Epidermoid carcinoma' 


0.0 


RF-1- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


0.0 


RF-48- Gastric 
adenocarcinoma 


0.0 


DU 145- Prostate carcinoma 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


0.0 


MDA-MB-468- Breast 
adenocarcinoma 


1.3 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.2 


SCC-15- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


0.0 



Table AXG. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
162427416 


Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
162427416 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


31.4 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


14.9 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


8.8 


Secondary Th2 rest 


4.4 


HUVEC IL-11 


11.4 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


10.1 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


8.7 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


11.3 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


3.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


5.3 ! 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


3.7 
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Primary Trl rest 


0.0 


Oillall aliWdy CfJlLUvltUUl 

TNFalpha + IL-lbeta 


66.0 


lymphocyte act 


0.0 


Coronery artery SMC rest 


21.0 


lymphocyte act 


0.0 


i^oronery artery oiviv^ 
TNFalpha + IL-lbeta 


10.4 


Li/5 lympnocyie aci 




/vbtrocyLCS tcdl 


0 0 


Secondary CD8 
lympnocyie rest 


0.0 


Astrocytes TNFalpha + 

TT -1 ViP»ta 

jj-r- 1 Dcia 


0.0 


Secondary CD8 
lympnocyie aci 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


1<TTT 81? rRflcnnhiH 
JVU-O LZ. \DabOpilll) 

PMA/ionomycin 


0.0 


2ryThl/Th2/Trl_anti- 
CD95 CHI 1 


0.0 


CCD1106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD1106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


17.7 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


5.6 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


3.2 


LAK cells IL-2+ IL-1 8 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


10.4 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


6.3 


Two Way MLR 7 day 


p.o 


HPAEC TNF alpha + IL-1 
beta 


3.2 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramnc {XX ne*\W nnnp 
l\alIlUi> ^JJ LCll^ I1U11C 




T lino fiKroHlnct TT -0 

Idling lllH v/Uldol JUL* y 


0 0 

w.v 


Ramos (B cell) 

i r\Vk r\vn vp i n 
lunuiiijciii 


4.7 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 

gaining 


0.0 


B lvmnhocvtes CD40L 
andIL-4 


0.0 


Dermal fibroblast 

CCD1070rest 


0.0 


EOL-1 dbcAMP . 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD1070 IL-1 beta 


4.1 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 


6.1 
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gamma 




Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


8.7 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


7.7 


Monocytes LPS 


0.0 


Colon 


24.1 


Macrophages rest 


0.0 


Lung 


14.5 


Macrophages LPS 


5.3 


Thymus 


100.0 


HUVEC none 


24.7 


Kidney 


o.o 


HUVEC starved 


11.8 







Table AXH. Panel CNS_1 



Tissue Name 


ReLExp,(%)Ag3024, 
Run 171694538 


Tissue Name 


ReL Exp.(%)Ag3024, 
Run 1716Q4^8 

XVII U J. / lU71^JO 


BA4 Control 


38.2 


BA17PSP 


27.7 


dA4 control/ 


15.1 


BA17 roP2 


10.7 


BA4 

Aizneinner sz 


7.7 


Sub Nigra Control 


11.7 


BA4 Parkinson ! s 


60.7 


Sub Nigra Control2 


20.4 


BA4 

Parkinson's2 


85.3 


Sub Nigra 
Alzheimer's2 


6.4 


BA4 

Huntington's 


44.1 


Sub Nigra 
Parkinson f s2 


29.5 


Huntington's2 


8.7 


Sub Nigra 
Huntington's 


28.9 


BA4 PSP 


7.7 


Sub Nigra 
Huntington's2 


20.7 


BA4PSP2 


39.5 


Sub Nigra PSP2 


1.5 


BA4 Depression 


2.1 


Sub Nigra 
Depression 


0.8 


BA4 

Depression2 


5.8 


Sub Nigra 
Depression2 


4.7 


BA7 Control 


66.0 


Glob Palladus 
Control 


3.9 


BA7 Control2 


52.5 


Glob Palladus 
ControI2 


6.9 


BA7 

Alzheimer's2 


8.8 


Glob Palladus 
Alzheimer's 


5.5 


BA7 Parkinson's 


20.4 


Glob Palladus 
Alzheimer^ 


o 

2.8 


BA7 

Parkinson f s2 


64.6 


Glob Palladus 
Parkinson's ; 


48.6 


BA7 

Huntington's 


55.1 


Glob Palladus 
Parkinson's2 


7.1 
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Df\ 1 

Huntington f s2 


54.7 


Glob Palladus PSP 


3.2 


DA 7 OCp 

DJ\/ ror 


J /.o 


C,\nh Pallarinc PW? 
VjlUD rdlldULlS r jrZ 


1 0 
J.u 


BA7 PSP2 


26.4 


Glob Palladus 
L/eprcssion 


2.1 


BA7 Depression 


7.7 


Temp Pole Control 


14.8 


BA9 Control 


31.4 


Temp Pole Control2 


57.0 


BA9 Control2 


100.0 


Temp Pole 
Alzheimer's 


6.1 


BA9 Alzheimer's 


4.8 


Temp Pole 
Alzheimer's2 


3.4 


BA9 

Alt' 1 *•> 

Alzheimer's2 


14.5 


Temp Pole 
Parkinson's 


23.2 


BA9 Parkinson's 


31.6 


Temp Pole 
Parkinson's2 


30.1 


BA9 

Parkinson^ 


56.3 


Temp Pole 
Huntington's 


41.8 


BA9 

riuniiiigion s 


53.2 


Temp Pole PSP 


3.5 


BA9 

riuniingion sz 


14.3 


Temp Pole PSP2 


3.3 


BA9PSP 


17.7 


Temp Pole 
uepress lonz 


1.9 


RAO P^P9 




v^ing vjyr control 


O/.o 


BA9 Depression 


9.0 


Cing Gyr Control2 


38.4 


BA9 

Depression2 


• 7.4 


Cing Gyr 
Alzheimer's 


7.4 


BA1 7 Control 


51.8 


Cing Gyr 
Alzheimer's2 


12.2 


BA17 Controlz 


54.0 


Cing Gyr Parkinson's 


15.4 


BA17 

Alzheimer's2 


2.7 


Cing Gyr 
Parkinson's2 


39.5 


BA17 
Parkinson's 


23.2 


Cing Gyr 
Huntingtons 


48.0 


BA17 

Parkinson's2 


53.6 


Cing Gyr 
Huntingtons2 


14.2 


BA17 | 
xiuniuigion s 


41.2 


Cing Gyr PSP 


13.9 


BA17 

Huntington's2 


9.5 


Cing Gyr PSP2 


5.0 


BA17 
Depression 


6.0 


Cing Gyr Depression 


5.7 


BA17 

Depression2 


14.4 


Cing Gyr 
Depression2 


9.7 
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AI_comprehensive panel_vl.O Summary: Ag3024 The NOV55 gene is found at low 
but significant levels in lung tissue from COPD, emphysema and asthma patients. This 
expression is consistent with panel 4D which shows expression in small airway epithelium. 
Therefore, this gene could be a marker or a target for lung inflammatory diseases. 
5 CNS_neurodegeneration_vl.O Summary: Ag3024 Results of two experiments with 

the same probe and primer set confirm expression of the NOV55 gene in the brain. Please see 
Panel 1 .3D for discussion of utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3024 Expression of the NOV55 gene, a heparin sulfate 
proteoglycan homolog, is highly brain preferential and suggests a role for this gene product in 

10 CNS processes. Heparin sulfate proteoglycans (HSPGs)are a component of amyloid plaques in 
Alzheimer's disease. The interaction of apoE with HSPGs has also been implicated in the 
pathogenesis of Alzheimer's disease and may play a role in neuronal repair. apoE has an 
HSPG-binding site highly complementary to heparan sulfates rich in N- and O-sulfo groups. 
Therefore, enzymes that influence the structure of HSPGs, such as the putative protein product 

1 5 of the NOV55 gene, may influence protein agregation and the functional processes underlying 
Alzheimer's disease. Thus, agents that target and modulate the activity of this gene product 
may be effective in the treatment of neurodegenerative diseases including Alzheimer's disease. 
This gene is also expressed in breast and brain cancer cell lines at low but significant levels. 
Therefore, the expression of this gene could be of use as a marker for breast and brain cancer. 

20 In addition, therapeutic inhibition of the activity of the product of this gene, through the use of 
antibodies or small molecule drugs, may be useful in the therapy of brain and breast cancer. 
References: 

Libeu CP, Lund-Katz S, Phillips MC, Wehrli S, Hernaiz MJ, Capila I, Linhardt RJ, 
Raffai RL, Newhouse YM, Zhou F, Weisgraber KH. New insights into the heparan sulfate 
25 proteoglycan-binding activity of apolipoprotein E. J Biol Chem 2001 Oct 19;276(42):39138- 
44 

Defective binding of apolipoprotein E (apoE) to heparan sulfate proteoglycans 
(HSPGs) is associated with increased risk of atherosclerosis due to inefficient clearance of ' 
lipoprotein remnants by the liver. The interaction of apoE with HSPGs has also been 
30 implicated in the pathogenesis of Alzheimer's disease and may play a role in neuronal repair. 
To identify which residues in the heparin-binding site of apoE and which structural elements 
of heparan sulfate interact, we used a variety of approaches, including glycosaminoglycan 
specificity assays, (13)C nuclear magnetic resonance, and heparin affinity chromatography. 
The formation of the high affinity complex required Arg-142, Lys-143, Arg-145, Lys-146, and 
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Arg-147 from apoE and N- and 6-O-sulfo groups of the glucosamine units from the heparin 
fragment. As shown by molecular modeling, using a high affinity binding octasaccharide 
fragment of heparin, these findings are consistent with a binding mode in which five 
saccharide residues of fully sulfated heparan sulfate lie in a shallow groove of the alpha-helix 
5 that contains the HSPG-binding site (helix 4 of the four-helix bundle of the 22-kDa fragment). 
This groove is lined with residues Arg-136, Ser-139, His-140, Arg-142, Lys-143, Arg-145, 
Lys-146, and Arg-147. In the model, all of these residues make direct contact with either the 
2-O-sulfo groups of the iduronic acid monosaccharides or the N- and 6-O-sulfo groups of the 
glucosamine sulfate monosaccharides. This model indicates that apoE has an HSPG-binding 

10 site highly complementary to heparan sulfate rich in N- and O-sulfo groups such as that found 
in the liver and the brain. 

Inoue S. Basement membrane and beta amyloid fibrillogenesis in Alzheimer's disease. 
IntRevCytol 2001;210:121-61 

High-resolution ultrastructural and immunohistochemical studies revealed that in situ 

15 beta amyloid fibrils of Alzheimer's disease were made up of a core consisting of a solid 

column of amyloid P component (AP) arid associated chondroitin sulfate proteoglycan, and a 
heparan sulfate proteoglycan surface layer with externally associated fine filaments of beta 
protein. The main body of beta amyloid fibrils closely resembled that of microfibrils. 
Abundant microfibrils were reported to be present at the basement membrane of capillaries 

20 with "leaky" blood-urine or blood-air barriers. Similarly, abundant microfibril-like beta 
amyloid fibrils are formed at the microvascular basement membrane in cerebrovascular 
amyloid angiopathy with altered blood-brain barrier. Since AP is an indispensable major 
component of microfibrils and microfibril-like structures, the formation of microfibrils may 
depend on, among other factors, the availability of AP. Thus, in beta amyloid fibrillogenesis 

25 fibrils may be built around AP which continuously leaks out from circulation into vascular 
basement membrane, and beta amyloid fibrils may be regarded as pathologically altered 
basement membrane-associated microfibrils. With no source of AP around them, senile plaque 
fibrils may also be derived from perivascular amyloid. 

Panel 2D Summary: Ag3024 The NOV55 gene is expressed at low but significant 

30 levels in most of the samples on this panel, with highest expression in a kidney cancer sample 
(CT=30.6). Significant levels of expression are also seen in samples derived from breast and 
gastric cancer samples. 

Therefore, expression of this gene could be of use as a marker for breast and gastric 
cancer. In addition, therapeutic inhibition of the activity of the product of this gene, through 
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the use of antibodies or small molecule drugs, may be useful in the therapy of breast and 
gastric cancer. 

Panel 3D Summary: Ag3024 The NOV55 gene is expressed at low but significant 
levels in cell lines from a renal carcinoma, colon cancer, glioblastoma and three lung cancer 
5 lines. Thus, this gene could be a marker as well as a target for inhibition in these cancers. 

Panel 4D Summary: Ag3024 The NOV55 gene, a heparin Sulfate 6-Sulfotransferase 
3 homolog, is expressed at low but significant levels in thymus and small airway epithelium 
treated with TNFalpha + IL-lbeta (CTs=34). Thus, the NOV55 gene product may be a marker 
for thymus or activated small airway epithelium. 
1 0 Panel CNSJ Summary: Ag3024 Expression of the NOV55 gene in this panel 

confirms the presence of this gene product in the brainl. Please see Panel 1 .3D for discussion 
of utility of this gene in the central nervous system. 

NOV56a and NOV56b 

Expression of gene NOV56a and variant NOV56b was assessed using the primer-probe 
1 5 sets Ag3027 and Agl 1 69, described in Tables AYA and AYB. Results of the RTQ-PCR runs 
are shown in Tables AYC, A YD, AYE, AYF, AYG, AYH and AYI. 



Table AYA. Probe Name Ag3027 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 ' -aaaaccagatttggagttcgtt-3 1 


22 


355 


1210 


Probe 


TET-5 ' -cttgaaatgtcctcaccacaactgat-3 1 - 
TAMRA 


26 


377 


1211 


Reverse 


5 ' -tccagatagatggtggaatcag-3 1 


22 


425 


1212 



Table AYB. Probe Name Agll69 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -aaaaccagatttggagttcgtt-3 ■ 


22 


355 


1213 


Probe 


TET-5 ■ -cttgaaatgtcctcaccacaactgat-3 ■ - 
TAMRA 


26 


377 


1214 


Reverse 


5' - tccagatagatggtggaatcag-3 1 


22 


425 


1215 



20 



Table AYC. General screening panel vl. 5 



Tissue Name 


Rel. Exp.(%) Ag3027, 
Run 228714682 


Tissue Name 


Rel. Exp.(%) Ag3027, 
Run 228714682 


Adipose 


2.3 


Renal ca. TK-10 


10.4 


Melanoma* 


1.4 


Bladder 


li.o 
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Hs688(A).T 








Melanoma* 
Hs688(B).T 


1.0 


Gastric ca. (liver met.) 
NCI-N87 


0.6 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


7.4 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


43.8 


Melanoma* bK- 
MEL-5 

-IV 1 is X.J J 


1.9 


Colon ca. SW480 


6.2 


Sauamous cell 
carcinoma SCC-4 


0.0 


met) SW620 


9.9 


Testis Pool 


0.7 


Colon ca. HT29 


2.0 


Prostate ca * fhone 
met) PC-3 


0.7 


Colon ca.HCT-1 16 


0.5 


Prostate Pool 


3 8 


P'nlnn rz\ 

V^LIlUil Cel. V_/aV^U~Z. 


7^ 1 
1 J.J. 


Placenta 


0.0 


Colon cancer tissue 


■ 21.8 


Uterus Pool 


0.4 


Colon ca. SW1116 


0.7 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


2.6 


Ovarian ca. SK-OV- 

3 


29.9 


Colon ca. SW-48 


0.3 


Ovarian ca. 
OVCAK-4 


0.8 


Colon Pool 


2.6 


Ovarian ca. 
OVCAR-5 


11.0 


Small Intestine Pool 


0.8 


Ovarian ca. IGROV- 
1 


6.9 


Stomach Pool 


2.0 


Ovarian ca 

V/ TU1 lull vU* 

OVCAR-8 


1.1 


Bone Marrow Pool 


0.0 


Ovary 


0:8 


Fetal Heart 


1.4 


Breast ca. MCF-7 


0.2 


Heart Pool 


0.5 


Breast ca. MDA- 
MB-231 


0.5 


Lymph Node Pool 


1.4 


Breast ca. BT 549 


0.5 


Fetal Skeletal Muscle 


1.1 


Breast ca. T47D 


0.4 




u.u 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Jl*04pf D/\AI 

oreasi rooi 




Thymus Pool 


0.3 


Trachea 


0.3 


CNS cancer (glio/astro) 

T TOT "fc jry~> 

U87-MG 


4.7 


Lung 


0.0 


l^ino uant/cr ^gno/ astro; 
U-118-MG 


1.6 


Fetal Lung 


2.0 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lungca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.5 


Lung ca. LX-1 


100.0 


CNS cancer (astro) 


4.1 
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OXTR 7« 
OlND-/ J 




Lungca. NCI-H146 


0.3 


SNB-19 


13.8 


Lung ca. SHP-77 


0.0 


v^/iNo Cancer ^giioj or- 
295 


3.6 


Lung ca. A549 


1.4 


Brain (Amygdala) Pool 


0.2 




i ft 


Drain ^cercDciiumj 


U. I 




? f\ 
z.o 


orain \ieiaij 


ft o 
u.z 


Lungca. NCI-H460 


1.2 


Brain (Hippocampus) 
Pool 


0.3 


Lung ca. HOP-62 


0.2 


Cerebral Cortex Pool 


0.5 


Lung ca. NCI-H522 


0.7 


Brain (Substantia nigra) 
Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


0.4 


Fetal Liver 


3.2 


Brain (whole) 


0.6 


Liver ca. HepG2 


0.5 . 


Spinal Cord Pool 


0.3 


Kidney Pool 


1.8 


Adrenal Gland 


1.1 


Fetal Kidney 


29.9 


Pituitary gland Pool 


0.3 


Renal ca. 786-0 


1.8 


Salivary Gland 


0.2 


Renal ca. A498 


0.7 


Thyroid (female) 


0.1 


Renal ca. ACHN 


3.2 


Pancreatic ca. CAPAN2 


3.1 


Renal ca. UO-31 


3.2 


Pancreas Pool 


4.4 



Table A YD. Panel 12 



Tissue Name 


Rel. Exp.(%) 
Agll69, Run 
129128191 


Rel. Exp.(%) 
Agll69,Run 
129656838 


Tissue Name 


Rel. Exp.(%) 
Agll69,Run 
129128191 


Rel. Exp.(%) 
Agll69,Run 
129656838 


Endothelial cells 


0.0 


24.3 


Renal ca. 786- 
0 


0.2 


10.5 


Heart (Fetal) 


0.0 


57.0 


Renal ca. A498 


5.5 


10.9 


Pancreas 


0.0 


0.0 


Renal ca. RXF 
393 


0.9 


14.9 


Pancreatic ca. 
CAP AN 2 


02 


0.0 


Renal ca. 
ACHN 


1.5 


22.8 


Adrenal Gland 


0.0 


7.0 


Renal ca. UO- 
31 


5.6 


12.3 


Thyroid 


0.1 


0.1 


Renal ca. TK- 
10 


11.3 


1.7 


Salivary gland 


0.1 


6.2 


Liver 


0.0 


12.9 


Pituitary gland 


0.0 


0.3 


Liver (fetal) 


0.0 


1.4 


Brain (fetal) 


0.0 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.2 


4.5 
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Brain (whole) 


0.0 


4.5 


Lung 


0.0 


14.9 


Brain 

(amygdala) 




ft 1 


T iino ( "fptnl^ 


0 0 


22 2 


Brain 

(cerebellum) 


0.0 


0.3 


Lung ca. 
(small cell) 
LX-1 


27.0 


74.2 


Brain 

(hippocampus) 


0.0 


14.4 , 


Lung ca. 
(small cell) 
NCI-H69 


0.0 


0.0 


Brain (thalamus) 


0.0 


1.2 


Lung ca. (s.cell 
var.) SHP-77 


0.0 


0.0 


Cerebral Cortex 


0.0 


15.5 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


Spinal cord 


0.0 


22.4 


Lung ca. (non- 
sm.cell) A549 


0.9 


0.0 


glio/astro U87- 
MG 


0.2 


0.0 


Lung ca. (non- 

s.celI)NCI- 

H23 


0.3 


0.0 


glio/astro U- 
118-MG 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


0.1 


0.6 


astrocytoma 
SW1783 


0 0 


0 0 


Lung ca. (non- 
s.d)NCI-H522 


0 1 


0 1 


11 Will \J , Hid OXv 

N-AS 


0j0 


0.0 


Lung ca. 
(squam.) SW 
900 


6.3 


0.0 


astrocytoma SF- 
539 


0.0 


0.0 


Lung ca. 
(squam.) NCI- 
H596 


0.0 


0.0 


astrocytoma 
SNB-75 


0.0 


0.0 


Mammary 
gland 


100.0 


100.0 


glioma SNB-19 


0.4 


0.6 


Breast ca * 
(pI.ef)MCF-7 


0.0 


4.0 


glioma U251 


0.1 


1.7 


Breast ca.* 
(pl.ef) MDA- 
MB -231 


0.0 


0.0 


glioma SF-295 


0.5 


0.0 


Breast ca. (pi. 
ef) T47D 


0.0 


1.4 


Heart 


0.0 


58.2 


Breast ca. BT- 
549 


0.0 


0.1 


Skeletal Muscle 


0.0 


9.3 


Breast ca. 
MDA-N 


0.0 


0.0 


Bone marrow 


0.0 


0.0 


Ovary 


0.0 


14.0 


Thymus 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


5.2 


Spleen 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


15.3 
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Lymph node ] 0.0 


3.0 


Ovarian ca. 
OVCAR-5 


50.7 


15.7 


Colorectal 
Tissue 


0.1 


0.0 


Ovarian ca. 
OVCAR-8 


1.5 


0.1 


OlAJllldVli 


8.4 


23.0 


Ovarian ca. 
IGROV-1 


43.5 


17.0 


Small intestine 




0.2 


10.0 


Ovarian ca. 
(ascites) SK- 
OV-3 


13.5 


17.0 


Colon ca. 
SW480 


0.6 


0.0 


Uterus 


0.2 


7.7 


Colon ca.* 
SW620 (SW480 
met) 


7.5 


0.4 


Placenta 


0.0 


75.8 


Colon ca. HT29 


0.0 


0.0 


Prostate 


12.1 


13.7 


116 


0.0 


0.0 


Prostate ca.* 
(bone met) PC- 
3 


0.0 


10.4 


Colon ca. CaCo- 
2 


34.4 


11.8 


Testis 


0.6 


0.2 


Colon ca. Tissue 
(OD03866) 


11.4 


15 9 


Melanoma 
Hs688(A).T 


u.u 


0.0 


2998 


2.0 


0.0 


Melanoma* 
(met) 

Hs688(B).T 


0.2 


0.0 


Gastric ca.* 
(liver met) NCI- 
N87 


0.0 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


Bladder 


5.3 


11.3 


Melanoma 
M14 


0.0 


0.0 


Trachea 


0.0 


. 2.6 


Melanoma 
LOXIMVI . 


0.0 


0.0 


Kidney 


4.5 


14.0 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney (fetal) 


14.3 


31.2 






Table AYE. Panel 13D 


Tissue Name 


Rel. Exp.(%; 
Agll69, Run 
165518394 


> Rel. Exp.(%) 
i Ag3027, Run 
165519993 


Tissue Name 


Rel. Exp.(%) 
Agll69,Run 
165518394 


Rel. Exp.(%) 
Ag3027, Run 
165519993 


Liver 

adenocarcinoma 


0.5 


0.4 


Kidney (fetal) 


0.7 


0.0 


Pancreas 


0.0 


0.0 


Renal ca. 786- 
0 


0.5 


0.6 
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Pancreatic ca. 
CAPAN 2 


0.7 


0.4 


Renal ca. 

A A AO 

A498 


0.9 


1.6 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 

393 


0.0 


0.9 


Thyroid 


0.0 


0.0 


Renal ca. 


0.0 


0.0 


Salivary gland 


0.0 


0.0 


Renal ca. UO- 
31 


0.0 


0.0 


Pituitary gland 


0.0 


0.0 


Renal ca. TK- 

1 0 


2.1 


0.4 


Brain (fetal) 


0.0 


0.0 


Liver 


0.0 


0.0 


Brain (whole) 


0.0 


0.0 


Liver (fetal) 


0.0 


0.0 


Brain (amygdala) 

. — . _ 


0.0 

.. 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


0.0 


Brain (cerebellum) 


0.0 


0.0 


Lung 


0.0 


0.0 


Brain 

(hippocampus) 


0 0 


0 4 


T itncr ( fi*tan 


0 0 


0 0 


Brain (substantia 


0.3 


0.0 


Lung ca. 
(small cell) 
LX-1 


21.5 


12.0 


Brain (thalamus) 


0.0 


0.0 


Lung ca. 
(small cell) 
NCI-H69 


0.0 


0.0 


Cerebral Cortex 


0.0 


0.5 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


0.0 


Spinal cord 


0.0 


0.3 


Lung ca. (large 
cell)NCI-H460 


0.0 


1.3 


glio/astro U87-MG 


0.0 


0.3 


Lung ca. (non- 
sm. cell) A549 


0.0 


0.0 


glio/astro U-118- 
MG 


0.0 


0.3 


Lung ca. (non- 

s.cell)NCI- 

H23 


0.7 


0.3 


astrocytoma 
SW1783 ' 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


0.0 


0 0 


neuro*; met SK-N- 
AS 


0.0 | 


0.0 


Lung ca. (non- 
s.cl) NCI- 
H522 


0.0 


0.0 


astrocytoma SF- 
539 


0.3 


0.0 


Lung ca. 
(squam.) SW 
900 


2.8 


0.0 


astrocytoma SNB- 
75 


1.9 


0.9 


Lung ca. 
(squam.) NCI- 
H596 


0.0 


0.0 


glioma SNB-19 


0.0 


0.4 


Mammary 
gland 


100.0 


100.0 
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glioma U251 


0.4 


0.4 


Breast ca.* 
(pl.ef) MCF-7 


0.5 


0.0 


glioma SF-295 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 

1V1D-Z J l 


0.0 


0.3 


Heart (fetal) 


0.0 


0.0 


jDieast ca. 
(pl.ef) T47D 


0.0 


0.0 


Heart 


0.0 


0.0 


rJicaSI Ca. Dl" 

549 


0.3 


0.0 


QliTP* 1 <^to 1 m 1 1 cr 1 r> 
OJVClCUll IIlUdL«lC 

(fetal) 


0.0 


0.0 


Breast ca. 
MDA-N 


0.0 


0.0 




u.u 


u.u 


Ovary 


A A 
0.0 


A A 

0.0 


Bone marrow 


0.0 


0.0 


Ovarian ca. 

\J V \^/VK- J 


0.0 


0.0 


Thymus 


0.9 


0.0 


Ovarian ca. 

UVtAK-4 


0.4 


0.0 


Spleen 


0.7 


0.0 


Ovarian ca. 
OVCAR-5 


0.9 


1.4 


Lymph node 


0.0 


0.0 


Ovarian ca. 

fWr/~* ado 
UVLAK-5 


0.0 


0.0 


Colorectal 


0.9 


0.2 


Ovarian ca. 
IGROV-l 


1.0 


2.3 


OlvJlllaL-Jl 


i n 


U.o 


Ovarian ca.* 
(ascites) bK- 
OV-3 


1 A 

1.4 


1.8 


Small intestine 


0.7 


0.0 


Uterus 


0.5 


0.0 


Colon ca. SW480 


0.8 


0.4 


Placenta 


0.0 


0.0 


Colon ca.* 

SW620(SW480 

met) 


0.6 


0.4 


Prostate 


0.3 


0.5 


Colon ca. HT29 


0.0 


0.8 


Prostate ca.* 
(bone met)PC- 

3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.9 


0.3 


Testis ; 


0.8 


1.2 


Colon ca. CaCo-2 


6.7 


8.1 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


2.9 


2.0 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.4 


0.3 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met)NCI-N87 


0.3 


0.0 


Melanoma 
M14 


0.0 


0.0 


Bladder 


1.7 


0.4 


Melanoma 
LOXIMVI 


0.0 


0.0 
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Trachea 


0.0 


jMelanoma* 
0.0 (met)SK- 
jMEL-5 


0.0 


0.0 


Kidney 


2.7 


1.3 jAdipose 


0.7 


0.0 



Table AYF. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3027,Run 
163577594 


Tissue Name 


Rel. Exp.(%) 
Ag3027, Run 
163577594 


Normal Colon 


14.6 


Kidney Margin 
8120608 


5.9 


CC Well to Mod Diff 
(OD03866) 


15.1 


Kidney Cancer 
8120613 


34.4 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


5.7 


CC Gr.2 rectosigmoid 
(OD03868) 


15.5 


Kidney Cancer 
9010320 


10.5 


CC Margin (OD03868) 


0.7 


Kidney Margin 
9010321 


25.0 


CC Mod Diff(ODO3920) 


64.2 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


2.2 


Uterus Cancer 0640 11 


18.4 


CC Gr.2 ascend colon 
(OD03921) 


1 1 0 
1 l.Z 


iNoimai inyroiu 


1.0 


I vial in \\Jxs\jD 1 1 


. l.o 


Thyroid Cancer 
064010 


1 .0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


16.0 


Thyroid Cancer 

AjUZI JZ 


0.0 


T iver Margin fODO^OQ^ 


ft ^ 1 


Thyroid Margin 
A302153 


n n 


Colon mets to lung 
(OD04451-01) 


6.8 


Normal Breast 


28.7 


Lung Margin (OD04451- 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.5 


Normal Prostate 6546-1 


14.9 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


29.7 


Breast Cancer Mets 
(OD04590-03) 


3.7 


Prostate Margin 
(OD04410) 


19.1 


Breast Cancer 

Metastasis 

(OD04655-05) 


1.7 


Prostate Cancer 
(OD04720-01) 


4.5 


Breast Cancer 064006 


0.4 


Prostate Margin 
(OD04720-02) 


3.0 


Breast Cancer 1024 


55.1 
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Normal Lung 061010 


0.5 


Breast Cancer 
9 i 00266 


1.9 


Lung Met to Muscle 
(OD042R6} 


0.0 


Breast Margin 

7 1 \3\jJL\jO 


8.2 


(OD04286) 


0.9 


r>reaSL cancer 
A209073 


18.3 


Lung Malignant Cancer 
(OD03126) 


11.0 


Breast Margin 
A2090734 


34.6 


Lung Margin (OD03126) 


1.1 


Normal Liver 

A 1 W1111UI A w VI 


0 0 


Lung Cancer (OD04404) 


1.8 


Liver Cancer 064003 


15.0 1 


T liner Marain {CYnC\dd()A\ 

JLrUUg XVICU. gill \VAL/U*T*Tl/ t T ) 


ft d 


T {trot* PnM^ar 1 flO ^ 

.Liver cancer i \jz j 


A A 
0.0 


Lung Cancer (OD04565) 


0.5 


Liver Cancer 1026 


0.0 


Lung Margin (OD04565) 


2.1 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


3.3 


Liver Tissue 6004-N 


1.1 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005-T 


2.6 


Ocular Mel Met to Liver 
(ODO4310) 


1.6 


Liver Tissue 6005-N 


0.0 


Liver Margin (ODO4310) 


0.0 


Normal Bladder 


13.5 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


32.5 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


2.6 


Normal Kidney 


14.6 


Bladder Cancer 
(OD04718-01) 


0.0 


Kidney Ca, Nuclear grade 
2 (OD04338) 


3.3 


Bladder Normal 
Adjacent (OD047 18- 
03) 


1.0 


Kidney Margin 
(OD04338) 


5.1 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/z (0004339) 


7.9 


Ovarian Cancer 
064008 


1.8 


Kidney Margin 
(UJJU4339) 


27.0 


Ovarian Cancer 
(OD04768-07) 


0.0 


Kidney Ca, Clear cell 
type (UDU4340) 


3.4 


Ovary Margin 
(OD04768-08) 


0.5 


Kidney Margin 


61.6 


Normal Stomach 


45.7 


Kidney Ca, Nuclear grade 
3(OD04348) 


0.4 


Gastric Cancer 
9060358 


0.0 


Kidney Margin 
(OD04348) 


84.1 


Stomach Margin 
9060359 


8.2 


Kidney Cancer 
(OD04622-01) 


3.5 


Gastric Cancer 
9060395 


6.4 
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Kidney Margin 


1.4 


Stomach Margin 


12.9 


Kidney Cancer 
(OD04450-01) 


39.2 


Gastric Cancer 
9060397 


42.9 


Kidney Margin 
(OD04450-03) 


51.8 


Stomach Margin 
9060396 


31.4 


Kidney Cancer 8120607 


4.0 


Gastric Cancer 
064005 


100.0 



Table AYG. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Agll69,Run 
164038616 


Tissue Name 


Rel. Exp.(%) 
Agll69,Run 
164038616 


Daoy- Medulloblastoma 


1.9 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 


TE671- Medulloblastoma 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-l-Pnmitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498-CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 

^niCgUKaT y UDlaSL J 


0.0 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma 


0.0 


T98G- Glioblastoma 


0.0 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- 

Neuroblastoma 

(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin's B-cell 
lymphoma 


0.0 


Cerebellum 


0.0 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


0.0 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


1.8 


TF- 1 - Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0.0 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small cell lung 
cancer 


100.0 


U937- Histiocytic lymphoma 


0.0 


NCI-H146- Small cell 
lung cancer 


0.0 


KU-812- Myelogenous 
leukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


2.6 


769-P- Clear cell renal 
carcinoma 


1.0 
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NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


2.3 


NCI-H82- Small cell 
lung cancer 


0.0 


SW 839- Clear cell renal 
carcinoma 


3.7 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


0.0 


G401- Wilms' tumor 


0.0 


NCI-H1155- Large cell 
lung cancer 


0.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Large cell 
lung cancer 


0.0 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


3.5 


NCI-H727-Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


2.3 


NCI-UMC-ll-Lung 
carcinoid 


1.7 


BxPC-3- Pancreatic 
adenocarcinoma 


L2 


LX-1- Small cell lung 
cancer 


20.6 


HPAC- Pancreatic 
adenocarcinoma 


3.3 


Colo-205- Colon cancer 


0.0 


MIA PaCa-2- Pancreatic 
carcinoma 


0.0 


KM12- Colon cancer 


123 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


2.6 


KM20L2- Colon cancer 


0.0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


0.0 


NCI-H716- Colon cancer 


0.0 


T24- Bladder carcinma 
(transitional cell) 


0.0 


SW-48- Colon 
adenocarcinoma 


6.0 


5637- Bladder carcinoma 


0.0 


SW1116- Colon 
adenocarcinoma 


2.6 


HT-1 197- Bladder carcinoma 


0.0 


LS 174T- Colon 
adenocarcinoma 


25.5 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.0 


SW-948- Colon 
adenocarcinoma 


10.7 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1 080- Fibrosarcoma 


1.0 


NCI-SNU-5- Gastric 
carcinoma 


1.0 


MG-63- Osteosarcoma 


0.0 


KATOIII- Gastric 
carcinoma 


0.0 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


0.0 


NCI-SNU-16- Gastric 
carcinoma 


0.0 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


2.0 


NCI-SNU-1- Gastric 
carcinoma 


29.7 


A431- Epidermoid carcinoma 


0.0 


RF-1- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


0.0 
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RF-48- Gastric 
adenocarcinoma 


0.0 


DU 145- Prostate carcinoma 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


3.5 


MDA-MB-468- Breast 
adenocarcinoma 


0.0 


NCI-N87- Gastric 

oniY»innmci 

od.ii,inuiiid 


0.0 


SCC-4- Squamous cell 
carcinoma oi tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


0.0 



Table AYH. Panel 4D 



Tissue 
Name 


Rel. 
Exp.(%) 
Agll69, 

Run 
139591349 


Rel. 
Exp.(%) 
Agll69, 

Run 
145735616 


Rel. 
Exp.(%) 
Ag3027, 

Run 
162426723 


Tissue 
Name 


Rel. 
Exp.(%) 
Agll69, 

Run 
139591349 


Rel. 

Exp.(%) 
Agll69, 

Run 
145735616 


Rel. 

Exp.(%) 
Ag3027, 
Run 
162426723 


Secondary 
Thl act 


0.0 


0.0 


0.0 


HUVEC 
IL-lbeta 


0.0 


0.0 


0.0 


Secondary 
Th2act 


0.0 


0.0 


0.0 


HUVEC 

IFN 

gamma 


0.0 


0.0 


0.7 


Secondary 
Trl act 


0.0 


0.0 


0.0 


HUVEC 
TNF alpha 
+ IFN 
gamma 


0.0 


0.0 


0.0 


Secondary 
Thl rest 


0.0 


0.0 


0.0 


HUVEC 
TNF alpha 
+ IL4 


0.4 


0.0 


0.0 


Secondary 
Th2rest 


0.0 


0.0 


0.0 


HUVEC 
IL-11 


0.0 


0.0 


0.0 


Secondary 
Trl rest 


0.0 


0.0 


0.0 


Lung 

Microvasc 

ularEC 

none 


0.4 


0.0 


0.0 


Primary 
Thl act 


0.0 


0.0 


0.0 


Lung 

Microvasc 
ularEC 
TNFalpha 
+ IL-lbeta 


0.9 


0.0 


0.0 


Primary 
Th2act 


0.0 


0.0 


0.0 


Microvasc 
ular 

Dermal EC 
none 


0.5 


0.0 


3.1 
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Primary 
Trl act 


0.0 


0.0 


0.0 


Microsvas 
ular 

Dermal EC 
TNFalpha 
+ IL-lbeta 


0.0 


0.0 


0.0 


Primary 
Thl rest 


0.0 


0.0 


0.0 


Bronchial 
epithelium 
TNFalpha 
+ ILlbeta 


1.0 


0.0 


5.1 


Primary 
Th2 rest 


0.0 


0.0 


0.0 


Small 
airway 
epithelium 
none 


0.0 


0.0 


2.0 


Primary 
Trl rest 


0.0 


0.0 


0.0 


Small 
airway 
epithelium 
TNFalpha 
+ IL-lbeta 


0.0 


1.3 


1.9 


CD45RA 
CD4 

lymphocyt 
e act 


0.0 


0.0 


0.0 


Coronery 
artery 
SMC rest 


0.0 


0.0 


0.0 


CD45RO 
CD4 

lymphocyt 
e act 


0.0 


0.0 


0.0 


Coronery 
artery 
SMC 
TNFalpha 
+ IL-lbeta 


0.0 


0.0 


o.o 


CD8 

lymphocyt 
e act 


0.0 


0.0 


0.0 


Astrocytes 
rest 


0.0 


0.0 


0.0 


Secondary 
CD8 

lymphocyt 
erest 


0.0 


0.0 


0.0 


Astrocytes 
TNFalpha 
+ IL-lbeta 


1.4 


3.1 


0.6 


Secondary 
CD8 

lymphocyt 
e act 


0.0 


0.0 


0.0 


KU-812 

(Basophil) 

rest 


0.0 


0.0 


0.0 


CD4 

lymphocyt 
enone 


0.0 


0.0 


0.0 


KU-812 
(Basophil) 
PMA/iono 
mycin 


0.0 


0.0 


0.0 


2ry 

Thl/Th2/T 
rl anti- 
CD95 
CHll 


0.0 


0.0 


0.0 


CCD1106 
(Keratinoc 
ytes) none 


0.0 


0.0 


0.0 


LAK cells 
rest 


0.0 


0.0 


0.0 


CCD1106 
(Keratinoc 


0.0 


0.0 


0.0 
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ytes) 

TNFalpha 
+ IL-lbeta 








LAK ceils 
IL-2 


0.0 


0 0 


0 0 


Liver 
cirrhosis 


2 1 


1 3 


3 1 


LAK cells 

IL-2+1L- 

12 


0.0 


0.0 


0.0 


Lupus 


5.3 


3.7 


4.8 


LAK cells 
IL-2+IFN 
gamma 


0.0 


0.0 


0.0 


NCI-H292 


0.9 


0.0 


2.2 


LAK cells 
IL-2+ IL- 
18 


0.0 


0.0 


0.0 


NCI-H292 
IL-4 


0.0 


1.2 


0.7 


LAK cells 
PMA/iono 
mycin 


0.0 


0.0 


0.0 


NCT-H292 
IL-9 


0.3 


0.0 


0.9 


NK Ceils 
IL-2 rest 


0 0 


ft ft 


ft ft 


NCI-H292 
IL-13 


ft ft 


ft ft 

u.v 


n 7 


Two Way 
MLR 3 
day 


0.0 


0.0 


0.0 


NCI-H292 

IFN 

gamma 


0.0 


3.7 


0.0 


Two Way 
MLR 5 " 
day 


0.0 


0.0 


0.0 


HPAEC 

nunc 


36.1 


0.0 


1.1 


Two Way 
MLR 7 
day 


0.0 


0.0 


1.1 


HPAEC 
TNF alpha 
+ IL-1 beta 


0.0 


0.0 


0.0 


PBMC rest 


0.0 


0.0 


0.0 


Lung 

fibroblast 

none 


0.0 


0.0 


0.0 


PBMC 
PWM 


0.0 


1.4 


0.0 


Lung 
fibroblast 
TNF alpha 
+ IL-1 beta 


0.0 


0.0 


0.0 


PBMC 
PHA-L 


0.5 


0.0 


1.0 


Lung 

fibroblast 

IL-4 


0.0 


0.0 


0.0 


Ramos (B 
cell) none 


0.0 


1.8 


0.0 


Lung 

fibroblast 

IL-9 


0.0 


0.0 


2.0 


Ramos (B 
cell) 

ionomycin 


0.5 


0.0 


0.0 


Lung 

fibroblast 

IL-13 


0.0 


0.0 


1.0 


B 

lymphocyt 
esPWM 


0.0 


0.0 


1.3 


Lung 
fibroblast 
IFN 
gamma 


0.5 


0.0 


0.0 
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B 

lymphocyt 
es CD40L 
and IL-4 


0.0 


0.0 


0.0 


Dermal 
fibroblast 
CCD1070 
rest 


0.0 


0.0 


7 

0.0 


EOL-1 
dbcAMP 


0.0 


0.0 


0.0 


Dermal 
fibroblast 
CCD1070 
TNF alpha 


0.0 


0.0 


0.0 


EOL-1 
dbcAMP 
PMA/iono 
mycin 


0.0 


0.0 


0.0 


Dermal 
fibroblast 
CCD1070 
IL-1 beta 


0.3 


0.0 


0.0 


Dendritic 
cells none 


2.1 


3.5 

■ — - — 


1.1 


Dermal 
fibroblast 
EFN 
gamma 


0.0 


0.0 


0.0 


Dendritic 
cells LPS 


0.0 


0.0 


0.0 


Dermal 

fibroblast 

IL-4 


0.0 


0.0 


0.0 


Dendritic 
cells anti- 

UU4U 


0.0 


0.0 


0.0 


IBD 
Colitis 2 


1.3 


3.6 


2.4 


Monocytes 
rest 


0.0 


0.0 


0.0 


IBD 
Crohn's 


04 


n n 


1 A 
l.U 


Monocytes 

T DC 


0.0 


0.0 


0.0 


Colon 


O. 1 


JZ..J 




Macropha 
ges rest 


0.0 


0.0 


4.8 


Lung 


0.5 


0.0 


0.0 


Macropha 
ges LPS 


0.0 


0.0 


1.8 


Thymus 


100.0 


100.0 


100.0 


HUVEC 
none 


0.0 


0.0 


0.0 


Kidney 


0.9 


0.0 


0.0 


HUVEC 
starved 


0.0 


0.0 


1.3 











Table AYI. Panel 5 Islet 



Tissue Name 


Rel. Exp.(%) 
Ag3027, Run 
225051163 


Tissue Name 


1 Rel.Exp.(%) 
Ag3027,Run 
225051163 


97457_Patient- 
02go_adipose 


0.0 


94709_Donor 2 AM - A_adipose 


9.5 


97476_Patient- 
07sk_skeletal muscle 


3.2 


94710_Donor 2 AM - B_adipose 


0.0 


97477_Patient- 
07ut uterus 


15.0 


9471 l_Donor 2 AM - C_adipose 


5.3 
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97478_Patient- 
07pl_placenta 


5.4 


94712_Donor 2 AD - A_adipose 


11.8 


991 67_Bayer Patient 1 


0.0 


94713_Donor 2 AD - B_adipose 


8.7 


97482_Patient- 
08ut_uterus 


0.0 


94714_Donor 2 AD - C_adipose 


9.3 


97483_Patient- 
08pl_placenta 


4.2 


94742_Donor 3 U - 
A_Mesenchymal Stem Cells 


0.0 


97486_Patient- 
09sk_skeletal muscle 


0.0 


94743_Donor3U- 
BJMesenchymal Stem Cells 


0.0 


97487_Patient- 
09ut_uterus 


0.0 


94730_Donor 3 AM - A_adipose 


8.0 


97488_Patient- 
09pl_placenta 


0.0 


94731 JDonor 3 AM - B_adipose 


0.0 


97492_Patient- 
10ut_uterus 


0.0 


94732JDonor 3 AM - C_adipose 


8.1 


97493_Patient- 
10pl_placenta 


0.0 


94733JDonor 3 AD - A_adipose 


4.9 


97495_Patient- 
llgo_adipose 


8.1 


94734 Donor 3 AD - B_adipose 


13.3 


97496_Patient- 

1 lsk_skeletal muscle 


0.0 


94735_Donor 3 AD - C_adipose 


5.1 


97497_Patient- 
llut uterus 


0.0 


77 1 38 JLi ver_HepG2untreated 


0.0 


97498_Patient- 
llpl_placenta 


0.0 


73556_Heart_Cardiac stromal 
cells (primary) 


0.0 


97500_Patient- 
12go_adipose 


3.5 


81735_Small Intestine 


22.7 


97501_Patient- 

1 2sk_skeletal muscle 


0.0 


72409 JCidney_ProximaI 
Convoluted Tubule 


100.0 


97502_Patient- 
12ut uterus 


0.0 


82685_Small intestine Duodenum 


6.1 


97503_Patient- 
12pl_placenta 


0 0 


90650_Adrenal_Adrenocortical 
adenoma 


0 0 

v/.V 


94721_Donor2U- 
A_Mesenchymal Stem 
Cells 


0.0 


72410_Kidney_HRCE 


74.7 


94722_Donor2U- 
B Mesenchymal Stem 
Cells 


0.0 


72411JCidney_HRE 


18.2 


94723_Donor 2 U - 
C Mesenchymal Stem 
Cells 


0.0 


73139JtJterusJJterine smooth 
muscle cells 


0.0 



Generaljscreening_panel_vl.5 Summary: Ag3027 Expression of the NOV56a gene 
is highest in a sample derived from a lung cancer (CT=30.5). Significant expression is also 
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seen in samples derived from colon cancer and ovarian cancer. Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 
marker to detect the presence of lung, colon, and breast cancers. Furthermore, therapeutic 
modulation of the expression or function of this gene may be effective in the treatment of lung, 
5 colon and breast cancers. 

While expression of this gene is seen predominantly in cancer cell lines, significant 
expression is also seen in fetal kidney (CT=3L8). Furthermore, expression is higher in fetal 
kidney than in adult kidney (CT=35.8). Thus, expression of this gene could be used to 
differentiate between adult and fetal kidney. In addition, the expression in fetal kidney 
1 0 suggests that this gene product may be involved in the development of the kidney. Therefore, 
therapeutic modulation of the expression or function of this gene may be useful in treating 
disease of the kidney. 

Panel 1.2 Summary: Agl 169 Results from one experiment, Run 129128191, with the 
NOV56a gene are in agreement with Results in Panel 1.3D and 
15 General j>creeningj>anel_vL5. A second run, Run 129656838, produces disparate results. 

/ Panel 1.3D Summary: Agl 169/Ag3027 Two experiments with the same probe and 
primer both show highest expression of the gene NOV56a in the mammary gland (CTs=3 1). 
Low, but significant levels of expression are also seen in a lung cancer cell line. Thus, 
expression of this gene may be used to differentiate between these samples and other samples 
20 on this panel. 

Panel 2D Summary: Ag3027 Highest expression of the NOV56a gene is seen in a 
gastric cancer. Significant expression is also seen in breast cancer, colon cancer and normal 
kidney. Thus, expression of this gene could be used to differentiate between these samples and 
other samples on this panel and as a marker for the presence of breast, colon and kidney 

25 cancers. A second experiment with the probe/primer set Agl 169 is not included. The amp plot 
indicates that there were experimental difficulties with this run. 

Panel 3D Summary: Agl 169 Expression of the NOV56a gene is restricted to samples 
derived from lung and gastric cancer cell lines (CTs=33-35). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 

30 marker to detect the presence of lung and gastric cancers. Furthermore, therapeutic modulation 
of the expression or function of this gene may be effective in the treatment of lung and gastric 
cancers. 

Panel 4D Summary: Agll69/Ag3027 Two experiments with the same probe and 
primer set show expression of the NOV56a gene limited to the thymus (CTs=32-33). Thus, 
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expression of this gene could be used as a marker for thymic tissue. Furthermore, this 
restricted expression suggests that this gene product may play an important role in T cell 
development. Therefore, small molecule therapeutics, or antibody therapeutics designed 
against the protein encoded for by this gene could be utilized to modulate immune function (T 
5 cell development) and be important for organ transplant, AIDS treatment or post 
chemotherapy immune reconstitution. 

Panel 5 Islet Summary: Ag3027 Expression of the NOV56a gene is restricted to a 
sample derived from the kidney (CT=34.9). This expression is consistent with expression in 
Panel 1 .3D. Thus, expression of this gene could be used as a marker for kidney tissue. 

10 NOV57 

Expression of gene NOV57 was assessed using the primer-probe sets Ag303 1, 
Agl301b and Agl415, described in Tables AZA, AZB and AZC. Results of the RTQ-PCR 
runs are shown in Tables AZD, AZE, AZF and AZG. 

Table AZA. Probe Name Ag3031 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -aaaaggtgatgtctggagcat-3 ' 


21 


616 


1216 


Probe 


TET-5 1 -tgtatgtcatgctctgtgccagccta-3 1 - 
TAMRA 


26 


648 


1217 


Reverse 


5 ' -gatgtctgtgtcgtcaaaagg-3 * 


21 


674 


1218 



15 

Table AZB. Probe Name Agl301b 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 1 -aaaaggtgatgtctggagcat-3 1 


21 


616 


1219 


Probe . 


TET-5 ' -tgtatgtcatgctctgtgccagccta-3 ' - 
TAMRA 


26 


648 


1220 


Reverse 


5 ' -gatgtctgtgtcgtcaaaagg-3 ' 


21 


674 


1221 



Table AZC. Probe Name Agl415 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -aaaaggtgatgtctggagcat-3 1 


21 


616 


1222 


Probe 


TET-5 1 -tgtatgtcatgctctgtgccagccta-3 1 - 
TAMRA 


26 


648 


1223 


Reverse 


5 ■ -gatgtctgtgtcgtcaaaagg-3 1 


21 


674 


1224 



1206 



WO 02/068649 



PCT/US02/02785 



Table AZD. CNSjaeurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3031, Run 
211011868 

X X \J X X OvO 


Rel. Exp.(%) 
Ag3031,Run 


Tissue 
Name 


Rel. Exp.(%) 
Ag3031,Run 


Rel. Exp.(%) 
Ag3031,Run 
ZZ540/445 


AD 1 Hippo 


23.8 


46.7 


Control 
(Path) 3 
i empoiai 
Ctx 


6.0 


10.2 


AD 2 Hippo 


21.2 


21.9 


Control 
(Path) 4 
i cmponii 
Ctx 


26.4 


28.1 


AD 3 Hippo 


14.2 


1 A »U 


AD 1 

VsCCipilal 

Ctx 




A 1 O 

41.5 


AD 4 Hippo 


7.4 


10.5 


AD 2 

uccipnai 

Ctx 

(Missing) 


0.0 


0.0 


AD 5 hippo 


94.0 


94.0 


An 7 

Occipital 
Ctx 


17.9 


17.4 


AD 6 Hippo 


53.6 


61.6 


AT} A 

Occipital 
Ctx 


12.9 


17.3 


Control 2 
Hippo 


12.7 


17.0 


Occipital 
Ctx 


35.1 


20.9 


Control 4 
Hippo 


14.6 


15.2 


ah a 

AU o 

Occipital 
Ctx 


20.9 


. . 27.5 


Control (Path) 
3 Hippo 


22.2 


72 


i^oniroj i 
Occipital 
Ctx 


4.9 


4.9 


AD 1 Temporal 
Ctx 


42.3 


42.9 


Pnntml 9 

VsUIIllUl z« 

Occipital 
Ctx 


46.7 


32.1 


AD 2 Temporal 


78.5 


29.9 


Control 3 
Occipital 
Ctx 


17.8 


192 


AD 3 Temporal 
Ctx 


17.9 


20.4 


Control 4 
Occipital 
Ctx 


13.2 


12.1 


AD 4 Temporal 
Ctx 


30.1 


31 2 


Control 
(Path) 1 
Occipital 
Ctx 


49.3 


59.9 
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AD 5 Inf 
Temporal Ctx 


100.0 


94.6 


Control 
(Path) 2 
Occipital 
Ctx 


13.6 


8.8 


AD 5 

SupTemporal 


59.9 


52.5 


Control 
(Path) 3 
Occipital 
Ctx 


6.1 


4.7 


AD 6 Inf 
Temporal Ctx 


75.3 


95.9 


Control 
(Path) 4 
Occipital 
Ctx 


31.2 


21.2 


AD 6 Sup 
Temporal Ctx 


94.6 


100.0 


Control 1 
Parietal Ctx 


10.3 


11.1 


Control 1 
Temporal Ctx 


8.5 


7.3 . 


Control 2 
Parietal Ctx 


56.6 


60.7 


Control 2 
Temporal Ctx 






Control 3 
Parietal Ctx 


15.4 


15.0 


Control 3 
lemporai utx 


13.0 


15.3 


Control 
(Path)l 
Parietal Ctx 


34.2 


41.8 


Control 4 
Temporal Ctx 


12.8 


16.6 


Control 
(Path) 2 
Parietal Ctx 


15.4 


20.3 


Control (Path) 
1 Temporal Ctx 


26.8 


31.0 


Control 
(Path) 3 
Parietal Ctx 


8.4 


7.0 


Control (Path) 
2 Temporal Ctx 


25.5 


17.8 


Control 
(Path) 4 
Parietal Ctx 


44.1 


31.6 



Table AZE. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Agl301b, 

Run 
165528224 


Rel. Exp.(%) 
Ag3031,Run 
167961982 


Tissue Name 


Rel. Exp.(%) 
Agl301b, 

Run 
165528224 


Rel. Exp.(%) 
Ag3031,Run 
167961982 


Liver 

adenocarcinoma 


18.4 


323 


Kidney (fetal) 


10.7 


66.0 


Pancreas 


4.7 


11.9 


Renal ca. 786- 
0 


15.3 


25.5 


Pancreatic ca. 
CAPAN2 


4.9 


4.9 


Renal ca. 
A498 


11.9 


12.5 


Adrenal gland 


23.0 


11.0 


Renal ca. RXF 
393 


14.9 


18.0 


Thyroid 


7.1 


8.5 


Renal ca. 
ACHN 


8.0 


10.5 
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Salivary gland 


10.1 


13.7 


Renal ca. UO- 
31 


6.0 


1.1 


Pituitary gland 


18.2 


9.2 


Renal ca.TK- 
i ft 


7.4 


18.6 


Brain (fetal) 


24.0 


42.3 


Liver 


11.9 


17.8 


Brain (whole) 


43.8 


19.6 


Liver (fetal) 


20.6 


14.7 


Brain (amygdala) 


20.4 


9.6 


Liver ca. 

(hepatoblast) 

HepG2 


22.2 


28.7 


Brain (cerebellum) 


50.7 


50.3 


Lung 


17.0 


22.8 


Brain 

(hippocampus) 


18 7 


8 1 


juung ^leiai ) 


1U.4 


26.4 


Brain (substantia 
nigra) 


8.4 


8.9 


Lung ca. 
(small cell) 

[Lx-i 


9.1 


17.4 


Brain (thalamus) 


20.0 


10.6 


Lung ca. 
(small cell) 
NCI-H69 


1.5 


4.0 


Cerebral Cortex 


5.3 


7.2 


Lung ca. 
(sxell var.) 
SHP-77 


18.4 


57.4 


Spinal cord 


9.6 


7.0 


Lung ca. 
(large 

cell)NCI-H460 


50.3 


4.2 


glio/astroU87-MG 


12.1 


16.5 


Lung ca. (non- 
sm. cell) A549 


8.0 


28.1 


glio/astro U-118- 
MG 


15.5 


13.4 


Lung ca. (non- 

sxell)NCI- 

H23 


10.2 


18.2 


astrocytoma 
SW1783 


6.9 


1 1 1 

11.1 


Lung ca. (non- 
sxell) HOP-62 




33. i) 


neuro*; met SK-N- 
AS 


11.4 


10.9 


Lung ca. (non- 

s.cl)NCI- 

H522 


9.7 


23.5 


astrocytoma SF- 
539 


12.5 


25.5 


Lung ca. 
(squam.) SW 
900 


4.0 


8.5 


astrocytoma SNB- 
75 


10.9 


18.9 


Lung ca. 
(squam.) NCI- 
H596 


1.6 


4.8 


glioma SNB-19 


32.1 


25.3 


Mammary 
gland 


36.9 


22.5 


glioma U251 


100.0 


100.0 


Breast ca.* 
(pl.ef)MCF-7 


5.4 


6.7 


glioma SF-295 


18.4 


46.0 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


16.7 


7.9 
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Heart (fetal) 


2.3 


6.2 


Breast ca.* 
(pl.ef)T47D 


13.2- 


46.7 


Heart 


8.4 


8.2 


Breast ca. BT- 
549 


9.9 


6.6 


Skeletal muscle 
(fetal) 


6.3 


24.1 


Breast ca. 
MDA-N 


1.6 


11.9 


Skeletal muscle 


20.2 


7.9 


Ovary 


3.0 


5.1 


Bone marrow 


21.8 


252 


Ovarian ca. 
OVCAR-3 


5.1 


12.0 


Thymus 


18.2 


38.4 


Ovarian ca. 
OVCAR-4 


4.7 


8.8 


Spleen 


26.6 


14.0 


Ovarian ca. 
OVCAR-5 


8.3 


42.3 


Lymph node 


42.6 


23.7 


Ovarian ca. 

./-"VI 7/~S . T> £% 

OVCAR-8 


2.8 


0.9 


Colorectal 


16.8 


16.8 


Ovarian ca. 
IGROV-1 


2.6 


11.7 


Stomach 


37 A 


5.9 


Ovarian ca.* 
(ascites) SK- 
OV-3 


10.0 


29.9 


Small intestine 


36.6 


142 


Uterus 


38.4 


17.4 


Colon ca. SW480 


3.7 


4.5 


Placenta 


11.1 


9.4 


Colon ca.* 

SW620(SW480 

met) 


3.8 


24.0 


Prostate 


18.6 


8.5 j 


Colon ca. HT29 


1.6 


4.6 


Prostate ca.* 
(bone met)PC- 
3 


6.9 


14.1 


Colon ca. HCT- 
116 


3.0 


9.9 


Testis 


45.1 


26.6 


Colon ca. CaCo-2 


2.1 


5.1 


Melanoma 
Hs688(A).T 


0.8 


1.2 


Colon ca. 
tissue(OD03866) 


5.4 


3.6 


Melanoma* 
(met) 

Hs688(B).T 


3.8 


3.4 


Colon ca. HCC- 
2998 


6.2 


15.6 


Melanoma 
UACC-62 


4.6 


8.0 


Gastric ca.* (liver 
mef) NCI-N87 


20.0 


13.3 


Melanoma 

Lvll'r 


28.5 


9.9 


Bladder 


8.4 


12.2 


Melanoma 
LOXMVI 


0.7 


8.1 


Trachea 


13.6 


6.4 


Melanoma* 
(met) SK- 
MEL-5 


2.2 


2.9 


Kidney 


10.4 


23.8 


Adipose 


8.1 


12.3 
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Table AZF. Panel 2.2 



Tissue Name 


Rel. Exp.(%) 
Agl301b, Run 
173859869 


Tissue Name 


Rel. Exp.(%) 
Ag 130 lb, Run 
173859869 


Normal Colon 


39.2 


Kidney Margin 
(OD04348) 


79.0 


Colon cancer 
(OD06064) 


6.3 


Kidney malignant 
cancer (OD06204B) 


6.0 


Colon Margin 
(OD06064) 


13.0 


Kidney normal adjacent 
tissue (OD06204E) 


15.4 


Colon cancer 
(OD06159) 


0.0 


Kidney Cancer 
(OD04450-01) 


24.5 


Colon Margin 
(OD06159) 


27.0 


Kidney Margin 
(OD04450-03) 


23.7 


Colon cancer 
(OD06297-04) 


2.5 


Kidney Cancer 
8120613 


3.0 


Colon Margin 
(OD06297-015) 


39.8 


Kidney Margin 
8120614 


38.2 


CC Gr.2 ascend colon 
(OD03921) 


3.8 


Kidney Cancer 
9010320 


8.0 


CC Margin (OD03921) 


4.4 


Kidney Margin 
9010321 


10.8 


Colon cancer metastasis 
(OD06104) 


I*t.J> 


Kidney Cancer 
8120607 


1 ^ o 


Lung Margin 
(OD06104) 


17 7 


Kidney Margin 
8120608 


J.O 


Colon mets to lung 
(OD04451-01) 


ft ft 


\lArm oil Iton i c 

iNormai uterus 


H 7 


Lung Margin 
(OD04451-02) 


1 ft 7 


u lenne i^ancer uo^fu 1 1 


01 0 


Normal Prostate 


30.6 


Normal Thyroid 


2.9 1 


Prostate Cancer 
(OD04410) 


7.6 


Thyroid Cancer 064010 


4.4 


Prostate Margin 
(OD04410) 


Jjj .J 


Thyroid Cancer 
A302152 


7Q 0 


Normal Ovarv 


ft 


Thyroid Margin 
A302153 


7 % 


Ovarian cancer 
(OD06283-03) 


8.1 


Normal Breast 


37 6 


Ovarian Margin 
(OD06283-07) 


34.4 


Breast Cancer 
(OD04566) 


21.6 


Ovarian Cancer 064008 


46.3 


Breast Cancer 1024 


100.0 j 


Ovarian cancer 
(OD06145) 


34.4 


Breast Cancer 
(OD04590-01) 


25.2 


Ovarian Margin 
(OD06145) 


52.9 


Breast Cancer Mets 
(OD04590-03) 


35.6 
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Ovarian cancer 
(OD06455-03) 


11.6 


Breast Cancer 
Metastasis (OD04655- 
05) 


45.1 


Ovarian Margin 
(OD06455-07) 


18.8 


Breast Cancer 064006 


21.2 


Normal Lung 


22.8 


Breast Cancer 9100266 


26.6 


Invasive poor diff. lung 
adeno (ODO4945-01 


18.7 


Breast Margin 9100265 


19.3 


Lung Margin 
(ODO4945-03) 


13.9 


Breast Cancer A209073 


6.1 


Lung Malignant Cancer 
(OD03126) 


11.7 


Breast Margin 
A2090734 


35.8 


Lung Margin 
(OD03126) 


12.2 


Breast cancer 
(OD06083) 


49.3 


Lung Cancer 
(OD05014A) 


9.0 


Breast cancer node 
metastasis (OD06083) 


25.9 


Lung Margin 
(OD05014B) 


35.4 


Normal Liver 


36.3 


Lung cancer (OD06081) 


23.7 


Liver Cancer 1026 


2.5 


Lung Margin 
(OD06081) 


27.9 


Liver Cancer 1 025 


45.7 


Lung Cancer 
(OD04237-01) 


13.3 


Liver Cancer 6004-T 


30.1 


Lung Margin 
(OD04237-02) 


28.5 


Liver Tissue 6004-N 


27.7 


Ocular Melanoma 

XVtCiaSiaSlS 


11.5 


Liver Cancer 6005-T 


6.7 


Ocular Melanoma 

lvialglll vci y 


27.0 


Liver Tissue 6005-N 


17.4 


Melanoma Metastasis 


21.6 


Liver Cancer 064003 


32.3 | 


Melanoma Margin 
(Lung) 


14.7 


Normal Bladder 


13.2 


Normal Kidney 


17.8 


Bladder Cancer 1023 


23.0 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


50.3 


Bladder Cancer 

A A yv A -a MM 

A302173 


20.0 


Kidney Margin 
(OD04338) 


24.3 


Normal Stomach 


89.5 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


62.0 


Gastric Cancer 
9060397 


5.7 


Kidney Margin 
(OD04339) 


16.3 


Stomach Margin 
9060396 


17.7 


Kidney Ca, Clear cell 
type (OD04340) 


7.3 


Gastric Cancer 
9060395 


19.6 


Kidney Margin 
(OD04340) 


10.7 


Stomach Margin 
9060394 


42.6 


Kidney Ca, Nuclear 


4.6 


Gastric Cancer 064005 


7.5 ! 
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grade 3 (OD04348) 



Table AZG. Panel 4D 



Tissue 
Name 


Rel. 
Exp.(%) 
Ael301b 

Run 
138983163 


Rel. 
Exp.(%) 

Run 

138642033 


Rel. 
Exp.(%) 

Run 

162426783 


Tissue 
Name 


Rel. 
Exp.(%) 

A rrl 1ft1K 
AgloUl D, 

Run 
138983163 


Rel. 
Exp.(%) 

Agl415, 

Run 
138642033 


Rel. 
Exp.(%) 

AgJlWl, 

Run 
162426783 


Secondary 
Thl act 


17.0 


25.3 


13.8 


IL-lbeta 


11.5 


13.0 


11.3 


Secondary 
Th2 act 


22.7 


18.9 


20.0 


IFN 
gamma 


28.3 


24.7 


20.2 


Secondary 
Trl act 


32.3 


26.4 


20.0 


HUVEC 

+ IFN 
gamma 


3.2 


6.6 


12.3 


Secondary 
Thl rest 


22.2 


20.3 


12.0 


HUVEC 
TNF alpha 

-4- TT A 


11.7 


9.7 


9.7 


Secondary 
Th2 rest 


42.6 


37.9 


21.0 


HUVEC 

il-u 


10.1 


8.2 


11.6 


occonQary 
Trl rest 


27.9 


27.5 


26.8 


Lung 
Microvasc 
ular EC 
none 


33.9 


28.7 


25.2 


Primary 
Thl act 






i n i 

19.1 


Lung 
Microvasc 
ular EC 
TNFalpha 
+ IL-lbeta 


23.2 


19.1 


24.8 


i riiuary 
Th2act 


28.1 


41.2 


13.3 


Microvasc 
ular 

Dermal EC 
none 


41.8 


45.1 


25.7 


Primary 
Trl act 


49.0 


52.5 


20.9 


ular 

Dermal EC 
TNFalpha 
+ IL-lbeta 


27.5 


40.3 


22.4 


Primary 
Thl rest 


80.7 


87.1 


80.7 


Bronchial 
epithelium 
TNFalpha 
+ ILlbeta 


20.3 


32.3 


22.1. 


Primary 
Th2 rest 


67.4 


68.8 


64.2 


Small 

airway 

epithelium 


7.3 


3.3 


4.8 
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none 








Primary 
Trl rest 


46.0 


50.3 


54.7 


Small 
airway 
epithelium 
TNFalpha 
+ IL-lbeta 


34.4 


35.1 


26.8 


CD45RA 
CD4 

lymphocyt 
eact 


12.9 


8.0 


10. J 


Coronery 
artery 
SMC rest 


/.y 


7.9 


12.3 


CD4 

lymphocyt 
eact 


33.4 


44.8 


22.4 


Coronery 
artery 
SMC 
TNFalpha 
+ IL-lbeta 


7.6 


10.7 


8.1 


CD8 

lymphocyt 
e act 


22.8 


23.0 


18.4 


Astrocytes 
rest 


7.9 


10.3 


12.7 


Secondary 
CD8 

lymphocyt 
e rest 


22.2 


25.5 


25.0 


Astrocytes 
TNFalpha 
+ IL-lbeta 


8.7 


5.8 


13.3 


Secondary 
CD8 

lymphocyt 
e act 


19.3 


21.8 


22.5 


KU-812 

(Basophil) 

rest 


40.1 


35.4 


48.0 


CD4 

lympnocyi 
enone 


41.5 


42.3 


34.0 


KU-812 
(Basophil) 
PMA/iono 
mycin 


57.8 


61.1 


78.5 


2ry 

Thl/Th2/T 
rl anti- 
CD95 
CH11 


65.1 


54.7 


41.2 


CCDH06 
(Keratinoc 
ytes) none 


3.8 


8.0 


6.6 


LAK cells 
rest 


28.7 


37.1 


32.3 


CCDH06 
(Keratinoc 
ytes) 

TNFalpha 
+ IL-lbeta 


27.4 


24.5 


5.3 


LAK cells 

TT O 

1L-2 


38.7 


49.0 


36.3 


Liver 
cirrhosis 


17.0 


9.4 


5.0 


LAK cells 

IL-2+IL- 

12 


26.8 


27.4 


26.8 


Lupus 
kidney 


24.1 


23.8 


9.5 


LAK cells 
IL-2+IFN 
gamma 


43.5 


45.4 


42.9 


NCI-H292 
none 


38.7 


49.3 


45.4 
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LAK cells 

IL-2+IL- 

18 


26.2 


25.2 


35.6 . 


NCI-H292 
IL-4 


58.6 


51.4 


46.7 


LAK cells 
PMA/iono 
mycin 


8.3 


8.6 


3.3 


NCI-H292 
IL-9 


56.3 


46.0 


54.3 


NK Cells 
IL-2 rest 


28.7 


35.8 


32 8 


NCI-H292 
IL-13 


JV/it 




91 1 


Two Way 
MLR 3 
day 


42.3 


49.0 


46.3 


NCI-H292 

IFN 

gamma 


16.7 


28.3 


29.1 


Two Way 
MLR 5 
day 


17.9 


16.2 


13.5 


HPAEC 


15.7 


26.4 


19.1 


Two Way 
MLR 7 * 
day 


14.7 


12.6 


14.4 


HPAEC 
TNF alpha 
+ IL-1 beta 


23.8 


32.5 


27.9 


PBMC rest 


21.9 


29.9 


19.2 


Lung 

fibroblast 

none 


13.0 


11.0 


11.9 


PBMC 
PWM 


66.9 


53.2 


50.3 


Lung 
fibroblast 
TNF alpha 
+ IL-1 beta 


8.7 


7.2 


13.2 


PBMC 
PHA-L 


35.6 


46.7 


23.3 


Lung 

fibroblast 

IL-4 


5.6 


12.0 


10.5 


Ramos (B 
cell) none 


23.5 


33.2 


16.5 


Lung \ 

fibroblast 

IL-9 


8.5 


7.4 


15.3 


Ramos (B 
cell) 

ionomycin 


53.2 


53.2 


49.3 


Lung 

fibroblast 

IL-13 


20.3 


16.3 


8.9 


lymphocyt 
esPWM 


29.9 


36.3 


34.2 


Lung 
fibroblast 
IFN 
gamma 


11.1 


10.4 


13.0 


B 

lymphocyt 
es CD40L 
andIL-4 


33.2 


36.6 


35.8 


Dermal 
fibroblast 
CCD1070 
rest 


47.0 


11.3 


13.8 


EOL-1 
dbcAMP 


18.9 


14.8 


12.8 


Dermal 
fibroblast 
CCD1070 
TNF alpha 


45.7 


53.6 


55.1 


EOL-1 

dbcAMP 

PMA/iono 


30.4 


29.1 


25.5 


Dermal 

fibroblast 

CCD1070 


16.8 


17.3 


15.4 
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mycin 








IL-1 beta 








Dendritic 
cells none 


14.9 


10.0 


13.7 


Dermal 
fibroblast 
IFN 
gamma 


5.8 


8.8 


6.5 


Dendritic 
cells LPS 


15.7 


5.8 


8.7 


Dermal 

fibroblast 

IL-4 


17.6 


20.0 


13.6 


Dendritic 
cells anti- 
CD40 


12.7 


17.7 


15.2 


IBD 
Colitis 2 


2.0 


1.3 


1.1 


Monocytes 
rest 


35.6 


27.5 


36.6 


IBD 

Crohn's 


3.0 


1.4 


1.4 


Monocytes 
LPS 


34.6 


43.8 


25.0 


Colon 


33.0 


26.2 


34.6 


Macropha 
ges rest 


19.2 


16.6 


17.2 


Lung 


6.8 


13.5 


7.5 


Macropha 
ges LPS 


17.9 


16.2 


6.7 


Thymus 


100.0 


100.0 


55.5 


HUVEC 
none 


9.8 


13.2 


17.3 


Kidney 


87.1 


79.0 


100.0 


HUVEC 
starved 


27.2 


27.5 


28.3 









CNS_neurodegeneration_vl.O Summary: Ag303 1 Two experiments with the same 
probe and primer set produce results that are in excellent agreement. The NOV57 gene, a 
kinase homolog, is expressed more highly in the temporal cortex of brains from Alzheimer's 
5 disease patients than in the temporal cortex of normal brains unaffected by Alzheimer's 
disease. Kinases have been shown to play a role in the pathogenesis of Alzheimer's disease. 
The dysregulation of this kinase, NOV57, indicates an active role for this pathway in disease 
pathogenesis. Thus, inhibitors of this gene product, by modulating this pathway, may have 
utility in the treatment of Alzheimer's disease and other neurodegenerative diseases. 
10 References: 

Morishima Y, Gotoh Y, Zieg J, Barrett T, Takano H, Flavell R, Davis RJ, Shirasaki Y, 
Greenberg ME. Beta-amyloid induces neuronal apoptosis via a mechanism that involves the c- 
Jun N-terminal kinase pathway and the induction of Fas ligand. J Neurosci 2001 Oct 
l;21(19):7551-60 

1 5 Elevated levels of beta-Amyloid (Abeta) are present in the brains of individuals with 

either the sporadic or familial form of Alzheimer's disease (AD), and the deposition of Abeta 
within the senile plaques that are a hallmark of AD is thought to be a primary cause of the 
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cognitive dysfunction that occurs in AD. Recent evidence suggests that Abeta induces 
neuronal apoptosis in the brain and in primary neuronal cultures, and that this Abeta-induced 
neuronal death may be responsible in part for the cognitive decline found in AD patients. In 
this study we have characterized one mechanism by which Abeta induces neuronal death. We 
5 found that in cortical neurons exposed to Abeta, activated c-Jun N-terminal kinase (JNK) is 
required for the phosphorylation and activation of the c-Jun transcription factor, which in turn 
stimulates the transcription of several key target genes, including the death inducer Fas ligand. 
The binding of Fas ligand to its receptor Fas then induces a cascade of events that lead to 
caspase activation and ultimately cell death. By analyzing the effects of mutations in each of 
10 the components of the JNK-c-Jun-Fas ligand-Fas pathway, we demonstrate that this pathway 
plays a critical role in mediating Abeta-induced death of cultured neurons. These findings 
raise the possibility that the JNK pathway may also contribute to Abeta-dependent death in 
AD patients 

Panel 1.3D Summary: Agl301b/Ag3031 Two experiments with the same probe and 
1 5 primer set produce results that are in excellent agreement, with highest expression of the 
NOV57 gene in a brain cancer cell line (015=29-30). Overall, this gene is expressed at 
moderate to low levels in all the samples in this panel. 

This gene has low to moderate expression in several endocrine/metabolic-related 
tissues, including adipose, pancreas, liver, skeletal muscle and thyroid. Thus, a therapeutic 
20 modulator to this gene and/or gene-product may be useful in the treatment of diseases which 
affect the endocrine system. 

Panel 2.2 Summary: Agl301b The NOV57 gene is expressed in breast cancer at a 
moderate level. It is also expressed at a higher level in normal gastric, prostate and colon 
tissues compared to the adjacent tumors. Hence, inhibition of this drug might be used for 
25 treatment of breast cancer. It could also be used as a diagnostic marker for gastric, prostate and 
colon cancers. 

Panel 4D Summary: Agl301b/Agl415/Ag3031 Three experiments with the same 
probe and primer sets produce results that are in excellent agreement, with highest expression 
of the NOV57 gene in the thymus and kidney. This gene is also expressed at higher levels in 
30 resting Thl and Th2 lymphocytes than in activated Thl and Th2 lymphocytes. Therefore, 
small molecule agonists of the gene product may be useful as therapeutics to reduce the 
activation of Thl and Th2 cells and thus reduce symptoms in patients with autoimmune and 
inflammatory diseases, such as Crohn's disease, ulcerative colitis, multiple sclerosis, chronic 
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obstructive pulmonary disease, asthma, emphysema, rheumatoid arthritis, lupus 
erythematosus, or psoriasis. 

NOV58a and NOV58b: Gap Junction Beta-5 (connexin) 

Expression of gene NOV58a and variant NOV58b was assessed using the primer-probe 
set Ag2914, described in Table BAA. 

Table BAA. Probe Name Ag2914 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -aacactgtggactgcttcatct-3 • 


22 


517 


1225 


Probe 


TET-5 • -ccaaacccactgagaagacgatcttca- 
3 ' -TAMRA 


27 


539 


1226 


Reverse 


5 1 -atacacaagcatgaggtgatga-3 1 


22 


578 


1227 



CNSjneurodegeneration_vl.O Summary: Ag2914 The amp plot indicates that there 
are experimental difficulties with this run (data not shown). 
10 Panel 1.3D Summary: Ag2914 Expression of this gene is low/undetectable (CTs >35) 

across all of the samples on this panel (data not shown). 

Panel 2D Summary: Ag2914 Expression of this gene is low/undetectable (CTs >35) 
across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag2914 The amp plot indicates that there are experimental 
15 difficulties with this run (data not shown). 

BB. CG56633-01: TRANSLATION INITIATION FACTOR 5 

Expression of gene CG56633-01 was assessed using the primer-probe set Ag2900, 
described in Table BBA. Results of the RTQ-PCR runs are shown in Tables BBB, BBC, BBD 
and BBE. 

20 Table BBA. Probe Name Ag2900 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gctaagttccttgatgcttctg-3 1 


22 


184 


1228 


Probe 


TET-5 1 -caaaacttgattaccgtcgatgtgca-3 1 - 
TAMRA 


26 


209 


1229 


Reverse 


5 1 -ccaccagaatgtcaaagagtgt-3 1 


22 


238 


1230 



Table BBB. CNS_neurodegeneration_vl.O 
Tissue Name | Rel. Exp.(%) Ag29(Kh 



Tissue Name 



Rel. Exp.(%) Ag2900, 
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Run 206485415 




Run 206485415 


AD 1 Hippo 


12.9 


Control (Path) 3 

i. l/illLJUlCll VIA 


7.0 


AD 2 Hippo 


33.9 


Control fPatM A 

Temporal Ctx 

* 


18.6 


AD 3 Hippo 


7.4 


AD 1 Occipital Ctx 


14.5 


AD 4 Hippo 


11.2 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


68.8 


AD 3 Occipital Ctx 


8.0 


AD 6 Hippo 


61.6 




1 fx £ 

10.0 


Control 2 Hippo 


36.1 


AD 5 Occipital Ctx 


36.3 


r\rttrr\\ A T-Iiri-nr* 

v^unirui h nippo 




AD 6 Occipital Ctx 


25.7 


Control (Path) 3 

nippu 


7.7 


Control 1 Occipital 
Ctx 


6.8 


AD 1 Temporal Ctx 


19.6 


Control 2 Occipital 
Ctx 


65.5 


AD 2 Temporal Ctx 


27.5 


Control 3 Occipital 
Ctx 


14.6 


AD 3 Temporal Ctx 


jControl 4 Occipital 
ICtx 


12.3 


AD 4 Temporal Ctx 


176 Control (Path) 1 
[Occipital Ctx 


100.0 


AD 5 Inf Temporal 

CtX 


59.0 


Control (Path) 2 
Occipital Ctx 


6.5 


AD 5 Sup Temporal 

PtY 


42.0 


Control (Path) 3 
Occipital Ctx 


5.1 


AD 6 Inf Temporal 


65.1 


Control (Path) 4 
Occipital Ctx 


13.2 


AD 6 Sup Temporal 
Ctx 


48.3 


Control 1 Parietal 
Ctx 


8.0 


Control 1 Temporal 
Ctx 


8.0 


Control 2 Parietal 
Ctx 


24.5 


Control 2 Temporal 
Ctx 


52.9 


Control 3 Parietal 
Ctx 


18.3 


Control 3 Temporal 
Ctx 


18.2 


Control (Path) 1 
Parietal Ctx 


86.5 


Control 3 Temporal 
Ctx 


9.0 


Control (Path) 2 
Parietal Ctx 


17.4 


Control (Path) 1 
Temporal Ctx 


57.0 


Control (Path) 3 
Parietal Ctx 


9.4 


Control (Path) 2 
Temporal Ctx 


38.2 


Control (Path) 4 
Parietal Ctx 


31.6 
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Table BBC. Panel 1.3D 



Tissue Name 


ReL Exp.(%) Ag2900, 


Tissue Name 


ReL Exp.(%) Ag2900, 

Pun 150QQ£7« 


T .ivpr aHpnnnflrcinnma 


22 7 


JVlUIlCjr ^IClaly 


14 1 

1*T. 1 


x cuiLfi cab 


1 R 

1 .o 


xvcndl Ca. /OO-U 


1^1 

IDA 


Pflnprpatip on PAPAW 

2 


2.8 


Renal ca. A498 


23.7 


■TTUlvllCti gicuiu 


ZZ. 1 


tvenai ca. rat jyj 


z. / 


TTivrniH 




ivenai ca. /w^riiN 


J.U 




9 0 


ixcnai ca. uu-j i 


0./ 


Pituitary gland 


5.8 


Renal ca.TK-10 


3.6 


Brain (fetal) 


10.7 


Liver 


6.0 


Brain (whole) 


12.3 


Liver (fetal) 


12.5 


Brain (amygdala) 


8.4 


Liver ca. 

(hepatoblast) HepG2 


11.8 


Brain (cerebellum) 


9.3 


Lung 


15.0 


Brain (hippocampus) 


61.6 


Lung (fetal) 


6.9 


Brain (substantia nigra) 


5.1 


Lung ca. (small cell) 
LX-1 


11.1 


Brain (thalamus) 


10.4 


Lung ca. (small cell) 
NCI-H69 


12.3 


Cerebral Cortex 


22.4 


Lung ca. (s.cell var.) 
SHP-77 


30.4 


Spinal cord 


4.7 


Lung ca. (large 
cell)NCI-H460 


28.3 


glio/astro U87-MG 


26.1 


Lung ca. (non-sm. 
cell) A549 


15.7 


glio/astroU-118-MG 


94.0 


Lung ca. (non-s.cell) 
NC1-H23 


15.0 


astrocytoma SW1783 


16.5 


Lung ca. (non-s.cell) 
HOP-62 ! 


4 2 


neuro*; met SK-N-AS 


50.0 


Lung ca. (non-s.cl) 
NCI-H522 


y.D 


astrocytoma SF-539 


9.5 


Lung ca. (squam.) 
SW900 


R 4 


astrocytoma SNB-75 


10 9 


Lung ca. (squam.) 
NCI-H596 


2 R 


glioma SNB-19 


10.2 


Mammary gland 


10.1 


glioma U25l 


3.3 


Breast ca.* (pl.ef) 
MCF-7 


21.8 


glioma SF-295 


2 9 


Breast ca.* (pl.ef) 
MDA-MB-231 


100.0 


Heart (fetal) 


2.1 


Breast ca.* (pl.ef) 
T47D 


5.3 


Heart 


3.1 


Breast ca. BT-549 


40.9 
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Skeletal muscle (fetal) 


3.0 


Breast ca. MDA-N 


12.0 


Skeletal muscle 


2.1 


Ovary 


3.0 


Bone marrow 


8.1 


Ovarian ca. OVCAR- 
3 


7.3 


Thymus 


4.0 


Ovarian ca. OVCAR- 
4 


0.9 


Spleen 


5.3 


Ovarian ca. OVCAR- 
5 


5.9 


Lymph node 


4.0 


Ovarian ca. OVCAR- 

o 
o 


8.3 


Colorectal 


7.6 


Ovarian ca. IGROV- 
i 


2.1 


Stomach 


2.7 


Ovarian ca.* (ascites) 

oJv-vJ V -j 


11.3 


omau lniesiine 




uxerus 




Colon ca. SW480 


11.0 


Placenta 


12.6 


Colon ca.* 
SW620(SW480 met) 


7.2 


Prostate 


4.3 


Colon ca. HT29 


7.7 


Prostate ca.* (bone 
metjru-j 


24.7 


Colon ca.HCT-1 16 


23.0 


Testis 


7.7 


Colon ca. CaCo-2 


15.7 


Melanoma 
Hs688(A).T 


11.3 


Colon ca. 
tissue(OD03866) 


12.8 


Melanoma* (met) 
Hs688(B).T 


5.3 


Colon ca. HCC-2998 


33.4 


Melanoma UACC-62 


4.2 


Gastric ca.* (liver met) 
NCI-N87 


21.9 


Melanoma M 14 


4.4 


Bladder 


11.8 


Melanoma LOX 
IMVI 


28.7 


Trachea 


10.2 


Melanoma* (met) 
SK-MEL-5 


17.9 


Kidney ' 


4.3 jAdipose 


13.1 


Table BBD. Panel 2D 


Tissue Name 


Rel. Exp.(%) 
Ag2900, Run 
159996787 


Tissue Name 


Rel. Exp.(%) 
Ag2900,Run 
159996787 


Normal Colon 


71.2 


Kidney Margin 
8120608 


3.0 


CC WelltoModDiff 
(OD03866) 


24.5 


Kidney Cancer 
8120613 


2.0 


CC Margin (OD03866) 


20.6 


Kidney Margin 
8120614 


2.5 
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CC Gt2 rectosigmoid 


47.3 


Kidney Cancer 

7U I UJZU 


7.1 


CC Margin (OD03868) 


6.0 


Kidney Margin 


5.0 


CCMod Diff(ODO3920) 


39.2 


Normal Uterus 


5.3 


CC Margin (ODO3920) 


21.0 


Uterus Cancer 06401 1 


16.5 


CC Gr.2 ascend colon 
(OD03921) 


69.3 


Normal Thyroid 


8.2 


CC Margin (OD03921) 


16.0 


Thyroid Cancer 
064010 


15.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


41.8 


Thyroid Cancer 
A302152 


7.7 


Liver Margin (ODO4309) 


40.6 


Thyroid Margin 
A302153 


11.0 


Colon mets to lung 
(OD04451-01) 


14.3 


Normal Breast 


12.7 


Lung Margin (OD04451- 
02) 


15.0 


Breast Cancer 
(OD04566) 


21.3 


Normal Prostate 6546-1 


11.7 


Breast Cancer 
(OD04590-01) 


63.7 


Prostate Cancer 
(OD04410) 


40.1 


Breast Cancer Mets 
(OD04590-03) 


48.0 


Prostate Margin 
(OD04410) 


32.3 • 


Breast Cancer 

Metastasis 

(OD04655-05) 


32.1 


Prostate Cancer 
(OD04720-01) 


26.4 


Breast Cancer 064006 


21.8 


Prostate Margin 
(OD04720-02) 


35.8 


Breast Cancer 1024 


6.3 


Normal Lung 061010 


46.7 


Breast Cancer 
9100266 


39.0 


Lung Met to Muscle 
(OD04286^ 


34.6 


Breast Margin 


14.4 


iTiuovlv iricu £11 J 

(OD04286) 


12.4 


DLGclbL IsalZwCr 

A209073 


37.1 


Lung Malignant Cancer 
(OD03126) 


18.8 


Breast Margin 
A2090734 


14.4 


Lung Margin (OD03126) 


16.7 


Normal Liver 


14.1 


Lung Cancer (OD04404) 


20.9 


Liver Cancer 064003 


20.2 


Lung Margin (OD04404) 


15.3 


Liver Cancer 1025 


7.8 


Lung Cancer (OD04565) 


15.8 


Liver Cancer 1026 


5.6 


Lung Margin (OD04565) 


10.3 


Liver Cancer 6004-T 


6.0 


Lung Cancer (OD04237- 
01) 


29.1 


Liver Tissue 6004-N 


.6.6 
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Lung Margin (OD04237- 
02) 


29.5 


Liver Cancer 6005-T 


6.6 


Ocular Mel Met to Liver 
(ODO4310) 


17.0 


Liver Tissue 6005-N 


6.7 


Liver Margin (OD043 1 0) 


23.8 


Normal Bladder 


54.7 


Melanoma Mets to Lung 
(OD04321) 


16.7 


Bladder Cancer 1023 


7.6 


Lung Margin (OD04321) 


20.6 


Bladder Cancer 
A302173 


23.2 


Normal Kidney 


16.6 


Bladder Cancer 
(OD04718-01) 


100.0 


Kidney Ca, Nuclear grade 
2 (OD04338) 


15.8 


Bladder Normal 
Adjacent (OD0471 8- 
03) 


30.4 


Kidney Margin 
(OD04338) 


11.5 


Normal Ovarv 


2.3 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


10.2 


Ovarian Cancer 
064008 


30 6 


Kidney Margin 
(OD04339) 


15.9 


Ovarian Cancer 
(OD04768-07) 


37 6 


Kidney Ca, Clear cell 
type (OD04340) 


28.7 


Ovary Margin 
(OD04768-08) 


14 8 


Kidney Margin 
(OD04340) 


15.5 


Normal Stomach 


17 9 


Kidney Ca, Nuclear grade 
3 (OD04348) 


12.3 


Gastric Cancer 
9060358 


10 0 

1 v.v/ 


Kidney Margin 
(OD04348) 


16.0 


Stomach Margin 
9060359 


23.0 


Kidney Cancer 
(OD04622-01) 


12.5 


Gastric Cancer 
9060395 


35.8 


Kidney Margin 


3.6 


Stomach Margin 
9060394 


27.5 


Kidney Cancer 
(OD04450-01) 


9.5 


Gastric Cancer 
9060397 


66.0 


Kidney Margin 
(OD04450-03) 


13.0 


Stomach Margin 
9060396 


18.0 


Kidney Cancer 8120607 


3.1 


Gastric Cancer 
064005 


62.0 



Table BBE. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2900,Run 
159996820 


Tissue Name 


Rel. Exp.(%) 
Ag2900,Run 
159996820 


Secondary Thl act 


68.3" i 


HUVEC IL-lbeta 


10.5 


Secondary Th2 act 


67.8 


HUVEC IFN gamma 


15.5 
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Secondary Trl act 


732 


HUVEC TNF alpha + IFN 


16.0 


Secondary Thl rest 


8.2 


HUVEC TNF alpha + IL4 


18.8 


Secondary Th2 rest 


1 O /* 

18.6 


HUVEC IL-ll 


13.1 


Secondary Trl rest 


10.1 


Lung Microvascular EC 
none 


18.8 


Primary Thl act 


75.8 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


25.2 


Primary Th2 act 


60.7 


Microvascular Dermal EC 
none 


30.8 


Primary Trl act 


74.7 


Microsvasular Dermal EC 
TNFalpha + EL- 1 beta 


27.9 


Primary Thl rest 


42.0 


Bronchial epithelium 
TNFalpha + ELlbeta 


16.8 


Primary Th2 rest 


29.3 


Small airway epithelium 
none 


11.7 


Primary Trl rest 


16.3 


Small airway epithelium 
TNFalpha + IL-lbeta 


79.0 


CD45RA CD4 
lymphocyte act 


34.4 


Coronery artery SMC rest 


192 


CD45RO CD4 
lymphocyte act 


53.6 


Coronery artery SMC 
TNFalpha + IL-lbeta 


7.6 


CD8 lymphocyte act 


35.8 


Astrocytes rest 


8.6 


Secondary CD8 
lymphocyte rest 


. 39.0 


Astrocytes TNFalpha + 
IL-lbeta 


7.2 


Secondary CD8 
lymphocyte act 


33.9 


KU-812 (Basophil) rest 


7.8 


CD4 lymphocyte none 


6.2 


KU-812 (Basophil) 
r lvi/v lonumy cm 


40.6 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 

\~*XJ y x I X 


11.7 


CCDl 106 (Keratinocytes) 

nunc 


30.4 


LAK cells rest 


252 


TNFalpha + IL-lbeta 


13.6 


LAK cells IL-2 


23 8 


T ivpr f*irrhr*QiQ 


3 o i 


LAK cells IL-2+IL-12 


22 5 i 




1 3 


LAK cells IL-2+IFN 
gamma 


363 


NCI-H292none 


87.1 


LAK cells IL-2+ IL-18 


29.7 


NCI-H292 IL-4 


91.4 


LAK cells 
PMA/ionomycin 


47.0 


NCI-H292 IL-9 


92.0 


NK Cells IL-2 rest 


13.7 


NCI-H292 IL-13 


39.8 


Two Way MLR 3 day 


15.7 


NCI-H292 IFN gamma 


47.0 


Two Way MLR 5 day 


15.6 


HPAEC none 


18.0 


Two Way MLR 7 day 


12.6 


HPAEC TNF alpha + IL-1 
beta 


26.1 



1224 



WO 02/068649 



PCT/US02/02785 



PBMC rest 


7.5 


T Jinp" fibroblast nonp 


11 1 


PBMC PWM 


100.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


16.8 


PBMC PHA-L 


42.6 


Lung fibroblast IL-4 


62.4 


ivamos (d ceiiy none 


I /.U 


Lung fibroblast IL-9 


39.5 


Ramos (B cell) 
lonomycin 


54.0 


Lung fibroblast IL- 13 


35.4 


B lymphocytes PWM 


73.7 


Lung fibroblast IFN 
gamma 


70.2 


B lymphocytes CD40L 

and IJL/-H 1 


12.9 


Dermal fibroblast 
CCD 1070 rest 


49.3 


EOL-l dbcAMP 


9.5 


Dermal fibroblast 
CCD 1070 TNF alpha 


92.7 


PMA/ionomycin 


22.8 


Dermal fibroblast 


29.5 


Dendritic cells none 


13.7 


Dermal fihrnhl*i<:t TFTsJ 

gamma 


27.7 


Dendritic cells LPS 


15.0 


Dermal fibroblast IL-4 


41.5 


Dendritic cells anti- 
CD40 


19.1 


IBD Colitis 2 


1.1 


Monocytes rest 


10.7 


IBD Crohn's 


2.6 


Monocytes LPS 


4.3 


Colon 


14.8 


Macrophages rest 


26.1 


Lung 


21.5 


Macrophages LPS 


15.3 


Thymus 


25.0 


HUVEC none 


30.8 


Kidney 


37.4 


HUVEC starved 


34.6 







CNS_neurodegeneration_vl.O Summary: Ag2900 This panel confirms the 
expression of this gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's diseased 
postmortem brains and those of non-demented controls in this experiment. Please see Panel 
1.3D for a discussion of the potential utility of this gene in treatment of central nervous system 
disorders. 

Panel 13D Summary: Ag2900 The CG56633-01 gene is expressed at moderate levels 
in the cancer cell lines in this panel, with highest expression in a breast cancer cell line 
(CT=27). Expression of this gene could potentially be used as a diagnostic marker of cell 
proliferation and hence as a diagnostic marker for cancer. 

This gene also has moderate levels of expression in adipose, liver, heart, skeletal 
muscle, adrenal, pituitary, thyroid and pancreas. Therefore, therapeutic modulation of this 
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gene product may be a treatment for endocrine and metabolic diseases, including obesity and 
Types 1 and 2 diabetes. 

Overall, this gene, a translation initiation factor homolog, exhibits brain-preferential 
expression, particularly in the hippocampus, a structure critical for learning and memory. The 
5 processes of learning and memory are subject to regulation by mechanisms of translational and 
transcriptional control, including the regulation elongation factor phosphoiylation by the 
memory-mediating NMDA receptor. The hippocampus-preferential expression of this gene 
suggests that it plays a role in translationally-mediated learning and memory processes. 
Therefore, agents that modulate the activity and function of this gene product may have utility 
10 in treating CNS disorders involving memory deficits, including Alzheimer's disease and aging. 

References: 

Scheetz AJ, Nairn AC, Constantine-Paton M. N-methyl-D-aspartate receptor activation 
and visual activity induce elongation factor-2 phosphorylation in amphibian tecta: a role forN- 
methyl-D-aspartate receptors in controlling protein synthesis, Proc Natl Acad Sci U S A 1997 

15 Dec23;94(26):14770-5 

N-methyl-D-aspartate receptor (NMDAR) activation has been implicated in forms of 
synaptic plasticity involving long-term changes in neuronal structure, function, or protein 
expression. Transcriptional alterations have been correlated with NMDAR-mediated synaptic 
plasticity, but the problem of rapidly targeting new proteins to particular synapses is unsolved. 

20 One potential solution is synapse-specific protein translation, which is suggested by dendritic 
localization of numerous transcripts and subsynaptic polyribosomes. We report here a 
mechanism by which NMDAR activation at synapses may control this protein synthetic 
machinery. In intact tadpole tecta, NMDAR activation leads to phosphorylation of a subset of 
proteins, one of which we now identify as the eukaryotic translation elongation factor 2 

25 (eEF2). Phosphorylation of eEF2 halts protein synthesis and may prepare cells to translate a 
new set of mRNAs. We show that NMDAR activation-induced eEF2 phosphorylation is 
widespread in tadpole tecta. In contrast, in adult tecta, where synaptic plasticity is reduced, 
this phosphorylation is restricted to short dendritic regions that process binocular information. 
Biochemical and anatomical evidence shows that this NMDAR activation-induced eEF2 

30 phosphorylation is localized to subsynaptic sites. Moreover, eEF2 phosphorylation is induced 
by visual stimulation, and NMDAR blockade before stimulation eliminates this effect. Thus, 
NMDAR activation, which is known to mediate synaptic changes in the developing frog, 
could produce local postsynaptic alterations in protein synthesis by inducing eEF2 
phosphorylation. 
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Panel 2D Summary: Ag2900 The CG56633-01 gene is expressed at increased levels 
in colon, breast and bladder cancers compared to the normal adjacent tissue samples. 
Therefore, expression of this gene could be of use as a marker for these cancers. 

Panel 4D Summary: Ag2900 The CG56633-01 gene is expressed in a number of 
5 preparations of activated T lymphocytes at levels greater than in resting T cells. Therefore, 
small molecule antagonists of the CG56633-01 gene product may reduce T cell activation and 
thus reduce or eliminate the symptoms in patients with autoimmune and inflammatory 
diseases, such as Crohn's disease, ulcerative colitis, multiple sclerosis, chronic obstructive 
pulmonary disease, asthma, emphysema, rheumatoid arthritis, lupus erythematosus, or 
10 psoriasis. 

NOV60aandNOV60b 

Expression of gene NOV60a and variant NOV60b was assessed using the primer-probe 
sets Ag041b and Ag41, described in Tables BCA and BCB. Results of the RTQ-PCR runs are 
shown in Tables BCC, BCD, BCE, BCF, BCG, BCH and BCL 

1 5 Table BCA. Probe Name Ag041b 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gtagtaggtgcgcgtggtcat-3 * 


21 


486 


1231 


Probe 


TET-5 1 -accatagccgggcagcgcatg-3 ' -TAMRA 


21 


455 


1232 


Reverse 


5' -caacggagacaactgcttcaac-3 1 


22 


431 


1233 



Table BCB. Probe Name Ag41 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gtaggtgcgcgtggtcatg-3 • 


19 


485 


1234 


Probe 


TET-5 ' -ccatgcgctgcccggctatg-3 ' -TAMRA 


20 


454 


1235 


Reverse 


5 ' -cctacaacggagacaactgcttc-3 » 


23 


427 


1236 



Table BCC. CNS_neurodegeneratioii_vl.O 



Tissue Name 


Rel. Exp.(%) Ag041b, 
Run 206231412 


Tissue Name 


Rel.Exp.(%)Ag041b, 
Run 206231412 


AD 1 Hippo 


20.7 


Control (Path) 3 
Temporal Ctx 


9.7 


AD 2 Hippo 


44.1 


Control (Path) 4 
Temporal Ctx 


70.2 


AD 3 Hippo 


9.5 


AD 1 Occipital Ctx 


17.9 


AD 4 Hippo 


17.4 


AD 2 Occipital Ctx 


0.0 
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(Missing) 




AD 5 Hippo 


97.9 


AD 3 Occipital Ctx 


5.8 


AH 6 Hinnn 




1WJ H L/CCip I uil I^IX 


R 

J7.0 


Control 2 Hippo 


66.0 


AD 5 Occipital Ctx 


76.8 


Control 4 Hippo 


18.4 


AD 6 Occipital Ctx 


20.0 


Control (Path) 3 
Hippo 


11.3 


Control 1 Occipital 
Ctx 


5.3 


AD 1 Temporal Ctx 


11.3 


Control 2 Occipital 
Ctx 


82.4 


AD 2 Temporal Ctx 


54.3 


Control 3 Occipital 
Ctx 


36.9 


AD 3 Temporal Ctx 


9.5 


Control 4 Occipital 
Ctx 


12.1 


AD 4 Temporal Ctx 


39.8 


Control (Path) 1 
Occipital Ctx 


95.3 


AD 5 Inf Temporal 
Ctx 


80.7 


Control (Path) 2 
Occipital Ctx 


33.4 


AD 5 Sup Temporal 
Ctx 


57.8 


Control (Path) 3 
Occipital Ctx 


4.4 


AD 6 Inf Temporal 
Ctx 


52.1 


Control (Path) 4 
Occipital Ctx 


53.6 


AD 6 Sup Temporal 
Ctx 


54.3 


Control 1 Parietal 
Ctx 


9.3 


Control 1 Temporal 
Ctx 


13.8 


Control 2 Parietal 
Ctx 


49.0 


Control 2 Temporal 
Ctx 


66.0 


Control 3 Parietal 
Ctx 


34.6 


Control 3 Temporal 

PtY 


36.3 


Control (Path) 1 
ranetai ctx 


100.0 


Control 3 Temporal 
Ctx 


19.8 


Control (Path) 2 
Parietal Ctx 


44.4 


Control (Path) 1 
Temporal Ctx 


85.3 


Control (Path) 3 
Parietal Ctx 


6.9 


Control (Path) 2 
Temporal Ctx 


73.7 


Control (Path) 4 
Parietal Ctx 


722 



Table BCD. Panel 1 



Tissue Name 


Rel. Exp.(%) 
Ag41, Ron 
97804013 


Rel. Exp.(%) 
Ag41, Run 
97807227 


Tissue Name 


Rel. Exp.(%) 
Ag41, Run 
97804013 


Rel. Exp.(%) 
Ag41, Run 
97807227 


Endothelial cells 


0.0 


0.2 


Renal ca 786-0 


0.0 


0.4 


Endothelial cells 
(treated) 


0.0 


0.3 


Renal ca. A498 


11.6 


32.8 


Pancreas 


0.1 


4.2 


Renal ca. RXF 


3.6 


1.0 J 
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Pancreatic ca. 
CAP AN 2 


0.0 


0.5 


Renal ca. 


0.0 


1.1 


Adrenal gland 


1.2 


15.3 


Renal ca. UO- 
31 


0.0 


0.9 


Thyroid 


0.0 


4.4 


Renal ca. TK- 

1 V 


0.0 


0.2 


Salivarv tHand 


0 1 


6 5 


T ll/Af 

Lsl VCI 


0 0 


O.J 


Pituitary gland 


0.2 


6.0 


Liver (fetal) 


0.1 


3.3 


Brain (fetal) 


0.0 


3.7 


Liver ca. 
(hepatoblast) 


0.2 


5.6 


Brain (whole) 


0.0 


35.4 


Lung 


0.0 


1.8 


Brain (amygdala) 


6.4 


60.7 


Lung (fetal) 


0.0 


1.1 


Brain 

(cerebellum) 


4.2 


25.5 


Lung ca. (small 
cell) LX-1 


0.0 


1.0 


Brain 

(hippocampus) 


6.1 


54.0 


Lung ca. (small 
cell) NCI-H69 


0.0 


0.7 


Brain (substantia 
nigra) 


3.4 


35.6 


Lung ca. (s.cell 
var.) SHP-77 


0.0 


0.2 


Brain (thalamus) 


11.1 


100.0 


Lung ca. (large 
ceIl)NCI-H460 


15.9 


2.7 


Brain 

(hypothalamus) 


0.2 


6.3 


Lung ca. (non- 
sm. cell) A549 


0.0 


1.4 


Spinal cord 


4.1 


14.0 


Lung ca. (non- 
s.cell) NCI-H23 


5.7 


14.7 


glio/astro U87- 
MG 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


0.0 


0.4 


glio/astro U-l 18- 
MG 


0.0 


0.2 


Lung ca. (non- 
s.cl)NCI-H522 


0 1 




astrocytoma 
SW1783 


0.0 


0.4 


Lung ca. 
(squam.) SW 
900 


0.3 


6.9 


neuro*; met SK- 
N-AS 


0.0 


0.3 


Lung ca. 
(squam.) NCI- 
H596 


0.0 


0.4 


astrocytoma SF- 
539 


0.0 


0.8 


Mammary 
gland 


100.0 


10.5 


astrocytoma SNB- 
75 


0.3 


4.7 


Breast ca.* 


0.0 


0.0 


glioma SNB-1 9 


0.3 


2.3 


Breast ca.* 
(plef) MDA- 
MB-231 


0.0 


0.2 


glioma U251 


7.6 


0.5 


Breast ca * (pi. 
ef) T47D 


0.8 


7.4 


glioma SF-295 0.0 


0.8 


Breast ca. BT- 


0.0 


0.1 
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549 






Heart 


6.8 


49.0 


Breast ca. 
MDA-N 


0.0 


0.0 


Skeletal muscle 


0.5 


6.0 


Ovary 




li.U 


Bone marrow 


0.0 


1.0 


Ovarian ca. 

rwrr* ad q 


0.0 


1.0 


Thymus 


0.0 


0.9 


Ovarian ca. 

rwrr* a t> a 
UVCAK-4 


0.0 


0.4 




0 0 


2 5 


Ovarian ca. 
OVCAR-5 


0.4 


7.1 


Lymph node 


0.0 




Ovarian ca. 

rv^/P AD O 


0.0 


0.2 


Colon (ascending) 

_ I1MI _ .... 


0.0 

... _ lr „ 1 _ I 


1.7 


Ovarian ca. 
IGROV-1 


0.0 


0.5 


Stomach 


4.6 


3.2 


Ovarian ca. 
(ascites) bK- 
OV-3 


3.8 


0.7 


Small intestine 


0.1 


3.4 


Uterus 


0.7 


11.7 


Colon ca. SW480 


0.0 


0.3 


Placenta 


0.9 


9.9 


Colon ca.* 
SW620 (SW480 
met) 


0.0 


0.6 


Prostate 


0.3 


7.7 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met)PC- 
3 


0.0 


0.1 


Colon ca. HCT- 
116 


0 0 

V/.V/ 


0 8 


Testis 


0.1 


3.0 


Colon ca CaCo-2 


0 0 


0 5 


Melanoma 
Hs688(A).T 


0.0 


0.9 


Colon ca. HCT-15 


0.0 


0.4 


Melanoma* 
(met) 

iisooo(r>). 1 


0.0 


2.3 


Colon ca. HCC- 
2998 


0.0 


0.0 


Melanoma 
UACC-62 


0.0 


0.3 


Gastric ca. * (liver 
met)NCI-N87 


0.0 


0.3 


Melanoma Ml 4 


0.0 


0.1 


Bladder 


0.0 


2.5 


Melanoma 


0.0 


0.0 


Trachea 


0.0 


2.0 


Melanoma* 
(met) SK-MEL- 
5 


0.0 


0.1 


Kidney 


1.0 


11.6 


Melanoma SK- 
MEL-28 


0.1 


1.5 


Kidney (fetal) 


0.0 


3.4 









.1230 



WO 02/068649 



PCT/US02/02785 



Table BCE. Panel 1.1 



Tissue Name 


Rel. Exp.(%) Ag041b, 
Run 109666937 


Tissue Name 


ReL Exp.(%) Ag041b, 
Run 109666937 


Adrenal gland 


11.5 


Renal ca. UO-31 


1.3 


Bladder 


2.4 


Renal ca. RXF 393 


0.4 


Brain (amygdala) 


25.2 


Liver 


9.9 


Brain (cerebellum) 


57.4 


Liver (fetal) 


2.9 


Brain fh5ntw*.jim«iic^ 


77 A 


Liver ca. 

(hepatoblast) HepG2 


2.7 


Brain (substantia 
nigra) 


57 A 


Lung 


1.6 


Brain (thalamus) 


47.6 


Lung (fetal) 


3.3 


Cerebral Cortex 


100.0 


Lung ca. (non-s.cell) 
HOP-62 


0.3 


Brain (fetal) 


5.5 


Lung ca. (large 
cell)NCI-H460 


1.4 


Brain (whole) 


71.2 


Lung ca. (non-s.cell) 
NCI-H23 


6.2 


glio/astroU-118-MG 


0.2 


Lung ca. (non-sxl) 
NCI-H522 


9.2 


astrocytoma SF-539 


1.1 


Lung ca. (non-sm. 
cell) A549 


1.7 


astrocytoma SNB-75 


5.7 


Lung ca. (s.cell var.) 
SHP-77 


0.2 


a^trorvtnmn <\W1 19.1 

UO LI \J\^y IKJlUtX o w 1 / OJ 


u.o 


Lung ca. (small cell) 
LX-1 


1.4 


glioma U251 


0.2 


Lung ca. (small cell) 
NCI-H69 


u.o 


glioma SF-295 


1.2 


Lung ca. (squam.) 
SW 900 


5.9 


glioma SNB-19 


2.8 


Lung ca. (squam.) 
NCI-H596 


0.5 


glio/astro U87-MG 


0.0 


Lymph node 


1.4 


neuro*; met SK-N-AS 


0.2 


Spleen 


2.5 


Mammary gland 


6.3 


Thymus 


0.5 


Breast ca. BT-549 


0.0 


Ovary 


6.7 j 


Breast ca. MDA-N 


0.0 


Ovarian ca. IGROV- 
1 


0.5 


Breast ca.* (plef) 
T47D 


8.0 


Ovarian ca. 
OVCAR-3 


1.1 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


Ovarian ca. 
OVCAR-4 


0.2 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.3 


Ovarian ca. 
OVCAR-5 


9.8 


Small intestine 


6.7 


Ovarian ca. 


0.0 
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OVCAR-8 




Colorectal 


0.5 


Ovarian ca * 
(ascites) SK-OV-3 


0.7 


Colon ca. HT29 


0.0 


Pancreas 


11.5 


Colon ca. CaCo-2 


0.9 


Pancreatic ca 

CAP AN 2 


0.4 


Colon ca. HCT-15 


0 1 


Pituitflrv planH 


6 6 


Colon ca. HCT-116 


0.4 


Placenta 

X Idvvll L4J. 


9.5 


Colon ca. HCC-2998 


0.0 


Prostate 


7.0 


Colon ca. SW480 


0.1 


Prostate ca.* (bone 
mef> PC-3 


0.0 


Colon ca.* SW620 
(SW480 met) 


1.3 


Salivary gland 


6.2 


Stomach 


8.0 


Trachea 


2.3 


Gastric ca. (liver met) 
NCI-N87 


0.3 


Spinal cord 


14.3 


Heart 


92.7 


Testis 


3.1 


Skeletal muscle (FetaH 


4.9 


Thvrnid 


6 1 


Skeletal muscle 


16.8 


Uterus 


4.7 


Endothelial cells 


i n 


Melanoma Ml 4 


0.0 


Heart (Fetal) j 


33.9 


Melanoma LOX 
IMV1 


0.0 


Kidney 


16.8 


Melanoma UACC- 
52 


0.1 


Kidney (fetal) 


3.1 


Melanoma SK-MEL- 
28 


0.7 


Renal ca. 786-0 


0.2 


IVJLClallUIIlCl ^IIICIJ 

SK-MEL-5 


0.0 


Renal ca. A498 


30.8 


Melanoma 
Hs688(A).T 


1.1 


Renal ca. ACHN 


1.2 


Melanoma* (met) 
Hs688(B).T 


1.5 


Renal ca. TK-10 


0.2 







Table BCF. Panel 1.3D 



Tissue Name 


Rel. £xp.(%) 
Ag041b, Run 
150010102 


Tissue Name 


Rel. 

Exp.(%) Ag041b, 
Run 150010102 


Liver 
adenocarcinoma 


0.0 


Kidney (fetal) 


0.5 


Pancreas 


0.6 


Renal ca. 786-0 


0.0 


Pancreatic ca. 
CAP AN 2 


0.1 


Renal ca. A498 


11.0 


Adrenal gland 


2.0 


Renal ca. RXF 


0.1. 
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Thyroid 


l.O 


Renal ca. ACHN 


0.2 


Salivary gland 


0.6 


Renal ca. UO-3 1 


0.3 


Pituitarv £?land 


0.8 


Renal ca. TK-10 


U.U 


Brain (fetal) 


0.5 


Liver 


1.0 


Brain (whole) 


25.0 


Liver (fetal) 


0.8 


Brain (amygdala) 


25.3 


Liver ca. 
(hepatoblast) HepG2 


0.6 


Brain 
(cerebellum) 


2.7 


Lung 


0.6 


Brain 
(hippocampus) 


100.0 


Lung (fetal) 


0.9 


Brain (substantia 

nigra) 


3.5 


Lung ca. (small 
cell)LX-l 


0.1 


Brain (thalamus) 


20.7 


Lung ca. (small 
cell)NCI-H69 


0.2 


Cerebral Cortex 


63.3 


Lung ca. (s.cell 
var.) SHP-77 


0.1 


Spinal cord 


3.6 


Lung ca. (large 
celI)NCI-H460 


0.1 


glio/astro U87- 

MO 


0.0 


Lung ca. (non- 
sm. cell) A549 


0.2 


glio/astro U-l 18- 

MCj 


0.1 


Lung ca. (non- 
s.cell)NCI-H23 


3.4 


astrocytoma 

CY171 TOO 


0 1 


Lung ca. (non- 
s.cell)HOP-62 


0.1 


neuro*; met SK- 

N-AS 


0 0 


Lung ca. (non- 
s.cl) NCI-H522 


1.2 


astrocytoma SF- 

539 


0.1 


Lung ca. 
(squam.) SW 900 


0.5 


astrocytoma 
SNB-75 


0.8 


Lung ca. 
(squam.) NCI-H596 


0.0 


glioma bNU-ly 


0.4 


Mammary gland 


1.2 


! glioma U25l 


0.1 


Breast ca.* 
(pl.ef) MCF-7 


I 

0.0 


glioma SF-295 


0.4 


Breast ca.* 
(pl.ef) MDA-MB-231 


0.1 


Heart (fetal) 


21.9 


Breast ca.* 
(plef) T47D 


1.0 


Heart 


3.7 


Breast ca. BT- 

549 


0.0 


Skeletal muscle 

(fetal) 


10.7 


Breast ca. MDA- 

N 


0.0 


Skeletal muscle 


0.2 


Ovary 


8.6 


Bone marrow 


0.2 


Ovarian ca. 


0.1 
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OVCAR-3 




Thymus 


0.1 


Ovarian ca. 
OVCAR-4 


0.1 


Spleen 


1.7 


Ovarian ca. 
OVCAR-5 


0.8 


Lymph node 


0.4 


Ovarian ca. 
OVCAR-8 


0.0 


Colorectal. 


1.7 


Ovarian ca. 
IGROV-1 


0.1 


Stomach 


1.5 


Ovarian ca.* 
(ascites) SK-OV-3 


0.1 


Small intestine 


1.6 


Uterus 


1.9 


Colon ca. 

SW480 


0.2 


Placenta 


1.7 


Colon ca.* 
SW620(SW480 met) 


0.2 


Prostate 


0.9 


Colon ca. HT29 


0.0 


Prostate ca.* 
(bone met)PC-3 


0.0 


Colon ca. HCT- 

116 


0.1 


Testis 


0.8 


Colon ca. CaCo- 

2 


0.1 


Melanoma 
Hs688(A).T 


1.3 


Colon ca. 
tissue(OD03866) 


0.3 


Melanoma* 
(met) Hs688(B).T 


0.7 


Colon ca. HCC- 

2998 


0.0 


Melanoma 
UACC-62 


0.0 


oasinc ca. 
(liver met) NCI-N87 


0.0 


Melanoma Ml 4 


0.0 


Bladder 


0.6 


Melanoma LOX 

IMVI 


0.0 


Trachea 


0.7 


Melanoma* 
(met) SK-MEL-5 


0.0 


Kidney 


0.7 


Adipose 


1.6 j 



Table BCG. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag041b,Run 
157096248 


Rel. Exp.(%) 
Ag41, Run 
157938256 


Tissue Name 


Rel. £xp.(%) 
Ag041b,Run 
157096248 


Rel. Exp.(%) 
Ag41, Run 
157938256 


Normal Colon 


15.7 


21.5 


Kidney 
Margin 
8120608 


9.2 


12.9 


CC Well to Mod 
Diff(OD03866) 


4.4 


2.9 


Kidney Cancer 
8120613 


4.9 


7.2 


CC Margin 
(OD03866) 


6.3 


7.3 


Kidney 
Margin 


14.4 


16.6 
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8120614 






CC Gr.2 

rectosigmoid 

(OD03868) 


3.3 


2.3 


rviuney dancer 
9010320 


63.7 


. 69.3 


CC Margin 
(OD03868) 


8.2 


8.6 


Kidney 
Margin 
9010321 


33.0 


42.3 


CC Mod Diff 
(ODO3920) 


4.9 


6.9 


Normal Uterus 


8.5 


14.9 


CC Margin 
(ODO3920) 


8.8 


11.7 


Uterus Cancer 
064011 


16.2 


14.3 


CC Gr.2 ascend 
colon 

(OD03921) 


2.4 * 


1.9 


IN Ol Midi 

Thyroid 


5.7 


8.4 


CC Margin 
(OD03921) 


3 4 


4 1 


Thyroid 
Cancer 064010 


ft A 


10 1 
IZ.l 


CC from Partial 

Hepatectomy 

(ODO4309)Mets 


2.1 


7.1 


Thyroid 

Cancer 

A302152 


8.5 


9.9 


(ODO4309) 


15.1 


24.3 


Thyroid 

Margin 

A302153 


9.2 


12.6 


Colon mets to 
lung (OD04451- 
01) 


0.8 


1.2 


Normal Breast 


24.3 


42.3 


Lung Margin 
(OD04451-02) 


2.3 


3.1 


Breast Cancer 
(OD04566) 


1.8 


2.1 


Normal Prostate 
6546-1 


0 7 


o n 


Breast Cancer 
(OD04590-01) 






Prn<;tatp f^nc.pr 

(OD04410) 


13.6 


12.4 


Breast Cancer 
Mets 

(OD04590-03) 


8.5 


9.9 


Prostate Margin 
(OD04410) 


18.3 


19.3 


Breast Cancer 

Metastasis 

(OD04655-05) 


8.8 


11.7 


Prostate Cancer 
(OD04720-01) 


19.6 


28.1 


Breast Cancer 
064006 


10.3 


12.9 


Prostate Margin 
(OD04720-02) 


33.4 


36.9 


Breast Cancer 
1024 


15.4 


16.2 


Normal Lung 


7.2 


10.9 


Breast Cancer 
9100266 


3.8 


5.7 


Lung Met to 

Muscle 

(OD04286) 


0.7 


1.2 


Breast Margin 
9100265 


4.0 


6.3 


Muscle Margin 
(OD04286) 


7.9 


9.7 


Breast Cancer 
A209073 


4.4 


7.2 


Lung Malignant 


3.0 


3.7 


Breast Margin 8.2 


8.4 
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Cancer 
(OD03126) 






A2090734 






Lung Margin 
(OD03126) 


8.2 


12.9 


Normal Liver 


21.2 


20.3 


Lung Cancer 
(OD04404) 


6.3 


9.9 


Liver Cancer 
064003 


1.8 


1.4 


Lung Margin 
(OD04404) 


14.8 


15.5 


Liver Cancer 
1025 


25.5 


35.8 


Lung Cancer 
(OD04565) 


12.0 


21.6 


Liver Cancer 
1026 


23.3 


25.5 


Lung Margin 
(OD04565) 


7.5 


5.7 


Liver Cancer 
6004-T 


64.6 


60.3 


Lung Cancer 
(OD04237-01) 


3.9 


4.5 


Liver Tissue 
6004-N 


19.8 


23.0 


Lung Margin 
(OD04237-02) 


6.0 


8.9 


Liver Cancer 
6005-T 


33.2 


35.8 


Ocular Mel Met 
to Liver 
(ODO4310) 


4.0 


4.9 


T ivpr Ticciif* 

i~tl VCl 1 looUC 

6005-N 


17.4 


22.8 


Liver Margin 
(ODO4310) 


11 7 

11./ 


17 7 


Normal 
Bladder 


\J.J 




Melanoma Mets 
to Lung 
(OD04321) 


0.3 


0.3 


Bladder 


1.1 


2.3 


Lung Margin 
(OD04321) 


9.7 


15.5 


Bladder 

Cancer 

A302173 


0.3 


1.0 


Normal Kidney 


18.4 


28.1 


Bladder 
Cancer 

(OD04718-01) 


3.7 


3.7 


Kidney Ca, 
Nuclear grade 2 
(OD04338) 


6.9 


6.7 


Bladder 
Normal 
Adjacent 
(OD04718-03) 


15 4 


15 4 


Kidney Margin 
(OD04338) 


11.1 


21.2 


Normal Ovarv 


15 4 




Kidney Ca 
Nuclear grade 
1/2 (OD04339) 


13.5 


14.3 


Ovarian 
Cancer 06400R 


22.7 


32.3 


Kidney Margin 
(OD04339) 


30.4 


39.0 


Ovarian 
Cancer 

(OD04768-07) 


100.0 


100.0 


Kidney Ca, Clear 
cell type 
(OD04340) 


9.7 


10.1 


Ovary Margin 
(OD04768-08) 


25.5 


25.7 


Kidney Margin 
(OD04340) 


31.2 


33.0 


Normal 
Stomach 


13.9 


16.7 
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Kidney Ca, 
Nuclear grade 3 
(OD04348) 


3.1 


3.3 


vjasinc v^ancer 
9060358 


3.5 


3.9 


Kidney Margin 
(OD04348) 


8.4 


12.7 


Stomach 

Margin 

9060359 


3.0 


3.7 


Kidney Cancer 
(OD04622-01) 


3.6 


4.0 


Gastric Cancer 
9060395 


6.7 


6.3 


Kidnev Margin 
(OD04622-03) 


4.5 


4.7 


Stomach 

Margin 

9060394 


6.1 


7.8 


Jvianey cancer 
(OD04450-01) 


1.6 


0.7 


Gastric Cancer 
9060397 


9.2 


13.8 


Kidney Margin 
(OD04450-03) 


9.9 


13.9 


Stomach 

Margin 

9060396 


3.3 


3.0 


Kidney Cancer 
8120607 


2.4 


1.8 


Gastric Cancer 
064005 


4.7 


5.0 



Table BCH. Panel 3D 



Tissue Name 


Rel. 
Exp.(%) 
Ag041b, 

Run 
156897045 


Rel. 
Exp.(%) 
Ag41, Run 
157938257 


Tissue Name 


Rel. 
Exp.(%) 
Ag041b, 

Run 
156897045 


Rel. 
Exp.(%) 
Ag41, Run 
157938257 


Daoy- 

Medulloblastoma 


0.4 


0.2 


Ca Ski- Cervical 
epidermoid carcinoma 
(metastasis) 


0.0 


0.0 


TE671- 

Medulloblastoma 


0.3 


0.2 


ES-2- Ovarian clear 
cell carcinoma 


0.0 


0.0 


D283 Med- 
Medulloblastoma 


0.1 


03 


Ramos- Stimulated 
with PMA/ionomycin 
6h 


0.0 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.1 


0.5 


Ramos- Stimulated 
with PMA/ionomycin 
14h 


0.0 


0.0 


XF-498-CNS 


1.8 


2.7 


MEG-01- Chronic 
myelogenous 
leukemia 
(megokaryoblast) 


0.0 


0.3 


SNB-78- Glioma 


0.0 


0.0 


Raji-Burkitt's 
lymphoma 


0.0 


0.0 


SF-268- 
Glioblastoma 


0.4 


0.7 


Daudi- Burkitt's 
lymphoma 


0.3 


0.0 


T98G- 

Glioblastoma 


1.9 


2.0 


U266-B-cell 
plasmacytoma 


0.0 


0.0 
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SK-N-SH- 

Neuroblastoma 

(metastasis) 


2.3 


2.9 


CA46- Burkitt's 
lymphoma 


0.1 


0.0 


SF-295- 
Glioblastoma 


0.5 


1.1 


RL- non-Hodgkin's 
B-cell lymphoma 


0.0 


0.0 


Cerebellum 


.30.8 


55.9 


JMl-pre-B-cell 
lymphoma 


0.0 


0.0 


Cerebellum 


32.3 


60.3 


Jurkat- T cell 
leukemia 


0.0 


0.0 


NCI-H292- 
Mucoepidermoid 
lung carcinoma 


2.1 


1.7 


TF-1- 

Erythroleukemia 


0.0 


0.1 


DMS-114- Small 
cell lung cancer 


6.0 


6.4 


HUT78-T-cell 
lymphoma 


0.0 


0.0 


DMS-79- Small 
cell lung cancer 


100.0 


100.0 


U937- Histiocytic 
lymphoma 


0.2 


0.0 


NCI-H146- Small 
cell lung cancer 


0.0 


0.3 


KU-812- 

Myelogenous 

leukemia 


0.8 


0.8 


NCI-H526- Small 
cell lung cancer 


2.2 


3.8 


769-P- Clear cell 
renal carcinoma 


0.5 


0.1 


NCI-N417- Small 
cell lung cancer 


2 7 


4 5 


Caki-2- Clear cell 
renal carcinoma 


1 4 


1 4 


NCI-H82- Small 
cell lung cancer 


0.1 


0.3 


SW 839- Clear cell 
renal carcinoma 


0.0 


0.0 


NCI-H157- 
Squamous cell 
lung cancer 
(metastasis) 


0.0 


0.0 


G401- Wilms 1 tumor 


0.1 


0.0 


NCI-H1155- 
Large cell lung 
cancer 


2.1 


. 2.6 


Hs766T- Pancreatic 
carcinoma (LN 
metastasis) 


1.1 


1.2 


NCI-H1299- 
Large cell lung 
cancer 


0.1 


0.3 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


7.5 


13.0 


NCI-H727-Lung 
carcinoid 


0.0 


0.0 


SU86.86- Pancreatic 
carcinoma (liver 
metastasis) 


1.9 


2.9 


NCI-UMC-11- 
Lung carcinoid 


0.1 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


5.0 


7.0 


LX-1- Small cell 
lung cancer 


1.3 


1.4 


HPAC- Pancreatic 
adenocarcinoma 


0.4 


0.4 


Colo-205- Colon 
cancer 


0.0 


0.0 


MIA PaCa-2- 
Pancreatic carcinoma 


3.6 


4.0 


KM12- Colon 
cancer 


0.0 


0.0 


CFPAC-1- Pancreatic 
ductal 

adenocarcinoma 


1.6 


2.0 
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KM90T 9- Pnlnn 

cancer 


0.0 


0.1 


lPANC-1- Pancreatic 
epithelioid ductal 
carcinoma 


20.3 


22.1 


NCI-H716- Colon 
cancer 


0.1 


0.0 


T24- Bladder 
carcinma (transitional 
cell) 


0.4 


0.4 


SW-48- Colon 
adenocarcinoma 


0.1 


0.0 


5637- Bladder 
carcinoma 


1.2 


0.7 


SW1116- Colon 
adenocarcinoma 


0.0 


0.0 


HT-1 197- Bladder 
carcinoma 


0.0 


0.2 


LS 174T- Colon 
adenocarcinoma 


0.3 


0.1 . 


UM-UC-3- Bladder 
carcinma (transitional 
cell) 


0.1 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


0.0 


A204- 

Rhabdomyosarcoma 


2.3 


3.5 


SW-480- Colon 
adenocarcinoma 


0.0 


0.0 


HT-1080- 
Fibrosarcoma 


0.2 


0.4 


NCI-SNU-5- 
Gastric carcinoma 




u.o 


MG-63- 
Osteosarcoma 


9.3 


20.9 


carcinoma 


0.4 


0.4 


SK-LMS-1- 

Leiomyosarcoma 

(vulva) 


0.1 


0.4 


NCI-SNIM6- 
Gastric carcinoma 


0.1 


0.0 


SJRH30- 

Rhabdomyosarcoma 
(met to bone marrow) 


0.0 


0.0 


NCI-SNU-1- 
Gastric carcinoma 


0.2 


0.0 


A431- Epidermoid 
carcinoma 


0.4 


0.3 


RF-1- Gastric 
adenocarcinoma 


VJ.J 


u.u 


WM266-4- 
Melanoma 


0.6 


0.5 


RF-48- Gastric 

adenocaremnrnn 


0.0 


0.0 


DU 145- Prostate 
carcinoma (brain 
metastasis) 


0.0 


0.0 


MKN-45- Gastric 
carcinoma 


0.7 


0.4 


MDA-MB-468- 
Breast 

adenocarcinoma 


26.4 


30.4 


NCI-N87- Gastric 
carcinoma 


0.2 


0.1 


SCC-4- Squamous 
cell carcinoma of 
tongue 


0.2 


0.0 


OVCAR-5- 
Ovarian carcinoma 


1.7 


1.7 


SCC-9- Squamous 
cell carcinoma of 
tongue 


0.2 


0.2 j 


RL95-2- Uterine 
carcinoma 


5.2 


9.1 


SCC-15- Squamous 
cell carcinoma of 
tongue 


0.6 


0.3 


HelaS3- Cervical 
adenocarcinoma 


0.0 


0.0 


CAL 27- Squamous 
cell carcinoma of 
tongue 


0.0 


0.1 
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Table BCI. Panel 4D 



llSaUC 11 4111 C 


ReL Exp.(%) 

AoOdlh Pun 

146087302 


l laaiic ii dine 


Rel. Exp.(%) 

Aa041h Run 

AgutlU) XV (111 

146087302 


SpponHarv Thl art 


2 1 


HUVFC TT -lheta 


1 6 


Secondary Th2 act 


0.9 


HUVEC IFN gamma 


11.0 


Secondary Trl act 


0.9 


HUVEC TNF alpha + IFN 
gamma 


5.5 


Secondary Thl rest 


1-2 


HUVEC TNF alpha + IL4 


2.9 


Secondary Th2 rest 


2.7 


HUVEC IL-11 


5.8 


Secondary Trl rest 


3.0 


Lung Microvascular EC 
none 


2.7 


Primary Thl act 


0.9 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.8 


Primary Th2 act 


1.6 


Microvascular Dermal EC 
none 


2.4 


Primary Trl act 


2.6 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


2.0 


Primary Thl rest 


2.7 


Bronchial epithelium 

1 1 tv 1 I » flt > TT 1 1 A- 

TNFalpha + ILlbeta 


7.5 


Primary Th2 rest 


2.7 


Small airway epithelium 
none 


2.3 


Primary Trl rest 


0.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


6.2 


CD45RA CD4 
lymphocyte act 


2.4 


Coronery artery SMC rest 


6.6 


CD45RO CD4 
lymphocyte act 


3.9 


Coronery artery SMC 
TNr alpha + IL-Ioeta 


10.9 


CD8 lymphocyte act 


0.7 


Astrocytes rest 


10.2 


Secondary CD8 
lymphocyte rest 


2.0 


Astrocytes TNFalpha + 

TT 1 U^+« 

1L- 1 beta 


13.0 


oeconaary tUo 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


2.3 


CD4 lymphocyte none 


1.9 


JS.U-51Z jDasopniij 

PMA/ionomycin 


2.5 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


2.2 


LAK cells rest 


3.3 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.8 


LAK cells IL-2 


1.3 


Liver cirrhosis 


3.9 


LAK cells DL-2+IL-12 


1.3 


Lupus kidney 


5.6 


LAK cells IL-2+IFN 
gamma 


2.2 


NCI-H292 none 


5.3 
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LAKcellsIL-2+IL-18 


0.8 


NC1-H292 IL-4 


6.2 


LAK cells 
PMA/ionomycin 


1.3 


NC1-H292 1L-9 


4.9 


NK Cells IL-2 rest 


5.1 


NC1-H292 IL-13 


9.2 


Two Wav MI R 3 dav 

i "w vt a j i vxx-/xv j. UuV 


1 8 


NCI-H292 TFN samma 


7 1 

/ • X 


Two Wav MT R S Hav 


0 2 


TTPAFO none 


5 6 

J.V7 


Two Way MLR 7 day 


0.4 


HP AFC TNF alnha + TT -1 
beta 


1.1 


PBMC rest 


0 9 


T una fibroblast nntip 


74 2 


PBMC PWM 


1.6 


Lung fibroblast TNF alpha 
+ IL-1 beta 


33.4 


PBMC PHA-L 


0.3 


Lung fibroblast IL-4 


100.0 


Kamos (t> cell) none 


n A 
U.U 


Lung fibroblast 1L-9 


62.9 


Ramos (B cell) 
ionomycin 


1.3 


Lung fibroblast IL-13 


51.4 


B lymphocytes PWM 


,0.8 


Lung fibroblast IFN 
gamma 


84.1 


B lymphocytes CD40L 
and IL-4 


0.4 


Dermal fibroblast 
CCD 1 070 rest 


6.7 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


6.7 


EOL-1 dbcAMP 

PMf A /i nn ntn vr* i n 

X IVJLrTL/ iUliv/lIljr V^lll 


0.0 


Dermal fibroblast 


15.3 


Dendritic cells none 


122 


Dermal fihrnhlact TFN 

gamma 


40.6 


Dendritic cells LPS 


7.7 


Dermal fibroblast IL-4 


83.5 


Dendritic cells anti- 
CD40 


122 


IBD Colitis 2 


3.9 


Monocytes rest 


6.3 


IBD Crohn's 


4.5 


Monocytes LPS 


0.7 


Colon 


20.4 


Macrophages rest 


6.7 


Lung 


11.5 


Macrophages LPS 


0.9 


Thymus 


41.5 


HUVEC none 


5.3 


Kidney 


4.1 


HUVEC starved 


6.3 







CNS_neurodegeneration_vl.O Summary: Ag041b This panel does not show 
differential expression of the NOV60a gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1 .3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 1 Summary: Ag41 Two experiments with the same probe and primer set 
produce results that are in reasonable agreement, with highest expression of the NOV60a gene 
in the mammary gland and the brain. Overall, this gene appears to express in normal tissues at 

1241 



WO 02/068649 



PCT/US02/02785 



higher levels than in cancer cell lines. This gene encodes a lynxl homolog. Lynxl is an 
endogenous toxin-like modulator of nicotinic acetylcholine receptors in the mammalian CNS. 
Activation of nicotinic receptors is associated with positive effect on schizophrenia and 
alzheimer's disease. Therefore, agents that block Ag41 action in the CNS are likely to have 
5 utility in the treatment of these, and related, disorders. 

This gene also has high levels of expression in pancreas, adrenal, thyroid, pituitary, 
heart, skeletal muscle and liver. Therefore, therapeutic modulation of this gene product may be 
a treatment for endocrine and metabolic diseases, including obesity and Types 1 and 2 
diabetes. Please note that two additional experiments with the same probe and primer set show 
10 low/undetectable levels of expression (CTs>35). (Data not shown.) The results indicate that 
there is a possibility of a probe failure. 

References: 

Miwa JM, Ibanez-Tallon I, Crabtree GW, Sanchez R, Sali A, Role LW, Heintz N. 
lynxl, an endogenous toxin-like modulator of nicotinic acetylcholine receptors in the 

15 mammalian CNS. Neuron 1999 May;23(l): 105-14 

Elapid snake venom neurotoxins exert their effects through high-affinity interactions 
with specific neurotransmitter receptors. A novel murine gene, lynxl, is highly expressed in 
the brain and contains the cysteine-rich motif characteristic of this class of neurotoxins. 
Primary sequence and gene structure analyses reveal an evolutionary relationship between 

20 lynxl and the Ly-6/neurotoxin gene family, lynxl is expressed in large projection neurons in 
the hippocampus, cortex, and cerebellum. In cerebellar neurons, lynxl protein is localized to a 
specific subdomain including the soma and proximal dendrites, lynxl binding to brain sections 
correlates with the distribution of nAChRs, and application of lynxl to Xenopus oocytes 
expressing nAChRs results in an increase in acetylcholine-evoked macroscopic currents. 

25 These results identify lynxl as a novel protein modulator for nAChRs in vitro, which could 
have important implications in the regulation of cholinergic function in vivo. 

Panel 1.1 Summary: Ag041b The NOV60a gene is expressed in most cell lines and 
normal tissues with a significantly higher level of expression in normal brain and heart 
compared to cancer cell lines on this panel. The results in this panel are consistent with 

30 expression in Panel 1. Please see Panel 1 for further discussion of utility of this gene in 
metabolic and ens diseases and cancer. 

Panel 1.3D Summary: Ag041b Highest expression of the NOV60a gene is seen in the 
brain. Overall, this gene is expressed in most cell lines and normal tissues with a significantly 
higher level of expression in heart in addition to brain when compared to cancer cell lines on 
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this panel. Please see Panel 1 for discussion of utility of this gene in the central nervous 
system. 

Among metabolic tissues, this gene has a low level of expression in adipose, adult and 
fetal liver, adrenal, pituitary, fetal skeletal muscle, fetal and adult heart, thyroid and pancreas. 
5 Therefore, modulation of this gene product may be a treatment for endocrine and metabolic 
diseases, including obesity and Types 1 and 2 diabetes. In addition, this gene differentially 
expressed in fetal (CT value = 29) vs adult skeletal muscle (CT value = 35) and may be useful 
for the identification of the two sources of this tissue. 

Panel 2D Summary: Ag041b/Ag41 The expression of the NOV60a gene was assessed 
10 in two independent runs on this panel with good concordance between runs. This protein is a 
good diagnostic marker and target in ovarian, renal and liver cancer as the cancer expressed 
this gene at a higher level than the normal adjacent tissue. 

Panel 3D Summary: Ag041b/Ag41 Two experiments show expression of the 
NOV60a gene in cell lines derived from brain, lung, ovarian, renal, pancreatic, breast and 
1 5 osteosarcoma. Therefore, expression of this gene could be used as a diagnostic marker for the 
presence of these cancers. Furthermore, therapeutic modulation of the expression or function 
of this gene may be effective in the treatment of brain, lung, ovarian, renal, pancreatic, breast 
and osteosarcoma cancers. 

Panel 4D Summary: Ag041b The NOV60a gene, a lynxl homolog is expressed at 
20 moderate levels in untreated lung fibroblasts, lung fibroblasts activated with IL-4, IL-9 or IFN 
gamma, and dermal fibroblasts activated with IL-4 (CTs=30). Therefore, small molecules or 
therapeutic antibodies that antagonize the function of the NOV60a gene product may be useful 
to reduce or eliminate the symptoms in patients with chronic obstructive pulmonary disease, 
asthma, emphysema, or psoriasis. 

'\ 

25 NOV61: Adlican-Iike 

Expression of gene NOV61 was assessed using the primer-probe sets Ag2933, Ag3370 
and Ag3837, described in Tables BDA, BDB and BDC. 

Table BDA. Probe Name Ag2933 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -caccaccactaagccagaac-3 f 


20 


4011 


1237 


Probe 


TET-5 ' -ttctcagtccaagaacatctcaaatatgt- 
3 ' -TAMRA 


29 


4031 


1238 


Reverse 


5 1 -ggattccccatgtaattcaag-3 1 


21 


4083 


1239 



1243 
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Table BDB. Probe Name Ag3370 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -agctggattcttccaaacaga-3 1 


2i 


1327 


1240 


Probe 


TET-5 ' -tcacatgtatacatgctgccaaatgg-3 1 - 
TAMRA 


26 


1375 


1241 


Reverse 


5 ' -acctttgggatggaaagagtt-3 1 


21 


1401 


1242 



Table BDC. Probe Name Ag3837 



Primers! Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forwardjs • -acgagcttgaggatgtggat-3 1 


20 


3725 


1243 


p obe |TET-5 ' -ttttgtcctctgtgacagtctccaca-3 1 - 
[TAMRA 


26 


3758 


1244 


Reverse|5 * -gcttcttcctggtgaaatgg-3 1 


20 


3784 


1245 



5 CNS_neurodegeneration_vl.O Summary: Ag2933/Ag3370 Expression of this gene 

is low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Generalscreeningpanel vl.4 Summary: Ag3370/Ag3837 The amp plots suggest 
that there were experimental difficulties with these runs (data not shown). 

Panel 1 JD Summary: Ag2933 Expression of this gene is low/undetectable (CTs > 
10 35) across all of the samples on this panel (data not shown). . 

Panel 2D Summary: Ag2933 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag2933/Ag3370 Expression of this gene is low/undetectable 
(CTs > 35) across all of the samples on this panel (data not shown). 

1 5 BE. CG56781-01 : NETJROPSIN PRECURSOR 

Expression of gene CG56781-01 was assessed using the primer-probe sets Ag3019 and 
Ag4966, described in Tables BEA and BEB. 

Table BEA. Probe Name Ag3019 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -aggatctgagcctgtgttcag-3 » 


21 


714 


1246 


Probe 


TET-5 1 -cggagacccgctgtctacactaacgt-3 * - 
TAMRA 


26 


739 


1247 


Reverse 


5 1 ~ttcaatccactccaggtagtca-3 1 


22 


768 


1248 



20 Table BEB. Probe Name Ag4966 

Primers} Sequences | Length | Start | SEQ ID Ko7] 
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Position 




Forward 


5 ■ -ccaccctcttcctcagag-3 ' 


18 


7 


1249 


Probe 


TET-5 1 -caccctgtgcaatccagccgtg-3 ' - 
TAMRA 


22 


44 


1250 


Reverse 


5 1 -acacctgcccacgctc-3 1 


16 


89 


1251 



CNSjQeurodegeneration_vl.O Summary: Ag3019 The amp plot suggests that there 
were experimental difficulties with this run in one sample (data not shown). Given the lack of 
expression of this gene on the other panels the expression detected in the occipital cortex is 
5 likely artifactual. 

Panel 1.3D Summary: Ag3019 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

Panel 4.1D Summary: Ag3019 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 
0 Panel 4D Summary: Ag3019 Expression of this gene is low/undetectable (CTs > 35) 

across all of the samples on this panel (data not shown). 

Panel 5 Islet Summary: Ag3019 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

NOV63 

5 Expression of gene NOV63 was assessed using the primer-probe sets Ag226 1 and 

Ag3035, described in Tables BFA and BFB. Results of the RTQ-PCR runs are shown in 
Tables BFC, BFD, BFE and BFF. 



Table BFA, Probe Name Ag2261 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ggatgactcgcctagcttct-3 » 


20 


882 


1252 


Probe 


TET-5 1 -gccgtaggtgccaccgtgagaag-3 ■ - 
TAMRA 


23 


935 


1253 


Reverse 


5 1 -agcagatgctctcgcagtt-3 » 


19 


958 


1254 



Table BFB. Probe Name Ag3035 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID MO: 


Forward 


5 1 -acagcagcaagttcgtcaag-3 ' 


20 


527 


1255 


Probe 


TET-5 • -agacggtcaagcaaggatctgcgag-3 ' - 
TAMRA 


25 


559 


1256 


Reverse 


5 1 -cacgaggttgttgtggaagt-3 » 


20 


593 


1257 
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Table BFC. Panel L3D 



Tissue 
Name 


Rel. 

Ag2261, 

Run 
150631675 


Rel. 
Fxn (%\ 
Ag2261, 

Run 
152887692 


Rel. 

Evn (%\ 

JO A JJ.^ /O y 

Ag3035, 

Run 
167597764 


Tissue 
Name 


Rel. 

Ag2261, 

Run 
150631675 


Rel. 
Exn (%\ 
Ag2261, 

Run 
152887692 


Rel. 
Exn (%\ 
Ag3035, 

Run 
167597764 


Liver 

adenocarci 

noma 


22.4 


19.6 


71.2 


Kidney 
(fetal) 


2.1 


0.0 


2.7 


Pancreas 


3.9 


2.5 


"™""-- ,e " 
2.8 


Renal ca. 
786-0 


0.0 


0.0 


0.0 


Pancreatic 
ca. 

CAP AN 2 


5.3 


3.5 


9.5 


Renal ca. 
A498 


10.2 


5.3 


9.2 


Adrenal 
gland 


2.1 


0.6 


2.0 


Renal ca. 
RXF393 


0.0 


0.0 


0.0 


Thyroid 


7.0 


9.8 


3.9 


Renal ca. 
ACHN 


0.0 


2.2 


0.0 


Salivary 
gland 


1.9 


2.1 


4.2 


Renal ca. 
UO-31 


0.0 


0.0 


0.0 


Pituitary 
gland 


1.0 


2.2 


6.7 


Renal ca. 
TX-10 


0.0 


0.0 


0.0 


Brain 
(fetal) 


6.S 


4.9. 


10.8 


Liver 


0.0 


0.0 


0.0 


Brain 
(whole) 


4.8 


3.0 


1.4 


Liver 
(fetal) 


7.6 


0.0 


0.0 


Brain 

(amygdala 

) 


4.6 


5.3 . 


1.5 


TJver csi 

(hepatobla 
st) HepG2 


0.0 


0.0 


0.0 


Brain 

(cerebellu 

m) 


1.6 


1.6 


2.0 


Lung 


14.3 


15.8 


9.2 


Brain 

(hippocam 

pus) 


7.5 


11.3 


0.6 


Lung 
(fetal) 


15.1 


15.4 


7.4 


Brain 

(substantia 

nigra) 


12 


2.6 


1.3 


Lune ca 
(small cell) 
LX-1 


1.6 


0.0 


0.0 


Brain 
(thalamus) 


2.5 


1.7 


2.6 


Lung ca. 
(small cell) 
NCI-H69 


29.5 


19.1 


31.2 


Cerebral 
Cortex 


0.0 


0.0 


5.0 


Lung ca. 
(s.cell var.) 
SHP-77 


11.0 


5.1 


37.4 


Spinal 
cord 


1.7 


2.1 


2.7 


Lung ca. 

(large 

cell)NCI- 


0.0 


0.0 


' 0.0 
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H460 








glio/astro 
U87-MG 


0.0 


0.0 


0.0 


Lung ca. 
(non-sm. 
cell) A549 


0.0 


1.2 


1.6 


glio/astro 
U-118-MG 


55.1 


50.3 


42.9 


Lung ca. 
(non- 
s.cell) 
NCI-H23 


0.0 


1.3 


0.8 


astrocytom 
a SW1783 


0.0 


7.5 


0.0 


Lung ca. 
(non- 
s.cell) 
HOP-62 


0.0 


1.7 


0.0 


neuro*; 

metSK-N- 

AS 


0.0 


0.0 


0.7 


Lung ca. 

(non-s.cl) 

NCI-H522 


8.0 


8.3 


7.3 


a SF-539 


1.9 


4.7 


9.9 


Lung ca. 
(squam.) 
SW 900 


4.0 


0.0 


1.8 


a SNB-75 


2.0 


4.9 


6.9 


Lung ca. 
(squam.) 
NCI-H596 


15.8 


10.2 


58.2 


glioma 
SNB-19 


6.7 


2.4 


3.7 


Mammary 
gland 


7.2 


4.1 


4.4 


glioma 
U251 


2.1 


4.5 


6.8 


Breast ca.* 

(pl.ef) 

MCF-7 


1.7 


3.4 


7.3 


glioma SF- 
295 


10 0 


u.o 




Breast ca.* 
(pl.ef) 
MDA- 
MB-231 


23.2 


19.6 


19.2 


Heart 
(fetal) 


11.1 


9.9 


38.2 


Jreast ca.* 

(pl.ef) 

T47D 


4.3 


5.8 


21.8 


Heart • 


4.9 


6.0 


15.2 


Breast ca. 
BT-549 


0.0 


4.2 


2.2 


Skeletal 

muscle 

(fetal) 


100.0 


100.0 


85.3 


rsreasi ca. 
MDA-N 


0.0 


0.0 


0.0 


Skeletal 
muscle 


5 5 






Ovary 


3.6 


3.1 


8.1 


Bone 
marrow 


0.0 


0.0 


0.7 


Ovarian 
ca. 

OVCAR-3 


.1.1' 


1.0 


5.6 


Thymus 


10.0 


3.9 


6.4 


Ovarian 
ca. 

OVCAR-4 


0.0 


0.0 


0.7 


Spleen 


3.8 4.2 


1.6 


Ovarian 


0.0 


0.0 


11.5 
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V 











ca. 

OVCAR-5 








Lymph 
node 


5.0 


1.1 


1.4 


Ovarian 
ca. 

OVCAR-8 


1.3 


4.3 


4.1 


Colorectal 


3.4 


5.4 


6.8 


Ovarian 
ca. 

IGROV-1 


0.0 


0.0 


8.1 


Stomach 


6.0 


15.4 


3.1 


Ovarian 
ca.* 

(ascites) 
SK-OV-3 


7.5 


16.0 


100.0 


Small 
intestine 


15.9 


18.7 


2.3 


Uterus 


17.8 


15.1 


9.9 


Colon ca. 
SW480 


24.3 


15.3 


11.6 


Placenta 


4.6 


8.2 


2.1 


Colon ca * 
SW620(S 
W480 met) 


0.0 


0.0 


2.1 


Prostate 


3.6 


5.3 


0.6 


Colon ca. 
HT29 


0.0 


0.0 


0.0 


Prostate 
ca.* (bone 
met)PCr3 


1.7 


1.5 


6.1 


Colon ca. 
HCT-116 


3.8 


0.6 


3.3 


Testis 


21.9 


14.6 


1.6 


Colon ca. 
CaCo-2 


0.0 


0.8 


0.3 


Melanoma 
Hs688(A). 
T 


3.1 


4.7 


1.4 


Colon ca. 
tissue(OD 
03866) 


2.3 


0.0 


1.6 


Melanoma 
* (met) 
Hs688(B). 
T 


0.4 


1.3 


0.0 


Colon ca. 
HCC-2998 


0.0 


0.0 


1.6 


Melanoma 
UACC-62 


0.0 


0.0 


0.0 


Gastric 
ca.* (liver 
met) NCI- 
N87 


16.7 


14.9 


15.3 


Melanoma 
M14 


0.0 


0.0 


0.0 


Bladder 


1.0 


5.1 


J.U 


Melanoma 

LUX 
IMVI 


A A 
U.U 


A A 
0.0 


A A 
U.U 


Trachea 


24.3 


33.7 


5.7 


Melanoma 
* (met) 
SK-MEL- 
5 


0.0 


2.0 


0.7 


Kidney 


0.0 


0.0 


0.0 


Adipose 


6.7 


7.2 


21.2 
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Table BFD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2261, Run 
150811744 


Rel. Exp.(%) 
Ag2261,Run 
152887693 


Tissue Name 


Rel. Exp.(%) 
Ag2261, Run 
150811744 


Rel. Exp.(%) 
Ag2261, Run 
152887693 


Normal Colon 


19.1 


19.8 


Kidney 
Margin 
8120608 


2.4 


0.0 


CC Well to Mod 
Diff(OD03866) 


0.0 


5.8 


Kidney Cancer 
8120613 


14.6 


7.3 


CC Margin 
(OD03866) 


19.5 


12.5 


Kidnev 
Margin 
8120614 


4.8 


1.5 


CC Gr.2 

rectosigmoid 

(OD03868) 


3.8 


1.4 


Kidney Cancer 

C\l\ 1 AO'Nrt 

9010320 


0.0 


0.0 


CC Margin 


2.6 


5.1 


Kidney 
Margin 
9010321 


0.0 


0.0 


CCModDiff 
(ODO3920) 


6.0 


2.9 


Normal Uterus 


9.7 


2.8 


CC Margin 
(ODO3920) 


23.8 


6.4 


Uterus Cancer 
064011 


85.9 


41.5 


CC Gr2 ascend 
colon 

(OD03921) 


9.3 


22 


Normal 
Thyroid 




7 1 


CC Margin 
(OD03921) 


16.8 


11.7 


Thyroid 

Cancer 

064010 


0 0 


OA) 


CC from Partial 
Hepatectomy 
(ODO4309) 
Mets 


2.4 


0.0 


Thyroid 

Cancer 

A302152 


1.9 


1.2 


Liver Margin 


2.6 


0.0 


Thyroid 

Margin 

A302153 


2.6 


2.8 


Colon mets to 

lung(OD04451- 

01) 


7.9 


4.5 


Normal Breast 


16.2 


2.7 


Lung Margin 
(OD04451-02) 


11.3 


12 9 


Breast Cancer 
(OD04566) 


7R S 1 
to. J t 


Zy.f 


Normal Prostate 
6546-1 


6.3 


2.6 


Breast Cancer 
(OD04590-01) 


37.6 


23.8 


Prostate Cancer 
(OD04410) 


17.8 


7.3 


Breast Cancer 
Mets 

(OD04590-03) 


100.0 


24.5 


Prostate Margin 
(OD04410) 


10.7 


7.4 


Breast Cancer 
Metastasis 


94.0 


45.4 
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//mt\a a s~ r r f\r\ 

(OD04655-05) 






Prostate Cancer 
(OD04720-01) 


4.7 


4.4 


Breast Cancer 
064006 


25.7 


24.8 


Prostate Margin 
(OD04720-02) 


13.9 


5.6 


Breast Cancer 
1024 


23.2 


7.1 


Normal Lung 
061010 


36.6 


14.3 


Breast Cancer 
9100266 


33.0 


7.5 


Lung Met to 

Muscle 

(OD04286) 


1.0 


0.0 


Breast Marffin 
9100265 


7.6 


7.6 


Muscle Margin 
(OD04286) 


31.0 


38.2 


Breast Cancer 
A209073 


13.9 


0.9 


Lung Malignant 

Cancer 

(UDUJIZo) 


81.8 


100.0 


Breast Margin 
A2090734 


2.5 


0.0 


Lung Margin 
(OD03126) 


35.8 


18.2 


Normal Liver 


0.0 


0.0 


Lung Cancer 
(OD04404) 


57.0 


39.5 


Liver Cancer 
064003 


0.0 


0.0 


Lung Margin 
(OD04404) 


9.4 


11.8 


Liver Cancer 
1025 


4.8 


1.7 


Lung Cancer 
(OD04565) 


37.1 


42.0 


Liver Cancer 
1026 


7.1 


0.0 


Lung Margin 
(OD04565) 


22.7 


9.3 


Liver Cancer 
6004-T 


4.8 


0.0 


Lung Cancer 
(OD04237-01) 


5.3 


6.4 


Liver Tissue 
6004-N 


4.4 


1.8 


Lung Margin 
(OD04237-02) 


78.5 


32.8 


Liver Cancer 
6005-T 


0.0 


6.0 


Ocular Mel Met 
to Liver 
(ODO4310) 


0.0 


0.0 


Liver Tissue 
6005-N 


0.0 


1.8 


Liver Margin 
(ODO4310) 


2.4 


0.0 


Normal 
Bladder 


2.4 


3.0 


Melanoma Mets 
to Lung 
(OD04321) 


13.0 


0.0 


Bladder 
Cancer 1023 


8.5 


4.9 


Lung Margin 
(OD04321) 


96.6 


50.0 


Bladder 

Cancer 

A302173 


17.0 


11.8 


Normal Kidney 


0.0 


0.0 


Bladder 
Cancer 

(OD04718-01) 


10.0 


5.7 


Kidney Ca, 
Nuclear grade 2 
(OD04338) 


0.0 


O.o 


Bladder 
Normal 
Adjacent 
(OD04718-03) 


19.3 


27.5 
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Kidney Margin 
(OD04338) 


4 0 


4 6 


TsJrvrmfil Ovarv 
liwiiiidi v/v at y 




17 4 


Kidney Ca 
Nuclear grade 
1/2 (OD04339) 


0.0 


3.3 


Ovarian 

Cancer 

064008 


37.9 


2.1 


Kidnev Margin 
(OD04339) 


18.7 


0.0 


Ovarian 
Cancer 

(OD04768-07) 


18.4 


3.7 


Kidney Ca, Clear 
cell type 
(OD04340) 


8.8 


11.7 


w v cu y iviai gin 

(OD04768-08) 


28.3 


12.2 


Kidney Margin 
(OD04340) 


0 0 


20 


Normal 
Stomach 


48 3 


17 % 


Kidney Ca, 
Nuclear grade 3 
(OD04348) 


3.5 


4.0 


VJuoUiv ^allvvl 

9060358 


0.0 


0.0 . 


Kidnev Marcnn 

Xv.lUAlWJr 1VJ.CU gill 

(OD04348) 


2.0 


1.7 


Stomach 

Margin 

9060359 


9.9 


3.0 


Kidney Cancer 
(OD04622-01) 


9.3 


0.0 


Gastric Cancer 
9060395 


20.7 


10.4 


Kidnev lvtarmn 

(OD04622-03) 


0.0 


6.3 


Stomach 

Margin 

9060394 


10.0 


12.2 


Kidney Cancer 
(OD04450-01) 


0.0 


0.0 


Gastric Cancer 
9060397 


8.7 


1.5 


Kidney Margin 
(OD04450-03) 


0.0 


0.0 


Stomach 

Margin 

9060396 


7.5 


62 


Kidney Cancer 
8120607 


0.0 


0.7 


Gastric Cancer 
064005 


10.7 


4.8 



Table BFE. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag3035,Run 
190944495 


Tissue Name 


Rel. Exp.(%) 
Ag3035, Run 
190944495 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


1.7 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.8 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.2 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.6 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


1.1 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


2.7 


Primary Thl act 


0.0 


Lung Microvascular EC 


0.6 
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ffTi T"I — » 1 1 ■ TT 1 L a. 

TNFalpha + IL-lbeta 




Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


3.8 


Primary Trl act 


0.0 


Microsvasular Dermal EC 

rpvTT" 1 1 t TT It. 

TNFalpha + IL-lbeta 


1.2 


Primary Thl rest 


0.0 


Bronchial epithelium 

i-i-rv tr< i f i tt 11 i 

TNFalpha + ELI beta 


3.7 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


1-9 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


4.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery arteiy SMC rest 


0.2 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


2.4 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


1.3 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
x ivi/v ionomycin 


2.1 


2ry Thl/Th2/Trl_anti- 


0.0 


CCD1 106 (Keratinocytes) 
nunc 


22.2 


LAK cells rest 


0.0 


PPHI 1A£ fit f*r at in rtn\/tf*c\ 

k^ksU i iuo ^ivcr<iiinouyicd^ 
TNFalpha + IL-lbeta 


18.8 


Ju/\rv isCUo iij-z* 


ft n 


T r»1T*T*Vl/\C1C 


fi 7 


T relic: TT -7+TT -19 


v.U 


iN\^i-n^7^ none 


n 4 


T AK rplk TT -94-TF7M 

gamma 


0.0 


NCI-H292IL-4 


1.5 


T AK celk TT -9+ TT -1 R 


0 0 


MrT-H909 TT -0 

1n^1-.T3j£j/Z. lutV 




T AK cells 
PMA/ionomycin 


11.0 


NCI-H292 IL-13 


1.4 


NK Cells IL-2 rest 


0 0 


NCI-H292 TFN pamma 




Two Wav Ml R 3 dav 


0 0 

V/.vr 


XXXT\Xj\^ nunc 


1 1 


Two Way MLR 5 day 


0.0 


HPAFC TNF alnha 4- TT -1 
beta 


0.5 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


6.2 | 


PBMC rest 


0.0 


Lune fibroblast TNF aloha 
+ IL-lbeta 


2.1 


PBMC PWM 


0.5 


Lung fibroblast IL-4 


4.2 


PBMC PHA-L 


0.4 


Lung fibroblast IL-9 


8.3 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-13 


4.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN 
gamma 


8.1 
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B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.4 


i> iympnocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 


0.9 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 

PfT>1070 TT -1 hpfa 


2.9 


EOL-1 dbcAMP 
PMA/ionomycin 


1.0 


L/ CI Ilia L IlurOUiabL IrlN 

gamma 


5.8 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


17.2 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


4.8 


Dendritic cells anti- 
CD40 


0.0 


Neutrophils TNFa+LPS 


1.0 


Monocytes rest 


0.0 


Neutrophils rest 


2.2 


Monocytes LPS 


0.6 


Colon 


2.6 


Macrophages rest 


0.0 


Lung 


8.8 


Macrophages LPS 


0.0 


Thymus 


17.1 


HUVEC none 


2.4 


Kidney 


100.0 


HUVEC starved 


8.8 







Table BFF. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
Ag2261, 

Run 
152887762 


Rel. 
Exp.(%) 
Ag3035, 

Run 
165242424 


Tissue Name 


Rel. 
Exp.(%) 
Ag2261, 

Run 
152887762 


Rel. 
Exp.(%) 
Ag3035, 

Run 
165242424 


Secondary Thl act 


6.0 


2.1 


HUVEC IL-1 beta 


0.0 


1.7 


Secondary Th2 act 


0.0 


0.0 


HUVEC IFN 
gamma 


3.7 


11.5 


Secondary Trl act 


0.0 


4.2 


HUVEC TNF 
alpha + IFN 
gamma 


0.0 


3.1 


Secondary Thl rest 


0.0 


0.0 


HUVEC TNF 
alpha + IL4 


4.3 


5.1 


Secondary Th2 rest 


0.0 


2.3 


HUVEC IL-1 1 


4.0 


11.2 


Secondary Trl rest 


0.0 


0.0 


Lung 
Microvascular EC 
none 


7.2 


8.1 


Primary Thl act 


0.0 


0.0 


Lung 
Microvascular EC 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Primary Th2 act 


0.0 


0.0 


Microvascular j 
Dermal EC none 


8.4 


14.5 


Primary Trl act 


0.0 


0.0 


Microsvasular 
Dermal EC 


0.0 


2.2 
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TNFalpha + IL- 
Ibeta 






Primary Thl rest 


0.0 


.0.0 


Bronchial 
epithelium 
TNFalpha + 
ILlbeta 


0.0 


16.3 


Primarv Th2 re<*t 

x l mini j x iu< iwov 


00 


0.0 


Small airway 
epithelium none 


5.9 


18.8 


Primarv Trl rest 

X 1 I111U1 J X. A X I Vwi 


0.0 


0.0 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


24.3 


58.6 


CD45RA CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC rest 


0.0 


2.0 


CD45RO CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


0.0 


0.0 


CD8 lymphocyte 
act 


0.0 


0.0 


Astrocytes rest 


3.3 


13.5 


Secondary CD8 
lymphocyte rest 


0.0 


0.7 


Astrocytes 
TNFalpha + IL- 
lbeta 


0.0 


8.6 


Secondary CD8 
lymphocyte act 


1 6 


00 


KU-812 
(Basophil) rest 


0 0 


0 0 

Vr.Vr 


CD4 lymphocyte 
none 


0.0 


0.0 


KU-812 
(Basophil) 
PMA/ionomycin 


0.0 


9.7 


2ry 

Thl/Th2/Trl anti- 
CD95CH11 


0.0 


1.4 


CCD1106 
(Keratinocytes) 
none 


47.3 


100.0 


LAK cells rest 


3.5 


0.0 


CCD1106 
(Keratinocytes) 
1 Nr alpha + IL- 
lbeta ~ 


9.0 


53.6 


JAW __t1 _ XT 

LAK cells ILrZ 


0.0 


0.0 


Liver cirrhosis 


32.8 


9.4 


LAK cells IL-2+IL- 
12 


0.0 

— — ■ 


0.0 


Lupus kidney 


0.0 


1.6 


LAK Cells 1L- 

2+IFN gamma 


0.0 


4.0 


NCI-H292none 


3.8 


3.4 


TAT/" „_1 1 „ TT 1 

LAK cells 1L-2+ 
IL-18 


0.0 


0.0 


NCI-H292IL-4 


8.0 


19.5 


LAK cells 
PMA/ionomycin 


26.1 


50.7 


NCI-H292 IL-9 


0.0 


42 


NK Cells IL-2 rest 


0.0 


0.0 


NCI-H292IL-13 


13.8 


7.0 


Two Way MLR 3 
day 


0.0 


0.0 


NCI-H292 IFN 
gamma 


16.2 


5.7 


Two Way MLR 5 


0.0 


0.0 


HPAEC none 


6.7 


. 30.1 
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day 












Two Way MLR 7 
day 


0.0 


0.0 


HPAEC TNF 
alpha + IL-1 beta 


0.0 


0.0 


PBMC rest 


0.0 


0.0 


Lung fibroblast 
none 


7.6 


42.0 


PBMC PWM 


0.0 


0.0 


Lung fibroblast 
TNF alpha + IL-1 
beta 


3.1 


6.3 


PBMC PHA-L 


0.0 


0.0 


Lung fibroblast 

TT A 


4.3 


34.2 j 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast 

TT ft 

1L-9 


12.7 


27.5 


Ramos (B cell) 
ionomycin 


0.0 


0.0 


Lung fibroblast 
IL-13 


6.8 


19.9 


B lymphocytes 
PWM 


0.0 


0.0 


Lung fibroblast 
EFN gamma 


30.4 


51.1 


B lymphocytes 
CD40Land BL-4 


3.1 


0.0 


Dermal fibroblast 
CCD1070rest 


0 0 


1 R 


EOL-1 dbcAMP 


0.0 


0.0 


Dermal fibroblast 
CCD1070TNF 
alpha 


5.2 


19.6 


EOL-1 dbcAMP 
PMA/ionomycin 


3.5 


2.7 


Dermal fibroblast 
CCD1070 IL-1 
beta 


0.0 


2.0 


i-/cnuruic ecus 
none 


0.0 


0.0 


Dermal fibroblast 

11 li galllllla. 


28.5 


32.1 


Dendritic cells LPS 


0.0 


0.0 


Dprmal fiHrnKliict 

IL-4 


42.9 


91.4 


Dendritic cells anti- 
CD40 


A A 

0.0 


0.0 


IBD Colitis 2 


2.2 


5.5 


Monocytes rest 


0.0 


0.0 


IBD Crohn's 


3.1 


9.6 


Monocytes LPS 


0.0 


0.0 


Colon 


100.0 


58.6 


Macrophages rest 


0.0 


0.0 


Lung 


36.3 


. 26.1 


Macrophages LPS 


0.0 


0.0 


Thymus 


0.0 


0.0 


HUVEC none 


0.0 


17.7 


Kidney 


4.0 


33.0 


HUVEC starved 


17.4 


51.1 









Panel 1 3D Summary: Ag2261 The NOV63 gene is expressed at moderate levels in a 
number of metabolic tissues, with highest overall expression seen in fetal skeletal muscle 
(CTs=30.4-3L8). The higher levels of expression in fetal skeletal muscle when compared to 
adult skeletal muscle suggests that the protein product encoded by the 88091010JEXT gene 
may be useful in treating muscular dystrophy, Lesch-Nyhan syndrome, myasthenia gravis and 
other conditions that result in weak or dystrophic muscle. This gene is also expressed in 
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adipose, thyroid and heart. Since biologic cross-talk between adipose and thyroid is a 
component of some forms of obesity, this gene product may be a protein therapeutic for the 
treatment of metabolic disease, including obesity and Type 2 diabetes. 

Ag3035 This probe/primer set recognizes a distinct portion of this gene and shows a 
5 distinctive expression pattern when compared to Ag2261 . This observation may indicate that 
the probe/primer sets can distinguish splice variants of this gene. In contrast to the results 
obtained with Ag 2261, expression of this gene is highest in an ovarian cancer cell line (CT = 
30.6). As is the case for Ag2261, expression of this gene using Ag3035 also shows relatively 
high levels in fetal skeletal muscle. However, in addition, Ag3035 shows increased levels of 

10 this gene in adult skeletal muscle as well as in adult and fetal heart. Most other expression is 
similar using both probe/primer sets. Please see Ag2261 for additional information. 

Panel 2D Summary: Ag2261 The expression of this gene was assessed in two 
independent runs on panel 2D. This is consistently expressed in samples of breast cancer, 
uterine cancer and lung cancer when compared to their respective normal adjacent tissue 

15 controls. Thus, the expression of this gene could be used to distinguish breast cancer, lung 
cancer or uterine cancer from their normal tissues. Moreover, therapeutic modulation of this 
gene, through the use of small molecule drugs, antibodies or protein therapeutics might be of 
use in the treatment of breast, lung or uterine cancer. 

Panel 4.1D Summary: Ag3035 This probe/primer set recognizes a distinct portion of 

20 this gene and shows a distinctive expression pattern when compared to Ag2261 in Panel 4D. 
This observation may indicate that the probe/primer sets can distinguish splice variants of this 
gene. In contrast to the results obtained with Ag2261, expression of this gene is highest in 
kidney (CT = 30.6). Most other expression is similar using both probe/primer sets. The 
NOV63 gene, a WNT-14 homolog is also expressed at moderate to low levels in several 

25 unstimulated or cytokine-activated keratinocyte and lung and dermal fibroblast preparations 
(CT range 29-34). Thus, the NOV63 gene may be useful as a protein therapeutic that reduces 
or eliminates the symptoms of chronic obstructive pulmonary disease, asthma, emphysema, or 
psoriasis. In addition, due to its known effects on development of vertebrate joints, the protein 
encoded by the NOV63 gene may also reduce or eliminate the symptoms of osetoarthritis (See 

30 Hartmann and Tabin, 2001). 
References: 

Christine Hartmann and Clifford J. Tabin Wnt-14 Plays a Pivotal Role in Inducing 
Synovial Joint Formation in the Developing Appendicular Skeleton Cell, Vol 104, 341-351, 
February 2001 



1256 



WO 02/068649 



PCT/US02/02785 



The long bones of the vertebrate appendicular skeleton arise from initially continuous 
condensations of mesenchymal cells that subsequently segment and cavitate to form discrete 
elements separated by synovial joints. Little is known, however, about the molecular 
mechanisms of joint formation. We present evidence that Wnt-14 plays a central role in 
5 initiating synovial joint formation in the chick limb. Wnt-14 is expressed in joint-forming 

regions prior to the segmentation of the cartilage elements, and local misexpression of Wnt-14 
induces morphological and molecular changes characteristic of the first steps of joint 
formation. Induction of an ectopic joint-like region by Wnt-14 suppresses the formation of the 
immediately adjacent endogenous joint, potentially providing insight into the spacing of joints. 

10 Panel 4D Summary: Ag2261 The NOV63 transcript is expressed at low levels in 

colon (CT=33.5). Low but significant levels of expression are also found in the lung, 
keratinocytes and dermal fibroblast. Thus, this transcript could be used as a marker for thymic, 
lung and skin tissues. The putative Wnt-14 molecule encoded by this transcript may play an 
important role in the normal homeostasis of these tissues. Therapeutics designed with the 

15 protein encoded by this transcript could be important for maintaining or restoring normal 
function to these organs during inflammation. 

NOV64 

Expression of gene NOV64 was assessed using the primer-probe set Ag3043, 
described in Table BGA. Results of the RTQ-PCR runs are shown in Tables BGB, 
20 BGC and BGD. 



Table BGA. Probe Name Ag3043 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 » -cctgtatgaggaagtcgatgag-3 1 


22 


868 


1258 


Probe 


TET-5 1 -aggtcattcacgtcccctctcctg-3 1 - 
TAMRA 


24 


900 


1259 


Reverse 


5 ■ -gatacgagtccgtcttcctttc-3 ' 


22 


932 


1260 



Table BGB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3043, 
Run 211012232 


Tissue Name 


Rel. Exp.(%) Ag3043, 
Run 211012232 


AD 1 Hippo 


23.0 


Control (Path) 3 
Temporal Ctx 


10.2 


AD 2 Hippo 


32.8 


Control (Path) 4 
Temporal Ctx 


28.3 


AD 3 Hippo 


12.3 


AD 1 Occipital Ctx 


20.2 



1257 



WO 02/068649 PCT/US02/02785 



AD 4 Hippo 


1L2 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


88.9 


AD 3 Occipital Ctx 


13.1 


AD 6 T-Tinnn 


73 2 


AD 4 Ocrinital Ctx 

i wi/i iji ten via 


26 2 


Control 2 Hippo 


31.6 


AD 5 Occipital Ctx 


47.6 


control 4 Hippo 


15.0 


AD 6 Occipital Ctx 


27.5 


Control (Path) 3 
Hippo 


13.1 


Control 1 Occipital 
Ctx 


8.7 


AD 1 Temporal Ctx 


32.3 


Control 2 Occipital 
Ctx 


57.0 


AD 2 Temporal Ctx 


44.4 


Control 3 Occipital 
Ctx 


25.5 


AD 3 Temporal Ctx 


15.4 


Control 4 Occipital 
Ctx 


11.9 


AD 4 Temporal Ctx 


29.1 


Control (Path) I 
Occipital Ctx 


68.8 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


14.1 


AD 5 Sup Temporal 
Ctx 


54.3 


Control (Path) 3 
Occipital Ctx 


6.7 


AD 6 Inf Temporal 
Ctx 


72.7 


Control (Path) 4 
Occipital Ctx 


17.3 


AD 6 Sup Temporal 
Ctx 


58.2 


Control 1 Parietal 
Ctx 


12.9 


Control 1 Temporal 
ctx 


12.0 


Control 2 Parietal 
Ctx 


47.0 


Control 2 Temporal 
ctx 


37.1 


Control 3 Parietal 
Ctx 


19.6 


Control 3 Temporal 
Ctx 


21.0 


Control (Path) 1 

Parietal Ptv 


62.4 


Control 3 Temporal 
Ctx 


13.5 


Control (Path) 2 
Parietal Ctx 


24.8 


Control (Path) 1 
Temporal Ctx 


56.6 


Control (Path) 3 
Parietal Ctx 


6.0 


Control (Path) 2 
Temporal Ctx 


40.3 


Control (Path) 4 
Parietal Ctx 


46.3 



Table BGC. Panel 1. 3D 



Tissue Name 


Rel. Exp.(%) Ag3043, 
Run 167963717 


Tissue Name 


Rel. Exp.(%) Ag3043, 
Run 167963717 


Liver adenocarcinoma 


79.0 


Kidney (fetal) 


21.6 


Pancreas 


4.3 


Renal ca. 786-0 


192 


Pancreatic ca. CAPAN 
2 


16.3 


Renal ca. A498 


18.4 
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Adrenal gland 


5.9 


Renal ca. RXF 393 


31.2 


Thyroid 


2.8 


Renal ca. ACHN 


19.2 


Salivary gland 


2.2 


Renal ca.UO-31 


46.0 


Pituitary gland 


3.3 


Renal ca.TK- 10 


18.2 


Brain (fetal) 


7.4 


Liver 


3.5 


Drain ^wnuicj 


q 7 


l-jvcr ^leiai ) 


1 1 A 


Brain (amygdala) 


6.9 


Liver ca. 
^nepaxoDiasi ) xicpvjz 


18.8 


Brain (cerebellum) 


12.9 


Lung 


3.9 


Brain (hippocampus) 


6.6 


Lung (fetal) 


13.0 


Brain (substantia nigra) 


8.2 


Lung ca. (small cell) 
LX-1 


19.5 


Brain (thalamus) 


4.6 


Lung ca. (small cell) 
NCI-H69 


11.0 


Cerebral Cortex 


10.5 


Lung ca. (s.cell var.) 
SHP-77 


55.5 


Spinal cord 


5.7 


Lung ca. (large 
cell)NCI-H460 


3.3 


glio/astro U87-MG 


40.1 


Lung ca. (non-sm. 
cell) A549 


25.2 


glio/astroU-118-MG 


50.3 


Lung ca. (non-s.cell) 
NCI-H23 


15.0 


astrocytoma SW1783 


29.1 


Lung ca. (non-s.cell) 
HOP-62 


13.2 


neuro*; met SK-N-AS 


11.7 


Lung ca. (non-s.cl) 
NCI-H522 


12.0 


! astrocytoma SF-539 


23.0 


Lung ca. (squam.) 
SW 900 


27.9 


astrocytoma SNB-75 


44.4 


Lung ca. (squam.) 
NCI-H596 


22.4 


glioma SNB-19 


15.9 


Mammary gland 


4.6 


glioma U25l 


17.3 


Breast ca.* (pl.ef) 

lV/fPF 7 


23.3 


glioma SF-295 


28.9 


Dreasi ca. yji.cij 
MDA-MB-231 


46.7 


Heart (fetal) 


20.7 


jjreast ca. ^pi.ei/ 
T47D 


21.5 


ncai I 


9 9 


£>reasi ca. d i o**y 


1 J.Z 


Skeletal muscle ( fetal) 


29.3 


Breast ca. MDA-N 


42 9 


Skeletal muscle 


37.4 


Ovary 


6.1 


Bone marrow 


4.9 


Ovarian ca. OVCAR- 
3 


10.4 


Thymus 


11.0 


Ovarian ca. OVCAR- 
4 


19.5 


Spleen 


7.0 | 


Ovarian ca. OVCAR- 


39.5 
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5 




Lymph node 


19.3 


Ovarian ca. OVCAR- 
o 


6.3 


Colorectal 


12.9 


Ovarian ca. IGROV- 
i 


6.5 


Stomach 


3.7 


Ovarian ca.* (ascites) 

uI\"\J V - J 


100.0 


onlall 1I11CM1I1C 






J.O 


Colon ca. SW480 


14.2 


Placenta 


3.8 


Colon ca.* 
SW620(SW480 met) 


45.4 


Prostate 


2.9 


Colon ca. HT29 


12.7 


Prostate ca.* (bone 
iiicijrL'O 


40.9 


Colon ca.HCT-1 16 


25.9 


Testis 


2.8 


Colon ca. CaCo-2 


25.2 


Melanoma 
Hs688(A).T 


6.8 


Colon ca. 
tissue(OD03866) 


15.4 


Melanoma* (met) 
Hs688(B).T 


7.7 


Colon ca. HCC-2998 


13.3 


Melanoma UACC-62 


17.3 


Gastric ca.* (liver met) 
NCI-N87 


10.4 


Melanoma M14 


11.0 


Bladder 


13.8 


Melanoma LOX 
IMVI 


43.5 


Trachea 


3.0 


Melanoma* (met) 
SK-MEL-5 


27.5 


Kidney 


9.5 


Adipose 


7.3 



Table BGD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3043, Ran 
164315037 


Tissue Name 


Rel. Exp.(%) 
Ag3043, Run 
164315037 


Secondary Thl act 


38.4 


HUVEC IL-lbeta 


20.6 


Secondary Th2 act 


38.7 


HUVEC IFN gamma 


22.5 


Secondary Trl act 


40.3 


HUVEC TNF alpha + IFN 
gamma 


28.7 


Secondary Thl rest 


10.2 


HUVEC TNF alpha + IL4 


40.6 


Secondary Th2 rest 


18.2 


HUVEC IL-11 


18.4 | 


Secondary Trl rest 


16.3 


Lung Microvascular EC 
none 


23.3 


Primary Thl act 


47.6 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


26.1 


Primary Th2 act 


34.6 


Microvascular Dermal EC 
none 


47.0 


Primary Trl act 


46.3 


Microsvasular Dermal EC 


28.5 



1260 



WO 02/068649 



PCT/US02/02785 







TNFalpha + IL-1 beta 




Primary Thl rest 


55.9 


Bronchial epithelium 
TNFalpha + ILlbeta 


54.7 


Primary Th2 rest 


28.5 


Small airway epithelium 
none 


29.7 


Primary Trl rest 


33.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


89.5 


/^*1~V A ^T"V A /lit M 

CD45RA CD4 
lymphocyte act 


31.4 


Coronery artery SMC rest 


33.0 


CD45RO CD4 
lymphocyte act 


39.8 


Coronery artery SMC 
TNFalpha + IL-1 beta 


14.2 


CD8 lymphocyte act 


47.0 


Astrocytes rest 


10.6 


Secondary CD8 
lymphocyte rest 


37.6 


Astrocytes TNFalpha + 
IL-lbeta 


8.2 


Secondary CD8 
lymphocyte act 


25.3 


KU-812 (Basophil) rest 


20.4 


CD4 lymphocyte none 


9.5 


KU-812 (Basophil) 
PMA/ionomycin 


49.7 


2ry Thl/Th2/Tr l_anti- 


21.2 


CCD 1 106 (Keratinocytes) 
none 


47.3 


LAK cells rest 


43.8 


CCDl lUo (Keratinocytes) 
TNFalpha + IL-1 beta 


25.3 


jU/xW. CcllS LLj-Z 




Liver cirrhosis 


A O 

4.8 






Lupus kidney 


5.0 


JLr/\iv ecus ijLf~^ • lr IN 
gamma 


52.5 


NCI-H292 none 


46.7 


LAK cells IL-2+ IL-18 


42.9 


NCI-H292 IL-4 


62.9 


jL/\rv ecus 
PMA/ionomycin 


15.6 


NCI-H292 IL-9 


70.7 


NK Cells IL-2 rest 


29.3 


NCI-H292IL-13 


33.9 


Two Wav "Ml P 3 Hav 




XTr^T OOOO TT7XT sro™™» 

INLJ-Hzyz iriN gamma 


35.6 


Two Wav MT R S rtav 

i wu vv ay ivii_/iv j uay 




LTD A VIC* nnna 

rir /vbL. none 


1 1 .3 


Two Way MLR 7 day 


16.2 


rlrAbC 1 JNr alpha + 1L-1 
beta 


29.5 


PR MP r*»Qt 


10 0 

IU.7 


i^ung iioroDiasi none 


zo.z 


PBMC PWM 


91.4 


Lung fibroblast TNF alpha 

1 TT 1 L a f a 

* 1JL- 1 oeta 


17.8 


PBMC PHA-L 


43.5 


Lung fibroblast IL-4 


40.9 


ivainua \d mil) none 




JLung tibroblast lL-y 


40.6 


Ramos (B cell) 
ionomycin 


100.0 


Lung fibroblast IL-13 


20.9 


B lymphocytes PWM 


94.6 


Lung fibroblast IFN 
gamma 


41.8 


B lymphocytes CD40L 
and IL-4 


39.0 


Dermal fibroblast 
CCD1070 rest 


56.3 
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EOL-l dbcAMP 


24.7 


Dermal fibroblast 
CCD1070 TNF alpha 


84.7 


EOL-l d be AMP 

i^ivi/A/ lUiiuiii y L/iii 


40.9 


Dermal fibroblast 

fYT>1f)7A IT -1 hf»ta 


25.0 


Dendritic cells none 


34.2 


uci nidi iiurouiobi iriN 
gamma 


29.3 


Dendritic cells LPS 


38.7 


Dermal fibroblast IL-4 


42.6 


Dendritic cells anti- 
CD40 


48.0 


IBD Colitis 2 


2:0 


Monocytes rest 


17.9 


IBD Crohn's 


1.8 


Monocytes LPS 


19.1 


Colon 


17.3 


Macrophages rest 


40.6 


Lung 


16.8 


Macrophages LPS 


18.4 


Thymus 


17.7 


HUVEC none 


37.4 


Kidney 


29.1 


HUVEC starved 


65.1 







CNS_neurodegeneration_vl.O Summary: Ag3043 There is an association with a 
statistical confidence of 0.1 between increased expression of the NOV64 gene in the temporal 
cortex and Alzheimer's disease. This gene encodes a homolog of dipeptidyl peptidase, which 
5 belongs to a known class of markers of T cell activation in Multiple Sclerosis. This indicates 
that inhibitors of this gene product may have utility in treatment of this disease. A dipeptidyl 
peptidase is also dysregulated in Huntington's disease. Our finding of increased expression of 
this gene product in the temporal cortex of Alzheimer's disease patients indicates that there 
may be a wider utility of inhibitors of the protein encoded by this gene, including the treatment 
1 0 of neurodegenerative diseases such as Huntington's and Alzheimer's, as well as Multiple 
Sclerosis. 

References: 

Khoury SJ, Guttmann CR, Orav EJ* Kikinis R, Jolesz FA, Weiner HL. Changes in 
activated T cells in the blood correlate with disease activity in multiple sclerosis. Arch Neurol 
15 2000 Aug;57(8):l 183-9 

OBJECTIVE: To determine whether changes in activation markers on peripheral blood 
T cells correlate with disease activity in patients with multiple sclerosis. DESIGN: In a 
prospective longitudinal study during 1 year, we analyzed the change in percentage of 
activated T lymphocytes in the peripheral blood of 40 patients with multiple sclerosis in 
20 relation to clinical findings and changes on brain magnetic resonance imaging (MRI) scans. 
The patients underwent repeated imaging of the brain (mean number of MRIs for each patient, 
22) at the time blood samples were obtained as well as at monthly neurological examinations, 
and at the time of scoring on the Kurtzke Expanded Disability Status Scale (EDSS) and 
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ambulation index scale. RESULTS: A change in the percentage of cells expressing the 
activation markers interleukin 2 receptor (CD25), class II major histocompatibility complex 
(MHC) (13) or surface dipeptidyl peptidase (CD26) correlated significantly with a change in 
lesion volume or a change in number of gadolinium-enhancing lesions as detected on MRI. 
5 Changes in CD25( +) cells and in CD4(+) cells expressing class II MHC also correlated with 
changes in disability as measured by EDSS in patients with relapsing-remitting disease, and 
changes in CD4(+)CD25(+) cells correlated with the occurrence of attacks in patients with 
relapsing-remitting disease. These correlations are dependent on measurement of changes 
between time points sampled at 1- or 2-week intervals. CONCLUSION: There is a linkage 

1 0 between peripheral T-lymphocyte activation as measured by cell surface markers and disease 
activity in patients with multiple sclerosis. 

Mantle D, Falkous G, Ishiura S, Perry RH, Perry EK- Comparison of cathepsin 
protease activities in brain tissue from normal cases and cases with Alzheimer's disease, Lewy 
body dementia, Parkinson's disease and Huntington's disease. J Neurol Sci 1995 Jul;13 1(1):65- 

15 70 

Recent evidence, based upon immunocytochemical and histochemical analysis of brain 
cortical tissue from alzheimer's disease patients, has suggested that altered activity and/or 
distribution of the lysosomal proteases cathepsins B and D may be implicated in the abnormal 
protein processing pathway resulting in formation of the neurotoxic amyloid A4 peptide, 

20 characteristic of this neurodegenerative disorder. We have therefore compared, via 

biochemical assay techniques using conventional or specially synthesised (corresponding to 
protein cleavage points of relevant to A4 peptide formation) fluorogenic substrates, the levels 
of activity of the lysosomal proteases cathepsins B, D, H and L, and dipeptidyl 
aminopeptidases I and II in frontal cortex (grey/white matter) from control and Alzheimer's 

25 disease patients. For comparative purposes, activity levels of the above enzymes were also 
determined in frontal cortex tissue from cases with Lewy body dementia and Parkinson's 
disease, and in caudate tissue from control and Huntington's disease cases. There was no 
significant difference in activity for any protease types in tissue from control cases and cases 
with Alzheimer's disease, Lewy body dementia or Parkinson's disease, with the exception of 

30 reduced dipeptidyl aminopeptidase II activity in Lewy body dementia and Parkinson's cases. 
We have therefore been unable to confirm a potential role for lysosomal cathepsins in the 
characteristic neurodegeneration associated with Alzheimer's disease; however the finding of 
significant increases in activity of dipeptidyl aminopeptidase II, cathepsin H and cathepsin D 
specifically in cases with Huntington's disease is of particular note. We therefore suggest the 

1263 



WO 02/068649 



PCT/US02/02785 

r 



potential role of the latter enzymes in the pathogenesis of Huntington's disease requires further 
investigation 

Panel 1.3D Summary: Ag3043 Highest expression of the NOV64 gene is seen in an 
ovarian cancer cell line (CT=26.2). This gene is expressed at moderate levels in all the cancer 
5 cell lines in this panel. Thus, this is a potential target for small molecule inhibitor drugs in 
cancer. 

This gene also has moderate levels of expression in pancreas, adrenal, thyroid, 
pituitary, heart, skeletal muscle, liver and adipose. Therefore, this gene product may be a small 
molecule target for the treatment of metabolic and endocrine diseases, including obesity and 
10 Types 1 and 2 diabetes. 

In addition, this gene is expressed in the central nervous system. Please see 
CNS_neurodegeneration_vl .0 for discussion of utility of this gene in the central nervous 
system. 

Panel 4D Summary: Ag3043 The NOV64 gene is expressed in a number of cells and 
15 tissues of immunological importance, especially in activated B cells, T cells, dendritic cells, 
and activated lung and skin fibroblasts. Therefore, small molecule antagonists that block the 
function of the NOV64 gene product may reduce or eliminate the symptoms of a wide range 
of autoimmune and inflammatory diseases, including Crohn's disease, ulcerative colitis, 
multiple sclerosis, chronic obstructive pulmonary disease, asthma, emphysema, rheumatoid 
20 arthritis, lupus erythematosus, or psoriasis. 

NOV65a and NOV65b 

Expression of gene NOV65a and variant NOV65b was assessed using the primer-probe 
sets Ag3020 and Ag2968, described in Tables BHA and BHB. Results of the RTQ-PCR runs 
are shown in Tables BHC, BHD, BHE, BHF and BHG. 

25 Table BHA. Probe Name Ag3020 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ggaatcacccacattctgaat-3 1 


21 


193 


1261 


Probe 


TET-5 1 -cgtttacactggccccgaattctaca-3 ' - 
TAMRA 


26 


231 


1262 


Reverse 


5 1 -cctctacacccaggtactggat-3 ' 


22 


268 


1263 


Table BHB. Probe Name Ag2968 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ggaatcacccacattctgaat-3 1 


21 


193 


1264 


Probe 


TET-5 ' -cgtttacactggccccgaattctaca-3 ' - 


26 


231 


1265 
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|tamra j 


Reversej5 1 -cctctacacccaggtactggat-3 • 


J22 


]268 


12S6 



Table BHC. General_screeningj>anel_vL4 



Tissue Name 


Run 221998694 


Tissue Name 


Kel. Exp.(%) AgiOzO, 
Run 221998694 




n a 


Kenal ca. 1 K- 1 U 


A A 


iviciaJ IUI 1 1 a 

Hs688(A).T 


0.0 


Bladder 


0.1 


xyicianumd 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* Ml 4 


0.0 


Gastric ca K ATO TTI 

VIOOUlv vQi IVril V/ 111 


0 0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


02 


Squamous cell 
carcinoma SCC-4 


0.0 


met)SW620 


0.0 


Testis Pool 


0.0 


Pnlnn ra 




Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.2 


Colon ca. CaCo-2 


0.0 


J IdCCIlld 


n 1 


Colon cancer tissue 


31.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 

\J V V>/vtvO 


0.0 


Colon ca. Colo-205 


100.0 


Ovarian ca ^-OV- 

3 


0.3 


Colon ca. SW-48 


5.7 


Ovarian pa 

OVCAR-4 


0.0 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


0.1 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV- 
1 


18.9 


Stnmar^h PaaI 




Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.1 


Fetal Heart 


12.7 


Breast ca. MCF-7 


0.0 


Heart Pool 


4.9 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.1 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


23.3 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


20.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


0.0 


Thymus Pool 


0.0 
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Trachea 


0.3 


CNS cancer (glio/astro) 
U87-MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) 
U-H8-MG 


0.1 


Fetal Lung 


0.2 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.7 


Lung ca. LX-1 


12.7 


CNS cancer (astro) 

QXTO-7S 


0.1 


Lungca.NCI-H146 


0.1 


v^INO CallCCI ^gllU^ 

SNB-19 


15.9 


Lung ca. SHP-77 


0.0 


l^ino Cancer ^gnuy or* 
295 L 


0.5 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.1 




ft ft 


Drain \cercDviiuiiij 


u.u 




ft ft 


Drain rfptaH 
131 all l ^lciaij 


V. 1 


Lungca.NCI-H460 


0.0 


a rain ^nippucaiiipuDj 
Pool 


0.1 


Lung ca. HOP-62 


0.1 


Cerebral Cortex Pool 


0.1 


Lung ca. NCI-H522 


0.0 


Brain (Substantia nigra) 
Pool 


0.1 


Liver 


0.0 


Brain (Thalamus) Pool 


0.1 


Fetal Liver 


0.1 


Brain (whole) 


0.1 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


0.0 


Kidney Pool 


0.1 


Adrenal Gland 


0.0 


Fetal Kidney 


0.1 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.1 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


0.4 


Renal ca.UO-31 


6.6 


Pancreas Pool 


0.0 



Table BHD. Panel 1JD 



Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
166220058 


Rel. Exp.(%) 
Ag3020, Run 
167819114 


Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
166220058 


Rel. Exp.(%) 
Ag3020, Run 
167819114 


Liver 
adenocarcinoma 


0.0 


0.1 


Kidney (fetal) 


0.0 


0.3 


Pancreas 


0.0 


0.0 


Renal ca. 786- 
0 


0.0 


0.0 


Pancreatic ca. 
CAP AN 2 


0.1 


0.2 


Renal ca. 
A498 


0.0 


0.0 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 
393 


0.0 


0.0 
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Thyroid 


0.3 


1.6 


Renal ca. 


0.0 


0.0 


Salivary gland 


0.8 


0.6 


Renal ca. UO- 
31 


0.0 


0.0 


Pituitary gland 


0.0 


0.0 


Renal ca. TK- 

10 


0.0 


0.0 


Brain (fetal) 


0.0 


0.1 


Liver 


0.0 


0.0 


Brain (whole) 


0.4 


1.0 


Liver (fetal) 


0.5 


0.0 


Brain (amygdala) 


0.2 


1.0 


Liver ca. 
(hepatoblast) 
HepG2 


0.0 


0.0 


Brain (cerebellum) 


0.0 


0.0 


Lung 


0.0 


0.5 


Brain 
(hippocampus) 


0 0 


0 0 


T liner rfptflH 


0 0 


0 0 


Brain (substantia 


0.2 


0.1 


Lung ca. 
(small cell) 
LX-1 


10.7 


16.2 


Brain (thalamus) 


0.0 


0.0 


Lungca. . 
(small cell) 
NCI-H69 


0.0 


0.5 


Cerebral Cortex 


0.4 


0.1 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


0.0 


Spinal cord 


0.0 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


glio/astro U87-MG 


0.0 


0.0 


Lung ca. (non- 
sm. cell) A549 


0.0 


02 


elio/astro U-118- 
MG 


0.0 


0.1 


Lung ca. (non- 
s.cell)NCI- 
H23 


0.0 


0.0 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


0 1 


0 1 


neuro*;met SK-N- 
AS 


0.0 


0.0 


Lung ca. (non- 
s.cl) NCI- 
H522 


0.0 


0.0 


astrocytoma SF- 
539 


12.1 


8.4 


Lung ca. 
(squam.) SW 
900 


0.0 


0.0 


astrocytoma SNB- 
75 


0.0 


0.3 


Lung ca. 
(squam.) NCI- 


0.1 


0.4 


glioma SNB-19 


0.0 


0.0 


Mammary 
gland 


0.2 


0.2 


glioma U251 


0.4 


0.0 


Breast ca.* 
(pl.ef) MCF-7 


0.0 


0.0 


glioma SF-295 


0.3 


0.3 


Breast ca.* 
(pl.ef) MDA- 


0.0 


0.0 
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MB-231 






Heart (fetal) 


7.4 


26.8 


Breast ca.* 
(pl.ef)T47D 


0.0 


0.0 


Heart 


29.9 


35.4 


breast ca. b 1 - 
549 


0.0 


0.0 


Skeletal muscle 
(fetal) 


. 10.8 


33.9 


Breast ca. 
MDA-N 


0.0 


0.0 


Skeletal muscle 


100.0 


1 AA A 
100.0 


Ovary 


A A 
0.0 


A 1 

0.1 


Bone marrow 


0.1 


0.6 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


Thymus 


0.1 


0.1 


Ovarian ca. 
OVCAK-4 


0.0 


0.0 


Snleen 


0 0 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


0.6 


Lymph node 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 


Colorectal 


0.2 


0.2 


Ovarian ca. 
IGROV-1 


26.2 


26.2 


Stomach 


0.0 


0.0 


Ovarian ca.* 
(ascites) SK- 
OV-3 


0.2 


1.0 


Small intestine 


0.0 


0.0 


Uterus 


0.0 


0.0 


Colon ca. SW480 


0.0 


0.2 


Placenta 


1.0 


0.0 


Colon ca.* 
SW620(SW480 
met) 


1.6 


6.1 


Prostate 


0.2 


0.1 


Colon ca.HT29 


0.0 


0.0 


Prostate ca.* 
(bone met)PC- 
3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Testis 


0.2 


0.2 


Colon ca CaCo-2 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


21.9 


30.6 


Melanoma* 
(met) 

HSOoo(b).l 


0.0 


0.0 


Colon ca.HCC- 
2998 


0.0 


0.5 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met)NCI-N87 


0.0 


0.0 


Melanoma 
Ml & 

JV11H 


0.0 


0.0 


Bladder 


0.1 


0.0 


Melanoma 
LOXIMVI 


0.0 


0.0 


Trachea 


0.8 


0.6 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney ! 


0.0 


0.0 ! 


Adipose 


1.1 


1.7 I 
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Table BHE. Panel 3D 



Tissue Name 


ReL Exp.(%) 
Ag2968, Run 
170188142 


Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
170188142 


Hnnv- K4^Hti11nH!fl<5tfMTifl 

1/ttUY iVlwUUllwUlvlOlwllia 


0 0 

1/ all 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 


TE671- Medulloblastoma 


2.2 


ES-2- Ovarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
^me goicaryo oiasiy 


0.0 


oiNi>- /o- vjiioma 


n n 


"Doit l5 1 ifL^i 4+ 'c nfmnnAtno 

ivaji- ourKiit s xympnoma 


u.u 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma 


0.0 


T98G- Glioblastoma 


0.0 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- 
Neuroblastoma 
(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin's B-cell 
lymphoma 


0.0 


Cerebellum 


0.0 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


0.0 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


0.0 


TF-l-Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0.0 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small celt lung 
cancer 


0.0 


U937- Histiocytic lymphoma 


0.0 


NCI-H146- Small cell 
lung cancer 


00 


KU-812- Myelogenous 
leukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


0.0 


769-P- Clear cell renal 
carcinoma 


0.0 


NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


0.0 


NCI-H82- Small cell 
lung cancer 


0.0 


SW 839- Clear cell renal 
carcinoma 


0.0 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


0.0 


G401- Wilms 1 tumor 


0.0 


NCI-H1155- Large cell 


0.0 


Hs766T- Pancreatic 


0.0 
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lung cancer 




carcinoma (LN metastasis) 




NCI-H1299- Large cell 
lung cancer 


0.0 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


NCI-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


0.0 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.0 


LX-1- Small cell lung 
cancer 


4.4 


HPAC- Pancreatic 
adenocarcinoma 


0.0 


Colo-205- Colon cancer 


100.0 


MIA PaCa-2- Pancreatic 
carcinoma 


0.0 


KM 12- Colon cancer 


0.0 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


0.0 


KM20L2- Colon cancer 


0.0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


0.0 


NCI-H716- Colon cancer 


0.0 


T24- Bladder carcinma 
(transitional cell) 


0.0 


SW-48- Colon 
adenocarcinoma 


4.3 


5637- Bladder carcinoma 


0.0 


SW1116- Colon 
adenocarcinoma 


0.0 


HT-1 197- Bladder carcinoma 


0.0 


LS174T- Colon 
adenocarcinoma 


10.1 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1080- Fibrosarcoma 


0.0 


NCI-SNU-5- Gastric 
carcinoma 


0.0 


MG-63- Osteosarcoma 


0.0 


KATO III- Gastric 
carcinoma 


1.2 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


0.0 


NCI-SNU-l 6- Gastric 
carcinoma 


0.0 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0.7 


NCI-SNU-l- Gastric 
carcinoma 


0.1 


A431- Epidermoid carcinoma 


0.0 


RF-l- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


0.0 


RF-48- Gastric 

aucauvai \*iii\jllia 


0.0 


DU 145- Prostate carcinoma 
^ orain meiasiasis ) 


0.0 


MKN-45- Gastric 
carcinoma 


0.0 


MDA-MB-468- Breast 
adenocarcinoma 


0.0 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 
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RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


0.0 


Table BHF. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3020,Run 
164528102 


Tissue Name 


Rel. Exp.(%) 
Ag3020, Run 
164528102 


Secondary Thl act 


0.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


0.0 


Primary Th2 act 


a A 
U.U 


Microvascular Dermal EC 
none 


O 0 


Primary Irl act 


A A 

u.u 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0 ft 


Primary Tnl rest 


A A 
U.U 


Bronchial epithelium 
TNFalpha + ILlbeta 


0 0 
U.U 


Primary l nz rest 


A A 
U.U 


Small airway epithelium 
none 


O ft 

U.U 


rnmary in rest 


A A 
U.U 


Small airway epithelium 
TNFalpha + IL-lbeta 


ft ft 

U.U 


CD45RA CD4 
lymphocyte act 




Cnrnnerv arterv SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


65.1 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


21.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


10.7 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95CH11 


0.0 


CCD1 1 06 (Keratinocytes) 
none 


0.0 


LAK cells rest 


13.3 


CCD1106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 
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T AK celk IT -? 


0 0 


1 iver cirrhosk 

L/l Vtl Will JIWOIO 


100.0 


I AK rplk TT -9+TT -1? 


0 0 


T iimic HHnev 

L/UUUO IViVIllVyjr 


0.0 


T AK relk TT -2+TFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+IL-18 


0.0 


NCI-H292 IL-4 


0.0 


T AK celk 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


8.0 


NCI-H292 IL-13 


0.0 


Two Wflv MT P Hpv 
i wu way ivu^iv j nay 


o o 

v.v 


TsirT-H?Q? TFN cramma 


0 0 

v.VI 


Two Wav MT R S Hnv 


U.v 


HP AFP nnnp 
nrAEv^ nunc 


0 0 


Two Way MLR 7 day 


10.6 


FTPAFP TNF alnha + TT -1 

beta 


11.2 




0 0 


T una fiHrfihlnct tinnf* 
idling uuiuuidoi nunc 


0 0 


PBMCPWM 


0.0 


Lung fibroblast TNF alpha 

4- TT -1 h^ffl 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


TV A 

0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 

J TT A 

and IL-4 


9.9 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


0.0 


LOL-1 dbcAMP 

PM A/ionomvrin 


0.0 


Dermal fibroblast 
ppdio70 tt -i hpta 


0.0 


Dendritic cells none 


23.8 


normal fihrnhlflct TFN 

jLsCIlJICll JJUHJUIaoL JX1N 

gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


44.4 


Macrophages rest 


0.0 


Lung 


26.6 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







Table BHG. Panel 5D 





Rel. Exp.(%) 




Rel. Exp.(%) 


Tissue Name 


Ag3020, Run 


Tissue Name 


Ag3020, Run 




172171108 




172171108 
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97457_Patient- 
02go_adipose 


0.2 


94709 Donor 2 AM - A adipose 


0.0 


97476JPatient- 
07sk_skeletal muscle 


8 4 


94710 Donor 2 AM - B adipose 


0.0 


97477_Patient- 
07ut_uterus 


0 0 


94711 Donor 2 AM -C adioose 


0.0 


97478_Patient- 
07pl_placenta 


1 3 


9471 2 Donor 2 AD - A adioose 


3.0 


97481_Patient- 
08sk_skeletal muscle 


12 6 


9471 3 Donor 2 AD - B adioose 


0.0 


97482 J>atient- 
08ut_uterus 


0 0 


94714 Donor 2 AD -C adioose 


0.0 


97483_Patient- 
08pl_placenta 


i 00 

\J»\J 


94742_Donor 3 U - • 
A_Mesenchymal Stem Cells 


0.0 


97486_Patient- 
09sk skeletal muscle 


12 9 


94743JDonor 3 U - 
BJVtesenchymal Stem Cells 


0.0 


97487 JPatient- 
09ut_uterus 


0 7 


Q4730 Donor 3 AM - A adinose 


0.0 


97488_Patient- 
09pl_placenta 


0 2 


Q4731 Dnnnr 3 AM - R adinnse 


00 


97492_Patient- 
10ut_uterus 


0 2 


04739 Donor 3 AM - C adinose 


0.0 


97493_Patient- 
10pl_placenta 


1 1 

1.1 


047^3 Donor 3 AD - A ad i nose 

ZrtfJJ JL/UUlFi J illy n _ au, r Ui3 ^ 


0.0 


97495_Patient- 
llgo_adipose 


0 0 


047*34 Donor 3 AD - R ad i nose 


0.0 


97496_Patient- 
1 lsk_skeletal muscle 


53 6 


94735 Donor 3 AD - C adioose 


0.0 


97497_Patient- 
llut_uterus 


0 3 


77138 1 iver HenG2untreated 


0.0 


97498_Patient- 
llpl_placenta 


23 


73556_Heart_Cardiac stromal 
cells (primary) 


0.0 


97500_Patient- 
12go_adipose 


0.4 


81735 Small Intestine 


0.0 


97501_Patient- 
12sk_skeletal muscle 


100.0 


72409 JCidney_Proximal 
Convoluted Tubule 


0.0 


97502 JPatient- 
12ut_uterus 


0.0 


82685_Small intestine_Duodenum 


0.0 


97503_Patient- 
12pljplacenta 


l ^ 

u 


90650_Adrenal_Adrenocortical 
adenoma 


ft 9 


94721_Donor2U- 
A Mesenchymal Stem 
Cells 


0.0 


72410JCidney_HRCE 


0.3 


94722_Donor2U- 
B Mesenchymal Stem 
Cells 


0.0 


72411JCidney_HRE 


0.0 
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94723_Donor2U- 
C Mesenchymal Stem 
Cells 


0.0 


73 1 39_Uterus_Uterine smooth 
muscle cells 


0.0 



General^screening^panel^vl^ Summary: Ag3020 The NOV65a gene is expressed 
in brain, colon, lung and ovarian cancer cell lines with highest expression in a colon cancer 
cell line Colo-205 (CT=24.37). This suggests that this gene can be used as a diagnostic marker 
5 for these types of cancer . Furthermore, inhibition of the protein using small molecule drugs 
could potentially be useful for the treatment of brain, colon, lung and ovarian cancer. 

In addition, this gene has low expression in adipose and high expression in adult and 
fetal heart and skeletal muscle. Thus, this protein phosphatase may be a small molecule target 
for the treatment of obesity, Type 2 diabetes and cardiac and skeletal muscle disease. 

10 Panel 1.3D Summary: Ag2968/Ag3020 Results from two experiments using identical 

probe/primer sets are in excellent agreement. Expression of the NOV65a gene is highest in 
adult skeletal muscle (CTs = 26-28). Significant but somewhat lower expression is also seen in 
fetal skeletal muscle and adult/fetal heart. Thus, expression of this gene may be used to 
distinguish these samples from the other samples on this panel. 

15 This gene is also expressed in brain, colon, lung and ovarian cancer cell lines, 

consistent with General_screeningj>anel_vl.4. This suggests that this gene can be used as a 
diagnostic marker for these types of cancer and inhibition of the protein using small molecule 
drugs can be used for the treatment of brain, colon, lung and ovarian cancer. 

Panel 3D Summary: Ag2968 Expression of the NOV65a gene is highest in colon 

20 cancer cell line Colo-205 (CT = 25.6). In addition, significant expression of this gene is seen 
in two other colon cancer cell lines. Thus, expression of this gene may be used to distinguish 
these colon cancer cell lines from the other samples on this panel. Moreover, therapeutic 
modulation of the activity of this gene or its protein product, using small molecules, antibodies 
or protein therapeutics, may be of benefit in the treatment of colon cancer. 

25 Panel 4D Summary: Ag3020 Expression of the NOV65a gene is highest in a liver 

cirrhosis sample (CT = 33.3). Furthermore, expression of this gene is not detected in normai 
liver in Panels 1.3D or 1.4, suggesting that its expression is unique to liver cirrhosis. This gene 
encodes a putative protein phosphatase; therefore, antibodies or small molecule therapeutics 
could reduce or inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this 

30 protein could also be used for the diagnosis of liver cirrhosis. Low levels of expression are 
also seen in colon and resting astrocytes. 
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Ag2968 Expression of this gene is low/undetectable (CTs > 35) across all of the 
samples on this panel (data not shown). 

Panel 5D Summary: Ag3020 Expression of the NOV65a gene is primarily restricted 
to samples from skeletal muscle. This specific expression is in agreement with the results in 
5 Panels 1.3D and 1.4. Thus, expression of this gene could be used to differentiate between 
skeletal muscle and other samples on this panel, and as a marker of skeletal muscle. Results 
from one experiment with the probe and primer set Ag2968 are not included. The amp plot 
indicates that there were experimental difficulties with this run* 

NOV66 

10 Expression of gene NOV66 was assessed using the primer-probe set Ag2913, 

described in Table BIA. Results of the RTQ-PCR runs are shown in Tables BIB, BIC and 
BED. 



Table BIA. Probe Name Ag2913 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -tttgtggcttgatggcttt-3 1 


1*9 


956 


1267 


Probe 


TET-5 1 -ttcctttccgcatttcctatgtgaat-3 ' - 
TAMRA . 


26 


977 


1268 


Reverse 


5 ' -ttccagttaaaggcataacgaa-3 • 


22 


1012 


1269 



15 Table BIB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2913, 
Run 157366466 


Tissue Name 


Rel. Exp.(%) Ag2913, 
Run 157366466 


Liver adenocarcinoma 


1.2 


JtCidney (fetal) 


4.3 


Pancreas 


0.4 


Renal ca. 786-0 


0.6 


Pancreatic ca. CAP AN 
2 


0.0 


Renal ca. A498 


0.6 


Adrenal gland 


2.3 


Renal ca. RXF 393 


0.2 


Thyroid 


0.7 


Renal ca. ACHN 


0.0 


Salivary gland 


8.3 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.3 


Renal ca.TK-10 


0.0 


Brain (fetal) 


26.4 


Liver 


2.0 


Brain (whole) 


6.4 


Liver (fetal) 


3.5 


Brain (amygdala) 


18.4 


Liver ca. 
(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


. 8.6 


Lung 


9.2 


Brain (hippocampus) 


100.0 


Lung (fetal) 


4.7 


Brain (substantia nigra) 


2.6 


Lung ca. (small cell) 


0.6 
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LX-1 




* Brain (thalamus) 


9.5 


Lung ca. (small cell) 
NCI-H69 


2.1 


Cerebral Cortex 


3L2 


Lung ca. (s.cell var.) 
SHP-77 


3.8 


Spinal cord 


1.0 


Lung ca. (large 
cell)NCI-H460 


1.5 


glio/astro U87-MG 


1.2 


Lung ca. (non-sm. 
cell) A549 


2.2 


glio/astro U-l 1 8-MG 


18.9 


Lung ca. (non-s.cell) 
NCI-H23 


7.2 


astrocytoma SW1783 


2.8 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


8.2 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


2.5 


Lung ca. (squam.) 
SW 900 


0.9 


astrocytoma SNB-75 


0.7 


Lung ca. (squam.) 
NCI-H596 


0.8 


glioma SNB-19 


1.6 


Mammary gland 


5.7 


glioma U251 


0.4 


Breast ca.* (pl.ef) 
MCF-7 


2.2 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


3.1 


Heart (fetal) 


0.1 


Breast ca.* (pl.et) 
T47D 


1.4 


rleart 


0.4 


Breast ca. BT-549 


2.6 


Skeletal muscle (fetal) 


0.4 


Breast ca. MDA-N 


1.8 


Skeletal muscle 


0.5 


Ovary 


0.0 


Bone marrow 


8.5 


Ovarian ca. OVCAR- 
3 


0.9 


Thymus 


2.6 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


7.7 


Ovarian ca. OVCAR- 
5 


0.0 


Lymph node 


9.1 


Ovarian ca. OVCAR- 

o 
O 


1.0 


Colorectal 


8.4 


Ovarian ca. IGROV- 

i 
i 


0.0 


Stomach 


0.9 


Ovarian ca.* (ascites) 
SK-OV-3 


0.9 


Small intestine 


11.2 


Uterus 


1.2 


Colon ca. SW480 


0.0 


Placenta 


1.7 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


1.9 
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Colon ca. HT29 


0.0 


Prostate ca.* (bone 


0.0 


Colon ca. HCT-116 


0.1 


Testis 


2.8 


Colon ca. CaCo-2 


7.7 


Melanoma 
Hs688(A).T 


0.2 


Colon ca. 
tissue(OD03866) 


1.1 


Melanoma* (met) 
Hs688(B).T 


0.8 


Colon ca. HCC-2998 


7.9 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


92 


Melanoma M14 


0.7 


Bladder 


6.4 


Melanoma LOX 
IMVI 


1.5 


Trachea 


1.4 


Melanoma* (met) 
SK-MEL-5 


1.3 


Kidney 


1.4 


Adipose 


2.3 


Table BIC. Panel 2D 


Tissue Name 


Rel Exn (%\ 
Ag2913, Run 
157366467 


Tissue Name 


Rf>l Fvn (*A\ 

Ag2913, Run 
157366467 


Normal Colon 


32.5 


Kidney Margin 
8120608 


0.0 


CC WelltoModDiff 
(OD03866) 


11.2 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


8.7 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


13.5 


Kidney Cancer 
9010320 


0.0 


CC Margin (OD03868) 


9.5 


Kidney Margin 
9010321 


0.0 


CC Mod Diff (ODO3920) 


35.4 • 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


57.4 


Uterus Cancer 06401 1 


1.6 


CC Gr.2 ascend colon 
(OD03921) 


30.8 


Normal Thyroid 


11.8 


CC Margin (OD03921) 


6.4 


Thyroid Cancer 
064010 


0.3 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


59.0 


Liver Margin (ODO4309) 


5.3 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


0.0 


Lung Margin (OD04451- 
02) 


0.4 


Breast Cancer 
(OD04566) 


0.0 
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Normal Prostate 6546-1 


1. 6 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


21.8 


Breast Cancer Mets 
(OD04590-03) 


0.4 


Prostate Margin . 
(OD04410) 


19.6 


Breast Cancer 

Metastasis 
(OD04655-05) 


1.3 


Prostate Cancer 
(OD04720-01) 


16.2 


Breast Cancer 064006 


2.4 


Prostate Margin 
(OD04720-02) 


18.9 


Breast Cancer 1024 


0.0 


Normal Lung 061010 


73.7 


Breast Cancer 
91 00266 


10.7 


Lung Met to Muscle 


4.0 


Breast Margin 


2.0 


(OD04286) 


0.0 


A209073 


2.4 


Lung Malignant Cancer 
(OD03126) 


0.0 


Breast Margin 
A2090734 


1.9 


Lung Margin (OD03126) 


2.1 


Normal Liver 


2.6 


Lung Cancer (OD04404) 


0.4" 


Liver Cancer 064003 


4.5 


i^ung ividrgin \ \JU\} i r L k\} L \ j 


n 0 
u.u 


j-«iver v^anccr iuzj 


u.u 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0.0 


Lung Margin (OD04565) 


2.5 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01). 


52 


Liver Tissue 6004-N 


1.5 


Lung Margin (OD04237- 
02) 


1-9 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.4 


Liver Tissue 6005-N 


0.0 


Liver Margin (OD043 10) 


2.7 


Normal Bladder 


18.6 


Melanoma Mets to Lung 
(OD04321) 


0.7 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


12.6 


Normal Kidney 


0.0 


Bladder Cancer 
(OD04718-01) 


6.3 


Kidney Ca, Nuclear grade 
2 (OD04338) 


0.0 


Bladder Normal 
Adjacent (OD04718- 


7.3 


Kidney Margin 
(OD04338) 


0.0 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


0.0 


Ovarian Cancer 
064008 


0.0 


Kidney Margin 
(OD04339) 


0.5 


Ovarian Cancer 
(OD04768-07) 


0.0 
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Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


0.0 


Normal Stomach 


0.0 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0.0 


Gastric Cancer 
9060358 


2.0 


Kidney Margin 
(OD04348) 


1.0 


Stomach Margin 
9060359 


13.1 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


11.2 


Kidney Margin 
(OD04622-03) 


0.0 


Stomach Margin 
9060394 


15.9 


Kidney Cancer 
(OD04450-01) 


0.5 


Gastric Cancer 
9060397 


41.2 


Kidney Margin 
(OD04450-03) 


0.0 


Stomach Margin 
9060396 


5.6 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


100.0 



Table BID. Panel 4D 



Tissue Name 


Rel. Exp;(%) 
Ag2913,Run 
157366468 


Tissue Name 


Rel.Exp.(%) 
Ag2913,Run 
157366468 


Secondary Thl act 


14.9 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


27.7 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


47.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


18.2 


HUVEC TNF alpha + IL4 


0.9 


Secondary Th2 rest 


19.6 


HUVEC IL-11 


1.3 


Secondary Trl rest 


13.9 


Lung Microvascular EC 
none 


9.2 


Primary Thl act 


61.6 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


9.1 


Primary Th2 act 


56.3 


Microvascular Dermal EC 
none 


10.4 


Primary Trl act 


41.5 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


7.8 


Primary Thl rest 


100.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


02 


Primary Th2 rest 


46.0 


Small airway epithelium 
none 


0.5 


Primary Trl rest 


16.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


2.6 


CD45RA CD4 
lymphocyte act 


9.9 


Coronery artery SMC rest 


0.1 
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CD45ROCD4 j _ 
lymphocyte act j 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act j 15.5 


Astrocytes rest 


0.6 


Secondary CD8 
lymphocyte rest 


17.9 


Astrocytes TNFalpha + IL- 
lbeta 


0.5 


Secondary CD8 
lymphocyte act 


16.6 


KU-812 (BasoDhiD rest 


4.6 


CD4 lymphocyte none 


16.2 


KU-812 (Basophil) 
PMA/ionomycin 


18.0 


2ryThl/Th2/Trl_anti- 
CDbO Cnl I 


29.9 


CCD1 106 (Keratinocytes) 
none 


7.2 


LAK cells rest 


28.3 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.2 


! LAK celk IT -2 


IK 0 


Liver cirrhosis 


O 1 

8.1 


LAK cells IL-2+IL-12 


6.0 


Lupus kidney 


O A 

3.4 


T A I/" ~~ 1 1 ~ TT i TTTwT 

LAK cells IL-2+IFN 


6.8 


NCI-H292 none 


4.3 


LAK cells IL-2+IL- 18 


3.5 


NCI-H292 IL-4 


2.1 


T A V 11 _ 

LAK cells 

PMA/innnmvrin 


8.8 


NCI-H292 IL-9 


1.8 


NK Celte 11 -2 re<:t 




NCI-H292IL-13 


0.2 


Two Way MLR 3 day 


12.4 


NCI-H292 1 FN gamma 


0.5 


1 wo Way MLR 5 day 


6.9 


TTT» A T" , /~» 

HPAEC none 


2.6 


Two Way MLR 7 day 


5.1 


HPAEC TNF alpha + IL-1 
beta 


6.3 


PBMC rest 


4.9 


Lung fibroblast none 


9.9 


PBMC PWM 


37.4 


Lung fibroblast TNF alpha 
+ 1L-1 beta 


2.5 


PBMC PHA-L 


12.9 


Lung fibroblast IL-4 


12.7 


Ramos (B cell) none 


11.3 


Lung fibroblast IL-9 


8.7 


Ramos (B cell) 
ionomycin 


13.7 


Lung fibroblast IL-13 


12.1 


B lymphocytes PWM 


1.1 


Lung fibroblast IFN 
gamma 


9.3 


B lymphocytes CD40L 

J TT A 

and JLL-4 


0.5 


Dermal fibroblast 
CCD1070rest 


9.9 


EOL-1 dbcAMP 


0.4 


Dermal fibroblast 
CCD1070 TNF alpha 


29.3 


EOL-1 dbcAMP 
PMA/ionomycin 


0.9 


Dermal fibroblast 
CCD1070IL-1 beta 


0.7 


Dendritic cells none 


0.7 


Dermal fibroblast IFN 
gamma 


1.4 


Dendritic cells LPS 


1.5 


Dermal fibroblast IL-4 


5.1 


Dendritic cells anti- 
CD40 


4.7 


IBD Colitis 2 


16.6 
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Monocytes rest 


5.6 


IBD Crohn's 


7.9 


Monocytes LPS 


0.3 


Colon 


24.7 


Macrophages rest 


3.6 


Lung 1 


13.4 


Macrophages LPS 


0.4 


Thymus 


72.7 


HUVEC none 


1.2 


Kidney 


65.1 


HUVEC starved 


1.2 







CNSjneurodegeneration_vl.O Summary: Ag2913 No significant expression 
detected. Potential failed chemistry reaction or bad probe/primer set (data not shown). 
Panel 1.3D Summary: Ag2913 The NOV66 gene represents a novel G-protein 
5 coupled receptor (GPCR) with expression in the brain. The GPCR family of receptors contains 
a large number of neurotransmitter receptors, including the dopamine, serotonin, a and b- 
adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic glutamate 
receptors. GPCRs are excellent drug targets in various neurologic and psychiatric diseases. All 
antipsychotics have been shown to act at the dopamine D2 receptor; similarly novel 

1 0 antipsychotics also act at the serotonergic receptor, and often the muscarinic and adrenergic 
receptors as well. While the majority of antidepressants can be classified as selective serotonin 
reuptake inhibitors, blockade of the 5-HT1A and a2 adrenergic receptors increases the effects 
of these drugs. The GPCRs are also of use as drug targets in the treatment of stroke. Blockade 
of the glutamate receptors may decrease the neuronal death resulting from excitotoxicity; 

15 further more the purinergic receptors have also been implicated as drug targets in the treatment 
of cerebral ischemia. The b-adrenergic receptors have been implicated in the treatment of 
ADHD with Ritalin, while the a-adrenergic receptors have been implicated in memory. 
Therefore this gene may be of use as a small molecule target for the treatment of any of the 
described diseases. 

20 In addition, this gene is expressed in clusters of cell lines derived from lung cancer and 

colon cancer. Thus, expression of this gene could be used to differentiate between these 
sample and other samples on this panel and as a marker to detect the presence of colon and 
lung cancers. Furthermore, therapeutic modulation of the expression or function of this gene 
may be effect! ve in the treatment of lung and colon cancers. 

25 References: 

El Yacoubi M, Ledent C, Parmentier M, Bertorelli R, Ongini E, Costentin J, Vaugeois 
JM. Adenosine A2A receptor antagonists are potential antidepressants: evidence based on 
pharmacology and A2A receptor knockout mice. Br J Pharmacol 2001 Sep; 1 34(1 ):68-77 
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1. Adenosine, an ubiquitous neuromodulator, and its analogues have been shown to 
produce 'depressant' effects in animal models believed to be relevant to depressive disorders, 
while adenosine receptor antagonists have been found to reverse adenosine-mediated 
'depressant' effect. 2. We have designed studies to assess whether adenosine A2A receptor 
5 antagonists, or genetic inactivation of the receptor would be effective in established screening 
procedures, such as tail suspension and forced swim tests, which are predictive of clinical 
antidepressant activity. 3. Adenosine A2A receptor knockout mice were found to be less 
sensitive to 'depressant' challenges than their wildtype littermates. Consistently, the adenosine 
A2A receptor blockers SCH 58261 (1 - 10 mg kg(-l), i.p.) and KW 6002 (0.1 - 10 mg kg(-l), 

10 p.o.) reduced the total immobility time in the tail suspension test 4. The efficacy of adenosine 
A2A receptor antagonists in reducing immobility time in the tail suspension test was 
confirmed and extended in two groups of mice. Specifically, SCH 58261 (1 - 10 mg kg(-l)) 
and ZM 241385 (15-60 mg kg(-l)) were effective in mice previously screened for having 
high immobility time, while SCH 58261 at 10 mg kg(-l) reduced immobility of mice that were 

15 selectively bred for their spontaneous 'helplessness' in this assay. 5. Additional experiments 
were carried out using the forced swim test. SCH 58261 at 10 mg kg(-l) reduced the 
immobility time by 61%, while KW 6002 decreased the total immobility time at the doses of 1 
and 10 mg kg(-l) by 75 and 79%, respectively. 6. Administration of the dopamine D2 receptor 
antagonist haloperidol (50 - 200 microg kg(-l) i.p.) prevented the antidepressant-like effects 

20 elicited by SCH 58261 (1 0 mg kg(-l) i.p.) in forced swim test whereas it left unaltered its 

stimulant motor effects. 7. In conclusion, these data support the hypothesis that A2A receptor 
antagonists prolong escape-directed behaviour in two screening tests for antidepressants. 
Altogether the results support the hypothesis that blockade of the adenosine A2A receptor 
might be an interesting target for the development of effective antidepressant agents. 

25 Blier P. Pharmacology of rapid-onset antidepressant treatment strategies. Clin 

Psychiatry 200 1;62 Suppl 15:12-7 

Although selective serotonin reuptake inhibitors (SSRIs) block serotonin (5-HT) 
reuptake rapidly, their therapeutic action is delayed. The increase in synaptic 5-HT activates 
feedback mechanisms mediated by 5-HT1 A (cell body) and 5-HT1B (terminal) autoreceptors, 

30 which, respectively, reduce the firing in 5-HT neurons and decrease the amount of 5-HT 
released per action potential resulting in attenuated 5-HT neurotransmission. Long-term 
treatment desensitizes the inhibitory 5-HT1 autoreceptors, and 5-HT neurotransmission is 
enhanced. The time course of these events is similar to the delay of clinical action. The 
addition of pindolol, which blocks 5-HT1 A receptors, to SSRI treatment decouples the 
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feedback inhibition of 5-HT neuron firing and accelerates and enhances the antidepressant 
response. The neuronal circuitry of the 5-HT and norepinephrine (NE) systems and their 
connections to forebrain areas believed to be involved in depression has been dissected. The 
firing of 5-HT neurons in the raphe nuclei is driven, at least partly, by alphal-adrenoceptor- 
5 mediated excitatory inputs from NE neurons. Inhibitory alpha2-adrenoceptors on the NE 
neuroterminals form part of a feedback control mechanism. Mirtazapine, an antagonist at 
alpha2-adrenoceptors, does not enhance 5-HT neurotransmission directly but disinhibits the 
NE activation of 5-HT neurons and thereby increases 5-HT neurotransmission by a 
mechanism that does not require a time-dependent desensitization of receptors. These 

1 0 neurobiological phenomena may underlie the apparently faster onset of action of mirtazapine 
compared with the SSRIs. 

Tranquillini ME, Reggiani A. Glycine-site antagonists and stroke. Expert Opin Investig 
Drugs 1999Nov;8(ll):1837-1848 

The excitatory amino acid, (S)-glutamic acid, plays an important role in controlling 

1 5 many neuronal processes. Its action is mediated by two main groups of receptors: the 
ionotropic receptors (which include NMDA, AMPA and kainic acid subtypes) and the 
metabotropic receptors (mGluR(l-8)) mediating G-protein coupled responses. This review 
focuses on the strychnine insensitive glycine binding site located on the NMDA receptor 
channel, and on the possible use of selective antagonists for the treatment of stroke. Stroke is a 

20 devastating disease caused by a sudden vascular accident. Neurochemically, a massive release 
of glutamate occurs in neuronal tissue; this overactivates the NMDA receptor, leading to 
increased intracellular calcium influx, which causes neuronal cell death through necrosis. 
NMDA receptor activation strongly depends upon the presence of glycine as a co-agonist. 
Therefore, the administration of a glycine antagonist can block overactivation of NMDA 

25 receptors, thus preserving neurones from damage. The glycine antagonists currently identified 
can be divided into five main categories depending on their chemical structure: indoles, 
tetrahydroquinolines, benzoazepines, quinoxalinediones and pyrida-zinoquinolines. 

Monopoli A, Lozza G, Forlani A, Mattavelli A, Ongini E. Blockade of adenosine A2A 
receptors by SCH 58261 results in neuroprotective effects in cerebral ischaemia in rats. 

30 Neuroreport 1998 Dec l;9(17):3955-9 

Blockade of adenosine receptors can reduce cerebral infarct size in the model of global 
ischaemia. Using the potent and selective A2A adenosine receptor antagonist, SCH 58261, we 
assessed whether A2A receptors are involved in the neuronal damage following focal cerebral 
ischaemia as induced by occluding the left middle cerebral artery. SCH 58261 (0.01 mg/kg 
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either i.p. or i.v.) administered to normotensive rats 10 min after ischaemia markedly reduced 
cortical infarct volume as measured 24 h later (30% vs controls, p < 0.05). Similar effects were 
observed when SCH 58261 (0.01 mg/kg, i.p.) was administered to hypertensive rats (28% 
infarct volume reduction vs controls, p < 0.05). Neuroprotective properties of SCH 58261 
5 administered after ischaemia indicate that blockade of A2A adenosine receptors is a 
potentially useful biological target for the reduction of brain injury. 

Panel 2D Summary: Ag2913 The NOV66 gene is a diagnostic marker for gastric 
thyroid and bladder cancer and a target for therapeutic intervention in gastric, thyroid and 
bladder cancer through the use of antibodies or small molecule drugs. This is based on the 
10 expression profile of this gene that shows higher expression in some gastric, thyroid and 
bladder cancer samples compared to normal tissues. 

Panel 4D Summary: Ag2913 The NOV66 gene, an olfactory receptor homolog is 
expressed at moderate levels in activated and resting T lymphocytes (CT range 30.13-32.98). 
Small molecules or therapeutic antibodies that antagonize the function of the NOV66 gene 
1 5 prodcut may reduce or eliminate the symptoms of autoimmune and inflammatory diseases, 
including Crohn's disease, ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary 
disease, asthma, emphysema, rheumatoid arthritis, lupus erythematosus, or psoriasis. 

Panel CNS_1 Summary: Ag2913 No significant expression detected. Potential 
probe/primer failure (data not shown). 

20 NOV67 



Expression of gene NOV67 was assessed using the primer-probe set Ag2951, 
described in Table BJA. Results of the RTQ-PCR runs are shown in Tables BJB and BJC. 

Table BJA- Probe Name Ag2951 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -acctctcacttgtggacatctg~3 1 


22 


243 


1270 


Probe 


TET-5 1 -tacacctccagcagggtccctcagat-3 1 - 
TAMRA 


26 


266 


1271 


Reverse: 


5 ' -ggcaaaggagatggtctttct-3 1 


21 


314 


1272 



25 Table BJB. AI_comprehensive panel_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag2951, Run 
248065290 


Tissue Name 


Rel.Exp.(%) 
Ag2951,Run 
248065290 


1 10967 COPD-F 


3.1 


112427 Match Control 
Psoriasis-F 


49.3 
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110980 COPD-F 


4.9 


112418 Psoriasis-M : 


13.6 


110968 COPD-M 


2.4 


1 12723 Match Control 
Psoriasis-M 


0.0 


110977 COPD-M 


45.7 


1 12419 Psonasis-M 


20.2 


1 10989 Emphysema- 
F 


25.9 


112424 Match Control 
Psoriasis-M 


9.5 


1 10992 Emphysema- 
F 


33.9 


112420 Psoriasis-M 


93.3 


110993 Emphysema- 
F 


5.8 


112425 Match Control 
Psoriasis-M 


32.8 


110994 Emphysema- 
F 


0.0 


104689 (MF)OA 
Bone-Backus 


0.0 


110995 Emphysema- 
F 


55.5 


104690 (MF)Adj 
"Normal" Bone-Backus 


0.0 


110996 Emphysema- 
F 


7.0 


104691 (MF)OA 
Synovium-Backus 


0.0 


110997 Asthma-M 


12.8 


104692 (BA) OA 
Cartilage-Backus 


0.0 


1 11001 Asthma-F 


18.6 


104694 (BA) OA 
Bone-Backus 


0.0 


111002Asthma-F 


27.7 


104695 (BA)Adj 
"Normal" Bone-Backus 


5.5 


11 1003 Atopic 
Asthma-F 


16.7 


104696 (BA) OA 
Synovium-B ackus 


10.2 


11 1004 Atopic 
Asthma-F 


37.4 


104700 (SS) OA Bone- 
Backus 


7.1 


111005 Atopic 
Asthma-F 


31.6 


104701 (SS) Adj 
"Normal" Bone-Backus 


14.3 


11 1006 Atopic 
Asthma-F 


1.8 


104702 (SS)OA 
Synovium-Backus 


28.3 


111417AUergy-M 


20.4 


117093 OA Cartilage 
Rep7 


65.5 


112347 Allergy-M 


2-5 


112672 OA BoneS 


26.6 


1 12349 Normal Lung- 
F 


0.0 


112673 OA Synoviums 


21.6 


1 12357 Normal Lung- 
F 


12.8 


1 12674 OA Synovial 
Fluid cells5 


9.3 


1 12354 Normal Lung- 
M 


4.5 


117100 OA Cartilage 
Repl4 


2.9 


1 12374 Crohns-F 


16.3 


1 12756 OA Bone9 


7.3 


112389 Match 
Control Crohns-F 


15.0 


112757 OA Synovium9 


3.3 


112375 Crohns-F 


7.5 


112758 OA Synovial 
Fluid Cells9 


12.9 


112732 Match 
Control Crohns-F 


41.5 


117125 RA Cartilage 
Rep2 


0.0 
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112725 Crohns-M 


2.0 


1 13492 Bohe2RA 


8.5 


112387 Match 
Control Crohns-M 


5.3 


113493 Synovium2RA 


1.9 


112378 Crohns-M 


4.0 


1 13494 Syn Fluid Cells 
RA 


7.6 


112390 Match 
Control Crohns-M 


61.6 


113499 Cartilage4RA 


13.7 


112726 Crohns-M 


21.3 


113500 Bone4RA 


8.3 


112731 Match 
Control Crohns-M 


14.6 


113501 Synovium4RA 


20.4 


112380 Ulcer Col-F 


18.3 


113502 Syn Fluid 
Cells4RA 


6.5 


112734 Match 
Control Ulcer Col-F 


_ 100.0 


113495 Cartilage3RA 


11.7 


1 12384 Ulcer Col-F 


60.7 


113496 Bone3RA 


4.4 


112737 Match 
Control Ulcer Col-F 


22.1 


1 13497 Synovium3RA 


3.0 


112386 Ulcer Col-F 


4.6 


113498 Syn Fluid 
Cells3 RA 


17.3 


112738 Match 
Control Ulcer Col-F 


3.7 


117106 Normal 
Cartilage Rep20 


11.4 


112381 Ulcer Col-M 


0.0 


113663 Bone3 Normal 


0.0 


112735 Match 
Control Ulcer Col-M 


15.4 


113664 Synovium3 
Normal 


0.0 


112382 Ulcer Col-M 


58.2 


113665 Syn Fluid 
Cells3 Normal 


1.3 


112394 Match 
Control Ulcer Col-M 


4.2 


117107 Normal 
Cartilage Rep22 


0.0. 


112383 Ulcer Col-M 


63.3 


1 13667 Bone4 Normal 


39.5 


112736 Match 
Control Ulcer Col-M 


12:i 


113668 Synovium4 
Normal 


25.5. 


112423 Psoriasis-F 


26.2 


113669 Syn Fluid 
Cells4 Normal 


26.1 



Table BJC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2951, Run 
164403342 


Tissue Name 


Rel. Exp.(%) 
Ag2951, Run 
164403342 


Secondary Thl act 


0.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.8 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 
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Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.7 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


1.3 


Bronchial epithelium 
TNFalpha + ILlbeta 


1.0 


Primary Th2 rest 


7.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


17.2 


Small airway epithelium 
TNFalpha + IL-lbeta 


5.2 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


1.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


6.6 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


1.0 


CD4 lymphocyte none 


13.3 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2iyThl/Th2/Trl_anti- 


0.9 


CCD1 106 (Keratinocytes) 
none 


1.2 


LAK cells rest 


11.8 


CCD! lOo (Jveratmocytes) 
TNFalpha + IL-lbeta 


0.0 


1j/viv ecus iJL»~z. 


u.u 


Liver cirrhosis 


7.7 


i-ATvJTV. eCllb 1JU~Z i^IL»- 1 Z 


u.u 


Lupus kidney 


A A 

0.0 


j-f-rtjx uciid ii_f /, t ir in 

gamma 


3.2 


NCI-H292 none 


2.4 


LAK cells IL-2+ IL-1 8 


4.6 


NCI-H292 IL-4 


1.9 


T AK celk 

PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Wav MT P 1 Hnv 
l ww yv ay ivii_/iv _> tidy 


u.u 


jNCi-rizyz irvi gamma 


A A 
0.0 


Two Way MLR 5 day 


1.6 


HPAEC none 


0 0 
V/.v 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


7.6 


Lung fibroblast none 


0.0 


PBMC PWM 


100.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


5.8 


Lung fibroblast IL-4 


0.0 
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Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


1.7 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


1.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 

J TT A 

and IL-4 


5.2 


Dermal fibroblast 
CCD 1 070 rest 


- 0.0 


EOL-1 dbcAMP 


19.9 


Dermal fibroblast 
CCD1070 TNF alpha 


1.4 


EOL-1 dbcAMP 

p\/f A /inn nmvrin 


7.8 


Dermal fibroblast 
rrni07ft tt -1 h^tn 


0.0 


Dendritic cells none 


9.2 


Dermal fiHrnhlnot fFTSl 

gamma 


0.0 


Dendritic cells LPS 


2.9 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


3.0 


IBD Colitis 2 


1.8 


Monocytes rest 


26.8 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


2.5 


Macrophages rest 


2.4 


Lung 


1.2 


Macrophages LPS 


0.0 


Thymus 


0.8 


HUVEC none 


0.0 


Kidney 


8.7 


HUVEC starved 


0.0 







AI_comprehensive panel_vl.0 Summary: Ag2951 Highest expression of the NOV67 
gene is seen in normal tissue adjacent to colon from an ulcerative colitis patient (CT=33). 
Thus, expression of this gene could be used to distinguish this sample from other samples on 
5 this panel. Please see Panel 4D for further discussion of utility of this gene in inflammation. 

CNS_neurodegeneration_vl.O Summary: Ag2951 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 13D Summary: Ag2951 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 
10 Panel 4.1D Summary: Ag295 1 Expression of this gene is low/undetectable (CTs > 

35) across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag2951 The NOV67 gene is expressed at a moderate level 
(CT=32.78) in pokeweed mitogen-stimulated peripheral blood leukocytes, consisting primarily 
of activated B lymphocytes. Small molecule antagonists or therapeutic antibody antagonists 
15 that block the function of the CG56571-gene product may be useful in several autoimmune 
and inflammatory diseases in which activated B cells can play major roles as sources of 
autoantibody-producing cells and as powerful antigen-presenting cells, including, but not 
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limited to, Crohn's disease, ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary 
disease, asthma, emphysema, rheumatoid arthritis, lupus erythematosus, or psoriasis. 

NOV69a and NOV69b 

Expression of gene NOV69a and variant NOV69b was assessed using the primer-probe 
5 sets Ag2460 and Ag349, described in Tables BKA and BKB. Results of the RTQ-PCR runs 
are shown in Tables BKC, BKD, BKE and BKR 



Table BKA. Probe Name Ag2460 



Primers 


Sequences 


1 Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 * -tcatagcagtcccgaggaa-3 ' 


[19 


89 


1273 1 


Probe 


TET-5 1 -tcactattgccttaatctcatgocga-3 »- 
TAMRA 


26 


125 


1274 


Reverse 


5 ' -ttctcaagggtctccacatg-3 ■ j20 


151 


1275 




Table BKB. Probe Name Ag349 




Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gggaaagccacagactcgaa-3 ' 


20 


289 


1276 


Probe 


TET-5 ' -cttctaccacagccagagtggcaggaact- 
3 1 -TAMRA 


29 


255 


1277 


Reverse 


5 ' -acccgagcctgtgaagtcct-3 ' 


20' 


231 


1278 



Table BKC. Panel 1 



Tissue Name 


Rel. Exp.(%) Ag349, 
Run 97804233 


Tissue Name 


Rel.Exp.(%)Ag349, 
Run 97804233 


Endothelial cells 


0.0 


Renal ca. 786-0 


0.0 


Endothelial cells 
(treated) 


0.0 


Renal ca. A498 


0.2 


Pancreas 


0.0 


Renal ca. RXF 393 


0.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. ACHN 


0.0 


Adrenal gland 


0.0 


Renal ca.UO-31 


0.0 


Thyroid 


0.5 


Renal ca. TK-10 


0.0 


Salivary gland 


25.5 


Liver 


0.0 


Pituitary gland 


0.0 


Liver (fetal) 


0.0 


Brain (fetal) 


0.0 


Liver ca. (hepatoblast) 
HepG2 


0.0 


Brain (whole) 


0.0 


Lung 


0.0 


Brain (amygdala) 


0.0 


Lung (fetal) 


0.0 


Brain (cerebellum) 


0.0 


Lung ca. (small cell) 


0.0 
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TV! 




Brain (hippocampus) 


0.0 


Lung ca. (small cell) 


1.5 


Brain (substantia nigra) 


0.0 


Lung ca. (s.cell var.) 
Mir-// 


0.0 


Brain (thalamus) j 0.0 


Lung ca. (large 
ceil)NCl-ri4oU 


1.3 


Brain (hypothalamus) 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


Spinal cord 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


glio/astro U87-MG 


59.0 


Lung ca. (non-s.cell) 
ri(Jl J -o2 


0.0 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (squam.) SW 
900 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (squam.) 
NC1-H596 


0.0 


astrocytoma orojy 


U.U 


Mammary gland 


0.0 


astrocytoma SNB-75 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SNB-1 9 


3.1 


MDA-MB-231 


0.0 


glioma U251 


0.0 


Breast ca * (vA ef\ 

XJl W/U*9L WCX. (Ul. 1 

T47D 


0.0 


glioma 


fi fi 


Breast ca RT-S4Q 


00 


Heart 

JL J.WU1 t 


0 0 


Breast ca. MDA-N 


0.0 


Skplptal tniiQf*1p 


fi 0 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 

5 


0.0 


Thymus 


66.4 


uvarian ca. UVUAK- 
4 


0.0 


Spleen 


0.1 


Uvarian ca. UVCAK- 
5 


1.5 


Lymph node 


3.1 


Ovarian ca OVCAR- 
8 


0.0 


Colon (ascending) 


29.9 


Ovarian ca IGROV-1 

v_y V Ul lull wU. AVJlVV V X 


0 0 


Stomach 


77.4 


Ovarian ca. (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* SW620 
(SW480 met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 


o.b 
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metj ri^o 




Colon ca. HCT-116 


0.0 


Testis 


0.0 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colonca.HCT-15 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


Gastric ca. * (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LUX 
IMVI 


0.0 


Trachea 


15.5 


Melanoma* (met) SK- 
MEL-5 


0.0 


Kidney 


0.0 


Melanoma SK-MEL- 
28 


100.0 


Kidney (fetal) 


0.0 







Table BKD. Panel 13D 



Tissue Name 


Rel. Exp.(%) Ag2460, 
Run 157914666 


Tissue Name 


Rel. Exp.(%) Ag2460, 
Run 157914666 


Liver adenocarcinoma 


0.0 | 


Kidney (fetal) 


0.0 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


9.6 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


5.6 


Renal ca. ACHN 


0.0 


Salivary gland 


19.8 


Renal ca.UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.0 


Liver ca. 
(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


2.2 


Brain (hippocampus) 


0.0 


Lung (fetal) 


0.0 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


45.7 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astroU87-MG 


2.4 


Lung ca. (non-sm. 
cell)A549 


0.0 
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glio/astro U-118-MG 


2.5 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


100.0 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.0 


MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


T47D 


0.0 


Heart 


0 0 


Dlvdol v/d. XJ 1 J*t7 


0 0 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Thymus 


61.1 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.0 


Lymph node 


8.4 


Ovarian ca. OVCAR- 

o 


0.0 


Colorectal 


0.0 


Ovarian ca. IGROV- 

i 
i 


0.0 


Stomach 


77.4 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0 


T JtenK 


00 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


2.3 


Colon ca. HT29 


0.0 


TV. - ale /I- _ _ 

Prostate ca. (bone 


0.0 


Colon ca.HCT-1 16 


0.0 


Testis 


0.0 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca.HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 


0.0 


Melanoma Ml4 


0.0 
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NCI-N87 








Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


59.5 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.0 


Adipose 


0.0 i 



Table BKE. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2460,Run 
157914720 


Tissue Name 


Rel. Exp.(%) 
Ag2460, Run 
157914720 


Normal Colon 


0.0 


Kidney Margin 
8120608 


0.0 


CC Well to Mod Diff 
(OD03866) 


u.u 


Kidney Cancer 
8120613 


0.0 


CC Ma rain ^nnn^/I^ 


u.u 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


u.u 


Kidney Cancer 
9010320 


0.0 


lviarglll \ \JLJ\J DoOo) 


u.u 


Kidney Margin 
9010321 


0.0 


CC Mod DifF(ODO3920) 


0.0 


Normal Uterus 


o.o 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 064011 


0.0 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.9 


CC Margin (OD03921) 


0.0 


Thyroid Cancer 
064010 


0.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 


0.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


0.0 


Lung Margin (OD04451- 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.0 


Normal Prostate 6546-1 


0.0 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
(OD04590-03) 


0.2 


Prostate Margin 
(OD04410) 


0.0 


Breast Cancer 

Metastasis 
(OD04655-05) 


0.6 


Prostate Cancer 
(OD04720-01) 


0.0 


Breast Cancer 064006 


0.0 
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Prostate Margin 
(OD04720-02) 


0.0 


| Breast Cancer 1024 


0.0 


Normal Lung 061010 


0.4 


Breast Cancer 
9100266 


0.0 


Lung Met to Muscle 

\\^S \J\J J 


0.0 


Breast Margin 
910026*5 


0.0 


Muscle Margin 

lrlUOvlw IVJLCll gill 

(OD04286) 


0.0 


A209073 


0.0 


Lung Malignant Cancer 
(OD03126) 


0.0 


Breast Margin 
A2090734 


0.0 


Lung Margin (OD03126) 


0.0 


Normal Liver 


0.0 


Lung Cancer (OD04404) 


100.0 


Liver Cancer 064003 


0.0 


Lune Marein fOD04404"l 

X^Ullg IvlCU gill ^VJL/UTTU" I 


00 




ft 1 


Lung Cancer (OD04565) 


0.5 


Liver Cancer 1026 


0.0 


Lung Margin (OD04565) 


0.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (UD04237- 
01) 


0.0 


Liver Tissue 6004-N 


0.0 


Lung Margin (UD04237- 
02) 


0.0 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (OD043 10) 


A A 

0.0 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


0.0 


Normal Kidney 


0.0 


Bladder Cancer 
(OD04718-01) 


0.0 


Kidney Ca, Nuclear grade 
2 (OD04338) 


0.0 


Bladder Normal 
Adjacent (OD04718- 

AO \ 

03) 


0.0 


Kidney Margin 

[kJUUHj Do) 


0.0 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 

1 n {C\T}(\A?.1Q\ 
1/Z [yJlJXJHDjy) 


0.0 


Ovarian Cancer 
Uo4UUo 


0.0 


Kidney Margin j 

{yJlJKj^Djy) 


0.0 


Ovarian Cancer 
(UD04/O5-07) 


0.0 


Kidney Ca, Clear cell 
tvne (OT)04MCf\ 


0.0 


Ovary Margin 


0.0 


Kidney Margin 
(OD04340) 


0.0 


Normal Stomach 


0.0 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0.0 


Gastric Cancer ! 
9060358 


0.0 


Kidney Margin 
(OD04348) 


0.0 


Stomach Margin 
9060359 


0.0 
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Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


0.0 


Kidney Margin 


0.0 


Stomach Margin 


0.0 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


0.0 


Kidney Margin 
(OD04450-03) 


0.0 


Stomach Margin 
9060396 


0.0 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


0.6 



Table BKF. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2460, Run 
157914794 


Tissue Name 


Rel. Exp.(%) 
Ag2460, Run 
157914794 


secondary inl act 


0.4 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.5 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


2.3 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.5 


HUVEC IL-ll 


0.0 


Secondary Trl rest 


3.7 


Lung Microvascular EC 
none 


0 0 


Primary Thl act 


0.6 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


1.1 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


1.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


15.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


Primary Th2 rest 


9.2 


Small airway epithelium 
none 


2.1 


Primary Trl rest 


4.3 


Small airway epithelium 
TNFalpha + IL-lbeta 


100.0 


CD45RA CD4 
lymphocyte act 


0.9 


Coronery artery SMC rest 


0.0 


CD45ROCD4 
lymphocyte act 


1.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


1.1 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.5 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.5 


KU-812 (Basophil) rest 


0.0 
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CD4 lymphocyte none 


1.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2iyThl/Th2/Trl_anti- 
CDyj Cxil 1 


2.6 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.7 


CCU 1 10b (Keratinocytes) 
TNFalpha + IL-lbeta 


0.5 


TAT/" TT O 

LAK cells LL-2 


A A 

0.0 


Liver cirrhosis 


o.u 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


T AT/ TT 0_i_TE7vT 

LAK cells lL-2+IrN 
pamma 


0.5 


NCI-H292 none 


0.3 


LAKcellsIL-2+IL-l8 


0.0 


NCI-H292 IL-4 


0.6 


LAK ceils 
PMA/ionomvcin 

x ivjujl/ iwiujui y will 


0.0 


NCI-H292 IL-9 


1.1 


NK Cells IL-2 rest 


0.0 


NCI-H292IL-13 


0.0 


1 wo Way MLK 3 day 


A A 
0.0 


XT/^T TT/>AO TT7XT - . _ _ _ _ _ 

NCI-H292 IFN gamma 


A A 

0.0 


lwo Way MLK 5 day 


A A 
0.0 


HPAEC none 


A A 

0.0 


Two Way MLR 7 day 


0.8 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


2.1 


Lung fibroblast none 


A A 

0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


2.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lune fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


1.8 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


1.6 


Dermal fibroblast 
CCD1 070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


0.8 


- EOL-1 dbcAMP 
riVLAV ionomy c in 


0.0 


Dermal fibroblast 
uijljio/o ll-i beta 


0.0 


Dendritic cells none 


0.0 


Jjermai tioro blast LrN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


ED Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


1.1 


Colon 


0.0 


Macrophages rest 


0.6 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


9.9 


HUVEC starved 


0.0 
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CNS_neurodegeneration_vLO Summary: Ag2460 Expression of the NOV69a gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown). 

Panel 1 Summary: Ag349 Highest expression of the NOV69a gene is seen in a 
melanoma cell line (CT=28.7). Thus, expression of this gene could be used to differentiate 
5 between this sample and other samples on this panel. There is also significant expression in 
thymus. Please see Panel 4D for discussion of utility of this gene in autoimmunity. 

Panel 1.3D Summary: Ag2460 Expression of the NOV69a gene is limited to a few 
samples that are all derived from normal tissue. Significant levels of expression are seen in 
mammary gland, trachea, stomach, thymus, and spinal cord. Thus, expression of this gene can 
1 0 be used to differentiate between these samples and other samples on this panel. 

Panel 2D Summary: Ag2460 Expression of the NOV69a gene is limited to a few 
samples, with highest expression in a lung cancer (CT=27.5). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 
marker to detect the presence of lung cancer. Furthermore, therapeutic modulation of the 
1 5 expression or function of this gene may be effective in the treatment of lung cancer. 

Panel 4D Summary: Ag2460 The NOV69a gene encodes a homolog of the IL-1 
epsilon. Interleukin 1 (IL-1) is a member of a large family of cytokines, which modulates 
immune and inflammatory responses. IL-1 molecules such as IL-lalpha, -beta, -delta, - 
gamma, and ILl-receptor agonist (IL-lra) are typically secreted by macrophages, 
20 mononuclear cells, epithelial and endothelial cells. IL-1 molecules are first produced as 

precursors of about 30 kDa and do not contain a signal sequence. The IL-1 precursors are then 
proteolytically cleaved into their secreted active forms (-1 7 kDa). Their immunomodulatory 
functions are mediated by two IL-1 receptors, which are members of the immunoglobulin 
superfamily. The biological functions of IL-1 include: activation of vascular endothelial cells 
25 to secrete IL-6, increase leukocyte adhesion and activate mononuclear phagocytes that activate 
inflammatory leukocytes; tissue destruction, and fever. Given the biological potency of the IL- 
1 family of proteins, a need exists to identify new members of this family as well as 
understand the biological function of its members. The high levels of expression of this gene 
in small airway epithelium activated by treatment with TNF-alpha + IL-1 beta(CT=28.9) 
30 indicate that CG56136-01 may play a substantial role in mediating inflammation in the lung. 
Thus, therapeutic targeting of CG56136-01 with a monoclonal antibody is anticipated to limit 
or block the extent of inflammation potential and thus the symptoms, caused by pro- 
inflammatory cytokines such as DL-1 epsilon, when these cytokines are induced in allergic, 
asthma and COPD patients. 
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References: 

SmithJXE., Renshaw,B.R.-, Ketchem,R.R., Kubin,M., Garka,K.E. and Sims,J.E.Four 
new members expand the interleukin-1 superfamily J. Biol. Chem. 275 (2), 1 169-1 175 (2000) 
Abstract: We report here the cloning and characterization of four new members of the 
5 interleukin-1 (IL-1) family (FILldelta, FILlepsilon, FILlzeta, and FILleta, with FIL1 

standing for "Family of EL-1"). The novel genes demonstrate significant sequence similarity to 
IL-lalpha, IL-lbeta, IL-lra, and IL-1 8, and in addition maintain a conserved exon-intron 
arrangement that is shared with the previously known members of the family. Protein structure 
modeling also suggests that the FIL1 genes are related to IL-lbeta and IL-lra. The novel genes 
10 form a cluster with the IL-ls on the long arm of human chromosome 2. 

NOV71 

Expression of gene NOV71 was assessed using the primer-probe set Ag3049, 
described in Table BLA. Results of the RTQ-PCR runs are shown in Tables BLB, BLC, BLD 
and BLE. 

1 5 Table BLA. Probe Name Ag3049 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gctggcgatatttaatgaattg-3 1 


22 


696 


1279 


Probe 


TET-5 1 -catgcagacgtggatctttacgcact-3 1 - 
TAMRA 


26 


718 


1280 


Reverse 


5 ' -agtacaatggcaacagcatcat-3 ■ 


22 


767 


1281 



Table BLB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3049, 
Run 209823735 


Tissue Name 


Rel.Exp.(%)Ag3049, 
Run 209823735 


AD 1 Hippo 


10.2 


Control (Path) 3 
Temporal Ctx 


5.9 


AD 2 Hippo 


33.7 


Control (Path) 4 
Temporal Ctx 


28.1 


AD 3 Hippo 


9.3 


AD 1 Occipital Ctx 


142 


AD 4 Hippo 


7.7 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


99.3 


AD 3 Occipital Ctx 


5.8 


AD 6 Hippo 


46.3 


AD 4 Occipital Ctx 


16.6 


Control 2 Hippo 


44.1 


AD 5 Occipital Ctx 


17.1 


Control 4 Hippo 


6.0 


AD 6 Occipital Ctx 


63.7 


Control (Path) 3 
Hippo 


7.6 


Control 1 Occipital 
Ctx 


2.2 
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AD 1 Temporal Ctx 


13.0 


Control 2 Occipital 
Ctx 


68.3 


AD 2 Temporal Ctx 


33.4 


Control 3 Occipital 
-Ctx 


11.5 


AD 3 Temporal Ctx 


6.8 


Control 4 Occipital 
Ctx 


5.5 


AD 4 Temporal Ctx 


16.5 


Control (Path) 1 
Occipital Ctx 


77.9 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


8.6 


AD 5 SupTemporal 
Ctx 


48.0 


Control (Path) 3 
Occipital Ctx 


2.1 


AD 6 Inf Temporal 
Ctx 


37.9 


Control (Path) 4 
Occipital Ctx 


12.9 


AD 6 Sup Temporal 
Ctx 


41.5 


Control 1 Parietal 
Ctx 


4.1 


Control 1 Temporal 
Ctx 


6.3 


Control 2 Parietal 
Ctx 


29 9 


Control 2 Temporal 
Ctx 


62.4 


Control 3 Parietal 
Ctx 


180 


Control 3 Temporal 
Ctx 


14.2 


Control (Path) 1 
Parietal Ctx 


97.9 


Control 4 Temporal 
Ctx 


7.7 


Control (Path) 2 
Parietal Ctx 


19.3 


Control (Path) 1 
Temporal Ctx 


64.6 


Control (Path) 3 
Parietal Ctx 


6.3 


Control (Path) 2 
Temporal Ctx 


29.9 


Control (Path) 4 
Parietal Ctx 


42.6 



Table BLC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3049, 
Run 167972763 


Tissue Name 


Rel. Exp.(%) Ag3049, 
Run 167972763 


Liver adenocarcinoma 


37.6 


Kidney (fetal) 


7.5 


Pancreas 


1.8 


Renal ca. 786-0 


4.7 


Pancreatic ca. CAPAN 
2 


8.0 


Renal ca. A498 


8.7 


Adrenal gland 


2.1 


Renal ca. RXF 393 


28.3 


Thyroid 


3.0 


Renal ca. ACHN 


6.3 


Salivary gland 


1.5 


Renal ca.UO-31 


36.9 


Pituitary gland 


8.5 


Renal ca. TK-10 


9.5 


Brain (fetal) 


10.4 


Liver 


0.2 


Brain (whole) 


9.0 


Liver (fetal) 


1.7 


Brain (amygdala) 


18.7 


Liver ca. 
(hepatoblast) HepG2 


6.3 
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Brain (cerebellum) 


32.5 


Lung 


0.4 


Brain (hippocampus) 


14.0 


Lung (fetal) 


2.0 


Brain (substantia nigra) 


6.0 


Lung ca. (small cell) 

T V 1 

LX-1 


5.4 


Brain (thalamus) 


2.9 


Lung ca. (small cell) 
IMC1-H69 


3.6 


Cerebral Cortex 


30.1 


Lung ca. (s.cell var.) 
SHP-77 


15.9 


! Spinal cord 


3.4 


Lung ca. (large 
cell)NCI-H460 


1.7 


glio/astro U87-MG 


27.4 


Lung ca. (non-sm. 
cell) A549 


28.9 


glio/astroU-118-MG 


16.0 


Lung ca. (non-s.cell) 
NCI-H23 


19.6 


astrocytoma SW1783 


40.3 


Lung ca. (non-s.cell) 
HOP-62 


18.3 


neuro*; met SK-N-AS 


0.6 


Lung ca. (non-s.cl) 
NCI-H522 


23.5 


astrocytoma SF-539 


2.9 


Lung ca. (squam.) 
SW 900 


15.4 


astrocytoma SNB-75 


26.8 


Lung ca. (squam.) 
NCI-H596 


4.0 


glioma bfstj- iv 


A A 

34.4 


Mammary gland 


1.1 


glioma U251 


100.0 


Breast ca.* (pl.ef) 
MCF-7 


37.4 


glioma SF-295 


39.2 


MDA-MB-231 


25.7 


Heart (fetal) 


1.3 


ftrpaot na * (r\\ f*f^ 

T47D 


55.5 


Heart 


\2 


Rrea<tf ca RT-S4Q 


7 4 
/ .*t 


Skeletal muscle (fetal) 


2.0 


Breast ca. MDA-N 


2.0 


oKeieiai muscie 


y.i 


Ovary 


2.7 


Bone marrow 


0.3 


Ovarian ca. OVCAR- 

i 


2.6 


Thymus 


3.0 


Ovarian ca. OVCAR- 
4 


17.4 


Spleen 


2.2 


Ovarian ca. OVCAR- 

5 


50.7 


Lymph node 


5.0 


Ovarian ca. OVCAR- 
8 


1.1 


Colorectal 


1.9 


Ovarian ca. IGROV- 
1 


52 


Stomach j 


3.3 


Ovarian ca.* (ascites) 
SK-OV-3 j 


24.7 


Small intestine 


0.2 | Uterus 


0.1 ! 
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Colon ca. SW480 


7.9 


Placenta 


0.2 


Colon ca.* 
SW620(SW480 met) 


30.4 


Prostate 


3.7 


Colon ca. HT29 


8.7 


Prostate ca.* (bone 
met)PC-3 


91.4 


Colon ca. HCT-116 


12.4 


Testis 


0.5 


Colon ca. CaCo-2 


18.7 


Melanoma 
Hs688(A).T 


6.0 


Colon ca. 
tissue(OD03866) 


3.7 


Melanoma* (met) 
Hs688(B).T 


10.1 


Colon ca. HCC-2998 


8.9 


Melanoma UACC-62 


11.0 


Gastric ca.* (liver met) 
1 NCI-N87 


8.7 


Melanoma M14 


0.4 


Bladder 


10.7 


Melanoma LOX 
IMVI 


17.8 


Trachea 


1.0 


Melanoma* (met) 
SK-MEL-5 


1.7 


Kidney 


22 


Adipose 


3.9 



Table BLD. Panel 2.2 



Tissue Name 


Rel. Exp.(%) 
Ag3049,Run 
174441445 


Tissue Name 


Rel. Exp.(%) 
Ag3049, Run 
174441445 


Normal Colon 


11.7 


Kidney Margin 
(OD04348) 


64.6 


Coloncancer 
(OD06064) 


57.4 


Kidney malignant 
cancer (OD06204B) 


10.5 


Colon Margin 
(OD06064) 


3.4 


Kidney normal adjacent 
tissue (OD06204E) 


10.1 


Colon cancer 
(OD06159) 


12.1 


Kidney Cancer 
(OD04450-01) 


24.1 


Colon Margin 
(OD06159) 


1.8 


Kidney Margin 
(OD04450-03) 


21.6 


Colon cancer 
(OD06297-04) 


7.1 


Kidney Cancer 8120613 


9.9 


Colon Margin 
(OD06297-015) 


14.7 


Kidney Margin 
8120614 


13.6 


CC Gr.2 ascend colon 
(OD03921) 


5.1 


Kidney Cancer 9010320 


10.3 


CC Margin (OD03921) 


5.4 


Kidney Margin 
9010321 


11.3 


Colon cancer metastasis 
(OD06104) 


1.8 


Kidney Cancer 8120607 


30.6 


Lung Margin 
(OD06104) 


14.8 


Kidney Margin 
8120608 


112 
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Colon mets to lung 
(OD04451-01) 


25.5 


Normal Uterus 


2.0 


Lung Margin 
(OD04451-02) 


51.4 


Uterine Cancer 064011 


6.8 


Normal Prostate 


28.9 


Normal Thyroid 


O.J 


Prostate Cancer 
(OD04410) 


13.4 


Thyroid Cancer 064010 


14.2 


Prostate Margin 
(OD04410) 


20.3 


Thyroid Cancer 
A302152 


51.4 


Normal Ovary 


7.9 


Thyroid Margin 
A302153 


8.5 


Ovarian cancer 
(OD06283-03) 


56.3 


Normal Breast 


53.6 


Ovarian Margin 
(OD06283-07) 


13.6 


Breast Cancer 
(OD04566) 


6.9 


Ovarian Cancer 064008 


24.1 


Breast Cancer 1024 


100.0 


Ovarian cancer 
(OD06145) 


31.2 


Breast Cancer 
(OD04590-01) 


9.5 


Ovarian Margin 

//VTVAlTI AC\ 

(OD06145) 


14.0 


Breast Cancer Mets 
(OD04590-03) 


24.3 


Ovarian cancer 
(OD06455-03) 


1 A A 

10.4 


Breast Cancer 
Metastasis (OD04655- 
05) 


65.1 


Ovarian Margin 
(OD06455-07) 


1.0 


Breast Cancer 064006 


26.2 


Normal Lung 


..... 1 1 o 

11.8 


Breast Cancer 9100266 


24.5 


Invasive poor cliff, lung 
adeno (UDU4945-U1 


10.7 


Breast Margin 9100265 


25.0 


Lung Margin 


6.2 


Breast Cancer A209073 


19.6 


Lung Malignant Cancer 

(VUvj iZo) 


5.9 


Breast Margin 
A2U90734 


58.6 


Lung Margin 
(OD03126) 


3.3 


Breast cancer 
(OD06083) 


79.0 


Lung Cancer 
(OD05014A) 


42.3 


Breast cancer node 
metastasis (OD06083) 


40.1 


Lung Margin 
(OD05014B) 


13.0 


Normal Liver 


31.0 ' 


Lung cancer (UJJliouol) 


13.1 


Liver Cancer 1026 


IOC 

12.5 


l^ung lvidrgin 
(OD06081) 


8.1 


Liver Cancer 1025 


25.5 


Lung Cancer 
(OD04237-01) 


35.8 


Liver Cancer 6004-T 


14.8 


Lung Margin 
(OD04237-02) 


14.7 


Liver Tissue 6004-N 


4.7 


Ocular Melanoma 


0.0 


Liver Cancer 6005-T 


25.9 
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\Af*t-n eta c i Q 








Ocular Melanoma 


8.5 


Liver Tissue 6005-N 


70.7 


Melanoma Metastasis 


4.3 


Liver Cancer 064003 


14.5 


Melanoma Margin 
(Lung) 


6.5 


Normal Bladder 


32.3 


Normal Kidney 


16.5 


Bladder Cancer 1023 


0.0 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


41 .2 


Bladder Cancer 

A *> 1 "7") 

A302173 


51.1 


Kidney Margin 
(OD04338) 


10.8 


Normal Stomach 


72.7 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


45.7 


Gastric Cancer 9060397 


4.7 


Kidney Margin 


8.2 


Stomach Margin 


40.6 


Kidney Ca, Clear cell 
type (OD04340) 


18.4 


Gastric Cancer 9060395 


10.4 


Kidney Margin 
(OD04340) 


7.4 


Stomach Margin 
9060394 


57.8 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


17.9 


Gastric Cancer 064005 


24.7 



Table BLE. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3049, Run 
164334396 


Tissue Name 


Rel. Exp.(%) 
Ag3049,Run 
164334396 


Secondary Thl act 


9.4 


HUVECIL-lbeta 


1.0 


Secondary Th2 act 


7.8 


HUVEC IFN gamma 


3.1 


Secondary Trl act 


8.1 


HUVEC TNF alpha + IFN 
gamma 


2.1 


Secondary Thl rest 


0.3 


HUVEC TNF alpha + IL4 


1.0 


Secondary Th2 rest 


2.0 


HUVEC IL-11 


0.7 


Secondary Trl rest 


2.1 


Lung Microvascular EC 
none 


0.3 


Primary Thl act 


13.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.2 


Primary Th2 act 


13.5 


Microvascular Dermal EC 
none 


0.5 


Primary Trl act 


162 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.1 


Primary Thl rest 


3.8 


Bronchial epithelium 
TNFalpha + ILlbeta 


4.5 


Primary Th2 rest 


3.7 


Small airway epithelium 
none 


2.6 
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Primary Trl rest 


1.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


8.5 


CD45RA CD4 
lymphocyte act 


6.6 


Coronery artery SMC rest 


11.8 


CD45KU LD4 
lymphocyte act 


7.8 


Coronery artery SMC 
TNFalpha + IL-lbeta 


6.8 


CD8 lymphocyte act 


5.6 


Astrocytes rest 


3.8 


Secondary CD8 
lymphocyte rest 


4.0 


Astrocytes TNFalpha + 
IL-lbeta 


5.4 


Secondary CD8 
lymphocyte act 


9.2 


KU-812 (Basophil) rest 


1.5 


CD4 lymphocyte none 


0.6 


KU-812 (Basophil) 

T^\yf A /irwir\mvr»in 
x irLr\j IvIllUllljr 


10.4 


2iyThl/Th2/Trl_anti- 
CD95 CH11 


3.7 


CCDl 106 (Keratinocytes) 


3.7 


LAK cells rest 


9.3 


TNFalpha + IL-lbeta 


4.5 


T AK celk TT -2 

L/AIV l/Vlla UL/-Z. 




i^ivcr wirriiUdib 


ft ^ 


T AK celk TT -2+TT -1? 


A 6 

*t.O 


T untie lri/inA\/ 

L»upub Kiuncy 


ft 1 


LAK celk TT -2+IFN 
gamma 


9.6 


NCI-H292 none 


9.5 


LAK cells IL-2+ IL-18 


9.5 


NCI-H292 IL-4 


15.1 i 


T AK celk 
PMA/ionomycin 


4.7 


NCI-H292 IL-9 


12.5 


NK Cells IL-2 rest 


1.4 


NCI-H292 IL-13 


7.7 


Two Wav MT R 3 dav 

i mi/ v » ay i vi i jix j vie* y 


J. VI 


rNV^l-ITXJJx. Irli gaill Ilia 


o n 
y.v 


Two Wav MT R S dav 


1 7 


HP AFP nnnp 

nr/vDv^ nunc 




Two Way MLR 7 day 


2.3 


HP AFP TNF alnha + TT -1 

beta 


1.1 


PBMC rest 


i 1 


T line fihmhlact nnnp 




PBMC PWM 


17.6 


Lung fibroblast TNF alpha 
+ IL-1 beta 


1.2 


PBMC PHA-L 


10.8 


Lung fibroblast IL-4 


5.0 . 


Kamos (x> ceil ) none 


Zi.z 


Lung fibroblast IL-9 


7.5 


Ramos (B cell) 
lonomycin 


100.0 | 


Lung fibroblast IL-13 


3.9 


B lymphocytes PWM 


40.9 


Lung fibroblast IFN 
gamma 


14.6 


and IL-4 


252 


L/eimal IlDrODlaSI 

CCD 1070 rest 


9.4 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCDl 070 TNF alpha 


14.8 


EOL-1 dbcAMP 
PMA/ionomycin 


0.4 


Dermal fibroblast 
CCD1070 IL-1 beta 


6.0 


Dendritic cells none 


7.7 


Dermal fibroblast IFN 


10.2 
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gamma 




Dendritic cells LPS 


10.8 


Dermal fibroblast IL-4 


9.2 


Dendritic cells anti- 
CD40 


13.1 


IBD Colitis 2 


0.7 


Monocytes rest 


0.9 


IBD Crohn's 


0.1 


Monocytes LPS 


0.3 


Colon 


1.1 


Macrophages rest 


19.9 


Lung 


1.1 


Macrophages LPS 


4.3 


Thymus 


3.7 


HUVEC none 


2.7 


Kidney 


7.4 


HUVEC starved 


2.9 







CNS_neurodegeneration_yl.O Summary: Ag3049 This panel does not show 
differential expression of the NOV71 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3049 The NOV71 gene is expressed at moderate to high 
leves in the cancer cell lines in this panel with the highest expression shown by a glioma cell 
line U251 (CT=26.6). Because normal tissues show a lower level of expression of this gene, 
expression of this gene migh be used as a diagnostic marker for cancer. Furthermore, 
10 therapeutics designed using antibodies and small molecule inhibitors of this gene may be 
effective in the treatment of cancer. 

Among tissues with metabolic function, this gene has moderate levels of expression in 
pancreas, adrenal, thyroid, pituitary, heart, skeletal muscle and adipose. Therefore, this gene 
product may be a small molecule target for the treatment of endocrine and metabolic diseases, 
15 including obesity, and Types 1 and 2 diabetes. 

In addition, moderate expression of this gene in the CNS suggests a role for this gene 
product in brain processes. Inhibition of SODIUM/HYDROGEN EXCHANGER function in 
the brain is associated with the activity of several enzymes known to play a positive role in 
cell survival and learning and memory, such as PKA and PKC. Therefore, inhibitors of the 
20 protein encoded by this gene may have utility in mimicking the potentially therapeutic action 
of these enzymes in the treatment of neurodegenerative diseases including Alzheimer's and 
Parkinson's diseases, as well as in memory loss due to aging. 

References: 

Am J Physiol Cell Physiol 2001 Oct;281(4):Cl 146-Acute inhibition of brain-specific 
25 Na(+)/H(+) exchanger isoform 5 by protein kinases A and C and cell shrinkage. Attaphitaya S, 
Nehrke K, Melvin JE. 
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Little is known of the functional properties of the mammalian, brain-specific 
Na(+)/H(+) exchanger isoform 5 (NHE5). Rat NHE5 was stably expressed in NHE-deficient 
PS 120 cells, and its activity was characterized using the fluorescent pH-sensitive dye 2\7- 
bis(2-carboxyethyl)-5(6)-carboxyfluorescein. NHE5 was insensitive to ethylisopropyl 
5 amiloride. The transport kinetics displayed a simple Michaelis-Menten relationship for 
extracellular Na(+) (apparent K(Na) = 27 +/- 5 mM) and a Hill coefficient near 3 for the 
intracellular proton concentration with a half-maximal activity at an intracellular pH of 6.93 
+/- 0.03. NHE5 activity was inhibited by acute exposure to 8-bromo-cAMP or forskolin 
(which increases intracellular cAMP by activating adenylate cyclase). The kinase inhibitor H- 

1 0 89 reversed this inhibition, suggesting that regulation by cAMP involves a protein kinase A 
(PKA)-dependent process. In contrast, 8-bromo-cGMP did not have a significant effect on 
activity. The protein kinase C (PKC) activator phorbol 12-myristrate 13-acetate inhibited 
NHE5, and the PKC antagonist chelerythrine chloride blunted this effect. Activity was also 
inhibited by hyperosmotic-induced cell shrinkage but was unaffected by a hyposmotic 

1 5 challenge. These results demonstrate that rat brain NHE5 is downregulated by activation of 
PKA and PKC and by cell shrinkage, important regulators of neuronal cell function. 

Panel 2.2 Summary: Ag3049 The NOV71 gene can be used as a diagnostic marker 
for stomach, breast, lung, ovarian and some colon cancers as expression in the normal adjacent 
tissue and the tumor tissue differs. Antibodies and small molecule inhibitors designed with this 

20 gene product may also be used for therapy in breast, lung, ovarian and some colon cancers. 

Panel 4D Summary: Ag3049 The NOV71 gene, a sodium/hydrogen Exchanger 
homolog is expressed at a high level in Ramos (B cell) activated with monomycin (CT=24.72), 
and at a moderate to high level in other activated B cell preparations. Therefore, small 
molecule antagonists or therapeutic antibody antagonists that block the function of the NOV71 

25 gene product may be useful in several autoimmune and inflammatory diseases in which 

activated B cells can play major roles as sources of autoantibody-producing cells and also as 
powerful antigen-presenting cells, including, but not limited to, Crohn's disease, ulcerative 
colitis, multiple sclerosis, chronic obstructive pulmonary disease, asthma, emphysema, 
rheumatoid arthritis, lupus erythematosus, or psoriasis. 

30 NOV72: UBIQUTnN-SPECIFIC PROTEASE 

. Expression of gene NOV72 was assessed using the primer-probe set Ag3050, 
described in Table BMA. Results of the RTQ-PCR runs are shown in Tables BMB, BMC, 
BMD and BME. 
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Table BMA. Probe Name Ag3050 



Primers j Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forwardjs ' -tgaccaggtattaaccatggaa-3 1 


22 


799 


1282 


„ ^ StET-5 1 -ttactgctgcagggacatgctctcct-3 1 - 

Probe 1 

JTAMRA • 


26 


823 


1283 


Reversejs ' -taggcaaaggtctctttgtcaa-3 1 


22 


852 


1284 



Table BMB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3050, 
Run 211012446 


x issue r> a me 


Rel. Exp.(%) Ag3050, 
Run 211012446 


AfV 1 rjm„ A 

A±J 1 Jllppo 




Control (Path) 3 
Temporal Ctx 




J\L) Z rilppo 


00 1 


Control (Path) 4 
Temporal Ctx 


70 1 

1 


AD 3 Hippo 


3.0 


AD 1 Occipital Ctx 


15.0 


AD 4 Hippo 


11.9 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


59.0 


AD 3 Occipital Ctx 


7.1 


AD 6 Hippo 


51.1 


AD 4 Occipital Ctx 


22.2 


Control 2 Hippo 


17.6 


AD 5 Occipital Ctx 


24.7 


vunirui t mppu 


6 1 


AD f* Ocrinital Ctx 


22 4 


Control (Path) 3 


7.0 


Control 1 Occipital 
Ctx 

VIA 


2.9 


AD 1 Temporal Ctx 


17.4 


Control 2 Occipital 
Ctx 

VIA 


39.0 


AD 2 Temporal Ctx 


23.5 


Control 3 Occipital 
Ctx 


16.8 


AD 3 Temporal Ctx 


7.9 


Control 4 Occipital 
Ctx 


6.0 


AD 4 Temporal Ctx 


27.5 


Control (Path) 1 
Occipital Ctx 


51.4 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


122 


AD 5 Sup Temporal 
Ctx 


34.9 


Control (Path) 3 
Occipital Ctx 


2.4 


AD 6 Inf Temporal 
Ctx 


54.7 


Control (Path) 4 
Occipital Ctx 


15.9 


AD 6 Sup Temporal 
Ctx 


60.3 


Control 1 Parietal 
Ctx 


8.4 


Control 1 Temporal 
Ctx 


3.7 


Control 2 Parietal 
Ctx 


36.3 • 


Control 2 Temporal 
Ctx 


20.3 


Control 3 Parietal 
Ctx 


15.9 


Control 3 Temporal 


9.7 


Control (Path) 1 


39.0 



1307 



WO 02/068649 



PCT/US02/02785 



CTX 




ranetal Ctx 




Control 3 Temporal 
Ctx 


7.2 


Control (Path) 2 
Parietal Ctx 


16.2 


Control (Path) 1 
Temporal Ctx 


46.7 


Control (Path) 3 
Parietal Ctx 


2.2 


Control (Path) 2 
Temporal Ctx 


34.9 


Control (Path) 4 
Parietal Ctx 


36.1 



Table BMC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3050, 
Run 167985384 


Tissue Name 


Rel.Exp.(%) Ag3050, 
Run 167985384 


Liver adenocarcinoma 


10.1 


Kidney (fetal) 


28.7 


Pancreas 


5.7 


Renal ca. 786-0 


14.9 


Pancreatic ca. CAPAN 
2 


2.5 


Renal ca. A498 


6.6 


Adrenal gland 


2.5 


Renal ca. RXF 393 


8.8 


Thvroid 


3 7 


Renal ca APT-TN 


5 4 


Salivary gland 


1.5 


Renal ca. UO-31 


6.2 


Pituitary gland 


6.8 


Renal ca. iK-10 


• 6.5 


Brain (fetal) 


100.0 


Liver 


2.2 


Brain (whole) 


23.3 


Liver (fetal) 


6.4 


Brain (amygdala) 


22.2 


Liver ca. 
(hepatoblast) HepG2 


11.3 


Brain (cerebellum) 


88.9 


Lung 


0.8 


Brain (hippocampus) 


19.5 


Lung (fetal) 


15.1 




10 3 


Lung ca. (small cell) 
LX-1 


14 4 


Brain (thalamus) 


7.7 


Lung ca. (small cell) 
NCI-H69 


18.4 


Cerebral Cortex 


12.1 


Lung ca. (s.cell var.) 
SHP-77 


43.2 


Spinal cord 


11.1 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


4.6 


Lung ca. (non-sm. 
cell) A549 


5.2 1 


glio/astroU-118-MG 


18.9 


Lung ca. (non-s.cell) 
NCI-H23 


7.4 


astrocytoma SW1783 


9.4 


Lung ca. (non-s.cell) 
HOP-62 


7.2 


neuro*; met SK-N-AS 


15.7 


Lung ca. (non-s.cl) 
NCI-H522 


21.0 


astrocytoma SF-539 


17.1 


Lung ca. (squam.) 
SW 900 


7.7 


astrocytoma SNB-75 


12.2 


Lung ca. (squam.) 


60.7 
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NCI-H596 


glioma SNB-19 


11.9 


Mammary gland 


1.3 


glioma U251 


15.7 


Breast ca.* (pl.ef) 
MLr-7 


14.5 


glioma SF-295 


16.4 


Breast ca.* (pl.ef) 

■\jfT\A A ZD 1 

MDA-Mr>-2 J 1 . 


24.7 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 

147JD 


21.8 






Breast ca. BT-549 


7.3 


Skeletal muscle (fetal) 


8.1 


Breast ca. MDA-N 


13.5 


Skeletal muscle 


11.0 


Ovary 


4.0 


Bone marrow 


4.5 


Ovarian ca. OVCAR- 
3 


2.2 


Thymus 


9.6 


Ovarian ca. OVCAR- 
4 


4.7 


Spleen 


5.4 


Ovarian ca. OVCAR- 
5 


117 

11./ 


Lymph node 


14.6 


Ovarian ca. OVCAR- 
8 


4.9 


Colorectal 


5.6 


Ovarian ca. IGROV- 
1 


2.2 


Stomach 


2.0 


Ovarian ca.* (ascites) 
SK-OV-3 


24.3 


0111d.il llllGollllC 


i ft 
1 .o 


Uterus 


6.9 


Colon ca. SW480 


5.1 


Placenta 


1.3 


Colon ca.* 
SW620(SW480 met) 


22.2 


Prostate 


5 9 


Colon ca. HT29 


4.9 


Prostate ca.* (bone 
met)PC-3 


4.8 


Colon ca. HCT-1 16 


9.3 


Testis 


11.2 


Colon ca. CaCo-2 


18.2 


Melanoma 
Hs688(A).T 


1.7 


colon ca. 
tissue(OD03866) 


2.4 


Melanoma* (met) 
Hs688(B).T 


0.0 


i^oion ca. ricc-zyyo 


1/11 

10.1 


Melanoma UACC-62 


4.0 


NCI-N87 


9.8 


Melanoma M14 


2.1 


Bladder 


6.3 


Melanoma LOX 
IMV1 


5.8 


Trachea 


2.2 


Melanoma* (met) 
SK-MEL-5 


2.1 


Kidney 


6.7 


Adipose 


4.3 
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Table BMD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3050, Run 
164317257 


Tissue Name 


Rel. Exp.(%) 
Ag3050, Run 
164317257 


Secondary Thl act 


21.0 


HUVEC IL-lbeta 


5.3 


Secondary Th2 act 


22.4 


HUVEC IFN gamma 


6.3 


Secondary Trl act 


23.3 


HUVEC TNF alpha + IFN 
gamma 


11.6 


Secondary Thl rest 


4.6 


HUVEC TNF alpha + IL4 


7.5 


Secondary Th2 rest 


7.9 


HUVEC IL-11 


2.6 


Secondary Trl rest 


9.8 


Lung Microvascular EC 
none 


9.5 


Primary Thl act 


24.3 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


9.9 


Primary Th2 act 


13.5 


Microvascular Dermal EC 
none 


13.4 


Primary Trl act 


21.6 


Microsvasular Dermal EC 
TNFalpha* IL-lbeta 


11.7 


Primary Thl rest 


36.6 


Bronchial epithelium 
TNFalpha + IL1 beta 


21.9 


Primarv Th? rpst 

r I lilldi jr I ILL. IGaL 


17 f\ 


Small airway epithelium 
none 


u.u 


Primatv Trl rpct 

XllJUCUJr 111 Jbdl 


1Q 6 


Small airway epithelium 
TNFalpha* IL-lbeta 


1 S 7 


CD45RA CD4 
lymphocyte act 


7 1 






CD45RO CD4 
lymphocyte act 


14 5 


• Coronery artery SMC 
TNFalpha + IL-lbeta 


1 4 


CD8 lymphocyte act 


7.7 


Astrocytes rest 


11.8 


Secondary CD8 
lymphocyte rest 


17.2 


Astrocytes TNFalpha + 
IL-lbeta 


1.6 


Secondary CD8 
lymphocyte act 


13.9 


KU-812 fBasonhiH rest 


15 6 


CD4 lymphocyte none 


2.1 


KU-812 (Basophil) 
PMA/ionomycin 


40.1 


2ryThl/Th2/Trl_anti- 
CD95 CHI l 


17.0 


CCD1 106 (Keratinocytes) 
none 


11.0 


LAK cells rest 


16.7 


lAJJl lUo (Keratinocytes) 
TNFalpha + IL-lbeta 


4.6 


LAK cells IL-2 


15.1 


Liver cirrhosis 


5.3 


LAK cells H-2+IL-12 


12.6 


Lupus kidney 


1.6 


LAK cells IL-2+IFN 
gamma 


15.3 


NCI-H292 none 


17.1 


LAK cells IL-2+IL- 18 


14.9 


NCI-H292 IL-4 


15.3 


LAK cells 


7.2 


NCI-H292 IL-9 


15.5 
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rMA/ionornycin 








NK Cells IL-2rest 


14.3 


NCI-H292 IL-13 


5.2 


Two Way MLR 3 day 


7.5 


NCI-H292 IFN gamma 


9.3 


Two Way MLR 5 day 


7.0 


HPAEC none 


9.7 


Two Way MLR 7 day 


6.0 


HPAEC TNF alpha + IL- 1 
beta 


13.9 


PBMC rest 


3.4 


Lung fibroblast none 


5.3 


PBMC PWM 


38.4 


Lung fibroblast TNF alpha 
+ IL-l beta 


5.7 


PBMC PHA-L 


14.5 


Lung fibroblast IL-4 


7.7 


Ramos (B cell) none 


24.8 


Lung fibroblast IL-9 


7.0 


Ramos (B cell) 
ionomycin 


48 0 

to.u 


T una fihmhljict' TT -1 3 


3.8 


x> lynipnocyica r wivi 


*\& 7 

J*T. / 


Lung fibroblast IFN 
gamma 


104 

I v. » 


B lymphocytes CD40L 
and IL-4 


HO .Z 


Dermal fibroblast 
CCD1070 rest 


12 6 


EOL-1 dbcAMP 


10.7 


Dermal fibroblast 
CCD1070 TNF alpha 


30.1 


EOL-1 dbcAMP 
PMA/ionomycin 


10.8 


Dermal fibroblast 
CCD1070 IL-l beta 


2.1 


Dendritic cells none 


6.9 


Dermal fibroblast IFN 
gamma 


4.5 


Dendritic ceus Lro 


J.O 


1 \amm a 1 ti nrrvnlon'f TT /I 

jjermai iiDroDiasi il-h 


1 J.O 


Dendritic cells anti- 
CD40 


5.5 


IBD Colitis 2 


0.9 


Monocytes rest 


9.9 


IBD Crohn's 


1.2 


Monocytes LPS 


5.0 


Colon 


3.2 


Macrophages rest 


10.4 


Lung 


2.6 


Macrophages LPS 


2.0 


Thymus 


8.6 


HUVEC none 


8.1 


Kidney 


100.0 


HUVEC starved 


23.7 







Table BME. Panel CNS_1 



Tissue Name 


Rel. Exp.(%) Ag3050, 
Run 171694540 


Tissue Name 


Rel. Exp.(%) Ag3050, 
Run 171694540 


BA4 Control 


20.7 


BA17 PSP 


30.4 


BA4 Control 


35.8 


BA17 PSP2 


15.7 


BA4 
Alzheimer's2 


0.0 


Sub Nigra Control 


38.2 


BA4 Parkinson's 


45.7 


Sub Nigra Control2 


14.3 


BA4 
Parkinson's2 


59.9 


Sub Nigra 
Alzheimer's2 


10.4 
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BA4 
Huntington's 


24.7 


Sub Nigra 
Parkinson's2 


46.7 


BA4 
Huntington's^ 


4.8 


Sub Nigra 
Huntington's 


49.0 


BA4 PSP 


31.4 


Sub Nigra 
Huntington's2 


11.6 


BA4 PSP2 


32.3 


Sub Nigra PSP2 


6.7 


BA4 Depression 


1.3 


Sub Nigra 
Depression 


9.2 


BA4 
Depression2 


17.3 


Sub Nigra 
Depression2 


15.2 


BA7 Control 


38.2 


Glob Palladus 
Control 


23.2 


BA7 Control2 


1.2 


Glob Palladus 
Control2 


7.5 


BA7 
Alzheimer ! s2 


5.4 


Glob Palladus 
Alzheimer's 


6.1 


BA7 Parkinson's 


43.5 


Glob Palladus 
Alzheimer's2 


0.0 


BA7 
Parkinson's2 


57.8 


Glob Palladus 
Parkinson's 


100.0 


BA7 
Huntington's 


43.8 


Glob Palladus 
Parkinson's2 


20.2 


dA/ 
Huntington's2 


59.5 


Glob Palladus PSP 


0.0 


DA? DCD 


42 .o 


CjiOD rail ad US rbrZ 


3.0 


BA7 PSP2 


31.9 


Glob Palladus 
Depression 


16.4 


BA7 Depression 


12.6 


Temp Pole Control 


18.9 


BA9 Control 


26.2 


Temp Pole Control2 


45.7 


BA9 Control2 


19.8 


Temp Pole 
Alzheimer's 


2.6 


BA9 Alzheimer's 


6.6 


Temp Pole 
Alzheimer , s2 


10.0 


BA9 
Alzheimer^ 


13.5 


Temp Pole 
Parkinson's 


22.8 


BA9 Parkinson's 


32.1 


Temp Pole 
Parkinson's2 


16.7 


BA9 
Parkinson's2 


30.1 


Temp Pole 
Huntington's 


34.9 


BA9 
Huntington's 


38.2 


Temp Pole PSP 


5.8 


BA9 
Huntington's2 


31.6 


Temp Pole PSP2 


10.7 


BA9 PSP 


17.6 


Temp Pole 
Depression2 


24.3 
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BA9 PSP2 


2.0 


v^ing oyr control 


62 9 


BA9 Depression 


20.3 


Cing Gyr Control2 


13.4 

* 


BA9 
Depression2 


11.9 


Cing Gyr 
Alzheimer's 


23.7 


BA17 Control 


65.5 


Cing Gyr 
Alzheimer r s2 


11.7 


BA17Control2 


31.4 


Cing Gyr Parkinson's 


36.6 


BA17 
Alzheimer's2 


11.0 


Cing Gyr 
Parkinson's2 


67.4 


BA17 
Parkinson's 


62.9 


Cing Gyr 
Huntington's 


89.5 


BA17 
Parkinson's2 


59.9 


Cing Gyr 
Huntington's2 


26.4 


BA17 
Huntington's 


. 72.2 


Cing Gyr PSP 


11.6 


BA17 
Huntington's2 


38.7 


Cing Gyr PSP2 


2.0 


BA17 
Depression 


12.2 


Cing Gyr Depression 


13.0 


BA17 
Depression2 


42.9 


Cing Gyr 
Depression2 


20.0 



CNS_neurodegeneration_vl.O Summary: Ag3050 This panel does not show 

differential expression of the NOV72 gene in Alzheimer's disease. However, this expression 

profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 

5 utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3050 The NOV72 gene exhibits brain-preferential 

expression and is a member of a family of proteins that mediates ubiquitin-mediated protein 

degradation. Misprocessing of proteins involved in ubiquitin-mediated protein degradation is 

thought to be the cause of many neurodegenerative disorders such as Parkinson's disease, as 

1 0 well as those resulting from CAG repeat expansion genes, such as Huntingtin's disease. 

Therefore, therapeutic modulation of the expression or function of this gene may affect the 

protein degradation dysfunction seen in these diseases. 

In addition, this gene is expressed at a slightly higher level in cancer cell lines 

compared to the normal lung, ovary, breast, and colon samples on this panel. This suggests 

1 5 that expression of this gene could be used as a diagnostic marker of cancer. Furthermore, 

inhibition of this gene product using small molecule drugs may be useful in the treatment of 

cancer in these tissues. 

Among tissues with metabolic function, this gene is has a low level of expression in 

pancreas, thyroid, pituitary, heart, skeletal muscle, and adipose. This gene product may be a 
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small molecule target for the treatment of metabolic and endocrine diseases, including obesity 
and Types 1 and 2 diabetes. 

Panel 4D Summary: Ag3050 The NOV72 gene is expressed at moderate to low levels 
(CT=29-34) in a wide range of cell types and tissues of significance in the immune response in 
health and disease, Highest expression of this gene is seen in kidney tissue (CT=29.36). 
Therefore, targeting of this gene product with a small molecule drug or antibody therapeutic 
may modulate the functions of cells of the immune system as well as resident tissue cells and 
lead to improvement of the symptoms of patients suffering from autoimmune and 
inflammatory diseases such as asthma, allergies, inflammatory bowel disease, lupus 
erythematosus, and arthritis, including osteoarthritis and rheumatoid arthritis. 

Panel CNSJI Summary: Ag3050 This panel confirms expression of the NOV72 gene 
in the brain. Please see Panel 1.3D for discussion of utility of this gene in the central nervous 
system. 



NOV73 



15 Expression of gene NOV73 was assessed using the primer-probe set Ag3030, 

described in Table BNA. 

Table BNA. Probe Name Ag3030 



Primers 


. Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -tgaaattcagaaccaggaaatg-3 ■ 


22 


838 


1285 


Probe 


TET-5 1 -aaagagtgcttagcaggcacctccct-3 ' - 
TAMRA 


26 


872 


1286 


Reverse 


5' -aaccctggcaatatgattcata-3 » 


22 


914 


1287 



CNS_neurodegeneration_vl.O Summary: Ag3030 Expression of this gene is 
20 low/undetectable (CTs > 34.5) across all of the samples on this panel (data not shown). 

Panel 13D Summary: Ag3030 Expression of this gene is low/undetectable (CTs > 
34.5) across all of the samples on this panel (data not shown). 

Panel 2D Summary: Ag3030 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 
25 Panel 3D Summary: Ag3030 Expression of this gene is low/undetectable (CTs > 35) 

across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag3030 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 
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Panel 5 Islet Summary: Ag3030 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

NOV74 

Expression of gene NOV74 was assessed using the primer-probe set Ag3016, 
5 described in Table BOA. Results of the RTQ-PCR runs are shown in Tables BOB, BOC, BOD 
and BOE. 



Table BOA. Probe Name Ag3016 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 
Probe 


5 1 -atctcagtgacctgctctcaga-3 • 


22 


83 


1288 


TET-5 1 -cagtggctgctacagcctcccaag~3 1 - 
TAMRA 


24 


108 


1289 


Reverse 


5 1 -aaatcttcagggtgacctcatt-3 1 


22 


142 


1290 



Table BOB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3016, 
Run 209820675 


Tissue Name 


Rel. Exp.(%) Ag3016, 
Run 209820675 


AD 1 Hippo 


4.6 


Control (Path) 3 
Temporal Ctx 


3.1 


AD 2 Hippo 


7.9 


Control (Path) 4 
Temporal Ctx 


33.9 


AD 3 Hippo 


2.9 


AD 1 Occipital Ctx 


17.9 


AD 4 Hippo 


3.3 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


7.5 


AD 6 Hippo 


8.5 


AD 4 Occipital Ctx 


12.5 


Control 2 Hippo 


5.4 


AD 5 Occipital Ctx 


11.7 


Control 4 Hippo 


1.8 


AD 6 Occipital Ctx 


12.9 


Control (Path) 3 
Hippo 


1.5 


Control 1 Occipital 
Ctx 


3.7 


AD 1 Temporal Ctx 


10.8 . 


Control 2 Occipital 
Ctx 


18.8 


AD 2 Temporal Ctx 


20.4 


Control 3 Occipital 
Ctx 


0.7 


AD 3 Temporal Ctx 


3.5 


Control 4 Occipital 
Ctx 


5.8 


AD 4 Temporal Ctx 


22.2 


Control (Path) 1 
Occipital Ctx 


50.3 


AD 5 Inf Temporal 
Ctx 


60.7 


Control (Path) 2 
Occipital Ctx 


23.3 


AD 5 Sup Temporal 


17.7 


Control (Path) 3 


1.5 
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Ctx 




Occipital Ctx 




AD 6 Inf Temporal 
Ctx 


23.7 


Control (Path) 4 
Occipital Ctx 


36.6 


AD 6 Sup Temporal 
Ctx 


23.7 


Control 1 Parietal 
Ctx 


6.1 


Control 1 Temporal 
Ctx 


5.6 


Control 2 Parietal 
Ctx 


28.7 


Control 2 Temporal 
Ctx 


5.9 


Control 3 Parietal 
Ctx 


21.3 


Control 3 Temporal 
ctx 


11.1 


Control (Path) 1 
Parietal C_tx 


49.3 


Control 3 Temporal 
Ctx 


6.3 


Control (Path) 2 
Parietal Ctx 


27.0 


Control (Path) 1 
Temporal Ctx 


45.4 


Control (Path) 3 
Parietal Ctx 


5.6 


Control (Path) 2 
Temporal Ctx 


36.9 


Control (Path) 4 
Parietal Ctx 


39.0 



Table BOC. Panel 1 3D 



Tissue Name 


Rel. Exp.(%) Ag3016, 
Run 167819111 


Tissue Name 


Rel. Exp.(%) Ag3016, 
Run 167819111 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


9.2 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


o.o 


Salivary gland 


0.0 


Renal ca.UO-31 


0.0 


Pituitary gland 


o.o 


Renal ca. TK-10 


0.0 


Brain (fetal) 


99.3 


Liver 


0.0 


Brain (whole) 


96.6 


Liver (fetal) 


0.0 


Brain (amygdala) 


21.5 


Liver ca. 
(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


12.0 


Lung 


0.0 


Brain (hippocampus) 


24.7 


Lung (fetal) 


6.2 


Brain (substantia nigra) 


15.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


8.2 


Cerebral Cortex 


100.0 


Lung ca. (s.cell var.) 
SHP-77 


6.9 


Spinal cord 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


46.0 


Lung ca. (non-sm. 


. 10.9 
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cell) A549 




elio/astro U-l 18-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


6.0 


astrocytoma SW1783 


6.3 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS. 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


7.1 


Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-1 9 


0.0 


Mammary gland 


0.0 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
ML-r-/ j 


0.0 


glioma SF-295 


32.5 


Breast ca. (pi.et) 
MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 
T47D 


812 


TT J 

Heart 


A A 

0.0 


Breast ca. Bi-54y 


O A 
U.U 


Skeletal muscle (fetal) 


10.6 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Rone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Thvmus 


7.0 


Ovarian ca. OVCAR- 
4 


4.5 


Snleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.0 


Lymph node 


63 


Ovarian ca. OVCAR- 

O 

o 


100.0 


Colorectal 


0.0 


Ovarian ca. IGROV- 

i 
i 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 


72.7 


Small intestine 


u.u 


uterus 


A 0 
U.U 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 

lllCv )x \s J 


0.0 


Colon ca.HCT-1 16 


0.0 


Testis 


15.4 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


19.5 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


57.0 
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vjooltic Ca. ^iivcr rnci^ 

NCI-N87 


0.0 


Melanoma M14 


26.8 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


19.8 


Kidney 


0.0 


Adipose 


0.0 



Table BOD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3016, Run 


Tissue Name 


Rel. Exp.(%) 
Ag3016, Run 

1 4% A AH A*) 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


6.2 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondarv Trl rest 


0 0 


Lung Microvascular EC 
none 


0 0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFaIpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha* IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 

1 iSr alpHa i ILjI DC la 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45ROCD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


13.7 


Secondary CDS 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl_anti- 


0.0 


CCD1 106 (Keratinocytes) 


0.0 
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none 




LAK cells rest 


0.0 


TNFalpha + IL-lbeta 


0.0 


I AK celk TT -? 




T h/^t* r* i rrn c i c 




LAK celk TT -2-hTT -1? 




T iinnc HHr»F k \/ 

i^upub rwiuiicy 


a n 

U.U 


gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+IL-18 


0.0 


NCI-H292 IL-4 


0.0 


T AK rf»11c 
L»/\rv L/C lib 

PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2rest 


0.0 


NCI-H292IL-13 


0.0 


i wvj w ay ivjuL/iv j uay 




>JpT OOOO TTTNJ era mm a 

lNv^i-rizyz iriN gamma 


a a 

U.U 




u.u 


jTLr/vcv^ none 


U.U 


Two Way MLR 7 day 


0.0 


T4PAFP TNJF alrthn -1- TT 1 

nT/\±j\-s i iNr aipna t ij^-i 
beta 


0.0 


rn>iviv_^ real 


a n 

U.U 


jLimg noroDiasi none 


A A 
U.U 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 

4- TT 1 Kjato 

• oeia 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 

• * 

lonomycin 


0.0 


Lung fibroblast IL- 13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 

J TT A 

and EL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 

A /i Anom vpin 

l ivxiT/ IwJllslUYw ill 


0.0 


Dermal fibroblast 
^uju/u iLt'i oeia 


0.0 


Dendritic cells none 


0.0 


L/enndi iiorooiasi iriN 
gamma 


7.6 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


6.0 


Dendritic cells anti- 
CD^ 


0.0 


IBD Colitis 2 


5.6 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


6.7 


Macrophages LPS 


0.0 


Thymus 


4.3 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 
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Table BOE. Panel CNS_1 



Tissue Name 


Rel. E.xp.(%) Ag3016, 
Run 171688428 


Tissue Name 


ReL Exp.(%) Ag3016, 
Run 171688428 


BA4 Control 


9.9 


BA17PSP 


28.5 


BA4 Control2 


12.0 


BA17PSP2 


17.8 


BA4 
Alzheimer's2 


2 9 


^liiK Wiorj* fVintml 


S 4 


BA4 Parkinson's 


28.9 


Sub Nigra Control2 


0.0 


BA4 
Parkinson's2 


35 1 


Sub Nigra 
Alzheimer's2 


7 7 


BA4 
Huntington's 


12.2 


Sub Nigra 
Parkinson f s2 


IS 1 


BA4 
Huntington's2 


8.3 


Sub Nigra 
Huntington's 


1 S 


BA4 PSP 


1 1 4 


Sub Nigra 
Huntington's? 


n o 
u.u 


BA4PSP2 


17.9 


Sub Nigra PSP2 


0.0 


BA4 Depression 


9.7 


Sub Nigra 
Depression 


5.2 


BA4 
Depression2 


11.7 


Sub Nigra 
Depression2 


14.8 


BA7 Control 


31.4 


Glob Palladus 
Control 


5.0 


BA7 Control2 


11.7 


Glob Palladus 
Control2 


0.0 


BA7 
Alzheimer f s2 


20.2 


Glob Palladus 
Alzheimer's 


0.0 


BA7 Parkinson's 


15.3 


Glob Palladus 
Alzheimer's2 


in 6 


BA7 
Parkinson's2 


59.0 


Glob Palladus 
Parkinson's 




BA7 
Huntington's 


432 


Glob Palladus 
Parkinson s2 


0.0 


BA7 
Huntington s2 


46.0 


Glob Palladus PSP 


0.0 


BA7PSP 


37.6 


Glob Palladus PSP2 


5.0 


BA7PSP2 


21.9 


Glob Palladus 
uepression 


5.7 


BA7 Depression 


21.9 


Temp Pole Control 


5.9 


BA9 Control 


14.8 


Temp Pole Control2 


0.0 


BA9Control2 . 


24.5 


Temp Pole 
Alzheimer's 


0.0 


BA9 Alzheimer's 


10.2 


Temp Pole 
Alzheimer's2 


9.4 


BA9 


12.2 


Temp Pole 


30.4 
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All • ^ t^^> 

Alzheimers2 




Parkinsons 




BA9 Parkinson's 


52.1 


Temp Pole 
Parkinson's2 


37.1 


BA9 
Parkinson f s2 


45.4 


Temp Pole 
Huntington's 


11.5 


BA9 

XxUlllIllgUJll o 


20.2 


Temp Pole PSP 


19.3 


BA9 

JLlUllllllgLVJll J>Z. 


35.1 


Temp Pole PSP2 


0.0 


BA9PSP 


24.3 


Temp Pole 
uepressionz 


12.5 


RAQ P9P2 




i^ing vjyr v^onxroi 




BA9 Depression 


9.8 


Cing Gyr Control2 


16.2 


BA9 
Depression2 


12.2 


Cing Gyr 
Alzheimer's 


0.0 


BA1 7 Control 


100.0 


Cing Gyr 
Alzheimer's2 


0.0 


BA17Control2 


40.9 


Cing Gyr Parkinson's 


10.5 


BA17 
AIzheimer's2 


28.9 


Cing Gyr 
Parkinson's2 


0.0 


BA17 
Parkinson's 


94.6 


Cing Gyr 
Huntington's 


35.4 


BA17 
Parkinson's2 


28.1 


Cing Gyr 
Huntington's2 


5.0 


BA17 
riuniingion s 


25.9 


Cing Gyr PSP 


0.0 


BA17 
Huntington's2 


19.3 


Cing Gyr PSP2 


9.8 


BA17 
Depression 


33.4 


Cing Gyr Depression 


5.0 


BA17 
Depression2 


56.3 


Cing Gyr 
Depression2 


24.8 



CNS_neurodegeneration_vLO Summary: Ag3016 This panel does not show 
differential expression of the NOV74 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 13D Summary: Ag3016 The NOV74 gene represents a dual specificity 
phosphatase that is expressed preferentially at low to moderate levels across the CNS. Dual- 
specificity phosphatases comprise a family of MAP kinase regulating enzymes, members of 
which are upregulated in brains subjected to insults such as ischemia and seizure activity. 
0 MAP kinases are kown to regulate neurotrophic and neurotoxic pathways. Consequently, 
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agents that modulate the activity of this gene may have utility in attenuating the apoptotic and 
neurodegenerative processes following brain insults. 

In addition, there are low but significant levels of expression in samples derived from 
breast cancer, ovarian cancer, and melanoma cell lines. Thus, expression of this gene could be 
5 used to differentiate between these samples and other samples on this panel and as a marker to 
detect the presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of breast cancer, ovarian cancer, and 
melanoma. 

References: 

10 Wiessner C. The dual specificity phosphatase PAC-1 is transcriptionally induced in the 

rat brain following transient forebrain ischemia. Brain Res Mol Brain Res 1995 
Feb;28(2):353-6 

PAC-1 mRNA has previously been found only in activated T-cells in vitro and in vivo. 
The gene encodes a dual specificity protein phosphatase that regulates MAP kinase activity. 

15 Here, I describe that PAC-1 mRNA is induced also in neurons in the rat brain following 30 
min of forebrain ischemia. At 6, 12 and 24 h after ischemia, PAC-1 mRNA was found most 
prominently in hippocampal cells which are resistant to 30 min of forebrain ischemia, but not 
in the selectively vulnerable CA1 sector. At later time points and in control animals no PAC-1 
mRNA could be detected in any brain region. The protein-tyrosine/threonine phosphatase 

20 PAC- 1 , therefore, may be involved in adaptational responses of hippocampal cells resistant to 
ischemic injury. 

Boschert U, Muda M, Camps M, Dickinson R, Arkinstall S. Induction of the dual 
specificity phosphatase PAC1 in rat brain following seizure activity. Neuroreport 1997 Sep 
29;8(14):3077-80 

25 Recurrent seizure activity leads to delayed neuronal death as well as to inflammatory 

responses involving microglia in hippocampal subfields CA1, CA3 and CA4. Since mitogen 
activated protein (MAP) kinases control neuronal apoptosis and trigger generation of 
inflammatory cytokines, their activation state could determine seizure-related brain damage. 
PAC1 is a dual specificity protein phosphatase inactivating MAP kinases which we have 

30 found to be undetectable in normal brain. Despite this, kainic acid-induced seizure activity 

lead to rapid (approximately 3 h) but transient appearance of PAC 1 mRNA in granule cells of 
the dentate gyrus as well as in pyramidal CA1 neurons. This pattern changed with time and 
after 2-3 days PAC1 was induced in dying CA1 and CA3 neurons. At this time PAC1 mRNA 
was also expressed in white matter microglia as well as in microglia invading the damaged 
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hippocampus. PAC1 may play an important role controlling MAP kinase involvement in both 
neuronal death and neuro-inflammation following excitotoxic damage. 

Panel 4D Summary: Ag3016 The NOV74 gene is only expressed at detectable levels 
in the kidney (CT = 34.2) among the samples on this panel. Thus, expression of this gene 
5 could be used to distinguish kidney from the other samples on this panel. In addition, the dual- 
specificity protein phospatase encoded for by this gene could allow cells within the kidney to 
respond to specific microenvironmental signals. Furthermore, small molecule therapies 
designed with the protein encoded for by this gene could modulate kidney function and be 
important in the treatment of inflammatory or autoimmune diseases that affect the kidney, 
10 including lupus and glomerulonephritis. 

Panel CNSJt Summary: Ag3016 This panel confirms expression of the NOV74 gene 
in the brain. Please see Panel 1 .3D for discussion of utility of this gene in the central nervous 
system. 

NOV75 

15 Expression of gene NOV75 was assessed using the primer-probe sets Ag3020 and 

Ag2968, described in Tables BPA and BPB. Results of the RTQ-PCR runs are shown in 
Tables BPC, BPD, BPE, BPF and BPG. 

Table BPA. Probe Name Ag3020 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ggaatcacccacattctgaat-3 ' 


21 


265 


1291 


Probe 


TET-5 1 -cgtttacactggccccgaattctaca-3 ' - 
TAMRA 


26 


303 


1292 


Reverse 


5 1 -cctctacacccaggtactggat-3 ' 


22 


340 


1293 



20 Table BPB. Probe Name Ag2968 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5 1 -ggaatcacccacattctgaat-3 ' 


21 


265 


1294 


Probe 


TET-5 1 -cgtttacactggccccgaattctaca-3 1 - 
TAMRA 


26 


303 


1295 


Reverse 


5 • -cctctacacccaggtactggat-3 ' 


22 


340 


1296 



Table BPC. General screening panel _vl.4 



Tissue Name 


Rel. Exp.(%) Ag3020, 
Run 221998694 


Tissue Name 


Re!.Exp.(%)Ag3020, 
Run 221998694 


Adipose 


0.4 


Renal ca.TK-10 


0.0 
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Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.1 


Ik yT 1 . ^ jfc 

Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* Ml 4 


A A 

0.0 


* _ _ ~tr 1 T^/^V TTT 

Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


X Jf _ 1 _ „ _ afc fl 'XT' 

Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.2 


jsjuauiuua wvii 

carcinoma SCC-4 


0.0 


met) SW620 


0.0 


Testis Pool 


0.0 


Colon ca. HT29 


b.o 


PrnQtflte cji * ^hrwip 

met) PC-3 


0.0 


Colon ca.HCT-116 


0.0 


Prostata Ponl 


ft ? 


v^UJUIl Co.. v>dV^U-^ 


ft ft 


Placenta 


0.1 


Colon cancer tissue 


31.0 [ 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 . 


Colon ca. Colo-205 


100.0 


Ovarian ca. SK-OV- 
3 


0.3 


Colon ca. SW-48 


5.7 


Ovarian ca. 

fW 7t> ATI A 

OVCAR-4 


0.0 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


0.1 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV- 
i 

1 


18.9 


Stomach Pool 


0.0 


Ovarian ca 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.1 


Fetal Heart 


12.7 


Breast ca. MCF-7 


0.0 


Heart Pool 


4.9 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.1 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


23.3 


Breast ca. T47D 


0 0 




0(\ ft 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


ureast rool 


A' A 

0.0 


Thymus Pool 


0.0 


Trachea 


0.3 


CNS cancer (glio/astro) 

TT87 \ACl 
Uo/-iVl\j 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) 
U-118-MG 


0.1 


Fetal Lung 


0.2 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lungca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.7 
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Lungca. LX-1 


12.7 


CNS cancer (astro) 

OlNtJ-/ J 


0.1 


Lung ca. NCI-H146 


0.1 


\^iN o cancer vgiio j 
SNB-19 


15.9 


Lung ca. SHP-77 


0.0 


L/iNo cancer ^giioj or- 
295 


0.5 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.1 


T itrirr r>o "NIPT tKOA ; 

ivung ca. iNL^i-rijzo 


u.u 


Jbrain (cereDenuni/ 


u.u 


T una nrk MPT 


fi n 
u.u 


r>rain (ietaij 


fi i 

U.l 


Lungca.NCI-H460 


0.0 


Jt>rain (riippocampus ) 
Pool 


0.1 


Lung ca. HOP-62 


0.1 


Cerebral Cortex Pool 


0.1 


Lung ca.NCI-H522 


0.0 


Brain ( Substantia nigral 
Pool 


0.1 


Liver 


0.0 


Brain (Thalamus) Pool 


0.1 


Fetal Liver 


0.1 


Brain (whole) 


0.1 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


0.0 


Kidney Pool 


0.1 


Adrenal Gland 


0.0 


Fetal Kidney 


0.1 


Pituitary gland Pool 


0.0 ; 


Renal ca. 786-0 


o.o 


Salivary Gland 


0.1 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


0.4 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.0 



Table BPD. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
166220058 


Rel. Exp.(%) 
Ag3020,Run 
167819114 


Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
166220058 


ReI.Exp.(%) 
Ag3020,Run 
167819114 


Liver 
adenocarcinoma 


0.0 


0.1 


Kidney (fetal) 


0.0 


0.3 


Pancreas 


0.0 


0.0 


Renal ca. 786- 
0 


0.0 


0.0 


Pancreatic ca. 
CAP AN 2 


0.1 


0.2 


Renal ca. 
A498 


0.0 


0.0 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 
393 


0.0 


0.0 


Thyroid 


0.3 


1.6 


Renal ca. 
ACHN 


0.0 


0.0 


Salivary gland 


0.8 


0.6 


Renal ca. UO- 
31 


0.0 


0.0 


Pituitary gland 


0.0 


0.0 


Renal ca. TK- 
10 


0.0 


0.0 


Brain (fetal) 


0.0 


0.1 


Liver 


0.0 


0.0 


Brain (whole) 


0.4 


1.0 


Liver (fetal) 


0.5 


0.0 
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Brain (amygdala) 


0.2 


1.0 


Liver ca. 
(hepatoblast) 

T T /~* 

HepG2 


0.0 


0.0 


Brain (cerebellum) 


0.0 


0.0 


Lung 


0.0 


0.5 


Brain 
(hippocampus) 


0.0 


0.0 


Ltmcr rtfetaH 


0.0 


0.0 


Brain (substantia 
niera} 


0.2 


0.1 


Lung ca. 
(small cell) 
LX-1 


10.7 


16.2 


Brain (thalamus) 


0.0 


0.0 


Lung ca. 
(small cell) 
NCI-H69 


0.0 


0.5 


Cerebrai Cortex 


0.4 


0.1 


Lung ca. 
(s.cell van) 
SHP-77 


0.0 


0.0 


Spinal cord 


0.0 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


glio/astro U87-MG 


0.0 


0.0 


Lung ca. (non- 
sm. cell) A549 


0.0 


0.2 


glio/astro U-118- 
MG 


0.0 


0.1 


Lung ca. (non- 
s.cell) NCI- 
H23 


0.0 


0.0 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. (non- 
s.cell)HOP-62 


0 1 


0 1 


neuro*; met SK-N- 
AS 


0.0 . 


0.0 


Lung ca. (non- 
s.cl)NCI- 
H522 


0.0 


0.0 


astrocytoma SF- 
539 


12.1 


8.4 


Lung ca. 
(squam.) SW 
900 


0.0 


0.0 


astrocytoma SNB- 
75 


0.0 


0.3 


Lung ca. 
(squam.) NCI- 
H596 


0.1 


0.4 


glioma SNB-19 


0.0 


0.0 


Mammary 
gland 


0.2 


0.2 


glioma U251 


0.4 


0.0 


Breast ca.* 
(pl.ef)MCF-7 


0.0 


0.0 


glioma SF-295 


0.3 


0.3 

— , 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.0 


0.0 


Heart (fetal) 


7.4 


26.8 


JDicaSL Ca. 

(pl.ef) T47D 


0.0 


0.0 


Heart 


29.9 


35.4 


Breast ca. BT- 
549 


0.0 


0.0 


Skeletal muscle 
(fetal) 


10.8 


33.9 


Breast ca. 
MDA-N 


0.0 


0.0 


Skeletal muscle 


100.0 


100.0 


Ovary 


0.0 


0.1 
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Bone marrow 


0.1 . 


0.6 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


Thymus 


0.1 


0.1 


Ovarian ca. 
OVCAR-4 


0.0 


0.0 


Spleen 


0.0 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


0.6 


Lymph node 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 


Colorectal 


02 


0.2 


Ovarian ca. 
IGROV-1 


26.2 


26.2 


Stomach 


0.0 


0.0 


Ovarian ca.* 
(ascites) SK- 
OV-3 


0.2 


1.0 


Small intestine 


0.0 


0.0 


Uterus 


0.0 


0.0 


Colon ca. SW480 


0.0 


0.2 


Placenta 


1.0 


0.0 


Colon ca.* 
SW620(SW480 
met) 


1.6 


6.1 


Prostate 


0.2 


0.1 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met)PC- 
3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Testis 


0.2 


02 


Colon ca. CaCo-2 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


21.9 


30.6 


Melanoma* 

(met) 
Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.0 


0.5 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 


0.0 


0.0 


Melanoma 

Ml 4 


0.0 


0.0 


Bladder 


0.1 


0.0 


Melanoma 
LOX IMVI 


0.0 


0.0 


Trachea 


0.8 


0.6 


Melanoma* 
(met)SK- 
MEL- 5 ! 


0.0 


0.0 


Kidney 


0.0 


0.0 


Adipose 


1.1 


1.7 



Table BPE. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
170188142 


Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
170188142 


Daoy- Medulloblastoma 


0.0 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 
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TE671- Medulloblastoma 


2.2 


ES-2- Ovarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulioblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin oh 


0.0 


rr 5>K- 1 - rnmiti ve 
Neuroectodermal 


0.0 


Ramos- Stimulated with 

PA/1 A/irn*inmvrin 14H 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 

^1 IlCg, L?JVal ywiaolj 


0.0 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


br-zoo- (glioblastoma 


A A 

0.0 


Daudi- Burkitt s lymphoma 


0.0 


1 VoCj- ulioblastoma 


A A 

0.0 


U266- B-cell plasmacytoma 


0.0 


Cl/" XT OTJ" 

Neuroblastoma 
(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin s B-cell 
lymphoma 


0.0 


Cerebellum 


0.0 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


0.0 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


0.0 


TF-l-Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0.0 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small cell lung 
cancer 


0.0 


U937- Histiocytic lymphoma 


0.0 


NCI-H146- Small cell 
lung cancer 


0.0 


KU-812- Myelogenous 
leukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


0.0 


769-P- Clear cell renal 
carcinoma 


0.0 


NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


0.0 


NC1-H82- Small cell 
lung cancer 


0.0 


SW 839- Clear cell renal 
carcinoma 


0.0 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


0.0 


G401- Wilms' tumor 


0.0 


NCI-H1155- Large cell 
lung cancer 


0.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Large cell 
lung cancer 


0.0 


CAPAN-1 - Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


NCI-H727-Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


0.0 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.0 
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LX-1- Small cell lung 
cancer 


4.4 


HPAC- Pancreatic 
adenocarcinoma 


0.0 


Colo-205- Colon cancer 


100.0 


MIA PaCa-2- Pancreatic 
carcinoma 


0.0 


KM 12- Colon cancer 


0.0 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


0.0 


KM20L2- Colon cancer 


0.0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


0.0 


NCI-H716- Colon cancer 


0.0 


T24- Bladder carcinma 
(transitional cell) 


0.0 


SW-48- Colon 
adenocarcinoma 


4.3 


5637- Bladder carcinoma 


0.0 


SW1116- Colon 
adenocarcinoma 


0.0 


HT-1197- Bladder carcinoma 


0.0 


LS 174T- Colon 
adenocarcinoma 


10.1 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1080- Fibrosarcoma 


0.0 


NCI-SNU-5- Gastric 
carcinoma 


0.0 


MG-63- Osteosarcoma 


0.0 


KATO III- Gastric 
carcinoma 


1.2 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


0.0 


NCI-SNU-16- Gastric 
carcinoma 


0.0 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0.7 


NCI-SNU-1- Gastric 
carcinoma 


0.1 


A431- Epidermoid carcinoma 


0.0 


RF-1- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


0.0 


RF-48- Gastric 
adenocarcinoma 


0.0 


DU 145- Prostate carcinoma 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


0.0 


MDA-MB-468- Breast 
adenocarcinoma 


0.0 


NCI-N87- Gastric 
Carcinonia 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


GAL 27- Squamous cell 
carcinoma of tongue 


0.0 



Table BPF. Panel 4D 



Tissue Name 



Rel. Exp.(%) 



Tissue Name \ Rel. Exp.(%) 
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Ap3020 Run 
164528102 




Ap3020 Run 
164528102 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0 0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 

fcWI 111 1 IU 


0.0 


Secondary Hil rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


secondary 1 nz rest 


a a 

u.u 


TTT T\7"C/~ >, TT 11 

HU VfciC 1L-1 1 


U.U 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
1 IN r alpha + 1L- 1 beta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
l in r alpna + 1L- 1 beta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
INb alpha + 1L1 beta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha 4- IL-lbeta 


0.0 


v^uq lympnocyte act 


u.u 


Astrocytes rest 


65.1 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 

TT 1 L n +_ 

IL-lbeta 


21.0 


Secondary CD8 
lympnocyie act 


0.0 


KU-812 (Basophil) rest 


10.7 


CD4 lymphocyte none 


0.0 


TAT T Oil ZD U :i\ 

ivu-olz (Basophil) 
PMA/i onom vc i n 

■*■ »y ■ viiviti y will 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CHI 1 


0.0 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


13.3 


CCD1 106 (Keratinocvtes^ 
TNFalpha + IL-lbeta 


0.0 


LAK cells LL-2 


0.0 


Liver cirrho^i^ 

J-(I Vwl wll lllvjij 


100 0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2rest 


8.0 


NCI-H292 IL-13 


0.0 
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1 WO Way IVli^lv 3 uay 


u.u 


>jn.H9Q9 TFNI crnm ma 
iN^i-rxZ7Z IF IN gaUIHla 


u.u 


J wo Way lvi_L>j\. j UiXy 


u.u 


HPAPP none 
n inDV.- nunc 


ft 0 
u.u 


Two Way MLR 7 day 


10.6 


HPAFP TNTF alnha + TT -1 

beta 


11.2 




U.U 


L/Ung iiuruuiabL nunc 


U.U 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 

i IJU-l L/C Ld. 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


b.o 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 

] TT >4 

and IL-4 


9.9 


Dermal fibroblast 
CCD1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
| CCD1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
rivi/v lonomy c m 


0.0 


Dermal fibroblast 

fPTUfWflTT 1 Koto 


0.0 


Dendritic cells none 


23.8 


uermai iiDTODiasi itjn 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


44.4 


Macrophages rest 


0.0 


Lung 


26.6 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


o.o 







Table BPG. Panel 5D 



Tissue Name 


Rel. 
Exp.(%) 
Ag2968, 

Run 
169270971 


Rel. 
Exp.(%) 
Ag3020, 

Run 
172171108 


Tissue Name 


Rel. 
Exp.(%) 
Ag2968, 

Run 
169270971 


Rel. 
Exp.(%) 
Ag3020, 

Run 
172171108 


97457JPatient- 
02go_adipose 


0.0 


0.2 


94709_Donor 2 AM - 
A_adipose 


0.0 


0.0 


97476_Patient- 
07sk_skeletal 
muscle 


1.2 


8.4 


94710_Donor2AM- 
B_adipose 


0.0 


0.0 


97477_Patient- 
07ut_uterus 


0.1 


0.0 


9471 l_Donor 2 AM- 
Cadipose 


0.0 


0.0 j 
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97478 Patient- 
07pl_placenta 


0.0 


1.3 


947 1 2_Donor 2 AD - 
A_adipose 


0.0 


3.0 


97481_Patient- 
08sk_skeletal 
muscle 


2.8 


12.6 


947l3_Donor2AD- 
B_adipose 


100.0 


0.0 


97482_Patient- 
08ut_uterus 


0.0 


0.0 


947l4_Donor2AD- 
C_adipose 


0.0 


0.0 


97483_Patient- 
08plj>lacenta 


0.0 


0.0 


94742_Donor3 U- 
A_Mesenchymal Stem Cells 


0.0 


0.0 


97486_Patient- 
09sk_skeletal 


3.0 


12.9 


94743_Donor3U- 
B_Mesenchymal Stem Cells 


0.0 


0.0 


97487_Patient- 

HOI \\ lit anio 

\>y ui__iucrus 


0.1 


0.7 


94730_Donor 3 AM - 
A_adipose 


0.0 


0.0 


97488_Patient- 
\jy pi pidLcnia 


0.1 


0.2 


9473l_Donor 3 AM - 
B_adipose 


0.0 


0.0 


97492_Patient- 
10ut_uterus 


0.0 


0.2 


94732_Donor 3 AM - 
C_adipose 


0.0 


0.0 


97493_Patient- 
10pl_placenta 


0.2 


l.l 


94733_Donor 3 AD - 
A_adipose 


0.0 


0.0 


97495_Patient- 
llgo_adipose 


0.0 


0.0 


94734_Donor 3 AD - 
B_adipose 


6.0 


0.0 


97496 JPatient- 
llsk_skeletal 


8.5 


53.6 


94735_Donor 3 AD - 
C_adipose 


0.0 


0.0 


97497_Patient- 
llut_uterus 


0.0 


0.3 


77l38_Liver_HepG2untreated 


0.0 


0.0 


97498JPatient- 
llpl_placenta 


0.1 | 


2.3 


73556_Heart_Cardiac stromal 
cells (primary) 


0.0 


0.0 


97500_Patient- 
12go_adipose 


0.0 


0.4 


8l735_Small Intestine 


0.1 


0.0 


9750l_Patient- 
I2sk_skeletal 
muscle 


I9.l 


100.0 


72409_Kidney_Proximal 
Convoluted Tubule 


0.0 


0.0 


97502 J>atient- 
I2ut_uterus 


0.1 


0.0 


82685_Small 
intestine Duodenum 


0.0 


0.0 


97503JPatient- 
I2pljplacenta 


0.1 


1.3 


90650_Adrenal_Adrenocortical 
adenoma 


0.0 


0.2 


94721 Donor 2 
U- 

Stem Ceils 


0.0 


0.0 


724lO_Kidney_HRCE 


0.0 


0.3 


94722 Donor 2 
U- 

BMesenchymal 
Stem Cells 


0.0 


0.0 


724ll_Kidney_HRE 


0.0 


0.0 


94723_Donor2 


0.0 j 0.0 


73139 Uterus Uterine smooth 


0.0 


0.0 
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u- 

G_Mesenchymal 
Stem Cells 






muscle cells 







General_screening_panel_vl.4 Summary: Ag3020 The NOV75 gene is expressed in 
brain, colon, lung and ovarian cancer cell lines with highest expression in a colon cancer cell 
line Colo-205 (CT=24.37). This suggests that this gene can be used as a diagnostic marker for 
5 these types of cancer. Furthermore, inhibition of the protein using small molecule drugs could 
potentially be useful for the treatment of brain, colon, lung and ovarian cancer. 

In addition, this gene has low expression in adipose and high expression in adult and 
fetal heart and skeletal muscle. Thus, this protein phosphatase may be a small molecule target 
for the treatment of obesity, Type 2 diabetes and cardiac and skeletal muscle disease. 

10 Panel 13D Summary: Ag2968/Ag3020 Results from two experiments using identical 

probe/primer sets are in excellent agreement. Expression of the NOV75 gene is highest in 
adult skeletal muscle (CTs = 26-28). Significant but somewhat lower expression is also seen in 
fetal skeletal muscle and adult/fetal heart. Thus, expression of this gene may be used to 
distinguish these samples from the other samples on this panel. 

15 This gene is also expressed in brain, colon, lung and ovarian cancer cell lines, 

consistent with General_screeningj>anel_vl.4. This suggests that this gene can be used as a 
diagnostic marker for these types of cancer and inhibition of the protein using small molecule 
drugs can be used for the treatment of brain, colon, lung and ovarian cancer. 

Panel 3D Summary: Ag2968 Expression of the NOV75 gene is highest in colon 

20 cancer cell line Colo-205 (CT = 25.6). In addition, significant expression of this gene is seen 
in two other colon cancer cell lines. Thus, expression of this gene may be used to distinguish 
these colon cancer cell lines from the other samples on this panel. Moreover, therapeutic 
modulatibn of the activity of this gene or its protein product, using small molecules, antibodies 
or protein therapeutics, may be of benefit in the treatment of colon cancer. 

25 Panel 4D Summary: Ag3020 Expression of the NOV75 gene is highest in a liver 

cirrhosis sample (CT - 33.3). Furthermore, expression of this gene is not detected in normal 
liver in Panels 1.3D or 1.4, suggesting that its expression is unique to liver cirrhosis. This gene 
encodes a putative protein phosphatase; therefore, antibodies or small molecule therapeutics 
could reduce or inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this 

30 protein could also be used for the diagnosis of liver cirrhosis. Low levels of expression are 
also seen in colon and resting astrocytes. 
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Ag2968 Expression of this gene is low/undetectable (CTs > 35) across all of the 
samples on this panel (data not shown). 

Panel 5D Summary: Ag3020 Expression of the NOV75 gene is primarily restricted to 
samples from skeletal muscle. This specific expression is in agreement with the results in 
5 Panels 1 .3D and 1 A Thus, expression of this gene could be used to differentiate between 
skeletal muscle and other samples on this panel, and as a marker of skeletal muscle. 

NOV76a 

Expression of gene NOV76a was assessed using the primer-probe sets Ag3022 and 
Ag4891, described in Tables BQA and BQB. Results of the RTQ-PCR runs are shown in 
1 0 Tables BQC and BQD. 

Table BQA. Probe Name Ag3022 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -tgcttgcaggtttattcttagg-3 » 


22 


431 


1297 


Probe 


TET-5 ' -tgctgagttctctcgttgtttccctg-3 1 - 
TAMRA 


26 


459 


1298 


Reverse 


5 1 -tgagaaatgcaggtagggacta-3 • 


22 


509 


1299 



Table BQB. Probe Name Ag4891 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 • -aatacctgtccaaagcctgact-3 1 


22 


661 


1300 


Probe 


TET-5 • -ttatccccgagtctcatttcctgcgt-3 1 - 
TAMRA 


26 


683 


1301 


Reverse 


5 ■ -ttctcacaaaagctgtcattca-3 1 


22 


716 


1302 



15 Table BQC. General, screening panel vl.5 



Tissue Name 


Rel. Exp.(%) Ag4891, 
Run 228714709 


Tissue Name 


Rel. Exp.(%) Ag4891, 
Run 228714709 


Adipose 


10.8 


Renal ca. TK-10 


24.5 | 


Melanoma* 
Hs688(A).T 


5.6 


Bladder 


22.8 


Melanoma* 
Hs688(B).T 


6.3 


Gastric ca. (liver met.) 
NCI-N87 


46.7 


Melanoma* M14 


27.4 


Gastric ca. KATO III 


22.2 


Melanoma* 
LOXIMVI 


9.5 


Colon ca. SW-948 


11.5 


Melanoma* SK- 
MEL-5 


12.4 


Colon ca. SW480 


» 20.7- 
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^niiamniw f*pll 

carcinoma SCC-4 


10.2 


colon ca. W'tsu 
met) SW620 


11.6 


Testis Pool 


ll.3 


Colon ca. HT29 


10.6 


met) PC-3 


6.7 


Colon ca.HCT-1 16 


12.2 


Prostate Pnr>l 


I O.o 


v^oion ca. caco-z 




Placenta 


33.7 


Colon cancer tissue 


45.4 


Uterus Pool 


9.7 


Colon ca. SW1116 


3.0 


Ovarian ca. 
OVCAR-3 


30.4 


Colon ca. Colo-205 


5.4 


Ovarian ca. SK-OV- 

3 


10.2 


Colon ca. SW-48 


10.2 


Ovarian ca. 

/"v\ //-> A T"> vl 

OVCAR-4 


8.5 


Colon Pool 


15.9 


Ovarian ca. 
OVCAR-S 


21.8* 


Small Intestine Pool 


13.2 


Ovarian ca. IGROV- 

i 
i 


10.6 


Stomach Pool 


7.4 


Ovarian on 

OVCAR-8 


5.6 


Bone Marrow Pool 


5.7 


Ovary 


6.5 


Fetal Heart 


113 


Breast ca. MCE-7 


52.5 


Heart Pool 


70 


Breast ca. MDA- 
MB-231 


11.5 


Lymph Node Pool 


18.4 


Breast ca. BT 549 


41.8 


Fetal Skeletal Muscle 


9.2 


Rrpaot ra T4TH 


A A 


skeletal Muscle rool 


16.3 


Breast ca. MDA-N 


12.1 


Spleen Pool 


19.3 


Breast Pool 


17.8 


Thymus Pool 


14.7 


Trachea 


12.1 


CNS cancer (glio/astro) 
U87-MG 


12.2 


Lung 


4.7 


CNS cancer (glio/astro) 
U-118-MG 


11.8 


Fetal Lung 


36.3 


CNS cancer 
(neuro;met) SK-N-AS 


24.5 


Lungca. NCI-N417 


4.4 


CNS cancer (astro) SF- 
539 


5.4 


Lung ca. LX-1 


9.9 


CNS cancer (astro) 

OIN.D-/D 


13.0 


T \TOT TT1 A y 

Lung ca. NCI-H146 


15.0 


CNS cancer ( &U6) 
SNB-19 


11.7 


Lung ca. SHP-77 


6.8 


CNS cancer (glio) SF- 
295 


15.7 


Lung ca. A549 


28.7 


Brain (Amygdala) Pool 


10.8 


Lung ca. NCI-H526 


0.8 


Brain (cerebellum) 


35.8 


Lungca.NCI-H23 


38.4 


Brain (fetal) 


9.3 
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Lung ca. NCI-H460 


10.0 


J314H1 ^rllUpUUalllJJUD^ 

Pool 


10.2 


Lung ca. HOP-62 


11.7 


Cerebral Cortex Pool 


9.7 


Lung ca. NCI-H522 


29.5 


Brain (Substantia nigra) 
Pool 


10.3 


Liver 


6.1 


Brain (Thalamus) Pool 


15.1 


i Fetal Liver 


19.3 


Brain (whole) 


10.3 


Liver ca. HepG2 


34.4 


Spinal Cord Pool 


10.3 


Kidney Pool 


23.7 


Adrenal Gland 


100.0 


Fetal Kidney 


11.6 


Pituitary gland Pool 


5.3 


Renal ca. 786-0 


14.6 


Salivary Gland 


12.5 


Renal ca. A498 


5.4 


Thyroid (female) 


6.7 


Renal ca. ACHN 


21.6 


Pancreatic ca. CAPAN2 


21.0 


Renal ca. UO-31 


25.9 


Pancreas Pool 


25.9 



Table BQD. Panel 4.1D 



Tissue Name 


ReL Exp.(%) 
Ag4891,Run 


Tissue Name 


ReL Exp.(%) 
Ag4891, Run 


Secondary Thl act 


55.1 


HUVEC IL-lbeta 


17.7 


Secondary Th2 act 


100.0 


HUVEC IFN gamma 


6.2 


Secondary Trl act 


74.2 


HUVEC TNF alpha + IFN 
gamma 


9.0 


Secondary Thl rest 


18.8 


HUVEC TNF alpha + IL4 


11.7 


Secondary Th2 rest 


34.4 


HUVEC IL-11 


2.9 


Secondary Trl rest 


20.9 


Lung Microvascular EC 
none 


21.3 


Primary Thl act 


18.8 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


26.2 


Primary Th2 act 


60.3 


Microvascular Dermal EC 
none 


6.7 


Primary Trl act 


35.4 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


15.1 


Primary Thl rest 


17.7 


Bronchial epithelium 
TNFalpha + ILlbeta 


16.8 


Primary Th2 rest 


24.8 


Small airway epithelium 
none 


4.0 


Primary Trl rest 


14.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


13.3 


CD45RA CD4 
lymphocyte act 


15.1 


Coronery artery SMC rest 


8.4 


CD45RO CD4 
lymphocyte act 


63.3 


Coronery artery SMC 
TNFalpha + IL-lbeta 


8.2 


CD8 lymphocyte act 


23.0 


Astrocytes rest 


8.4 
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Secondary CD8 
lymphocyte rest 


46.0 


Astrocytes TNFalpha + 
IL-lbeta 


3.1 


Secondary CDS 
lymphocyte act 


18.6 


KU-812 (Basophil) rest 


0.5 


CD4 lymphocyte none 


15.5 


KU-812 (Basophil) 
r ma/ ionomycin 


2.3 


2ry Thl/Th2/Trl_anti- 


47.0 


CCDl 106 (Keratinocytes) 
none 


3.3 


LAK cells rest 


15.4 


ecu 1 1 Uo (jveratmocytes) 
TNFalpha + IL-lbeta 


10.7 


I AlcT ™*llc IT 0 
LAN CeilS 11/-Z 


3fi A 


Liver cirrhosis 


1 O 7 

13./ 


I Afcf <^llc TT O-I-TT 10 


10 O 


MPT T-JOOO tiAtia 

jNui-rizyz none 


OQ O 


JLr/\lv CcllS UL#-Z i irlN 

gamma 


12.8 


NCI-H292 IL-4 


24.0 


TAT/'^llcTT O-UTT 12 
LAK CeilS 1L-Z"T 1L,-15 


OO A 
ZZ.4 


XTPT T-TOOO TT O 


CO Q 


PMA/ionomycin 


24.8 


NCI-H292 IL-13 


26.6 


W pplJc TT 0 rAC t 

injv v^ens ijl»-z resx 


on n 


XTf^T T-JOOO TT7M rrommo 

iNd-rizyz lriN gamma 


in q 


i wo w ay iviuiv o oay 


OO H 


rlrAxiL, none 


A O 

4.2 


Two Way MLR 5 day 


32.5 


rlrAbC 1 Mr alpha + 1L-1 
beta 


58.2 


Two Way MLR 7 day 


42.6, 


Lung fibroblast none 


10.0 


PBMC rest 


10.3 


Lung iioroDlast 1 Nt alpha 
+ IL-lbeta 


7.5 




An f\ 


Lung libro blast IL-4 


4.5 


PBMC PHA-L 


49.7 


Lung fibroblast IL-9 


5.9 


Ramos (B cell) none 


1.6 


Lung fibroblast IL-13 


3.4 


Ramos (B cell) 
ionomycin 


1.5 


Lung fibroblast IFN 
gamma 


8.4 


B lymphocytes PWM 


28.1 


Dermal fibroblast 
CCDl 070 rest 


4.1 


B lymphocytes CD40L 

o«/i XT _/i 


11.4 


Dermal fibroblast 
cuuiu/u ifNr aipna 


42.9 


EOL-1 dbcAMP 


1.5 


Dermal fibroblast 
cuJJiu/u iL-i Deta 


4.9 


FOT .1 HhrAMP 

PMA/ionomycin 


1.9 


jjermai iiDroDiast iriN 
earn ma 


7.3 


Dendritic cells none 


10.2 


Dermal fibroblast IL-4 


6.8 


Dendritic cells LPS 


7.3 


Dermal Fibroblasts rest 


7.7 


Dendritic cells anti- 
CD40 


6.0 


Neutrophils TNFa+LPS 


13.5 


Monocytes rest 


2.5 


Neutrophils rest 


12.8 


Monocytes LPS 


34.9 


Colon 


6.0 


Macrophages rest 


9.4 


Lung 


10.6 


Macrophages LPS 


7.1 


Thymus 


20.3 
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HUVEC none 


5.4 


Kidney 


10.2 


HUVEC starved 


8.5 







CNS_neurodegeneration_vl.O Summary: Ag3022 No significant expression 
detected. The amp plot indicates that there is possibility of a potential chemistry or 
probe/primer failure (data not shown). 
5 GeneraljscreeningjaneLvl^ Summary: Ag3022 The amp plot indicates that there 

is possibility of a potential chemistry or probe/primer failure (data not shown). 

General_screening_panel_vl.5 Summary: Ag4891 The NOV76a gene has moderate 
levels of expression in adipose, liver, heart, skeletal muscle, pituitary, thyroid and pancreas, 
and high levels of expression in adrenal gland. Thus, this gene product may be a small 
10 molecule target for the treatment of metabolic, endocrine and adrenal diseases, including 
obesity, Types 1 and 2 diabetes, and Addison's disease. 

In addition, this gene is expressed at moderate levels in the cancer cell lines in this 
panel. A higher level of expression is observed in colon, lung, breast and ovarian cancer cell 
lines when compared to samples from the normal colon, lung, breast and ovary. Thus, this 
15 gene could be used as a diagnostic marker of cancer in these tissues. Furthermore, inhibition of 
the activity of this gene product using small molecule drugs may be useful for the treatment of 
cancer in these tissues. 

This gene encodes a homolog of a dual specificity phosphatase that is also expressed at 
low to moderate levels across the CNS. Dual-specificity phosphatases comprise a family of 
20 MAP kinase regulating enzymes, members of which are upregulated in brains subjected to 

insults such as ischemia and seizure activity. MAP kinases are known to regulate neurotrophic 
and neurotoxic pathways. Consequently, agents that modulate the activity of this gene may 
have utility in attenuating the apoptotic and neurodegenerative processes following brain 
insults. 
25 References: 

Wiessner C. The dual specificity phosphatase PAC-1 is transcriptionally induced in the 
rat brain following transient forebrain ischemia. Brain Res Mol Brain Res 1995 
Feb;28(2):353-6 

PAC-1 mRNA has previously been found only in activated T-cells in vitro and in vivo. 

30 The gene encodes a dual specificity protein phosphatase that regulates MAP kinase activity. 

Here, I describe that PAC-1 mRNA is induced also in neurons in the rat brain following 30 

min of forebrain ischemia. At 6, 12 and 24 h after ischemia, PAC-1 mRNA was found most 

prominently in hippocampal cells which are resistant to 30 min of forebrain ischemia, but not 
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in the selectively vulnerable CA 1 sector. At later time points and in control animals no PAC-1 
mRNA could be detected in any brain region. The protein-tyrosine/threonine phosphatase 
PAC-1, therefore, may be involved in adaptational responses of hippocampal cells resistant to 
ischemic injury. 

5 Boschert U, Muda M, Camps M, Dickinson R, Arkinstall S. Induction of the dual 

specificity phosphatase PAC1 in rat brain following seizure activity. Neuroreport 1997 Sep 
29;8(14):3077-80 

Recurrent seizure activity leads to delayed neuronal death as well as to inflammatory 
responses involving microglia in hippocampal subfields CA1, CA3 and CA4. Since mitogen 
10 activated protein (MAP) kinases control neuronal apoptosis and trigger generation of 

inflammatory cytokines, their activation state could determine seizure-related brain damage. 
PAC1 is a dual specificity protein phosphatase inactivating MAP kinases which we have 

r 

found to be undetectable in normal brain. Despite this, kainic acid-induced seizure activity 
lead to rapid (approximately 3 h) but transient appearance of PAC1 mRNA in granule cells of 

15 the dentate gyrus as well as in pyramidal CA1 neurons. This pattern changed with time and 
after 2-3 days PAC1 was induced in dying CA1 and CA3 neurons. At this time PAC1 mRNA 
was also expressed in white matter microglia as well as in microglia invading the damaged 
hippocampus. PAC1 may play an important role controlling MAP kinase involvement in both 
neuronal death and neuro-inflammation following excitotoxic damage. 

20 Panel 4.1D Summary: Ag4891 The NOV76a gene is expressed in a wide range of cell 

types and tissues (CT=26-34) of significance in the immune response in health and disease. 
Highest expression of this gene is detected in activated secondary Th2 cells (CT=26.48). 
Therefore, targeting of this gene product with a small molecule drug or antibody therapeutic 
may modulate the functions of cells of the immune system as well as resident tissue cells and 

25 lead to improvement of the symptoms of patients suffering from autoimmune and 
inflammatory diseases such as asthma, allergies, inflammatory bowel disease, lupus 
erythematosus, and arthritis, including osteoarthritis and rheumatoid arthritis. 

Panel 4D Summary: Ag3022 No significant expression detected. Potential 
probe/primer failure (data not shown). 

30 NOV77 

Expression of gene NOV77 was assessed using the primer-probe sets Ag3023 and 
Ag3373, described in Tables BRA and BRB. Results of the RTQ-PCR runs are shown in 
Tables BRC, BRD, BRE and BRF. 
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Table BRA. Probe Name Ag3023 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ctaatgctggatttgtccatca-3 1 


22 


572 


1303 


Probe 


TET-5 1 -tcaggaatatgaagccatctacctagca- 
3 ' -TAMRA 


28 


597 


1304 


Reverse 


5 1 -tggagtggtgacatcatctgta-3 1 


22 


635 


1305 



Table BRB. Probe Name Ag3373 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -atttgtccatcaacttcaggaa-3 ' 


22 


582 


1306 


Probe 


TET-5 ■ -tgaagccatctacctagcaaaattaaca- 
3 ' -TAMRA 


28 


606 


1307 


Reverse 


5 1 -tggagtggtgacatcatctgta-3 • 


22 


635 


1308 



5 Table BRC. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
AgJUZ j, Kun 
209821074 


Rel. Exp.(%) 
Agio /j, Kun 
210154071 


Tissue 
Name 


Rel. Exp.(%) 
AgJUZo, Kun 
209821074 


Rel. Exp.(%) 

A „11«1 T* ____ 

Ag3373, Run 
210154071 


AD 1 Hippo 


10.9 


16.8 


Control 

(Path) 3 
Temporal 
Ctx 


9.1 


8.0 


AD 2 Hippo 


34.2 


37.6 


Control 
(Path) 4 
Temporal 
Ctx 


40.6 


65.5 


AD 3 Hippo 


12.0 


15.8 


AD 1 
Occipital 
Ctx 


. 24.7 


29.1 


AD 4 Hippo 


13.8 


10.3 


AD 2 
Occipital 

Ctx 
(Missing) 


0.0 


0.0 


AD 5 hippo 


60.7 


57.8 


AD3 
Occipital 
Ctx 


14.7 


15.0 


AD 6 Hippo 


80.7 


72.2 


AD4 
Occipital 
Ctx 


35.4 


22.4 


Control 2 
Hippo 


35.8 


38.4 


AD5 
Occipital 
Ctx 


3.9 


30.4 


Control 4 
Hippo 


16.5 


11.7 


AD6 
Occipital 


46.0 


37.4 
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Ctx 






Control (Path) 

^ Hinnn 


13.1 


15.4 


Control 1 
Occipital 
Ctx 


9.9 


10.7 


AD 1 Temporal 
Ctx 


39.0 


31.4 


Control 2 
Occipital 
Ctx 


39.0 


38.4 


AD 2 Temporal 
Ctx 


38.7 


73.2 


Control 3 
Occipital 
Ctx 


23.0 


20.6 


AD 3 Temporal 
Ctx 


9.5 


13.2 


Control 4 
Occipital 
Ctx 


13.3 


13.3 


AD 4 Temporal 
Ctx 


27.9 


34.9 


Control 
(Path) 1 
Occipital 
Ctx 


80.1 


76.3 


AD5Inf 
Temporal Ctx 


59.0 


100.0 


Control 
(Path) 2 
Occipital 
Ctx 


17.3 


20.0 


AD 5 
SupTemporal 
Ctx 


33.2 


44.1 


Control 
(Path) 3 
Occipital 
Ctx 


8.4 


8.7 


AD6Inf 
Temporal Ctx 


100.0 


73.2 


Control 
(Path) 4 
Occipital 
Ctx 


21.2 


20.6 


AD 6 Sup 
Temporal Ctx 


79.6 


80.1 


Control 1 
Parietal Ctx 


12.1 


16.3 


Control 1 
Temporal Ctx 


10.2 


13.7 


Control 2 
Parietal Ctx 


48.0 


40.9 


Control 2 
Temporal Ctx 


41.2 


31 9 ' 


Control 3 
Parietal Ctx 


17 9 


16 3 

1U.J 


Control 3 
Temnoral Ctx 


20.3 


20.0 


Control 
(Path) 1 
Parietal Ctx 


74.7 


64.2 


Control 4 
Temporal Ctx 


9.7 


9.9 


Control 
(Path) 2 
Parietal Ctx 


28.9 


59.9 


Control (Path) 
1 Temporal Ctx 


59.9 


68.3 


Control 
(Path) 3 
Parietal Ctx 


10.2 


9.0 


Control (Path) 
2 Temporal Ctx 


40.3 


41.2 


Control 
(Path) 4 
Parietal Ctx 


44.8 


43.8 
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Table BRD. General screening panel yl,4 



Tissue Name 


ReL Exp.(%) Ag3373, 
Run 217043119 


Tissue Name 


Rel. Exp.(%) Ag3373, 
Run 217043119 


Adipose 


12.0 


Renal ca. TK-10 


20.3 


Melanoma* 
Hs688(A).T 


jv.O 


jDiauucr 


91 9 


Melanoma* 
Hs688(B).T 


U7.J 


Gastric ca. (liver met.) 
NCI-N87 


95 1 


Melanoma* M14 


15.0 


Gastric ca. KATO III 


30.8 


Melanoma* 
LOXIMVI 






0 7 


Melanoma* SK- 
MEL-5 


21.5 


Colon ca. SW480 


35.1 


Squamous cell 
carcinoma SCC-4 


33.0 


Colon ca.* (SW480 
met) SW620 


13.9 


Testis Pool 


19.8 


Colon ca. HT29 


8.5 


Prostate ca.* (bone 
met) PC-3 


100.0 


Colon ca.HCT-1 16 


36.9 


Prostate Pool 


9.2 


Colon ca. CaCo-2 


42.9 


Placenta 


3.8 


Colon cancer tissue 


9.0 


Uterus Pool 


7.4 


Colon ca. SW1116 


5.8 


Ovarian ca. 
OVCAR-3 . 


28.5 


Colon ca. Colo-205 


' 4.3 


Ovarian ca. SK-OV- 
3 


40.3 


Colon ca. SW-48 


4.2 


Ovarian ca. 
OVCAR-4 




Pnlnn Pnnl 


90 7 


Ovarian ca. 
OVCAR-5 


35.1 


Small Intestine Pool 


12.2 


Ovarian ca. IGROV- 
1 


10.9 


Stomach Pool 


9.9 


Ovarian ca. 
OVCAR-R 


9.2 


Bone Marrow Pool 


11.6 


vv<ii y 


O 7 
y.i 


reiai xiean 


zU. / 


Breast ca. MCF-7 


37.6 


Heart Pool 


10.6 


Breast ca. MDA- 
MB-231 


37.1 


Lymph Node Pool 


17.9 


DredSL Ca. D I DHy 


OZ.4 


Fetal Skeletal Muscle 


12.3 


Breast ca. T47D 


61.1 


Skeletal Muscle Pool 


16.0 


Breast ca. MDA-N 


10.0 


Spleen Pool 


11.6 


Breast Pool 


17.3 


, Thymus Pool 


12.2 


Trachea 


12.0 


CNS cancer (glio/astro) 
U87-MG 


29.1 


Lung 


6.7 


CNS cancer (glio/astro) 
U-118-MG 


69.3 
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Fetal Lung 


34.2 


CNS cancer 
(neuro;met) SK-N-AS 


34.9 


Lung ca.NCI-N417 


5.4 


CNS cancer (astro) SF- 
539 


19.1 


Lung ca. LX-1 


17.2 


CNS cancer (astro) 


35.8 


Lung ca.NCI-H146 


3.0 


SNB-19 


11.3 


Lungca. SHP-77 


18.6 


vi>o vaiiL/cr or 

295 


26.4 


Lung ca. A549 


29.1 


Brain (Amygdala) Pool 


4.5 


Luneca NCI-H526 


4 6 


id i am ^oci cucuuuij 


O.l 


Luneca NCI-H23 


31 6 


JJlalll ^icialj 


IJ.Z 


Lung ca. NCI-H460 


18.2 


Rrain fF-Tirvnfipflrnrviic^ 

Ulaul ^Xiip]JUv<llIlL/lU>^ 

Pool 


5.3 


Lung ca. HOP-62 


14.1 


Cerebral Cortex Pool 


5.4 


Lung ca. NCI-H522 


31.6 


Brain (Substantia nigra) 
Pool 


4.8 


Liver 


12 


Brain (Thalamus) Pool 


8.0 


Fetal Liver 


32.3 


Brain (whole) 


62 


Liver ca. HepG2 


14.6 


Spinal Cord Pool 


6.6 


Kidney Pool 


22.1 


Adrenal Gland 


8.1 


Fetal Kidney 


26.1 


Pituitary gland Pool 


3.0 


Renal ca. 786-0 


28.7 


Salivary Gland 


4.7 


Renal ca. A498 


11.3 


Thyroid (female) 


4.4 


Renal ca. ACHN 


12.2 


Pancreatic ca. CAPAN2 


17.3 


Renal ca. UO-31 


24.1 


Pancreas Pool 


17.1 



Table BRE. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3023, 
Run 167966931 


Tissue Name 


Rel. Exp.(%) Ag3023, 
Run 167966931 


Liver adenocarcinoma 


51.1 


Kidney (fetal) 


26.2 


Pancreas 


6.1 


Renal ca. 786-0 


34.2 


Pancreatic ca. CAPAN 
2 


17.7 


Renal ca. A498 


17.6 


Adrenal gland 


3.8 


Renal ca. RXF 393 


17.2 


Thyroid 


3.0 


Renal ca. ACHN 


13.5 


Salivary gland 


3.9 


Renal ca. UO-31 


0.0 


Pituitary gland 


3.6 


Renal ca.TK- 10 


23.0 


Brain (fetal) 


8.1 


Liver 


11.7 


Brain (whole) 


8.5 


Liver (fetal) 


8.0 


Brain (amygdala) 


6.7 - 


Liver ca. 
(hepatoblast) HepG2 


26.2 
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Brain (cerebellum) 


1 5.2 


Lung 


3.1 


Brain (hippocampus) 


C A 

j A 


Lung (fetal) 


11.0 


Brain (substantia nigra) 


9.0 


Lung ca. (small cell) 
LX-1 


12.9 


Brain (thalamus) 


4.2 


Lung ca. (small cell) 
NCI-H69 


9.9 


Cerebral Cortex 


2.0 


Lung ca. (s.cell var.) 
SHP-77 


67.8 


Spinal cord 


6.9 


Lung ca. (large 
cell)NCI-H460 


3.4 


glio/astro U87-MG 


28.5 


Lung ca. (non-sm. 
cell) A549 


45.1 


glio/astroU-118-MG 


46.7 


Lung ca. (non-s.cell) 
NCI-H23 


22.7 


astrocytoma SW1783 


40.6 


Lung ca. (non-s.cell) 
HOP-62 


25.7 


neuro*; met SK-N-AS 


272 


Lung ca. (non-s.cl) 
NCI-H522 


38.2 


astrocytoma SF-539 


29.7 


Lung ca. (squam.) 
SW 900 


27.4 


astrocytoma SNB-75 


35.1 


Lung ca. (squam.) 
NCI-H596 


29.9 


nliAwtn C\TD 1 f\ 

glioma bJNo-iy 


15.6 


Mammary gland 


5.1 


glioma U25l 


37.9 


Breast ca.* (pl.ef) 
MCF-7 


47.0 


glioma SF-295 


18.4 


MDA-MB-231 


22.7 


Heart (fetal) 


2.9 


T47D 


86.5 


Heart 


12 9 




1 ^ Q 


Skeletal muscle (fetal) 


3.4 


Breast ca. MDA-N 


10.4 


oiceieiai muscje 


141 1 


Ovary 


2.9 


Bone marrow 


4.5 


Ovarian ca.OVCAR- 

3 


26.1 


Thymus 


14.3 


Ovarian ca. OVCAR- 
4 


16.3 


Spleen 


8.7 


Ovarian ca. OVCAR- 
5 


83.5 


Lymph node 


11.8 


yjvdiidJi ca. \J vv^/\Jtv- 
8 


9.3 


Colorectal 


10.4 


Ovarian ca. IGROV- 
1 


12.0 


Stomach 


7.8 


Ovarian ca.* (ascites) 
SK-OV-3 


100.0 


Small intestine 


5.1 


Uterus 


4.9 
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Colon ca. SW480 


19.3 


Placenta 


1.3 


Colon ca.* 
SW620(SW480 met) 


42.9 


Prostate 


3.9 


Colon ca. HT29 


9.9 


Prostate ca.* (bone 
metlPC-3 


78.5 


Colon ca. HCT-116 


26.2 


Testis 


9.7 


Colon ca. CaCo-2 


41.5 


Melanoma 
Hs688(A).T 


5.9 


Colon ca. 
tissue(OD03866) 


6.3 


Melanoma* (met) 
Hs688(B).T 


14.2 


Colon ca. HCC-2998 


16.0 


Melanoma UACC-62 


14.0 


NCI-N87 


18.8 


Melanoma M 14 


5.7 


Bladder 


30.6 


Melanoma LOX 
IMVI 


8.8 


Trachea 


3.2 


Melanoma* (met) 
SK-MEL-5 


14.7 


Kidney 


9.6 


Adipose 


18.9 



Table BRF. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
Ag3023, 

Run 
164516146 


Rel. 
Exp.(%) 
Ag3373, 

Run 
165296617 


Tissue Name 


Rel. 
Exp.(%) 
Ag3023, 
Run 
164516146 


Rel. 
Exp.(%) 
Ag3373, 

Run 
165296617 


Secondary Thl act 


18.6 


17.9 


HUVECIL-lbeta 


20.3 


18.6 


Secondary Th2 act 


24.3 


28.5 


HUVEC IFN 
gamma 


25.3 


22.7 


Secondary Trl act 


22.8 


21.8 


HUVEC TNF 
alpha + IFN 
gamma 


16.3 


18.0 


Secondary Thl rest 


7.5 


6.8 


HUVEC TNF 
alpha + IL4 


18.2 


13.4 


Secondary Th2 rest 


11.6 


9.5 


HUVEC IL-11 


13.7 


9.9 


Secondary Trl rest 


12.1 


10.7 


Lung 
Microvascular EC 
none 


25.7 


21.6 


Primary Thl act 


20.7 


16.5 


Lung 
Microvascular EC 
TNFalpha + IL- 
lbeta 


262 


18.3 


Primary Th2 act 


20.2 


19.3 


Microvascular 
Dermal EC none 


27.5 


21.3 


Primary Trl act 


23.3 


27.7 


Microsvasular 
Dermal EC 


20.7 


19.9 
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TNFalpha + IL- 
Ibeta 






Primary Thl rest 


51.1 


51.4 


Bronchial 
epithelium 
TNFalpha + 
ILlbeta 


13.0 


16.3 


Primarv Th2 rest 


26.2 


29.5 


Small airway 
epithelium none 


8.1 


8.5 


Primarv Trl rest 

X A X1XXCU T 1111 VlJI 


23.7 


26.1 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


50.3 


39.8 


CD45RA CD4 
lymphocyte act 


14.6 


11.0 


Coronery artery 
SMC rest 


20.2 


18.9 


CD45RO CD4 
lymphocyte act 


25.2 


22.4 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


12.0 


9.8 


CD8 lymphocyte 
act 


20.4 


15.8 


Astrocytes rest 


10.4 


ll. I 


Secondary CD8 
lymphocyte rest 


16.5 


19.9 


Astrocytes 
TNFalpha + IL- 
lbeta 


11.7 


9.8 


Secondary CD8 
lymphocyte act 


13 2 


9 3 


KU-812 
(Basophil) rest 


47 6 


38 2 


CD4 lymphocyte 
none 


" 17.1 


11.6 


KU-812 
(Basophil) 
PMA/ionomycin 


94.0 


92.0 


2ry 

Thl/Th2/Trl anti- 
CD95CH11 


18.3 


16.6 


CCD1106 
(Keratinocytes) 
none 


19.9 


13.2 


LAK cells rest 


25.5 


16.0 


CCD1106 
(Keratinocytes) 

' 1 ' V TT' -1.-1 i TT 

TNFalpha + IL- 
lbeta 


6.0 


4.8 


LAK cells 1L-2 


27.2 


22.5 


Liver cirrhosis 


3.1 


2.7 


LAK cells IL-2+IL- 
12 


27.2 


19.3 


Lupus kidney 


2.1 


1.7 


LAK ceils IL- 
2+IFN gamma 


36:3 


34.4 


NCI-H292 none 


30.1 


18.9 


TAT/" 11„ TT ) 

LAK cells IL-2+ 
IL-18 


35.1 


29.7 


NCI-H292 IL-4 


33.9 


34.6 


LAK cells 
PMA/ionomycin 


12.4 


11.0 


NCI-H292 IL-9 


40.1 


29.1 


NK Cells IL-2 rest 


20.0 


15.0 


NCI-H292IL-13 


16.2 


14.2 


Two Way MLR 3 
day 


24.0 


16.7 


NCI-H292IFN 
gamma 


16.6 


18.4 


Two Way MLR 5 


12.9 


10.1 


HPAEC none 


13.6 


13.5 
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day 










Two Way MLR 7 
day 


11.4 


9.5 


HPAEC TNF 
alpha + IL-1 beta 




25.3 


PBMC rest 


13.7 


10.5 


Lung fibroblast 
none 


1 1 A 


14.2 


PBMCPWM 


69.3 


66.4 


Lung fibroblast 
TNF alpha + IL-1 
beta 


6.1 


7.2 


PBMC PHA-L 


22.8 


17.7 


Lung fibroblast 
IL-4 


28.5 


29.1 


Ramos (B cell) 
none 


24.1 


19.3 


Lung fibroblast 
IL-9 


23.0 


23.3 


Ramos (B cell) 
ionomycin 


100.0 


100.0 


Lung fibroblast 
IL-13 


on £ 

ZU.O 


18.9 


B lymphocytes 
PWM 


71.7 


74.2 


Lung fibroblast 
IFN gamma 


39.0 


32.5 


B lymphocytes 
CD40LandIL-4 


90 1 


?R 7 


Dermal fibroblast 
CCD1070rest 


33.9 


31 0 


EOL-1 dbcAMP 


12.1 


10.5 


Dermal fibroblast 
CCD1070 TNF 
alpha 


76.8 


62.0 


EOL-1 dbcAMP 
PMA/ionomycin 


14.5 


10.9 


Dermal fibroblast 
CCD1070IL-1 
beta 


20.3 


13.9 


Dendritic cells 
none 


13.2 


14.8 


Dermal fibroblast 
IFN gamma 


14.2 


9.5 


Dendritic cells LPS 


11.7 


8.3 


Dermal fibroblast 
IL-4 


26.4 


20.4 


Dendritic cells anti- 
CD40 


17.7 


12.7 


IBD Colitis 2 


2.6 


2.2 


Monocytes rest 


16.7 


17.6 


IBD Crohn's 


2.0 


1.9 


Monocytes LPS 


6.4 


5,0 


Colon • 


11.9 


10.5 


Macrophages rest 


23.5 


22.8 


Lung 


13.3 


11.2 


Macrophages LPS 


9.9 


7.1 


Thymus 


14.4 


12.9 


HUVEC none 


20.6 


17.9 


Kidney 


27.5 


19.6 


HUVEC starved 


43.5 


38.4 









CNS_neurodegeneration_vl.O Summary: Ag3023/Ag3373 This panel does not show 
differential expression of the NOV77 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1 .3D for discussion of 
5 utility of this gene in the central nervous system. 

General screening panel vl.4 Summary: Ag3373 Highest expression of the 
NOV77 gene is seen in a prostate cancer cell line (CT=27). Overall, this gene is expressed at 
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moderate levels in the cancer cell lines in this panel. A higher level of expression is observed 
in clusters of cell lines derived from prostate, brain, melanoma, colon, lung, breast and ovarian 
cancer when compared to expression in normal prostate, brain, colon, lung, breast and ovary. 
Thus, this gene could potentially be used as a diagnostic marker of cancer in these tissues. 
5 Furthermore, inhibition of the activity of this gene product using small molecule drugs may be 
effective in the treatment of cancer in these tissues. 

Among tissues with metabolic function, this gene product has moderate levels of 
expression in adipose, heart, skeletal muscle, adrenal, pituitary, thyroid and pancreas. Thus, 
this gene product may be a small molecule target for the treatment of endocrine and metabolic 
10 diseases, including obesity and Types 1 and 2 diabetes. 

In addition, this gene appears to be differentially expressed in fetal (CT value = 29) vs 
adult liver (CT value =33) and may be useful for differentiation between the two sources of 
this tissue. 

This gene is also expressed at moderate levels in all central nervous system samples 
15 present on this panel. Please see Panel 1 .3D for discussion of utility of this gene in the central 
nervous system. 

Panel 1.3D Summary: Ag3023 The NOV77 gene is ubiquitously expressed among 
the samples on this panel, with highest expression in an ovarian cancer cell line (CT=28.8). 
Overall, the expression of this gene shows good agreement with panel 1 .4. A higher level of 

20 expression is observed in prostate, brain, melanoma, colon, lung, pancreatic, breast and 
ovarian cancer cell lines than the normal prostate, brain, colon, lung, pancreas, breast and 
ovary. Thus, expression of this gene could be used as a diagnostic marker of cancer in these 
tissues. Furthermore, inhibition of the activity of this gene product using small molecule drugs 
may be effective in the treatment of cancer in these tissues. 

25 Among tissues with metabolic function, expression of this gene is widespread, as in the 

previous panel. Please see Panel 1.4 for discussion of utility of this gene in metabolic disease. 

This gene represents a dual specificity phosphatase that is also expressed at low to 
moderate levels across the CNS. Dual-specificity phosphatases comprise a family of MAP 
kinase regulating enzymes, members of which are upregulated in brains subjected to insults 

30 such as ischemia and seizure activity. MAP kinases are kown to regulate neurotrophic and 
neurotoxic pathways. Consequently, agents that modulate the activity of this gene may have 
utility in attenuating the apoptotic and neurodegenerative processes following brain insults. 
References: 
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Wiessner C. The dual specificity phosphatase PAC-1 is transcriptionally induced in the 
rat brain following transient forebrain ischemia. Brain Res Mol Brain Res 1995 
Feb;28(2):353-6 

PAC-1 mRNA has previously been found only in activated T-cells in vitro and in vivo. 
5 The gene encodes a dual specificity protein phosphatase that regulates MAP kinase activity. 
Here, I describe that PAC-1 mRNA is induced also in neurons in the rat brain following 30 
min of forebrain ischemia. At 6, 12 and 24 h after ischemia, PAC-1 mRNA was found most 
prominently in hippocampal cells which are resistant to 30 min of forebrain ischemia, but not 
in the selectively vulnerable CA1 sector. At later time points and in control animals no PAC-1 
10 mRNA could be detected in any brain region. The protein-tyrosine/threonine phosphatase 

PAC-1, therefore, may be involved in adaptational responses of hippocampal cells resistant to 
ischemic injury. 

Boschert U, Muda M, Camps M, Dickinson R, Arkinstall S. Induction of the dual 
specificity phosphatase PAC1 in rat brain following seizure activity. Neuroreport 1997 Sep 

15 29;8(14):3077-80 

Recurrent seizure activity leads to delayed neuronal death as well as to inflammatory 
responses involving microglia in hippocampal subfields CA1, CA3 and CA4. Since mitogen 
activated protein (MAP) kinases control neuronal apoptosis and trigger generation of 
inflammatory cytokines, their activation state could determine seizure-related brain damage. 

20 PAC1 is a dual specificity protein phosphatase inactivating MAP kinases which we have 
found to be undetectable in normal brain. Despite this, kainic acid-induced seizure activity 
lead to rapid (approximately 3 h) but transient appearance of PAC1 mRNA in granule cells of 
the dentate gyrus as well as in pyramidal CA1 neurons. This pattern changed with time and 
after 2-3 daysPACl was induced in dying CA1 andCA3 neurons. At this time PAC1 mRNA 

25 was also expressed in white matter microglia as well as in microglia invading the damaged 
hippocampus. PAC1 may play an important role controlling MAP kinase involvement in both 
neuronal death and neuro-inflammation following excitotoxic damage. 

Panel 4D Summary: Ag3023/Ag3373 The NOV77 gene is expressed at high to 
moderate levels in a wide range of cell types and tissues of significance in the immune 

30 response in health and disease. Highest expression of this gene is seen in ionomycin treated 
Ramos B cells (CT=26.83). Therefore, targeting of this gene product with a small molecule 
drug or antibody therapeutic may modulate the functions of cells of the immune system as 
well as resident tissue cells and lead to improvement of the symptoms of patients suffering 
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from autoimmune and inflammatory diseases such as asthma, allergies, inflammatory bowel 
disease, lupus erythematosus, and arthritis, including osteoarthritis and rheumatoid arthritis. 

NOV78 

Expression of gene NOV78 was assessed using the primer-probe set Ag3025, 
5 described in Table BSA. Results of the RTQ-PCR runs are shown in Tables BSB, BSC and 
BSD. 



Table BSA. Probe Name Ag3025 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gctgctgcaattgtaataggtt-3 1 


22 


596 


1309 


Probe 


TET-5 1 -tcctgatgaattctgaacaaacctca-3 ' - 
TAMRA 


26 


618 


1310 


Reverse 


5 ' -catatggaaggtcttgcatttt-3 • 


22 


669 


1311 



Table BSB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3025, 
Run 209821733 


Tissue Name 


Rel. Exp.(%) Ag3025, 
Run 209821733 


AD 1 Hippo 


21.3 


Control (Path) 3 
Temporal Ctx 


4.8 


AD 2 Hippo 


29.1 


Control (Path) 4 
Temporal Ctx 


50.7 


AD 3 Hippo 


15.9 


AD 1 Occipital Ctx 


22.7 


AD 4 Hippo 


11.4 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


66.4 


AD 3 Occipital Ctx 


9.0 


AD 6 Hippo 


26.6 


AD 4 Occipital Ctx 


13.7 


Control 2 Hippo 


32.5 


AD 5 Occipital Ctx 


54.7 


Control 4 Hippo 


7.2 


AD 6 Occipital Ctx 


8.1 


Control (Path) 3 
Hippo 


4.8 


Control 1 Occipital 
Qx 


2.3 


AD 1 Temporal Ctx 


19.9 


Control 2 Occipital 
Ctx 


40.1 . 


AD 2 Temporal Ctx 


29.9 


Control 3 Occipital 
Ctx 


13.8 


AD 3 Temporal Ctx 


9.3 


Control 4 Occipital 
Ctx 


5.6 


AD 4 Temporal Ctx 


25.2 


Control (Path) 1 
Occipital Ctx 


84.7 


AD 5 Inf Temporal 
Ctx 


79.6 


Control (Path) 2 
Occipital Ctx 


10.0 


AD 5 Sup Temporal 


56.6 


Control (Path) 3 


0.6 
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Ctx 




Occipital Ctk 




AD 6 Inf Temporal 
Ctx 


19.3 


Control (Path) 4 
Occipital Ctx 


12.3 


AD 6 Sup Temporal 
Ctx 


24.5 


Control 1 Parietal 
Ctx 


3.8 


Control 1 Temporal 
Ctx 


9.7 


Control 2 Parietal 
Ctx 


30.1 


Control 2 Temporal 
Ctx 


43.5 


Control 3 Parietal 
Ctx 


21.6 


Control 3 Temporal 
Ctx 


15.2 


Control (Path) 1 
Parietal Ctx 


100.0 


Control 3 Temporal 
Ctx 


7.4 


Control (Path) 2 
Parietal Ctx 


34.9 


Control (Path) 1 
Temporal Ctx 


75.3 


Control (Path) 3 
Parietal Ctx 


1.5 


Control (Path) 2 
Temporal Ctx 


35.6 


Control (Path) 4 
Parietal Ctx 


52.1 



Table BSC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3025, 
Run 167968622 


Tissue Name 


Rel. Exp.(%) Ag3025, 
Run 167968622 


Liver adenocarcinoma 


3.6 


Kidney (fetal) 


163 


Pancreas 


6.8 


Renal ca. 786-0 


62 


Pancreatic ca. CAPAN 
2 


2.7 . 


Renal ca. A498 


10.9 


Adrenal gland 


2.3 


Renal ca. RXF 393 


6.7 


Thyroid 


7.2 


Renal ca. ACHN 


3.3 


Salivary gland 


3.2 


Renal ca.UO-31 


5.3 


Pituitary gland 


16.3 


Renal ca.TK-10 


7.2 


Brain (fetal) 


37.6 


Liver 


3.8 


Brain (whole) 


33.2 


Liver (fetal) 


3.4 


Brain (amygdala) 


29.7 


Liver ca. 
(hepatoblast) HepG2 


2.5 


Brain (cerebellum) 


9.7 


Lung 


4.6 


Brain (hippocampus) 


21.9 1 


Lung (fetal) 


0.0 


Brain (substantia nigra) 


12.2 


Lung ca. (small cell) 
LX-1 


3.8 


Brain (thalamus) 


10.4 


Lung ca. (small cell) 
NCI-H69 


2.5 


Cerebral Cortex 


16.3 


Lung ca. (s.cell var.) 
SHP-77 


100.0 


Spinal cord 


10.7 


Lung ca. (large 
cell)NCI-H460 


0.8 


glio/astro U87-MG 


6.0 


Lung ca. (non-sm. 


5.8 
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cell) A549 




glio/astroU-118-MG 


8.8 


Lung ca. (non-s.cell) 
NCI-H23 


15.4 


astrocytoma SW1783 


15.3 


Lung ca. (non-s.cell) 
HOP-62 


7.7 


neuro*; met SK-N-AS 


6.4 


Lung ca. (non-s.cl) 
NCI-H522 


13.4 


astrocytoma SF-539 


5.3 


T / \ 

Lung ca. (squam.) 
SW 900 


1.7 


astrocytoma SNB-75 


7.1 


Lung ca. (squam.) 
NCI-H596 


3.7 


glioma SNB-19 


6.3 


Mammary gland 


A t 

4.1 


glioma U251 


9.9 


Breast ca.* (pl.ef) 
MPF-7 


8.7 


glioma SF-295 


10.4 


J->I Cool L/d. \y\,\Z\) 

MDA-MB-231 


1.7 


Heart (fetal) 


3.3 


Oi wool ud. ypi.d) 

T47D 


14.1 


XlcaiL 


on i 






Skeletal muscle (fetal) 


0.5 


Breast ca. MDA-N 


15.8 


Skeletal muscle 


15.5 


Ovary 


9.0 


Bone marrow 


2.8 


Ovarian ca. OVCAR- 
3 


3.5 


Thymus 


11.9 


Ovarian ca. OVCAR- 
4 


1.0 


Spleen 


5.6 


Ovarian ca. OVCAR- 
5 


19.2 


Lymph node 


3.1 


Ovarian ca. OVCAR- 

Q 
o 


5.1 


Colorectal 


4.9 


Ovarian ca. IGROV- 

i 
i 


0.6 


Stomach 


11.9 


Ovarian ca.* (ascites) 
SK-OV-3 


52,1 


Small intestine 


6 2 




7 1 


Colon ca. SW480 


0.3 


Placenta 


3.1 


Colon ca.* 
SW620(SW480 met) 


17.0 


Prostate 


1.4 


Colon ca. HT29 


2.8 


Prostate ca.* (bone 
met)PC-3 


11.6 


Colon ca.HCT-1 16 


5.1 


Testis 


5.4 


Colon ca. CaCo-2 


15.2 


Melanoma 
Hs688(A).T 


2.2 


Colon ca. 
tissue(OD03866) 


1.7 


Melanoma* (met) 
Hs688(B).T 


2.2 


Colon ca. HCC-2998 


9.0 


Melanoma UACC-62 


12.2 
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Gastric ca.* (liver met) 
NCI-N87 


3.2 


Melanoma M14 


3.6 


Bladder 


3.1 


Melanoma LOX 
IMVI 


3.5 


Trachea 


5.2 


Melanoma* (met) 
SK-MEL-5 


11.8 


Kidney 


17.4 


Adipose 


7.9 



Table BSD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3025,Run 
164528140 


Tissue Name 


Rel. Exp.(%) 
Ag3025, Run 
164528140 


Secondary Thl act 


2.6 


HUVEC IL-lbeta 


12.5 


Secondary Th2 act 


3.3 


HUVEC IFN gamma 


9.5 


ovvuuuui y x i i dvv 


4 R 


HUVEC TNF alpha + IFN 
gamma 




Secondary Thl rest 


1.2 


HUVEC TNF alpha + IL4 


5.1 


Secondary Th2 rest 


2.7 


HUVEC IL-11 


3.4 


Secondary Trl rest 


1.4 


Lung Microvascular EC 
none 


10.7 


Primary Thl act 


6.2 


Lung Microvascular EC 
TNFaIpha + IL-lbeta 


13.5 


Primary Th2 act 


3.5 


Microvascular Dermal EC 
none 


6.0 


Primary Trl act 


2.5 . 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


2.6 


Primary Thl rest 


5.4 


Bronchial epithelium 
TNFalpha + ILlbeta 


6.2 


Primary Th2 rest 


6i7 


Small airway epithelium 
none 


0.5 


Primary Trl rest 


1.4 


Small airway epithelium 
TNFalpha + IL-lbeta 


8.0 


CD45RACD4 
lymphocyte act 


4.4 


Coronery artery SMC rest 


32.5 


CD45ROCD4 
lymphocyte act 


4.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


6.9 


CD8 lymphocyte act 


16.8 


Astrocytes rest 


9.2 


Secondary CD8 
lymphocyte rest 


5.2 


Astrocytes TNFalpha + 
IL-lbeta 


3.4 


Secondary CD8 
lymphocyte act 


1.9 


KU-812 (Basophil) rest 


7.3 


CD4 lymphocyte none 


2.3 


KU-812 (Basophil) 
PMA/ionomycin 


29.5 


2ry Thl/Th2/Trl_anti- 


2.6 


CCD1 106 (Keratinocytes) 


1.3 
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CD95 CH11 




none 




LAK cells rest 


5.8 


CCD1 106 (Keratinocvtes^ 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


21.0 


Liver cirrhosis 

M—M ft V W ft VlllllVtJllJ 


3.7 


LAK cells IL-2+IL-12 


11.3 


Ludus Icidnev 


3.7 


LAK cells IL-2+IFN 
gamma 


12.4 


NCI-H292 none 


12.2 


LAK cells IL-2+ IL-1 8 


67.4 


NCI-H292IL-4 


10.2 


LAK cells 
PMA/ionomycin 


0.6 


NCI-H292 IL-9 


100.0 


NK Cells IL-2 rest 


4.2 


NCI-H292IL-13 


5.1 


Two Wav MLR 3 dav 


3.5 


NCI-H292 IFN samma 

X^^'X llXi/^i AX 1 i CluXXIXXICl 


2 7 


Two Wav MLR 5 dav 


1 3 


HPAEC none 


9 3 


Two Way MLR 7 day 


4.9 


HPAEC TNF aloha + IL-1 

X XX xl .1 1 111 (tlUllu < XX.J X 

beta 


2.4 


PBMC rest 


4.2 


Liinf* fibroblast none 


5 5 


PBMCPWM 


24.7 


Lung fibroblast TNF alpha 
+ IL-1 beta 


6.3 


PBMC PHA-L 


4.3 


Lung fibroblast IL-4 


6.8 


Kamos (o ceil) none 


in i 

ly.i 


T XT 1- 1- 1 _ _ ^ it f\ 

Lung tibroblast IL-9 


10.4 


Ramos (B cell) 
lonomycin 


33.4 . 


Lung fibroblast IL-1 3 


4.5 


B lymphocytes PWM 


192 


Lung fibroblast IFN 
gamma 


152 


B lymphocytes GD40L 

orlJ J TT yl 

ana 


5.2 


Dermal fibroblast 
LCD 1070 rest 


17.8 


EOL-l dbcAMP 


0.7 


Dermal fibroblast 
CCD1070 TNF alpha 


36.6 


POT 1 A \yfl> 

PMA/ionomycin 


0.9 


Dermal tibroblast 
CCD1 070 IL-1 beta 


6.5 


Dendritic cells none 


3.7 


Dermal fibroblast IFN 
gamma 


0.6 


Dendritic cells LPS 


6.0 


Dermal fibroblast IL-4 


6.6 


Dendritic cells anti- 
CD40 


3.3 


IBD Colitis 2 


0.7 


Monocytes rest 


2.4 


IBD Crohn's 


0.8 


Monocytes LPS 


0.6 


Colon 


21.5 


Macrophages rest 


5.6 


Lung 


18.3 


Macrophages LPS 


0.8 


Thymus 


52.1 


HUVEC none 


6.6 


Kidney 


19.2 


HUVEC starved 


21.9 







CNSjneurodegeneration_vl.O Summary: Ag3025 This panel does not show 
differential expression of the NOV78 gene in Alzheimer's disease. However, this expression 
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profile confirms the presence of this gene in the brain. Please see Panel 1 .3D for discussion of 
utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3025 Highest expression of the NOV78 gene is seen in a 
lung cancer cell line (CT=30.5). Higher levels of expression are observed in prostate, lung, 
5 breast and ovarian cancer cell lines when compared with the normal prostate, lung, breast and 
ovary. Thus, expression of this gene may be used as a diagnostic marker of cancer in these 
tissues. Furthermore, inhibition of the activity of this gene product using small molecule drugs 
may be effective in the treatment of cancer in these tissues. 

Among tissues with metabolic function, this gene has a low level of expression in 
10 pancreas, thyroid, pituitary, heart, and adipose. Therefore, this gene product may be a small 
molecule target for the treatment of metabolic and endocrine diseases, including obesity and 
Types 1 and 2 diabetes. 

This gene represents a dual specificity phosphatase that is also expressed at low to 
moderate levels across the CNS. Dual-specificity phosphatases comprise a family of MAP 
15 kinase regulating enzymes that are upregulated in brains subjected to insults such as ischemia 
and seizure activity. MAP kinases are kown to regulate neurotrophic and neurotoxic pathways. 
Consequently, agents that modulate the activity of this gene may have utility in attenuating the 
apoptotic and neurodegenerative processes following brain insults. 

References: 

20 Wiessner C. The dual specificity phosphatase PAC-1 is transcriptionally induced in the 

rat brain following transient forebrain ischemia. Brain Res Mol Brain Res 1995 
Feb;28(2):353-6 

PAC-1 mRNA has previously been found only in activated T-cells in vitro and in vivo. 
The gene encodes a dual specificity protein phosphatase that regulates MAP kinase activity. 

25 Here, I describe that PAC-1 mRNA is induced also in neurons in the rat brain following 30 
min of forebrain ischemia. At 6, 12 and 24 h after ischemia, PAC-1 mRNA was found most 
prominently in hippocampal cells which are resistant to 30 min of forebrain ischemia, but not 
in the selectively vulnerable CA1 sector. At later time points and in control animals no PAC-1 
mRNA could be detected in any brain region. The protein-tyrosine/threonine phosphatase 

30 PAC-1, therefore, may be involved in adaptational responses of hippocampal cells resistant to 
ischemic injury. 

Boschert U, Muda M, Camps M, Dickinson R, Arkinstall S. Induction of the dual 
specificity phosphatase PAC 1 in rat brain following seizure activity. Neuroreport 1 997 Sep 
. 29;8(14):3077-80 
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Recurrent seizure activity leads to delayed neuronal death as well as to inflammatory 
responses involving microglia in hippocampal subfields CA1, CA3 and CA4. Since mitogen 
activated protein (MAP) kinases control neuronal apoptosis and trigger generation of 
inflammatory cytokines, their activation state could determine seizure-related brain damage. 
5 PAC1 is a dual specificity protein phosphatase inactivating MAP kinases which we have 
found to be undetectable in normal brain. Despite this, kainic acid-induced seizure activity 
lead to rapid (approximately 3 h) but transient appearance of PAC1 mRNA in granule cells of 
the dentate gyrus as well as in pyramidal CA1 neurons. This pattern changed with time and 
after 2-3 days PAC1 was induced in dying CA1 and CA3 neurons. At this time PAC1 mRNA 

10 was also expressed in white matter microglia as well as in microglia invading the damaged 
hippocampus. PAC1 may play an important role controlling MAP kinase involvement in both 
neuronal death and neuro-inflammation following excitotoxic damage. 

Panel 4D Summary: Ag3025 The NOV78 gene is expressed at moderate to low levels 
in a wide range of cell types of significance Highest expression is detected in IL-9 treated 

1 5 NCI-H292 mucoepidermoid cells (CT=3 1 .81) with lower expression levels in non-treated 
NCI-H292 cells. Expression is also seen in (i) LAK cells stimulated with IL-2, IL-2 +IL-12, 
IL-2 + IL-1 8, and IL-2 + IFNgamma (ii) stimulated and non-stimulated Ramos B cells and 
polkweed mitogen stimulated B lymphocytes, (iii) starved and IL-1 treated HUVECs, (iv) 
TNF alpha+IL-l beta treated and non treated lung microvascular endothelial cells and resting 

20 coronary artery smooth muscle cells (v) treated Ku-812 basophils (vi) IFN gamma treated lung 
fibroblasts, and (vii) normal tissues represented by colon, lung, thymus and kidney. Based on 
this pattern of expression, this gene product may be involved in both disease and homeostatic 
processes for these and other cell types and tissues. Therefore, modulation of this gene product 
with a functional therapeutic may lead to the alteration of functions associated with these cell 

25 and tissue types and improvement of the symptoms of patients suffering from autoimmune and 
inflammatory diseases such as COPD, emphysema, asthma, allergies, inflammatory bowel 
disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and osteoarthritis. 

NOV79: Dual Specificity Phosphatase 

Expression of gene NOV79 was assessed using the primer-probe set Ag3039, 
30 described in Table BTA. Results of the RTQ-PCR runs are shown in Tables BTB, BTC and 
BID. 

Table BTA. Probe Name Ag3039 

Primers] Sequences } Length j Start } SEQ ID NoT] 
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1 


Position 




Forward 


5 1 -gccgaaataagatcacacacat-3 1 


I 22 


320 


1312 


Probe 


TET-5 1 - tctatccatgagtcaccccagcctct-3 1 
TAMRA 


"26 


346 


1313 


Reverse 


5 • -atgcgaaggtaggtgatatcct-3 1 


|22 


377 


1314 



Table BTB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel.Exp.(%)Ag3039, 
Run 211012103 


Ticciip IVamp 


ReL Exp.(%) Ag3039, 
Run 211012103 


AD 1 HiDDO 

•* ***** M. -M. XI 1-r yj \J 


18.4 


Control (Path) 3 
Temporal Ctx 




AD 2 Hippo 


48.0 


Control (Path) 4 
Temporal Ctx 


36 3 


AD 3 Hippo 


9.8 


AD 1 Occipital Ctx 


9.5 


AD 4 Hippo 


13.6 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


70.2 


AD 3 Occipital Ctx 


6.3 


AD 6 Hippo 


69.3 


AD 4 Occipital Ctx 


20.9 


Control 2 Hippo 


25.5 


AD 5 Occipital Ctx 


18.3 


Control 4 Hippo 


24.0 


AD 6 Occioital Ctx 


43 2 


Control (Path) 3 
Hippo 


7.6 


Control 1 Occipital 
Ctx 


6.0 


AD 1 Temporal Ctx 


24.3 


Control 2 Occipital 
Ctx 


57.0 


AD 2 Temporal Ctx 


36.9 


Control 3 Occipital 
Ctx 


18.7 


AD 3 Temporal Ctx 


4.7 


Control 4 Occipital 
Ctx 


13.9 


AD 4 Temporal Ctx 


24.5 


Control (Path) 1 
Occipital Ctx 


74.2 


AD 5 Inf Temporal 
Ctx 


1UU.U 


Control (Path) 2 
Occipital Ctx 


1 J n 

14.8 


AD 5 SupTemporal 
Ctx 


62.9 


Control (Path) 3 
Occipital Ctx 


4.3 


AD 6 Inf Temporal 
Ctx 


58.2 


Control (Path) 4 
Occipital Ctx 


25.2 


AD 6 Sup Temporal 
Ctx 


49.3 


Control 1 Parietal 
Ctx 


15.9 


Control 1 Temporal 
Ctx 


11.6 


Control 2 Parietal 
Ctx 


58.2 


Control 2 Temporal 
Ctx 


34.4 


Control 3 Parietal 
Ctx 


32.1 


Control 3 Temporal 
Ctx 


20.0 


Control (Path) 1 
Parietal Ctx . 


66.9 


Control 4 Temporal 


20.7 


Control (Path) 2 


39.0 
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Ctx 




Parietal Ctx 




Control (Path) 1 
Temporal Ctx 


44.4 


Control (Path) 3 
Parietal Ctx 


4.6 


Control (Path) 2 
Temporal Ctx 


30.4 


Control (Path) 4 
Parietal Ctx 


35.8 



Table BTC. Panel 13D 



' 1 'iCCllA nJniMA 


Rel. Exp.(%)Ag3039, 
Run 167961816 


l issue in a me 


Rel. Exp.(%) Ag3039, 
Run 167961816 


Liver adenocarcinoma 


1.7 


Kidney (fetal) 


38.2 


Pancreas 


2.0 


Renal ca. 786-0 


0.6 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


0.4 


Adrenal gland 


1.2 


Renal ca. RXF 393 


6.1 


Thyroid 


6.0 


Renal ca. ACHN 


1.5 


Salivary gland 


6.8 


Renal ca. UO-31 


0.0 


Pitiiitfirv oi anH 


JmXi 


lYCIldl Cd. i IV" 1 U 




Brain (fetal) 


7.4 


Liver 


0.3 


Brain (whole) 


7.7 


Liver (fetal) 


0.7 


Brain (amygdala) 


6.0 


Liver ca. 
(hepatoblast) HepG2 


0.1 


Brain (cerebellum) 


5.9 


Lung 


1.3 


Brain (hippocampus) 


3.9 


Lung (fetal) 


3.0 


Brain (substantia nigra) 


16.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


3.8 


Lung ca. (small cell) 
NCI-H69 


1 5 


Cerebral Cortex 


10.8 


Lung ca. (s.cell var.) 
SHP-77 


3.7 


Spinal cord 


15.2 


Lung ca. (large 
ce!l)NCI-H460 


0.1 


glio/astro U87-MG 


0.2 


Lung ca. (non-sm. 
cell)A549 


1.6 


glio/astroU-118-MG 


0.1 


Lung ca. (non-s.ceIl) 
NCI-H23 


0.4 


astrocytoma SW1783 


0.6 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.3 \ 


Lung ca. (non-s.cl) 
NCI-H522 


1.4 


astrocytoma SF-539 


1.5 


Lung ca. (squam.) 
SW900 


0.7 


astrocytoma SNB-75 


1.2 


Lung ca. (squam.) 
NCI-H596 


5.7 


glioma SNB-19 


2.1 


Mammary gland 


2.4 
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glioma U251 


0.3 


Breast ca.* (pl.ef) 


0.7 


glioma SF-295 


2.3 


Breast ca. ^pi.eij 
MDA-MB-231 


0.1 


Heart (fetal) 


14.5 


t>reasi ca. {pi.eij 
T47D 


13.5 


jnccu l 




OicaSL Ca. D l -jU 


l.Z 


Skeletal muscle (fetal) 


5.1 


Breast ca. MDA-N 


6.4 


Skeletal muscle 


0.0 


Ovary 


3.9 


Bone marrow 


0.4 


Ovarian ca. OVCAR- 
3 


0.3 


Thymus 


0.2 


Ovarian ca. OVCAR- 
4 


13.8 


Spleen 


3.1 


Ovarian ca. OVCAR- 
5 


1.5 


Lymph node 


0.9 


Ovarian ca. OVCAR- 

o 
o 


0.0 


Colorectal 


0.4 


Ovarian ca. IGROV- 
i 


0.0 


Stomach 


0.6 


Ovarian ca.* (ascites) 

CT/' C\\7 1 


0.4 


C tri oil i ntoctino 

omaii lines tine 


i .u 


Uterus 


1.Z 


Colon ca. SW480 


6.7 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


2.7 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)r\_,-j 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


100.0 


Colon ca. CaCo-2 


12.2 


Melanoma 
Hs688(A).T 


0.1 


Colon ca. 
tissue(OD03866) 


12 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


3.0 


Melanoma UACC-62 


5.2 


Gastric ca.* (liver met) 
NCI-N87 


1.2 


Melanoma M14 


0.8 


Bladder 


1.S 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.6 


Melanoma* (met) 
SK-MEL-5 


2.0 


Kidney 


41.8 


Adipose j 


0.6 


Table BTD. Panel 4D 


Tissue Name 


ReI.Exp.(%) Tissue Name ReLExp.(%) 
Ag3039,Run UssueJName Ag 3039,Run 
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162427949 




162427949 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Sf-pnnHjirv Th2 art 


0 0 


HTIVFC' TFN mmma 




Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 

era mm a 

gClllllllu 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary I iiz rest 


0.0 


T TT "PI rT?/"*i TT 1 1 

HUVEC IL-ll 


0.3 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


1.3 


Primary Thl act 


0.0 


Lung Microvascular EC 

HPXTT"* 1 1_ I TT 1 L a. 

TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.3 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.4 


Primary Thl rest 


0.0 


Bronchial epithelium 

ii iv | ft tl^i TT It— a. 

TNFalpha + ILlbeta 


1.4 


Primary Th2 rest 




0.0 

— — _____ 


Small airway epithelium 
none 


1.1 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.3 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CJDo lymphocyte act 


0.0 


Astrocytes rest 


4.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


2.3 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 

p\/f A I\ on nmvr i n 


0.0 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


0.0 


CCD1 106 (Keratinocytes) 

none 

1 11/11 W 


0.4 


LAK cells rest 


0.1 


CCD1 106 nCeratinocvte<ii 

TNFalpha + IL-lbeta 


0.4 


LAK cells IL-2 


0.4 


-L*1VC1 wlXIllUoiJ 


ft 7 


LAK cells IL-2+IL-12 


0 0 


i_fU|JU5 jviuuc v 




LAK cells IL-2+IFN 
gamma 


0.4 


NCI-H292 none 


3.5 


LAKcellsIL-2+IL-18 


0.8 


NCI-H292 IL-4 


0.9 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292IL-9 


1.8 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


1.2 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


3.6 
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i wo w cty ivij_»i\. j tidy 


o n 
u.u 


HP AFP nnnp 

rir/\iZ/V_. none 


ft ft 


Two Way MLR 7 day 


0.0 


T4PAFP TNF alnfin 4- TT -1 

beta 


0.0 


x JDiviv^ rest 


ft ^ 


LfUug iiDruDiasi none 


ft ft 


PBMC PWM 


2.5 


Lung fibroblast TNF alpha 


0.0 


PBMC PHA-L 


1.6 


Lung fibroblast LL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.5 


B lymphocytes PWM 


5.5 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 

J TT A 

andIL-4 


1.3 


Dermal fibroblast 
CCD1 070 rest 


0.0 


EOL-1 dbcAMP 


0.6 


Dermal fibroblast 
CCD1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
jrivi/v lunuinycin 


0.0 


Dermal fibroblast 

rrni ft7fi tt i k^+o 
la^ljiu/u ijl-i oeia 


0.0 


Dendritic cells none 


0.0 


normal -ftKv/\K1oo-f TTTXT 

jjermai iiDroDiasi iriN 
gamma 


0.0 


Dendritic cells LPS 


0.4 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


1.1 


Monocytes LPS 


0.4 


Colon 


2.5 


Macrophages rest 


1.0 


Lung 


5.1 


Macrophages LPS 


0.4 


Thymus 


100.0 


HUVEC none 


0.6 


Kidney 


3.1 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3039 No differential expression of the 
NOV79 gene is detected between the postmortem brains of Alzheimer's diseased patients and 
those of non-demented controls. However, this panel confirms the expression of this gene in 
5 the CNS. Please see panel 1.3D for a discussion of utility of this gene in the central nervous 
system. 

Panel L3D Summary: Ag3039 Highest expression of the NOV79 gene is seen in the 
testis (CT=29). In addition, expression of this gene is extremely low in renal and brain cancer 
cell lines but is expressed in the normal brain and kidney tissues on this sample. Therefore, 
1 0 this gene may be used as a diagnostic marker for brain and kidney cancer and prostate tissue. 
Furthermore, therapeutic modulation of the expression or function of this gene may be 
effective in the treatment of brain and renal cancers. 
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In addition, this gene is expressed at low levels in metabolic tissues including pancreas, 
adrenal, thyroid, pituitary, adult and fetal heart, and adipose. This novel protein phosphatase 
may be a small molecule target for the treatment of metabolic and endocrine disease, including 
obesity and Types 1 and 2 diabetes. This gene is also differentially expressed in fetal (CT 
5 values = 32-33) vs adult skeletal muscle (CT values = 35-40) and may be useful for the 
differentiation of adult and fetal skeletal muscle. 

This gene represents a dual specificity phosphatase that is also expressed at low to 
moderate levels across the CNS. Dual-specificity phosphatases comprise a family of MAP 
kinase regulating enzymes that are upregulated in brains subjected to insults such as ischemia 
10 and seizure activity. MAP kinases are kown to regulate neurotrophic and neurotoxic pathways. 
Consequently, agents that modulate the activity of this gene may have utility in attenuating the 
apoptotic and neurodegenerative processes following brain insults. 

References: 

Wiessner C. The dual specificity phosphatase PAC-1 is transcriptionally induced in the 
15 rat brain following transient forebrain ischemia. Brain Res Mol Brain Res 1995 
Feb;28(2):353-6 

PAC-1 mRNA has previously been found only in activated T-cells in vitro and in vivo. 
The gene encodes a dual specificity protein phosphatase that regulates MAP kinase activity. 
Here, I describe that PAC-1 mRNA is induced also in neurons in the rat brain following 30 

20 min of forebrain ischemia. At 6, 12 and 24 h after ischemia, PAC-1 mKNA was found most 
prominently in hippocampal cells which are resistant to 30 min of forebrain ischemia, but not 
in the selectively vulnerable CA1 sector. At later time points and in control animals no PAC-1 
mRNA could be detected in any brain region. The protein-tyrosine/threonine phosphatase 
PAC-1, therefore, may be involved in adaptational responses of hippocampal cells resistant to 

25 ischemic injury. 

Boschert U, Muda M, Camps M, Dickinson R, Arkinstall S. Induction of the dual 
specificity phosphatase PAC1 in rat brain following seizure activity. Neuroreport 1997 Sep 
29;8(14):3077-80 

Recurrent seizure activity leads to delayed neuronal death as well as to inflammatory 
30 responses involving microglia in hippocampal subfields CA1, CA3 and CA4. Since mitogen 
activated protein (MAP) kinases control neuronal apoptosis and trigger generation of 
inflammatory cytokines, their activation state could determine seizure-related brain damage. 
PAC1 is a dual specificity protein phosphatase inactivating MAP kinases which we have 
found to be undetectable in normal brain. Despite this, kainic acid-induced seizure activity 
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lead to rapid (approximately 3 h) but transient appearance'of PAC1 mRNA in granule cells of 
the dentate gyrus as well as in pyramidal CA1 neurons. This pattern changed with time and 
after 2-3 days PAC 1 was induced in dying CA1 and CA3 neurons. At this time PAC 1 mRNA 
was also expressed in white matter microglia as well as in microglia invading the damaged 

5 hippocampus. PAC 1 may play an important role controlling MAP kinase involvement in both 
neuronal death and neuro-inflammation following excitotoxic damage. 

Panel 4D Summary: Ag3039 Expression of the NOV79 gene is highest and almost 
exclusive to the thymus (CTs=29-30). Expression of this gene could be used to distinguish 
thymus from the other samples on this panel. The putative dual-specificity phosphatase 

10 encoded by this gene may play an important role in T cell development. Small molecule 

therapeutics designed against the protein encoded by this gene could therefore be utilized to 
modulate immune function (T cell development) and be important for organ transplant, AIDS 
treatment or post chemotherapy immune reconstitiution. 

NOV80 

1 5 Expression of gene NOV80 was assessed using the primer-probe set Ag3044, 

described in Table BUA. Results of the RTQ-PCR runs are shown in Tables BUB and BUC. 

Table BUA. Probe Name Ag3044 



Primers 


Sequences 


Length 


Start 
Position 


SEQ XD NO: 


Forward 


5 1 - tgacgcagaatggaataagct - 3 1 


*i 


650 


1315 


Probe 


TET-5 ' -acgtcctctatgccagcaactcctg-3 1 - 
TAMRA 


25 


671 


1316 


Reverse 


5 1 -gcaagaagtggctctggtagat-3 f 


22 


712 


1317 



Table BUB. Panel 1 .3D 



Tissue Name 


Rel. Exp.(%) Ag3044, 
Run 167972762 


Tissue Name 


Rel. Exp.(%) Ag3044, 
Run 167972762 


Liver adenocarcinoma 


0.1 


Kidney (fetal) 


0.5 


Pancreas 


0.1 


Renal ca. 786-0 


0.2 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca.A498 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.1 


Thyroid 


0.0 


Renal ca. ACHN 


0.4 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.1 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.4 


Liver 


0.0 


Brain (whole) 


0.2 


Liver (fetal) 


0.0 
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Brain (amygdala) 


0.1 


Liver ca. 


0.1 


Brain (cerebellum) 


0.5 


Lung 


0.0 


Brain (hippocampus) 


A 1 

0.1 


Lung (fetal) 


0.1 


Brain (substantia nigra) 


0.1 


Lung ca. (small cell) 
LX-1 


O.l 


Brain (thalamus) 


0.1 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.1 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


0.1 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.2 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.4 


astrocytoma SW1783 


0.1 


Lung ca. (non-s.cell) 
HOP-62 


0.1 


neuro*; met SK-N-AS 


0.1 


Lung ca. (non-s.cl) 
NCI-H522 


03 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW900 


0.2 


astrocytoma SNB-75 


0.2 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.1 


Mammary gland 


0.0 


glioma U251 


0.1 


Breast ca.* (pl.ef) 

MPP-7 
ivi / 


0.0 


glioma SF-295 


0.1 


MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


Jjicaol La. ^pi.Cly 

T47D 


0.2 


Heart 


0 1 


Rrea«tf ea RT-S49 


o n 

v.v 


Skeletal muscle (fetal) 


0.1 


Breast ca. MDA-N 


bo 


oKeieiai muscle 


C\ 1 

0.1 


Ovary 


0.1 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 

3 


0.1 


Thymus 


0.1 


Ovarian ca. OVCAR- 
4 


0.1 


Spleen 


0.0 


uvanan ca. uv^ak- 


0.3 


Lymph node 


0.0 


Ovarian ca. OVCAR- 
8 


0.1 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.1 


Ovarian ca.* (ascites) 


0.1 
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SK-OV-3 


Small intestine 


u.u 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.2 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca.HCT-1 16 


0.0 


Testis 


0.1 


Colon ca. CaCo-2 


0.2 


Melanoma 
Hs688(A).T 


0 0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


00 


Colon ca. HCC-2998 


0.1 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


0.1 


Melanoma M14 


0.0 


Bladder 


0.1 


Melanoma LOX 
IMVI 


0.0 


Trachea 


100.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.2 


Adipose 


0.0 



Table BUC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3044, Run 
164334372 


Tissue Name 


Rel. Exp.(%) 
Ag3044,Run 
164334372 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


3.5 


Secondary Th2 act 


5.1 


. HUVEC IFN gamma 


0.0 


Secondary Trl act 


1.3 


HUVEC TNF alpha + IFN 
gamma 


2.0 


Secondary Thl rest 


1.2 


HUVEC TNF alpha + IL4 


2.0 


Secondary Th2 rest 


0.7 


HUVEC IL-11 


0.0 


Secondary Trl rest 


2.1 


Lung Microvascular EC 
none 


1.6 


Primary Thl act 


2.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.3 


Primary Th2 act 


4.0 


Microvascular Dermal EC 
none 


1.6 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


2.7 


Primary Thl rest 


9.5 


Bronchial epithelium 
TNFalpha + ILlbeta 


6.7 


Primary Th2 rest 


11.9 


Small airway epithelium 
none 


2.5 


Primary Trl rest 


4.6 


Small airway epithelium 


20.9 



1365 



WO 02/068649 



PCT/US02/02785 







TNFalpha + IL-lbeta 




CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


4.3 


CD45RO CD4 
lymphocyte act 


1.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.6 


CD8 lymphocyte act 


5.3 


Astrocytes rest 


4.3 


Secondary CD8 
lymphocyte rest 


1.9 


Astrocytes TNFalpha + 
IL-lbeta 


2.9 


Secondary CD8 
lymphocyte act 


22.8 


KU-812 (Basophil) rest 


6.8 


CD4 lymphocyte none 


0.1 


KU-812 (Basophil), 
r ivi/v i onomy c in 


9.5 


2ry Thl/Th2/Trl_anti- 


2.5 


CCDl 106 (Keratinocytes) 
none 


2.1 


LAK cells rest 


4.5 


^i^jji luo {jveratinocyies^ 
TNFalpha + IL-lbeta 


0.0 


T AY p^llc TI 9 
iv/\IV CCUS 1JL~Z 


ft ft 


jLiver cuTnosis 


1 7 H 


T AY rplk TT -94-TT 19 
JUrvTk. LCili> • lJ-»- 1 Z 




iwUpus Kianey 


£ 1 


T p#»11c TT 04-TT7KT 
J-f/vTw CcJlS 1 IN 

gamma 


8.6 


NCI-H292 none 


52.5 


LAK cells IL-2+IL- 18 


6.5 


NCI-H292 IL-4 


20.6 


PMA/ionomyc in 


0.2 


NCI-H292 IL-9 


31.4 


NK Cells IL-2 rest 


2.1 


NCI-H292 IL-13 


16.3 


i wu Way iyll»iv j Uay 


1 8 


"MPT TFTsJ from ma 
INv^l-JtlZyZ JJTiN gamma 


1 1 0 

1 1.7 


i wu w ay iviljIv. j uay 


ft ft 


rir/\iiv^ none 


u.u 


Two Way MLR 7 day 


0.0 


ixi/VD\^ iiNr aipna x ll,-i 
beta 


1.0 


IT J->1V|.V_, ICSl 




T linn TlnrAnlOCt nnnp 

j^ung iiDrooiasi none 


J. 5 


PBMC PWM 


3.2 


Lung fibroblast TNF alpha 

4- TT -1 Kpta 


3.6 


PBMC PHA-L 


3.0 


Lung fibroblast IL-4 


5.6 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


6.9 


Ramos (B cell) 
ionomycin 


02 


Lung fibroblast IL-13 


2.9 


B lymphocytes PWM 


11.7 


Lung fibroblast IFN 
gamma 


3.0 


B lymphocytes CD40L 

nnA TT A 


86.5 


Dermal fibroblast 
cCDiu/u rest 


5.8 


EOL-l dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


9.9 


EOL-1 dbcAMP 
PMA/ioriomycin 


0.0 


Dermal fibroblast 
CCD1070 IL-1 beta 


0.9 


Dendritic cells none 


4.1 


Dermal fibroblast IFN 
gamma 


1.0 
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Dendritic cells LPS 


1.1 


Dermal fibroblast IL-4 


4.6 


Dendritic cells anti- 
CD40 


1.1 


IBD Colitis 2 


0.0 


Monocytes rest 


1.7 


IBD Crohn's 


3.8 


Monocytes LPS 


2.5 


Colon 


17.9 


Macrophages rest 


4.1 


Lung 


7.6 


Macrophages LPS 


0.0 


Thymus 


100.0 


HUVEC none 


4.8 


Kidney 


10.1 


HUVEC starved 


2.2 







Panel 1.3D Summary: Ag3044 Results from one experiment with the NOV80 gene 
are not included. The amp plot indicates that there were experimental difficulties with this run 
(data not shown). 

5 Panel 4D Summary: Ag3044 The NOV80 gene is expressed at low levels in a wide 

range of cell types of significance in the immune response in health and disease. These cells 
include: (i) resting LAK and LAK cells stimulated with IL-2+IL-12, IL-2 + IL-18, and IL-2 + 
IFNgamma (ii) activated primary and secondary Th2 cells, resting primary Thl, Th2 and Trl 
cells, and activated CD8 and secondary CD8 lymphocytes, (iii) EL-1 beta treated HUVECs, 

10 (iv) polkweed mitogen stimulated and CD40L + EL-4 stimulated B lymphocytes, (v) treated . 
and non-treated Ku-812 basophils and non-treated dendritic cells, (vi) treated and non-treated 
peripheral blood mononuclear cells and resting macrophages (vii) treated and non-treated 
NCI-H292 mucoepidernioid, (viii) treated and non-treated lung fibroblasts, (viii) treated and 
non-treated astrocytes (ix) resting coronery artery SMCs, (x) resting and TNFalpha treated 

15 CCD1070 dermal fibroblasts and EL-4 treated dermal fibroblasts (xi) IBD Crohn's diseases 
tissue and normal tissues represented by colon, lung, thymus and kidney with the highest 
expression being detected in thymus tissue (CT=29.81). This expression profile suggests that 
this gene product may be involved in both disease and homeostatic processes in these and 
other cell types and tissues. Therefore, modulation of this gene product with a functional 

20 therapeutic may lead to the alteration of functions associated with these cell and tissue types 
and lead to improvement of the symptoms of patients suffering from autoimmune and 
inflammatory diseases such as COPD, emphysema, asthma, allergies, inflammatory bowel 
disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and osteoarthritis. 

NOV81aandNOV81b 
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Expression of gene NOV81a and the full length clone, NOV81b, was assessed using 
the primer-probe set Ag2906, described in Table BVA. Results of the RTQ-PCR runs are 
shown in Tables BVB, BVC and BVD. 



Table BVA. Probe Name Ag2906 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5' -ctacctgggtgaggtctttacc-3 1 


22 


845 


1318 


Probe 


TET-5 1 -ctccggaagccaggaggaccctt-3 ' - 
TAMRA 


23 


879 


1319 


Reverse 


5 1 -agaaggactcgggcacatag-3 • 


20 


902 


1320 



Table BVB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2906, 
Run 162556445 


Tissue Name 


Rel. Exp.(%) Ag2906, 
Run 162556445 


Liver adenocarcinoma 


2.0 


Kidney (fetal) 


13.5 


Pancreas 


1.2 


Renal ca. 786-0 


1.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


2.8 


Adrenal gland 


1.1 


Renal ca. RXF 393 


1.9 


Thyroid 


4.0 


Renal ca. ACHN 


0.7 


Salivary gland 


3.3 


Renal ca.UO-31 


0.0 


Pituitary gland 


2.4 


Renal ca.TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


6.8 


Brain (amygdala) 


2.0 


Liver ca. 
(hepatoblast) HepG2 


5.4 


Brain (cerebellum) 


3.8 


Lung 


42.0 


Brain (hippocampus) 


4.7 


Lung (fetal) 


11.8 


Brain (substantia nigra) 


0.9 


Lung ca. (small cell) 
LX-1 


0.7 


Brain (thalamus) 


4.6 


Lung ca. (small cell) 
NCI-H69 


1.0 


Cerebral Cortex 


6.4 


Lung ca. (s.cell var.) 
SHP-77 


4.5 


Spinal cord 


7.6 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


3.0 


Lung ca. (non-sm. 
cell)A549 


2.1 


glio/astroU-I18-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


1.6 


astrocytoma SW1783 


4.1 


Lung ca. (non-s.ceIl) 
HOP-62 


4.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 


0.0 
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NCI-H522 




1 astrocytoma SF-539 


4.5 


Lung ca. (squam.) 
SW 900 


8.4 


astrocytoma SNB-75 


4.9 


Lung ca. (squam.) 
NCI-H596 


0.2 


glioma SNB-1 9 


7.2 


Mammary gland 


4.0 


glioma U251 


3.2 


Breast ca.* (pl.ef) 


0.0 


glioma SF-295 


4.0 


oreasi ca. ^pj.eij 
MDA-MB-231 


3.9 


Heart (fetal) 


100.0 


jtsreast ca. ipi.eij 
T47D 


1.0 


rlCaiX 


m 7 


OicaSl Ca. J3 1 - DH y 


1 8 
l .0 


Skeletal muscle (fetal) 


35.8 


Breast ca. MDA-N 


0.3 


Skeletal muscle 


0.3 


Ovary 


2.2 


Bone marrow 


20.6 


Ovarian ca. OVCAR- 
3 


1.8 


Thymus 


15.8 


Ovarian ca. OVCAR- 
4 


0.6 


Spleen 


13.8 


Ovarian ca. OVCAR- 
5 


0.8 


Lymph node 


5.5 


Ovarian ca. OVCAR- 

Q 
O 


0.0 


Colorectal 


31.2 


Ovarian ca. IGROV- 

i 
l 


0.0 


Stomach 


7.3 


Ovarian ca.* (ascites) 

SJV-W V-J 


0.0 


oman iniesune 


1\ 0 
jl.y 


uierus 


d St 
n.o 


Colon ca. SW480 


1.0 


Placenta 


9.9 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


9.8 


Colon ca. HT29 


1.6 


Prostate ca.* (bone 

met JrUO 


0.0 


Colon ca. HCT-116 


1.5 


Testis 


2.4 


Colon ca. CaCo-2 


4.4 


Melanoma 
Hs688(A).T 


1.0 


Colon ca. 
tissue(OD03866) 


8.7 


Melanoma* (met) 
Hs688(B).T 


2.6 


Colon ca. HCC-2998 


O O 

8.8 


TV /f _ 1 TTAPP /TO 

Melanoma UACC-62 


O.U 


Gastric ca.* (liver met) 
NCI-N87 


2.3 


Melanoma Ml 4 


0.0 


Bladder 


9.4 


Melanoma LOX 
IMVI 


0.0 


Trachea 


14.1 


Melanoma* (met) 
SK-MEL-5 


2.1 
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Kidney 



11.8 



Adipose 



6.9 



Table BVC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2906, Run 
162345752 


Tissue Name 


Rel. Exp.(%) 
Ag2906, Run 
162345752 


Normal Colon 


39.5 


Kidney Margin 
8120608 


3.0 


CC Well to Mod Diflf 
(OD03866) 


16.3 


Kidney Cancer 
8120613 


17.3 


CC Margin (OD03866) 


33.9 


Kidney Margin 
8120614 


6.5 


CC Gr.2 rectosigmoid 
(OD03868) 


14.4 


Kidney Cancer 
9010320 


3.4 


CC Margin (OD03868) 


12 


Kidney Margin 
9010321 


72 


CCModDiff(ODO3920) 


27.9 


Normal Uterus 


4.0 


CC Margin (ODO3920) 


30.8 


Uterus Cancer 064011 


4.3 


CC Gr.2 ascend colon 
(0DO3921) 


100.0 


Normal Thyroid 


2.6 


CC Margin (OD03921) 


33.9 


Thyroid Cancer 
064010 


3.3 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


16.2 


Thyroid Cancer 
Aim is") 


10.7 


Liver Margin (ODO4309) 


2.9 


Thyroid Margin 
A302153 


5.5 


Colon mets to lung 
(OD04451-01) 


15.6 


Normal Breast 


7.5 


Lung Margin (OD0445 1 - 
02) 


15.8 


Breast Cancer 
(OD04566) 


7.4 


Normal Prostate 6546-1 


24.5 


Breast Cancer 
(OD04590-01) 


11.6 


Prostate Cancer 
(OD04410) 


4.8 


Breast Cancer Mets 
(OD04590-03) 


30.8 


Prostate Margin 
(OD04410) 


5.0 


Breast Cancer 

Metastasis 
(OD04655-05) 


7.0 


Prostate Cancer 
(OD04720-01) 


9.5 


Breast Cancer 064006 


5.7 


Prostate Margin 
* (OD04720-02) 


19.5 


Breast Cancer 1024 


7.5 


Normal Lung 061010 


16.5 


Breast Cancer 
9100266 


13.7 


Lung Met to Muscle 


3.9 


Breast Margin 


2.7 
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(OD04286) 








Muscle Margin 
(0004286^ 


6.2 


Breast uancer 
A209073 


6.1 


T liner Malicmfint danger 

(OD03126) 


64.2 


Breast Margin 
A2090734 


5.6 


T iin«r Margin fOD031261 

.L<Uiig IVJiaJ gill ^wL/uJ 1 Z»W J 


29.1 


Normal Liver 


1.9 


Lung Cancer (OD04404) 


16.7 


Liver Cancer 064003 


0.4 


Lung Margin (OD04404) 


15.1 


Liver cancer iuzd 


\jSj 


Lung Cancer (OD04565) 


27.7 


Liver Cancer 1026 


2.0 


Lung Margin (OD04565) 


19.8 


Liver Cancer 6004-T 


0.9 


Lung Cancer (OD04237- 
01) 


4.5 


Liver Tissue 6004-N 


2.8 


Lung Margin (OD04237- 
02) 


12.9 


Liver Cancer 6005-T 


0.4 


Ocular Mel Met to Liver 
(ODO4310) 


1.8 


Liver Tissue 6005-N 


1.0 


Liver Margin (ODO4310) 


2.6 


Normal Bladder 


8.5 


Melanoma Mets to Lung 
(OD04321) 


4.2 


Bladder Cancer 1023 


2.4 


Lung Margin (OD04321) 


35.6 


Bladder Cancer 
A302173 


0.8 


Normal Kidney 


5.4 


Bladder Cancer 
(OD04718-01) 


5.8 


Kidney Ca, Nuclear grade 
2 (00043381 


4.1 


Bladder Normal 
Adjacent (OD0471 8- 
03) 


2.0 


Kidney Margin 
(OD04338) 


11.0 


Normal Ovary 


3.1 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


5.0 


Ovarian Cancer 
064008 


11.0 


Kidney Margin 
(OD04339) 


2.0 


Ovarian Cancer 
(OD04768-07) 


7.9 


Kidney Ca, Clear cell 
type (OD04340) 


3.8 


Ovary Margin 
(OD04768-08) 


1.3 


Kidney Margin 
(OD04340) 


5.6 


Normal Stomach 


2.9 


Kidney Ca, Nuclear grade 
3 (OD04348) 


2.7 


Gastric Cancer 
9060358 


1.0 


Kidney Margin 
(OD04348) 


5.0 


Stomach Margin 
9060359 


3.7 


Kidney Cancer 
(OD04622-01) 


5.9 


Gastric Cancer 
9060395 


12.0 


Kidney Margin 
(OD04622-03) 


2.1 


Stomach Margin 
9060394 


14.2 


Kidney Cancer 


0.8 


Gastric Cancer 


31.9 
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(OD04450-01) 




9060397 




Kidney Margin 
(OD04450-03) 


4.5 


Stomach Margin 
9060396 


5.6 


Kidney Cancer 8120607 


14.6 


Gastric Cancer 
064005 


8.7 



Table BVD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2906,Run 
159078634 


Tissue Name 


Rel. Exp.(%) 
Ag2906, Run 
159078634 


Secondary Thl act 


5.6 


HUVEC IL-lbeta 


5.7 


Secondary Th2 act 


5.3 


HUVEC IFN gamma 


9.2 


Secondary Trl act 


8.6 


HUVEC TNF alpha + IFN 
gamma 


9.5 


Secondary Thl rest 


11.0 


HUVEC TNF alpha + IL4 


4.6 


Secondary Th2 rest 


6.6 


HUVEC IL-11 


9.1 


oeconoary in rest 


in c 


Lung Microvascular EC 
none 


|A A 

1U.U 


Primary Thl act 


z.U 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


5.6 


Primary Th2 act 


1.2 


Microvascular Dermal EC 
none 


102 


Primary Trl act 


1.1 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


2.5 


Primary Thl rest 


10.7 


Bronchia] epithelium 
TNFalpha + IL1 beta 


20.4 


Primary Th2 rest 


4.4 


Small airway epithelium 
none 


8.0 


Primary Trl rest 


18.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


22.4 


CD45RACD4 
lymphocyte act 


8.0 


Coronery artery SMC rest 


4.0 


CD45RO CD4 
lymphocyte act 


5.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


6.1 


CD8 lymphocyte act 


3.7 


Astrocytes rest 


7.4 


Secondary CD8 
lymphocyte rest 


2.5 


Astrocytes TNFalpha + 
IL-lbeta 


3.1 


Secondary CD8 
lymphocyte act 


1.5 


KU-812 (Basophil) rest 


2.6 


CD4 lymphocyte none 


6.5 


KU-812 (Basophil) 
PMA/ionomycin 


2.7 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


11.6 


CCD1 106 (Keratinocytes) 
none 


6.4 


LAK cells rest 


14.6 


CCD1 106 (Keratinocytes) 


3.6 
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TNFalpha + IL-lbeta 




¥ A T T It ¥T *\ 

LAK cells IL-2 


8.6 


JUJ VW1 Vli 1 HV-JolO 


8.0 


T A fS 11 TT ^ ■ yT 1 ^ 

LAK cells IL-2+IL-12 


6.3 


T nniiQ HHnev 


2.5 | 


LAK cells IL-2+IFN 
gamma 


14.8 


NCI-H292 none 


3.7 


LAK cells IL-2+ IL-18 


2.5 


NCI-H292IL-4 


4.9 


LAK cells 
PMA/ionomycin 


7.6 


NCI-H292 IL-9 


0.7 


NK Cells IL-2 rest 


112 


NCI-H292IL-13 


1.4 


Two Way MLR 3 day 


6.4 


IN \_/i*xiz. y l* iir in gamma 


1.3 


Two Way MLR 5 day 


8.3 


jnuT-rvXiVw' nunc 


4.0 


Two Way MLR 7 day 


2.7 


HPAFP TMF alnha + IL-1 
beta 


8.0 


PBMC rest 


62 


T lino - ■filnYVhlaof Tinni* 


5 3 


PBMC PWM 


8.2 


Lung fibroblast TNF alpha 
+ TT -1 beta 


0.0 


PBMC PHA-L 


2.6 


Lung fibroblast IL-4 


1.6 


Ramos (B cell) none 


7.5 


Lung fibroblast IL-9 


2.0 


Ramos (B cell) 
ionomycin 


2.2 


Lung fibroblast IL-1 3 


3.6 


B lymphocytes PWM 


2.4 


Lung fibroblast IFN 
gamma 


2.7 


B lymphocytes CD40L 
and IL-4 


100.0 


Dermal fibroblast 
CCD 1070 rest 


2.6 


FOI -1 dhcAMP 


444 


Dermal fibroblast 
CCD1070 TNF alpha 


8.5 


EOL-1 dbcAMP 
PMA/ionomycin 


77.9 


Dermal fibroblast 
rrn 1070 n -i heta 


5.6 


Dendritic cells none 


21.2 


T^atttiqI ■filvmhlael" Tl^W 

j_/ciiiiai iiuruuiaai if in 

gamma 


5.7 


npnHritir rpll'c T P<\ 


0 


Dermal fibroblast IL-4 


9.0 


Dendritic cells anti- 
CD40 


9.9 


IBD Colitis 2 


0.0 


Monocytes rest 


28.1 


IBD Crohn's 


2.6 


Monocytes LPS 


10.3 


Colon 


97.9 


Macrophages rest 


33.0 


Lung 


41.5 


Macrophages LPS 


12.7 


Thymus 


5.5 


HUVEC none 


11.9 


Kidney 


6.9 


HUVEC starved 


20.4 







Panel L3D Summary: Ag2906 The NOV81a gene has a low level of expression in 
adipose and may be a small molecule target for the treatment of obesity and obesity-related 
diseases, including Type 2 diabetes. In addition, this gene product appears to be differentially 
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expressed in fetal (CT value = 3 1) vs adult heart (CT value = 34) and may be useful for the 
differentiation between the two tissue types. 

Overall, there appears to be higher expression of this gene in the normal tissues 
compared to the cell lines. Thus, this difference in expression might be of use as a diagnostic 
5 marker of cancer. 

Panel 2D Summary: Ag2906 The NOV81a gene is expressed at low levels in this 
panel. A higher level of expression is observed in gastric, bladder, thyroid, breast and ovarian 
cancer samples when compared to expression in the normal adjacent gastric, bladder, thyroid, 
breast and ovary tissues. Thus, this gene could potentially be used as a diagnostic marker of 
1 0 cancer in these tissues. Furthermore, inhibition of the activity of this gene product using small 
molecule drugs may be useful for the treatment of cancer in these tissues. 

Panel 4D Summary: Ag2906 Expression of the NOV81a is widespread in this panel, 
with highest expression in B lymphocytes treated with CD40L and IL-4 (CT=29.8). 
Significant expression is also seen in treated eosinophils, resting macrophages and monocytes, 
15 and normal colon and lung. Based on this pattern of expression, this gene product may be 
involved in both disease and homeostatic processes for these and other cell types and tissues. 
Therefore, modulation of this gene product with a functional therapeutic may lead to the 
alteration of functions associated with these cell and tissue types and improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as COPD, 
20 emphysema, asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, 
rheumatoid arthritis, and osteoarthritis. In addition, the higher levels of expression in colon 
(CT-30) when compared to colon from patients with inflammatory bowel diseases 
(IBD)(CTs=35-40) suggests that expression of this gene could be used to differentiate between 
normal and inflamed colon. Therapeutic modulation of the expression or function of this gene 
25 may be effective in the treatment of IBD. 

NOV82 



Expression of gene NOV82 was assessed using the primer-probe sets Ag3 198 and 
Ag3063, described in Tables BWA and BWB. 

Table BWA. Probe Name Ag3198 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -cgtggtcaccagacagttaatt-3 1 


22 


179 


1321 


Probe 


TET-5 1 -cctaccagacaccattgtgtccaagg-3 1 - 
TAMRA 


26 


212 


1322 


Reverse 


5 » -gtctttcctttgtgcttgtgaa-3 1 


22 


246 


1323 



1374 



WO 02/068649 



PCT/US02/02785 



Table BWB. Probe Name Ag3063 



Primers j Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forwardjs ' -cgtggtcaccagacagttaatt-3 ' 


22 


179 


1324 


IteT-5 1 -cctaccagacaccattgtgtccaagg-3 1 - 

Probe |tamra 


26 


212 


1325 


Reversejs 1 -gtctttcctttgtgcttgtgaa-3 1 


22 


246. 


1326 



CNS_neurodegeneration_vl.O Summary: Ag3198 Expression of this gene is 
5 low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). The amp 
plot indicates that there is a high probability of a potential probe or chemistry failure. 

Panel 1.3D Summary: Ag3063 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). The amp plot indicates that there 
is a high probability of a potential probe or chemistry failure. 
10 Panel 4D Summary: Ag3198 Expression of this gene is low/undetectable (CTs > 35) 

across all of the samples on this panel (data not shown). The amp plot indicates that there is a 
high probability of a potential probe or chemistry failure. 

NOV83 

Expression of gene NQV83 was assessed using the primer-probe sets Ag3046 
15 and Ag4125, described in Tables BXA and BXB. Results of the RTQ-PCR runs are 
shown in Tables BXC and BXD. 



Table BXA. Probe Name Ag3046 



Primers! Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forwardjs 1 -gaagcaaagaactctgcaagac-3 ■ 


22 


1215 


1327 


ob jTET-5 ■ -ttccagcatgataacttcacagagga-3 1 - 


26 


1246 


1328 


Reversejs • -gagcctgcaaatatcttttgct-3 ' 


22 


1272 


1329 



Table BXB. Probe Name Ag4125 



Primers 


Sequences 


Length ! 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gaagcaaagaactctgcaagac-3 ' 


22 


1215 


1330 


Probe 


TET-5 1 -ttccagcatgataacttcacagagga-3 ' - 
TAMRA 


26 


1246 


1331 


Reverse 


5 1 -gagcctgcaaatatcttttgct-3 ' 


22 


1272 


1332 



20 
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Table BXC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3046, Run 
162559104 


Tissue Name 


Rel. Exp.(%) 
Ag3046, Run 
162559104 


Normal Colon 


0.0 


Kidney Margin 
8120608 


0.0 


CCWelltoModDiff 
(OD03866) 


0 0 

U.V 


Kidney Cancer 
8120613 


ft ft 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(0DO3868) 


0.0 


Kidney Cancer 
9010320 


0.0 


CC Margin (OD03868) 


0.0 


Kidney Margin 
9010321 


0.0 


CC Mod Diff (ODO3920) 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 06401 1 


0.0 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


0.0 


Thyroid Cancer 
064010 


0.0 • 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


0.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 
A302153 


o.o 


Colon mets to lung . 
(OD04451-01) 


0.0 


Normal Breast 


0.0 


Lung Margin (OD04451- 
02) 


0.0 


Breast Cancer 
(OD04566) 


100.0 


Normal Prostate 6546-1 


0.1 


Breast Cancer 
(OD04590-01) • 


0.0 


Prostate Cancer 
(OD04410) 


A A 

0.0 


Breast Cancer Mets 
(OD04590-03) 


0.0 


Prostate Margin ] 
(OD04410) 


0.0 


Breast Cancer 

Metastasis 
(OD04655-05) 


* 

0.0 


Prostate Cancer 
(OD04720-01) 


0.1 


Breast Cancer 064006 


0.0 


Prostate Margin 
(OD04720-02) 


0.0 


Breast Cancer 1024 


0.0 


Normal Lung 061010 


0.0 


Breast Cancer 
9100266 


0.0 


Lung Met to Muscle 
(OD04286) 


0.0 


Breast Margin 
9100265 


0.0 


Muscle Margin 
(ODQ4286) 


0.0 


Breast Cancer 
A209073 


0.0 
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Lriing Maiignam i^ancer 
(OD03126) 


0.0 


JJ I w/CXol iviai gin 

A2090734 


0.0 


Lung Margin ^uuuj izo; 


o n 

XJ.XJ 


Normal T ivpr 


0.0 


Lung Cancer (OD04404) 


0.0 


Liver Cancer 064003 


0.0 


Lung Margin (OD04404) 


0.0 


Liver Cancer 1025 


A A 
U.U 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0.0 


Lung Margin (OD04565) 


0.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


0.0 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (ODO4310) 


0.0 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


0.0 


Normal Kidney 


0.0 


Bladder Cancer 
(OD04718-01) 


0.0 


Kidney Ca, Nuclear grade 

Z \}JLJ\J l TJjo) 


0.0 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.0 


Kidney Margin 
(OD04338) 


0 0 


Normal Ovarv 

IN Ul 111 <*1 v-/VClljr 


0.0 


Kidney Ca Nuclear grade 
1/2(OD04339) 


0 0 


Ovarian Cancer 
064008 


0 0 


Kidney Margin 
(OD04339) 


0 0 


Ovarian Cancer 
(OD04768-07) 


0.0 


Kidney Ca, Clear cell 
type (OD04340) 


0 0 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


00 


Normal Stomach 


0.0 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0.0 


Gastric Cancer 
9060358 


0.0 


Kidney Margin 
(OD04348) 


00 


Stomach Margin 
9060359 


0.0 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


0.0 


Kidney Margin 
(OD04622-03) 


0.0 


Stomach Margin 
9060394 


0.0 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


0.0 


Kidney Margin 
(OD04450-03) 


0.0 


Stomach Margin 
9060396 


0.0 
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Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


0.0 



Table BXD. Panel 4.1D 



Tissue Name 


ReL Exp.(%) 
Ag41Z5, Kun 
172859315 


Tissue Name 


Rel. Exp.(%) 

Ag41Z5, Klin 
172859315 


■secondary ini act 


j.y) 


ITT lAAE/"" TT 1 koto 


U.U 


Secondary Th2 act 


4.2 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


2.7 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


3.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


11.9 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


7.4 


Primary Thl rest 


7.2 


Bronchial epithelium 
TNFalpha + ILlbeta 


7.5 


Primary Th2 rest 


3.1 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RACD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


occonaary l>L/o 
lymphocyte rest 


0.0 


Astrocytes iNraipna + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


3.3 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


0.0 


LAK cells IL-2+IL-12 


5.3 


NCI-H292 none 


o.o i 
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LAK Cells JJLrZ+lrIN 

gamma 


1.8 


NCI-H292 IL-4 


5.8 


LyAJv cells lL»-2rr IL,- 1 o 








T At/ r*t±\\a 

LrAJtv. Cells 

PMA/ionomycin 


0.0 


NCI-H292 IL-13 


1.3 


JNlv cells lL-z rest 




XTPT 140 OO TT7M rromma 

lNx^i-rizyz iriN gamma 


U.o 


1 wo way jvllk 3 day 




nJr/vC/^ none 


u.u 


Two Way MLR 5 day 


3.2 


T-TPAFr* TXTP Qlnlio 4- TT -1 

xir AcL/ liNr aipna • 
beta 


0.0 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


T una -ftKiriKlacf TKTE o lr» Via 

idling iiDroDiasi i iNr aipna 
+ IL-l beta 


2.3 


roNlK, rWM 


J.U 


T lin ft "ft V\it\ r% 1 0 o+ 1 1 /I 

jwung iioruDiasi jjl-h 


u.u 


PBMC PHA-L 


1.6 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-13 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCD1070 rest 


1.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CL/JJiu/u lJNr aipna 


0.0 


EOL-1 dbcAMP 


2.1 


Dermal fibroblast 
lajjiu/u lL-i oeta 


0.0 


bUL-1 abCAMr 
PMA/ionomvcin 


3.3 


Dermal iiDrooiast lrN 
gamma 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


0.0 


Dendritic cells anti- 
CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocles rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


13.5 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


5.4 


Macrophages LPS 


0.0 


Thymus 


18.4 


HUVEC none 


0.0 


Kidney 


10O.O 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3046 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Ag4125 Results from one experiment with this gene are not included. The amp plot 
5 indicates that there were experimental difficulties with this run (data not shown). 

General screening panel vl.4 Summary: Ag4125 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
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Panel 1.3D Summary: Ag3046 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

Panel 2D Summary: Ag3046 Significant expression of this gene is seen exclusively in 
a breast cancer sample (CT = 25.2). Therefore, expression of this gene may be used to 
5 distinguish breast cancers from the other samples on this panel. Furthermore, therapeutic 
modulation of the activity of the GPCR encoded by this gene may be beneficial in the 
treatment of breast cancer. 

Panel 3D Summary: Ag3046 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 
10 Panel 4.1D Summary: Ag4125 This gene is only expressed at detectable levels in the 

kidney (CT = 32.6). The putative GPCR encoded for by this gene could allow cells within the 
kidney to respond to specific microenvironmental signals (For example, ref. 1). Therefore, 
antibody or small molecule therapies designed with the protein encoded for by this gene could 
modulate kidney function and be important in the treatment of inflammatory or autoimmune 
15 diseases that affect the kidney, including lupus and glomerulonephritis. 

References: 

1. MarkNLD., Wittemann S., Herlitze S. (2000) G protein modulation of recombinant 
P/Q-type calcium channels by regulators of G protein signalling proteins. J. Physiol. 528 Pt 1 : 
65-77. 

20 1 . Fast synaptic transmission is triggered by the activation of presynaptic Ca2+ 

channels which can be inhibited by Gbetagamma subunits via G protein-coupled receptors 
(GPCR). Regulators of G protein signalling (RGS) proteins are GTPase-accelerating proteins 
(GAPs), which are responsible for >100-fold increases in the GTPase activity of G proteins 
and might be involved in the regulation of presynaptic Ca2+ channels. In this study we 

25 investigated the effects of RGS2 on G protein modulation of recombinant P/Q-type channels 
expressed in a human embryonic kidney (HEK293) cell line using whole-cell recordings. 2. 
RGS2 markedly accelerates transmitter-mediated inhibition and recovery from inhibition of 
Ba2+ currents (IBa) through P/Q-type channels heterologously expressed with the muscarinic 
acetylcholine receptor M2 (mAChR M2). 3. Both RGS2 and RGS4 modulate the prepulse 

30 facilitation properties of P/Q-type Ca2+ channels. G protein reinhibition is accelerated, while 
release from inhibition is slowed. These kinetics depend on the availability of G protein alpha 
and betagamma subunits which is altered by RGS proteins. 4. RGS proteins unmask the Ca2+ 
channel beta subunit modulation of Ca2+ channel G protein inhibition. In the presence of 
RGS2, P/Q-type channels containing the beta2a and beta3 subunits reveal significantly altered 



1380 



WO 02/068649 



PCT/US02/02785 



kinetics of G protein modulation and increased facilitation compared to Ca2+ channels 
coexpressed with the beta lb or beta4 subunit. 
PMID: 11018106 

Panel 4D Summary: Ag3046 Expression of this gene is low/undetectable (CTs > 35) 
5 across all of the samples on this panel (data not shown). 

NOV84 

Expression of gene NOV84 was assessed using the primer-probe set Ag3051, 
described in Table BYA. 

Table BYA, Probe Name Ag3051 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gcagctcattcagacctatgag-3 1 


22 


847 


1333 


Probe 


TET-5 1 -ctctcctgccacccctatgacactg-3 1 - 
TAMRA 


25 


883 


1334 


Reverse 


5 ' -cgacaacaggtacatcatgaag-3 1 


22 


913 


1335 



10 

Panel 13D Summary: Ag3051 Results from one experiment with this gene are not 
included. The amp plot suggests that there were experimental difficulties with this run (data 
not shown). 

Panel 2D Summary: Ag3051 Expression of this gene is low/undetectable (CTs > 35) 
1 5 across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag3051 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

NOV85 

Expression of gene NOV85 was assessed using the primer-probe set Ag3057, 
20 described in Table BZA. Results of the RTQ-PCR runs are shown in Tables BZB, BZC, BZD 
and BZE. 



Table BZA. Probe Name Ag3057 



Primers 


Sequences 


Length i 


Start | 
Position 


SEQ XD NO: 


Forward 


5 ' -aacattggaaggacaggagtct-3 \ 


22 


2314 


1336 


Probe 


TET-5 * -ccccaggagatgtatcagattcagct-3 1 - 
TAMRA 


26 


2336 


1337 


Reverse 


5 ' -cagatccccaagaaccctta-3 • 


20 


2382 


1338 
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Table BZB. CNS_neurodegeneration_vl.O 



1 15511 C name 


Rel. Exp.(%) Ag3057, 
Run 211012795 




Rel.Exp.(%)Ag3057, 
Run 211012795 


AD 1 Hippo 


12.9 


Control (Path) 3 
Temporal Ctx 


8.4 


AD 2 Hippo 


27.2 


Control (Path) 4 

1 CmpUiaJ V^IA 


25.0 


AD 3 Hippo 


11.2 


AD 1 Occipital Ctx 


29.5 


AD 4 Hippo 


13.0 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


23.8 


AD 3 Occipital Ctx 


15.4 


AD 6 Hippo 


77.4 


AD 4 Occipital Ctx 


28.9 


Control 2 Hippo 


34.9 


AD 5 Occipital Ctx 


15.9 


Control 4 Hippo 


15.5 


AD 6 Occipital Ctx 


14.7 


Control (Path) 3 
Hippo 


11.2 


Control 1 Occipital 

Ctx . 


8.0 


AD 1 Temporal Ctx 


45.7 


Control 2 Occipital 
Ctx 


28.3 


AD 2 Temporal Ctx 


31.4 


Control 3 Occipital 
Ctx 


14.8 


AD 3 Temporal Ctx 


19.3 


Control 4 Occipital 
Ctx 


23.8 


AD 4 Temporal Ctx 


28.1 


. Control (Path) 1 
Occipital Ctx 


73.7 


AD 5 Inf Temporal 
Ctx 


41.2 


Control (Path) 2 
Occipital Ctx 


16.8 


AD 5 SupTemporal 
Ctx 


23.3 


Control (Path) 3 
Occipital Ctx 


7.3 


AD 6 Inf Temporal 
Ctx 


100.0 


Control (Path) 4 
Occipital Ctx 


13.0 


AD 6 Sup Temporal 
Ctx 


75.3 


Control 1 Parietal 
Ctx 


14.8 


Control 1 Temporal 
Ctx 


9.7 


Control 2 Parietal 
Ctx 


29.3 


Control 2 Temporal 
Ctx 


21.8 


Control 3 Parietal 
Ctx 


12.8 


Control 3 Temporal 
Ctx 


16.0 


Control (Path) 1 
Parietal Ctx 


42.6 


Control 4 Temporal 
Ctx 


9.3 


Control (Path) 2 
Parietal Ctx 


30.4 


Control (Path) 1 
Temporal Ctx 


45.4 


Control (Path) 3 
Parietal Ctx 


9.7 


Control (Path) 2 
Temporal Ctx 


28.9 


Control (Path) 4 
Parietal Ctx 


30.8 
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Table BZC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3057, 
Kun 105519950 


Tissue Name 


Rel. Exp.(%) Ag3057, 

Pun 1^^1000^ 

Kun iDj^iyyyo 


Liver adenocarcinoma 




Kianey {ietai^ 


D.I 


Pancreas 




Kenai ca. /oo-u 


0.6 


Pancreatic ca. CAPAN 
2 


40.9 


Renal ca. A498 


15.1 


Adrenal gland 


i /i ^ 

14.S 


KCnai ca. KAr 5yD 


1 j.U 


1 nyroid 


o.z 


Kenai ca. acjiin 


9 1 


Salivary gland 


in o 

lO.o 


Kenai ca. uu-j i 


11 1 


Pituitary gland 


10.2 


Renal ca. TK-10 


3.9 


Brain (fetal) 


51.4 


Liver 


1.9 


Brain (whole) 


100.0 


Liver (fetal) 


4.8 


Brain (amygdala) 


48.0 


Liver ca. 
(hepatoblast) HepG2 


17.3 


Brain (cerebellum) 


49.3 


Lung 


10.4 


Brain (hippocampus) 


47.6 


Lung (fetal) 


7.2 


Brain (substantia nigra) 


70.2 


Lung ca. (small cell) 
LX-1 


7.6 


Brain (thalamus) 


51.8 


Lung ca. (small cell) 
NCI-H69 


1.0 


Cerebral Cortex 


11.3 


Lung ca. (s.cell var.) 
SHP-77 


7.3 


Spinal cord 


85.9 


Lung ca. (large 
ce!l)NCI-H460 


16.6 


glio/astro U87-MG 


16.5 


Lung ca. (non-sm. 
cell) A549 


4.5 


glio/astro U-118-MG 


27.9 


Lung ca. (non-s.cell) 
NCI-H23 


6.1 


q etrr»r»\/trvm ci QAX71 7d 




Lung ca. (non-s.cell) 
HOP-62 


O.J 


neuro , mci oiV"iN-/\o 


19 9 


Lung ca. (non-s.cl) 
NCI-H522 




astrocytoma or-jjy 




Lung ca. (squam.) 
SW 900 


0 -J 

O.J 


astrocytoma orso-/ j 


90 0 


Lung ca. (squam.) 
NCI-H596 




glioma SNB-19 


23.0 


Mammary gland 


17.3 


glioma U251 


25.7 


Breast ca.* (pl.ef) 
MCF-7 


7.3 


glioma SF-295 


9.8 


Breast ca.* (pl.ef) 
MDA-MB-231 


35.8 


Heart (fetal) 


0.2 


Breast ca.* (pl.ef) 
T47D 


5.5 


Heart J 11.1 


Breast ca. BT-549 


32.1 
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Skeletal muscle (fetal) 


3.2 


Breast ca. MDA-N 


3.3 


Skeletal muscle 


18.9 


Ovary 


1.9 


Bone marrow 


3.8 


Ovarian ca. OVCAR- 
3 


3.8 


Thymus 


1,5 


Ovarian ca. OVCAR- 
4 


6.9 


Spleen 


5.6 


Ovarian ca. OVCAR- 
5 


7.7 


Lymph node 


7.3 


Ovanan ca. OVCAR- 

o 
o 


5.7 


Colorectal 


92 


Ovarian ca. IGROV- 

i 
i 


2.5 


Stomach 


15.0 


Ovarian ca.* (ascites) 

o Jv- KJ V J 


20.9 


Qm nil intfctinp* 
OH 1 all ill LCD L1UC 




uterus 




Colon ca. SW480 


5.6 


Placenta 


6.0 


Colon ca.* 
SW620(SW480 met) 


4.9 


Prostate 


3.4 


Colon ca. HT29 


2.8 


Prostate ca.* (bone 
mei/r^-o 


8.4 


Colon ca.HCT-1 16 


3.7 


Testis 


9.8 


Colon ca. CaCo-2 


10.3 


Melanoma 
Hs688(A).T 


6.1 


Colon ca. 
tissue(OD03866) 


9.2 


Melanoma* (met) 
Hs688(B).T 


6.0 


Colon ca. HCC-2998 


5.3 


Melanoma UACC-62 


1.8 


vjasuic ca. ^Jiver mei; 
NCI-N87 


34.9 


Melanoma M14 


14.5 


Bladder 


14.0 


Melanoma LOX 
IMVI 


0.9 


Trachea 


6.5 


Melanoma* (met) 
SK-MEL-5 


3.8 


Kidney 


3.0 


Adipose 


12.8 



Table BZD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3057,Run 
163577596 


Tissue Name 


Rel. Exp.(%) 
Ag3057, Run 
163577596 


Normal Colon 


81.8 


Kidney Margin 
8120608 


1.1 


CC WelltoModDiff 
(0DO3866) 


10.7 


Kidney Cancer 
8120613 


8.1 


CC Margin (OD03866) 


17.6 


Kidney Margin 
8120614 


1.1 
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CC Gr.2 rectosigmoid 
[KJxJyjjoOo) 


11.2 


Kidney Cancer 


2.7 


CC Margin (OD03868) 


4.8 


Kidney Margin 


2.8 


CC Mod Diff(ODO3920): 


15.5 


Normal Uterus 


10.7 


CC Margin (ODO3920) 


20.2 


Uterus Cancer 06401 1 


31.2 


CC Gr.2 ascend colon 
(OD03921) 


36.9 


Normal Thyroid 


19.1 


CC Margin (OD03921) 


9.7 


Thyroid Cancer 
064010 


7.6 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


26.2 


Thyroid Cancer 
A302152 


5.6 


Liver Margin (ODO4309) 


13.1 


Thyroid Margin 
A302153 


13.7 


Colon mets to lung 
(OD04451-01) 


2.9 


Normal Breast 


41.5 


Lung Margin (OD04451- 
02) 


6.0 


Breast Cancer 
(OD04566) 


7.5 


Normal Prostate 6546-1 


60.3 


Breast Cancer 
(GD04590-01) 


40.6 


Prostate Cancer 
(OD04410) 


20.2 


Breast Cancer Mets 
(OD04590-03) 


32.3 


Prostate Margin 
(OD04410) 


24.5 


Breast Cancer 

Metastasis 
(OD04655-05) 


17.9 


Prostate Cancer 
(OD04720-01) 


23.8 


Breast Cancer 064006 


15.7 i 


Prostate Margin 
(OD04720-02) 


37.4 


Breast Cancer 1024 


10.2 


Normal Lung 061010 


42.6 


Breast Cancer 
9100266 


6.5 


Lung Met to Muscle 

[kJLJkJ'tZ.oO ) 


38.7 


Breast Margin 


6.4 


iviuscie iviargin 
(OD04286) 


9.2 


oreasi cancer 
A209073 


20.2 


Lung Malignant Cancer 
(OD03126) 


20.7 


Breast Margin 
A2090734 


15.2 


Lung Margin (OD03126) 


17.8 


Normal Liver 


7.4 


Lung Cancer (OD04404) 


36.9 


Liver Cancer 064003 


7.2 


Lung Margin (OD04404) 


11.0 


Liver Cancer 1025 


3.2 


Lung Cancer (OD04565) 


11.3 


Liver Cancer 1026 


3.5 


Lung Margin (OD04565) 


10.6 


Liver Cancer 6004-T 


4.8 


Lung Cancer (OD04237- 
01) 


30.4 


Liver Tissue 6004-N 


4.8 
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Lung Margin (OD04237- 
02) 


21.0 


Liver Cancer 6005-T 


2.5 


Ocular Mel Met to Liver 
(ODO4310) 


11.7 


Liver Tissue 6005-N 


1.1 


Liver Margin (ODO4310) 


13.5 


Normal Bladder 


32.1 


Melanoma Mets to Lung 
(OD04321) 


10.2 


Bladder Cancer 1023 


5.6 


Lung Margin (OD04321) 


36.3 


Bladder Cancer 
A302173 


49.3 


Normal Kidney 


100.0 


Bladder Cancer 
(OD04718-01) 


28.7 


Kidney Ca, Nuclear grade 
2(OD04338) 


40.6 


Bladder Normal 
Adjacent (OD0471 8- 
03) 


24.8 


Kidney Margin 
(OD04338) 


16.8 


Normal Ovary 


6.5 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


11.0 


Ovarian Cancer 
064008 


21.5 


Kidney Margin 
(OD04339) 


21.9 


Ovarian Cancer 
(OD04768-07) 


35.8 


Kidney Ca, Clear cell 
type (OD04340) 


66.4 


Ovary Margin 
(OD04768-08) 


5.4 


Kidney Margin 
(OD04340) 


18.0 


Normal Stomach 


80.1 


Kidney Ca, Nuclear grade 
3 (OD04348) 


3.4 


Gastric Cancer 
9060358 


22 


Kidney Margin 
(OD04348) 


10.2 


Stomach Margin 
9060359 


12.3 


Kidney Cancer 
(OD04622-01) 


12.3 


Gastric Cancer 

f\c\ f r\ r 

9060395 


23.2 


Kidney Margin 


12 


Stomach Margin 
on fk(\ io a 


14.6 


Kidney Cancer 
(OD04450-01) 


26 A 


Gastric Cancer 
9060397 


11.8 


Kidney Margin 
(OD04450-03) 


21.5 


Stomach Margin 
9060396 


5.8 


Kidney Cancer 8120607 


2.2 


Gastric Cancer 
064005 


55.5 



Table BZE. Panel CNS_1 



Tissue Name 


Rel.Exp.(%)Ag3057, 
Run 171694175 


Tissue Name 


Rel. Exp.(%) Ag3057, 
Run 171694175 


BA4 Control 


7.9 


BA17PSP 


21.3 


BA4 Control2 


16.0 


BA17PSP2 


5.4 


BA4 


2.9 


Sub Nigra Control 


58.6 
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Alzheimer's2 








BA4 Parkinson's 


32.8 


Sub Nigra Control2 


25.9 


BA4 

Parkinson's2 


46.7 


Sub Nigra 
Alzheimer ? s2 


20.0 


BA4 
Huntington's 


25.9 


Sub Nigra 
Parkinson f s2 


85.3 


BA4 
Huntington ! s2 


4.1 


Sub Nigra 
Huntington's 


100.0 


BA4 PSP 


5.7 


Sub Nigra 
Huntington's2 


59.0 


BA4 PSP2 


36.9 


Sub Nigra PSP2 


252 


BA4 Depression 


15.6 


Sub Nigra 
Depression 


. 35.8 


BA4 
Depression2 


27.2 


Sub Nigra 
Depression2 


26.1 


B A7 Control 


12.9 


GlobPalladus . 
Control 


54.3 


B A7 Control2 


12.4 


Glob Palladus 
Control2 


15.2 


BA7 
Alzheimer r s2 


5.2 


Glob Palladus 
Alzheimer's 


17.4 


B A7 Parkinson's 


15.4 


Glob Palladus 
Alzheimer's2 


15.6 


BA7 
Parkinson ! s2 


23.2 


Glob Palladus 
Parkinson's 


73.7 


BA7 
Huntington's 


34.9 


Glob Palladus 
Parkinson's2 


15.8 


"DAT 

Huntington's2 


56.3 


Glob Palladus PSP 


17.7 


DAT TlCT) 

BA/ rbr 


29.1 


olob ralladus rJSrz 


6.4 


BA7PSP2 


12.6 


Glob Palladus 
Depression 


24.1 


B A7 Depression 


12.2 


Temp Pole Control 


0.8 


BA9 Control 


4.4 


Temp Pole Control2 


13.9 


BA9 Control2 


36.1 


Temp Pole 
Alzheimer's 


2.7 


BA9 Alzheimer's 


3.8 


Temp Pole 
Alzheimer's2 


2.0 


BA9 
Alzheimer^ 


2.1 


Temp Pole 
Parkinson's 


15.9 


BA9 Parkinson's 


17.2 


Temp Pole 
Parkinson's2 


12.9 


BA9 
Parkinson's2 


20.7 


Temp Pole 
Huntington's 


19.2 


BA9 


36.9 


Temp Pole PSP 


6.2 
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nLUilLlilgtUIl o 








BA9 

T-Ti i nti n fftnn 1 c7 

JTllUlllliglAJIi J>Z. 


15.2 


Temp Pole PSP2 


1.8 


BA9 PSP 


27.4 


Temp Pole 

lyepresMonz 


10.8 


RAQ 


14 fl 


v^ing vjyr L^oniroi 


DJ.O 


BA9 Depression 


7.5 


Cing Gyr Control2 


15.2 


BA9 
Depression2 


11.7 


Cing Gyr 
Alzheimer's 


15.7 


BA17 Control 


20.7 


Cing Gyr 
Alzheimer f s2 


10.7 


BA17 Control2 


15.3 


Cing Gyr Parkinson's 


47.0 


BA17 
Alzheimer's2 


42 


Cing Gyr 
Parkinson's2 


48.6 


BA17 
Parkinsons 


41.8 


Cing Gyr 
Huntington's 


85.3 


BA17 
Parkinson's2 


27.5 


Cing Gyr 
Huntington's2 


53.2 


BA17 
riunungion s 


17.3 


Cing Gyr PSP 


80.1 


BA17 
Huntington's2 


27.0 


Cing Gyr PSP2 


6.9 


BA17 
Depression 


81.8 


Cing Gyr Depression 


18.6 


. BA17 
Depression2 


51.4 


Cing Gyr 
Depression2 


69.7 



CNS_neurodegeneration_vl.O Summary: Ag3057 The NOV85 gene is found to be 
slightly but significantly (p=0.0 1 6) upregulated in the Alzheimer's disease (AD) temporal 
cortex. The temporal cortex is the region of the brain where neurons degenerate in the mid 
5 stages of AD. This increase in expression is not apparent in the occipital cortex, which does 
not experience neurodegeneration in AD. Since the upregulation of this gene appears to be 
neurodegeneration-specific both within an individual brain and between brains, this gene is an 
excellent small molecule target. Therefore, treatment with an antagonist may decrease the 
pathology seen in Alzheimer's disease. 
10 Panel 1 3D Summary: Ag3057 Highest expression of the NOV85 gene is seen in the 

CNS. Please see CNS_Neurodegeneration for discussion of utility of this gene in the central 
nervous system. 

Among tissues with metabolic function, this gene has low levels of expression in 
pancreas, adrenal, thyroid, pituitary, skeletal muscle and adipose. Therefore, modulation of 
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this gene product may be a treatment for metabolic and endocrine diseases, including obesity 
and Types 1 and 2 diabetes. 

In addition, this gene is expressed at low levels in the cancer cell lines in this panel. 
This difference in expression is particularly prominent in the CNS cancer cell lines when 
5 compared to the normal brain tissues. Thus, this gene could potentially be used as a diagnostic 
marker in CNS cancers. 

Panel 2D Summary: Ag3057 The NOV85 gene is expressed at moderate to low levels 
in this panel. A higher level of expression is observed in lung, kidney, uterine, gastric and 
ovarian cancer when compared to the normal adjacent lung, kidney, uterine, gastric and 
10 ovarian tissues in this panel. Thus, this gene could be used as a diagnostic marker of cancer in 
these tissues. Futhermore, inhibition of the activity of this gene product using small molecule 
drugs may be useful for the treatment of cancer in these tissues 

Panel 4D Summary: Ag3057 The amp plot indicates that there is a high probability of 
experimental failure. (Data not shown.) 
15 Panel CNSJL Summary: Ag3057 These results confirm expression of the NOV85 

gene in the brain. Please see CNSJSJeurodegeneration for discussion of utility of this gene in 
the central nervous system. 

NOV86: GTPASE-ACTIVATING PROTEIN 

Expression of gene NOV86 was assessed using the primer-probe set Ag3058, 
20 described in Table CAA. Results of the RTQ-PCR runs are shown in Tables CAB, CAC and 
CAD. 



Table CAA. Probe Name Ag3058 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -agtacccgctgctgaacac-3 1 


19 


534 


1339 


Probe 


TET-5 1 -accctcattgccaaggtcaaagcct-3 • - 
TAMRA 


25 


578 


1340 


Reverse 


5' -tcattgttgctctcataatgga-3 1 


22 


603 


1341 



Table CAB. Panel 13D 



Tissue Name 


ReL Exp.(%) Ag3058, 
Run 165533238 


Tissue Name 


Rel. Exp.(%) Ag3058, 
Run 165533238 


Liver adenocarcinoma 


5.5 


Kidney (fetal) 


0.9 


Pancreas 


1.2 


Renal ca. 786-0 


0.6 


Pancreatic ca. CAPAN 
2 


3.9 


Renal ca. A498 


0.1 
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Adrenal gland 


1.8 


Renal ca. RXF 393 


1.0 


Thyroid 


1.7 


Renal ca. ACHN 


0.4 


Salivary gland 


6.4 


Renal ca.UO-31 


15.7 


Pituitary gland 


1.1 


Renal ca.TK-10 


0.1 


Brain (fetal) 


0.6 


Liver 


3.5 


4_Ma.Ul ^WllUlvy 


2 7 






Brain (amygdala) 


3.0 


Liver ca. 


5.3 


Brain (cerebellum) 


0.4 


Lung 


16.5 


Brain (hippocampus) 


2.4 


Lung (fetal) 


8.4 


Brain (substantia nigra) 


4.7 


Lung ca. (small cell) 

T ~\T 1 

LX-l 


1.8 


Brain (thalamus) 


2.7 


Lung ca. (small cell) 
NCI-H69 


1.0 


Cerebral Cortex 


.1-2 


Lung ca. (s.cell var.) 

OTTTI *'f ^ 

SHP-77 


1.3 


Spinal cord 


4.2 


Lung ca. (large 

1 l\-v T^""IT TTi( /A 

celI)NCI-H460 


0.9 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.1 


glio/astroU-118-MG 


02 


Lung ca. (non-s.cell) 
NCI-H23 


0.2 


astrocytoma SW1783 


0.1 


Lung ca. (non-s.cell) 
HOP-62 


4.6 


neuro*;metSK-N-AS. 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


2.5 


Lung ca. (squam.) 
SW 900 


6.9 


astrocytoma SNB-75 


5.4 


Lung ca. (squam.) 
NCI-H596 


0.5 


glioma aNo-iy 


0.7 


Mammary gland 


1.7 


glioma U251 


1.1 


Breast ca.* (pl.ex) 
MCF-7 


0.0 


glioma SF-295 


0.3 


Breast ca * fnl eft 
MDA-MB-231 


19.2 


Heart (fetal) 


1.0 


Breast ca * fnl eft 

T47D 


2.4 


Heart 


1 4 


Breast ea BT-54Q 


4 7 


Skeletal muscle (fetal) 


0.5 


Breast ca. MDA-N 


0.9 


Skeletal muscle 


1.0 


Ovary 


1.8 


Bone marrow 


32.1 


Ovarian ca. OVCAR- 
3 


0.3 


Thymus 


29.3 


Ovarian ca. OVCAR- 
4 


0.3 


Spleen 


46.3 


Ovarian ca. OVCAR- 


9.0 ! 
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5 




Lymph node 


100.0 


Ovarian ca. OVCAR- 

o 
o 


0.7 


Colorectal 


1.4 # 


Ovarian ca. IGROV- 

1 

I 


0.7 


Stomach 


12.1 


Ovarian ca.* (ascites) 
blv-UVo 


0.2 


Small intestine 


lo.o 


I If Al*l 1 O 

uterus 


9 A 


Colon ca. SW480 


2.3 


Placenta 


13.8 


Colon ca.* 
SW620(SW480 met) 


0.9 


Prostate 


0.9 


Colon ca. HT29 


1.2 


Prostate ca.* (bone 
metjrCo 


3.1 


Colon ca.HCT-116 


2.9 


Testis 


0.2 


Colon ca. CaCo-2 


3.5 


Melanoma 
Hs688(A)T 


0.1 


Colon ca. 
tissue(OD03866) 


4.6 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


2.8 


Melanoma UACC-62 


0.1 


Gastric ca.* (liver met) 
NCI-N87 


4.6 


Melanoma M14 


7.2 


Bladder 


1.9 


Melanoma LOX 
IMVI 


1.6 


Trachea 


8.1 


Melanoma* (met) 
SK-MEL-5 


0.1 


Kidney 


0.3 


Adipose 


3.5 



Table CAC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3058, Run 
162569974 


Tissue Name 


Rel.Exp.(%) 
Ag3058, Run 
162569974 


Normal Colon 


30.8 


Kidney Margin 
8120608 


6.5 


CCWelltoModDiff 
(OD03866) 


6.8 


Kidney Cancer 
8120613 


2.9 


CC Margin (OD03866) 


6.7 


Kidney Margin 
8120614 


6.3 


CC Gr.2 rectosigmoid 
(OD03868) 


8.5 


Kidney Cancer 
9010320 


35.4 


CC Margin (OD03868) 


1.3 


Kidney Margin 
9010321 


17.0 


CC Mod Diff (ODO3920) 


9.9 


Normal Uterus 


2.4 


CC Margin (ODO3920) 


11.3 


Uterus Cancer 064011 


10.2 


CC Gr.2 ascend colon 


14.7 


Normal Thyroid 


6.8 
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(OD03921) 








CC Margin (OD03921) 


10.7 


Thyroid Cancer 
064010 


5.1 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


18.7 


Thyroid Cancer 
A302152 


6.8 


Liver Margin (ODO4309) 


12.4 


Thyroid Margin 
A302153 


7.9 


Colon mets to lung 
(UJJU4451-01) 


12.7 


Normal Breast 


17.7 


Lung Margin (OD04451- 
02) 


18.7 


Breast Cancer 
(OD04566) 


10.4 


Normal Prostate 6546-1 


13.8 


Breast Cancer 
(OD04590-01) 


18.2 


Prostate Cancer 
(OD04410) 


13.8 


Breast Cancer Mets 
(OD04590-03) 


59.9 


Prostate Margin 
(OD04410) 


11.7 


Breast Cancer 

Metastasis 
(OD04655-05) 


52.5 


Prostate Cancer 

(UJJU472U-0 1 ) 


7.3 


Breast Cancer 064006 


21.3 


Prostate Margin 
(UJJU4720-U2) 


19.2 


Breast Cancer 1024 


9.7 


Normal Lung 061010 
— ■ ■ 


100.0 


Breast Cancer 
9100266 


12.9 


Lung Met to Muscle 
COD04286) 


29.5 


Breast Margin 
9 100265 


14.1 


Muscle Margin 
(OD04286) 


8.2 


Ulwuol v>allL>vi 

A209073 


11.6 


Lung Malignant Cancer 
(OD03126) 


31.6 


Breast Margin 
A2090734 


6.6 


Lung Margin (OD03 1 26) 


48.6 


Normal Liver 


7.2 


Lung Cancer (OD04404) 


16.4 


Liver Cancer 064003 


5.4 


Lung Margin (OD04404) 


17.1 


Liver Cancer 1 025 


U.O 


Lung Cancer (OD04565) 


6.7 


Liver Cancer 1026 


13.2 


i^ung Margin (ltliihoodj 


OA 1 


Liver Cancer 6004-T 


7.2 


T liner Pan^pr f CWiClA'J'X'7 

01) 


24.0 


Liver Tissue 6004-N 


6.7 


Lune Margin COD04237- 
02) 


43.2 


Liver Cancer 6005-T 


14.7 


Ocular Mel Met to Liver 
(ODO4310) 


3.4 


Liver Tissue 6005-N 


4.5 


Liver Margin (OD043 1 0) 


8.2 


Normal Bladder 


20.2 


Melanoma Mets to Lung 
(OD04321) 


6.0 


Bladder Cancer 1023 


9.2 
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Lung Margin (OD04321) 


69.3 


Bladder Cancer 
A302173 


9.6 


Normal Kidney 


13.0 


Bladder Cancer 
(OD04718-01) 


38.2 


Kidney Ca, Nuclear grade 
2 (OD04338) 


22.5 


Bladder Normal 
Adjacent (OD0471 8- 
03) 


14.9 


Kidney Margin 
(OD04338) 


11.1 


Normal Ovary 


5.8 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


20.4 


Ovarian Cancer 
064008 


17.4 


Kidney Margin 
(OD04339) 


6.5 


Ovarian Cancer 
(OD04768-07) 


3.7 


Kidney Ca, Clear cell 
type (OD04340) 


28.5 


Ovary Margin 
(OD04768-08) 


5.2 


Kidney Margin 
(OD04340) 


13.7 


Normal Stomach 


14.8 


Kidney Ca, Nuclear grade 
3 (OD04348) 


9.7 


Gastric Cancer 
9060358 


8.0 


Kidney Margin 
(OD04348) 


11.3 


Stomach Margin 
9060359 


17.4 


Kidney Cancer 
(OD04622-0I) 


27.9 


Gastric Cancer 
9060395 


17.7 


Kidney Margin 

(KJLfU'iO^Z-Vj ) 


2.6 


Stomach Margin 


26.8 


Kidney Cancer 
(OD04450-01) 


2.3 


Gastric Cancer 
9060397 


20.2 


Kidney Margin 
(OD04450-03) 


5.3 


Stomach Margin 
9060396 


12.9 


Kidney Cancer 8120607 


7.4 


Gastric Cancer 
064005 


29.7 



Table CAD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3058,Run 
162562989 


Tissue Name 


Rel. Exp.(%) 
Ag3058, Run 
162562989 


Secondary Thl act 


5.9 


HUVECIL-lbeta 


21.9 


Secondary Th2 act 


6.2 


HUVEC IFN gamma 


16.8 


Secondary Trl act 


9.5 


HUVECTNF alpha + IFN 
gamma 


20.6 


Secondary Thl rest 


1.0 


HUVECTNF alpha + IL4 


28.1 


Secondary Th2 rest 


1.2 


HUVEC IL-11 


12.3 


Secondary Trl rest 


1.4 


Lung Microvascular EC 
none 


28.3 


Primary Thl act 


8.7 


Lung Microvascular EC 


46.7 
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TNFalpha + IL-lbeta 




Primary Th2 act 


3.5 


Microvascular Dermal EC 
none 


38.7 


Primary Trl act 


8.3 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


24.7 


Primary Thl rest 


7.6 


Bronchial epithelium 
TNFalpha + ILlbeta 


9.2 


Primary Th2 rest 


3.1 


Small airway epithelium 
none 


13.8 


Primary Trl rest 


4.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


100.0 


CD45RA CD4 
lymphocyte act 


13.7 


Coronerv arterv SMC rest 


22.2 


CD45ROCD4 
lymphocyte act 


7.3 


Coronery artery SMC 
TNFalpha + IL-lbeta 


11.3 


CD8 lymphocyte act 2.3 


Astrocytes rest 


14.7 


Secondary CD8 
lymphocyte rest 


3.4 


Astrocytes TNFalpha + 
IL-lbeta 


12.0 


Secondary CD8 
lymphocyte act 


4.4 


KU-8I2 <T3asonhil i rest 


12.2 


CU4 lympnocyte none 


1.5 


KU-812 (Basophil) 
PMA/ionomycin 


46.7 


2ryThl/Th2/Trl anti- 
CD95CH11 


1 3 


CCD1106 (Keratinocytes) 
none 


24.0 


LAK cells rest 


15.3 


CCD1 106 (Keratinocytes) 
1 NFalpna + IL-lbeta 


1.6 


LAK cells IL-2 


4.5 


Liver cirrhosis 


J. / 


LAK cells IL-2+IL-12 


4.5 


Lupus kidney 


1 O 


LAK cells IL-2+IFN 
gamma 


8.1 


NCI-H292 none 


46.3 


LAK cells IL-2+ IL-18 


7.9 




57.4 


LAK cells 
PMA/ionomycin 


102 


NCI-H292 IL-9 


63.3 


NK Cells IL-2 rest 


2.0 


XTf T UOOO TT 11 


28.1 


Two Way MLR 3 day 


2.6 


XTP'T TJOOO TT7XT /to»vi«,n 

iNUi-rizyz iriN gamma 


o 


Two Way MLR 5 day 


3.1 


nrAbC none 


\A £ 
l*Ko 


Two Way MLR 7 day 


2.3 


HPAEC TNF alpha + IL-1 

hfta 

Uvia 


19.9 


PBMC rest 


2.4 


Lung fibroblast none 


29.1 


PBMC PWM 


12.3 


Lung fibroblast TNF alpha 
+ IL-1 beta 


13.3 


PBMC PHA-L 


5.1 


Lung fibroblast IL-4 


39.8 


Ramos (B cell) none 


0.0 i 


Lung fibroblast IL-9 


24.5 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


19.1 
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B lymphocytes PWM 


32.3 


Lung fibroblast IFN 
gamma 


45.7 


B lymphocytes CD40L 
and IL-4 


4.1 


Dermal fibroblast 
CCD1070 rest 


57.4 


EOL-1 dbcAMP 


0.7 


Dermal fibroblast 
CCD1070TNF alpha 


81.8 


EOL-1 dbcAMP 
PMA/ionomycin 


6.7 


Dermal fibroblast 

/"""/Til (V7f\ TT 1 ka+o 

i~CDiu/u i.L-1 beta 


25.0 


Dendritic cells none 


12.8 


Dermal tioroDiast irN 
samma 


51.1 


Dendritic cells LPS 


11.8 


Dermal fibroblast IL-4 

v W11I1U1 Vl/lUJl ^.li J • 


52.9 


Dendritic cells anti- 
CD40 


13.8 


IBD Colitis 2 


0.7 


Monocytes rest 


6.2 


IBD Crohn's 


3.4 


Monocytes LPS 


6.5 


Colon 


22.5 


Macrophages rest 


18.3 


Lung 


192 


Macrophages LPS 


10.0 


Thymus 


24.7 


HUVEC none 


33.7 


Kidney 


13.0 


HUVEC starved 


49.0 







Panel 1.3D Summary: Ag3058 Highest expression of the NOV86 gene, a GTPase- 
activating protein homolog, is seen in the lymph node (CT=27.8). Among tissues with 
metabolic function, this gene has low levels of expression in pancreas, adrenal, thyroid, 
5 pituitary, heart, skeletal muscle, liver and adipose. Rab GTPases are integral to vesicular 
transport in the secretory and endocytic pathways. Therefore, therapeutic modulation of this 
gene product may be a treatment for metabolic and endocrine diseases, including obesity and 
Types 1 and 2 diabetes. 

This GTPase activating enzyme like molecule is also expressed at low levels in the 
10 CNS. Thus, it may be useful in treating diseases of the nervous system, stroke or CNS trauma. 

In addition, this gene is expressed at low levels in the cancer cell lines in this panel. 
Therefore, modulation of expression of this gene may be useful in treating cancer. 

Panel 2D Summary: Ag3058 The NOV86 gene is expressed at low levels in this 
panel. There is higher expression in kidney, breast, liver and bladder cancer samples compared 
15 to the adjacent normal tissue. Conversely, there is lower expression in lung cancer samples 
compared to the adjacent normal tissue. Thus, the expression of this gene could be used as a 
diagnostic marker for kidney, breast, liver, bladder and lung cancers. Furthermore, modulation 
of expression of this gene may also be used for therapy of these cancers. 

Panel 4D Summary: Ag3058 The NOV86 gene is expressed at high to moderate 
20 levels in a wide range of cell types of significance in the immune response and tissue response 
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in health and disease, with the highest expression being detected in TNF alpha plus IL-1 beta 
treated small airway epithelial cells (CT=28.03). Therefore, targeting of this gene product with 
a small molecule drug or antibody therapeutic may modulate the functions of cells of the 
immune system as well as resident tissue cells and lead to improvement of the symptoms of 
5 patients suffering from autoimmune and inflammatory diseases such as COPD, emphysema, 
asthma, allergies, inflammatory bowel disease, lupus erythematosus, and arthritis, including 
osteoarthritis and rheumatoid arthritis. 

NOV87aandNOV87b 

Expression of gene NOV87a and full length clone NOV87b was assessed using the 
10 primer-probe set Ag3040, described in Table CBA. 

Table CBA. Probe Name Ag3040 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gagccctgaagctcttctttc-3 • 


21 


617 


1342 


Probe 


TET-5 1 -cttctcgcacttccgccagttcatt-3 1 - 
TAMRA 


25 


663 


1343 


Reverse 


5 1 -cctggtcctgctcactgat-3 ' 


19 


694 


1344 



CNS_neurodegeneration_yl.O Summary: Ag3040 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
15 Panel 1.3D Summary: Ag3040 Expression of this gene is low/undetectable (CTs > 

35) across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag3040 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

NOV88 

20 Expression of gene NOV88 was assessed using the primer-probe set Ag2923, 

described in Table CCA. 

Table CCA. Probe Name Ag2923 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 • -agtcaacagatttggccacat-3 ■ 


21 


45 


1345 


Probe 


TET-5 1 -tcaccagggctgcttttaactctggt-3 ■ - 
TAMRA 


26 


77 


1346 


Reverse 


5 ' -cagtgaaggggtcactgatg-3 * 


20 


120 


1347 
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CNS_neurodegeneration_vl.O Summary: Ag2923 Expression of this gene is 
low/undetectable (CTs > 34.5) across all of the samples on this panel (data not shown). 

Panel 1.3D Summary: Ag2923 Expression of this gene is low/undetectable (CTs > 
34.5) across all of the samples on this panel (data not shown). 
5 Panel 2D Summary: Ag2923 Expression of this gene is low/undetectable (CTs > 35) 

across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag2923 Expression of this gene is low/undetectable (CTs > 
34.5) across all of the samples on this panel (data not shown). 

NOV89 

1 0 Expression of gene NOV89 was assessed using the primer-probe set Ag2924, 

described in Table CD A. 

Table CD A. Probe Name Ag2924 



Primers 


Sequences 


Length 


Start 
Position 


SEQ XD NO: 


Forward 


5 1 -gaacgggaagcttgttatcaat-3 1 


22 


231 i 


1348 


Probe 


TET-5 ' -agatctcaccaaaatcaaatggggca-3 ' - 
TAMRA 


26 


282 


1349 


Reverse 


5 1 -atgatgtactcagtgccagcat-3 ' 


22 


308 


1350 



CNS_neurodegeneration_vLO Summary: Ag2924 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 1.3D Summary: Ag2924 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag2924 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

Panel 5 Islet Summary: Ag2924 Run 242285280 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). Run 
243564308 The amp plot indicates that there were experimental difficulties with this run. 

NOV90 

Expression of gene NOV90 was assessed using the primer-probe set Ag3045, 
25 described in Table CEA. Results of the RTQ-PCR runs are shown in Tables CEB, CEC, CED, 
CEE, CEF and CEG. 



15 



20 
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Table CEA. Probe Name Ag3045 



Primers 


Sequences 


-u 1 Start 
Length 1 Position 


SEQ ID NO: 


Forward 


5 1 - caccgga t catatgaaatcaat - 3 • 


22 j605 


1351 


Probe 


TET-5 ' -tgtaattgaccctgttcctgcaccag-3 1 - 
TAMRA 


26 639 


1352 


Reverse 


5 1 -ccaccatcaacatttgaatca-3 ' 


21 j669 


1353 



Table CEB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3045, 
Run 211012233 


Tfc^iie Name 


Rel. Exp.(%) Ag3045, 
Run211012233 


AD 1 Hippo 


0.8 


Control (Path) 3 
Temporal Ctx 


0.2 


AD 2 Hippo 


0.1 


Control (Path) 4 
Temporal Ctx 


4.4 


AD 3 Hippo 


0.5 


AD 1 Occipital Ctx 


2.1 


AD 4 Hippo 


0.3 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


8.5 


AD 3 Occipital Ctx 


0.3 


AD 6 Hippo 


0.3 


AD 4 Occipital Ctx 


2.0 


Control 2 Hippo 


0.5 


AD 5 Occipital Ctx 


0.8 


Control 4 Hippo 


0.5 


AD 6 Occipital Ctx 


1.1 


Control (Path) 3 
Hippo 


0.4 


Control 1 Occipital 
Ctx 


0.0 


AD 1 Temporal Ctx 


1.4 


Control 2 Occipital 
Ctx 


2.2 


AD 2 Temporal Ctx 


0.0 


Control 3 Occipital 
Ctx 


2.1 


AH ^ T^mTv^ml Ptv 
r\±J D 1 Cllipuial LA 


u./ 


Control 4 Occipital 
Ctx 


U.2 


AD 4 Temporal Ctx 


2.7 


Control (Path) 1 
Occipital Ctx 


4.1 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


2.4 


AD 5 SupTemporal 
Ctx 


4.4 


Control (Path) 3 \ 
Occipital Ctx 


0.0 


AD 6 Inf Temporal 
Ctx 


0.9 


Control (Path) 4 
Occipital Ctx 


2.7 


AD 6 Sup Temporal 
Ctx 


0.9 


Control 1 Parietal 
Ctx 


0.0 


Control 1 Temporal 
Ctx 


0.0 


Control 2 Parietal 
Ctx 


4.1 


Control 2 Temporal 
Ctx 


0.7 


Control 3 Parietal 
Ctx 


1.5 


Control 3 Temporal 


1.5 


Control (Path) 1 


2.6 
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Ctx 




Parietal Ctx 




Control 4 Temporal 
Ctx 


0.6 


Control (Path) 2 
Parietal Ctx 


2.6 


Control (Path) 1 
Temporal Ctx 


3.1 


Control (Path) 3 
Parietal Ctx 


0.2 


Control (Path) 2 
Temporal Ctx 


3.6 


Control (Path) 4 
. Parietal Ctx 


3.4 



Table CEC. Panel 1.3D 



Tissue Name 


ReL Exp.(%) Ag3045, 
Run 165519994 


Tissue Name 


ReL Exp.(%) Ag3045, 
Run 165519994 


Liver adenocarcinoma 


10.3 


Kidney (fetal) 


1.9 1 


Pancreas 


3.7 


Renal ca. 786-0 


4.8 


Pancreatic ca. CAPAN 
2 


3.5 


Renal ca. A498 


16.2 


Adrenal gland 


7.5 


Renal ca. RXF 393 


6.6 


i nyroiu 


JL.JL 


Kenai ca. AL>riJN 


l.U 


Salivary gland 


8.6 


Renal ca. UO-31 


6.7 


Pituitary gland 


28.7 


Renal ca.TK-10 


1.1 


Brain (fetal) 


20.3 


Liver 


4.4 


Brain (whole) 


100.0 


Liver (fetal) 


1.9 


Rrain famvfrdala^ 


8 4 


Liver ca. 
(hepatoblast) HepG2 




Brain (cerebellum) 


50.7 


Lung 


4.1 


Brain (hippocampus) 


36.6 


Lung (fetal) 


2.3 


Brain (substantia nigra) 


lot 

12.1 


Lung ca. (small cell) 
LX-1 


0.8 


Brain (thalamus) 


29.7 


Lung ca. (small cell) 
NCI-H69 


2.5 


Cerebral Cortex 


4.2 


Lung ca. (s.cell var.) 
SHP-77 


0.6 


Spinal cord 


10.4 


Lung ca. (large 
cell)NCI-H460 


2.1 


glio/astro U87-MG 


3.2 


Lung ca. (non-sm, 
cell) A549 


0.0 


glio/astroU-118-MG 


6.6 


Lung ca. (non-s.cell) 
NCI-H23 


3.2 


astrocytoma SW1783 


2.4 


Lung ca. (non-s.cell) 
HOP-62 


4.5 


neuro^ met SK-N-AS 


3.5 


Lung ca. (non-s.cl) 
NCI-H522 


0.6 


astrocytoma SF-539 


35.4 


Lung ca. (squam.) 
SW 900 


16.4 


astrocytoma SNB-75 


14.0 


Lung ca. (squam.) 


276 
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| NCI-H596 




gjioma 5>iNr>-iy 


37.9 j Mammary gland 


2.9 


glioma U251 


AAA ! B reast ca -* (P'- er ) 
j MCF-7 


4.8 


glioma SF-295 


n . j Breast ca.* (pl.ef) 
j MDA-MB-231 


7.1 


Heart (fetal) 


0.4 


Breast ca.* (pl.ef) 
T47D 


8.5 


Heart 


9.5 


Breast ca. BT-549 


J.O 


Skeletal muscle (fetal) 


1.0 


Breast ca. MDA-N 


1.3 


oKeietai muscle 


5.4 


Ovary 


1.3 


Bone marrow 


9.9 


Ovarian ca. OVCAR- 
3 


2.8 


Thymus 


8.8 


Ovarian ca. OVCAR- 
4 


0.9 


Spleen 


9.4 


Ovarian ca. OVCAR- 
5 


3.6 


Lymph node 


21.8 


Ovarian ca. OVCAR- 
8 


202 


Colorectal 


3.3 


Ovarian ca. IGROV- 
1 


0.6 


Stomach 


6.3 


Ovarian ca.* (ascites) 

Of fWT 1 

aK-uv-j 


11.4 


Small intestine 


8.0 


Uterus 


i o.o 


Colon ca. SW480 


4.1 


Placenta 


7.0 


Colon ca.* 
SW620(SW480met) 


2.1 


Prostate 


15.7 


Colon ca. HT29 


0.5 


Prostate ca.* (bone 
met)PC-3 


2.8 


Colon ca.HCT-1 16 




Testis 


11.4 


Colon ca. CaCo-2 


1.9 


Melanoma 
Hs688(A).T 


0.4 


v^uion Ca. 

tissue(OD03866) 


2.4 


Melanoma* (met) 
Hs688(B).T | 


0.4 




2.4 


Melanoma UACC-62 


3.1 


Gastric ca.* (liver met) 
NCI-N87 


3.3 


Melanoma M 14 


7.9 


Bladder 


11.3 


Melanoma LOX 
IMVI 


0.0 


Trachea 


1.7 


Melanoma* (met) 
SK-MEL-5 


1.8 


Kidney 


102 


Adipose 


1.4 
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Table CED. Panel 2D 



Tissue Name 


Ag3045, Run 
163577595 


Tissue Name 


Rel Exn(%} 

l\Wli LF»y 

Ag3045,Run 
163577595 


Normal Colon 


30.4 


Kidney Margin 
8120608 


1.1 


CCWelltoModDiff 
(OD03866) 


2.9 


Kidney Cancer 
8120613 


2.5 


CC Margin (OD03866) 


1.1 


Kidney Margin 
8120614 


0.2 


CC Gr.2 rectosigmoid 
(OD03868) 


2.7 


Kidney Cancer 
9010320 


2.9 


CC Margin (OD03868) 


4.1 


Kidney Margin 
9010321 


4.0 


CC Mod Diff(ODO3920) 


1.9 


Normal Uterus 


2.1 


CC Margin (ODO3920) 


3.9 


Uterus Cancer 064011 


22.2 


CC Ctr 0 flQfpnH rr^lnn 

V-/ VJ1 .Z. doW/dlll CLMUU 

(OD03921) 


2.7 


Normal Thyroid 


7.1 


CC Margin (OD03921) 


2.6 


lnyroiu dancer 
064010 


2.0 


CC from Partial 

Hepatectomy (ODO4309) 
Mets 


3.6 


Thyroid Cancer 
A302152 


2.0 


Liver Margin (ODO4309) 


3.6 


Thyroid Margin 
A302153 


20.9 


Colon mets to lung 
(OD04451-01) 


1.8 


Normal Breast 


9.6 


T una Maratn fOnA4.4S 1 - 

J_*UUg IVlCli gill ^V-/JL/U*T*t J 1 

02) 


2.5 


T3 rp»Q erf Pan^pr 
JDica&l V_/dJlLCr 

(OD04566) 


5.3 


Normal Prostate 6546-1 


100.0 


(OD04590-01) 


55.5 


Prostate Chancer 
(OD04410) 


39.0 


J_Vl^Ut3l V_/Cll lvwi IVAvld 

(OD04590-03) 


92.7 


Prostate Margin 
(OD04410) 


17.3 


Metastasis 
(OD04655-05) 


31.9 


Prostate Cancer 
(OD04720-01) 


15.1 


Breast Cancer 064006 


21.8 


Prostate Margin 
(OD04720-02) 


37.1 


Breast Cancer 1024 


6.5 


Normal Lung 061010 


47.0 


Breast Cancer 
9100266 


8.4 


Lung Met to Muscle 
(OD04286) 


4.4 


Breast Margin 
9100265 


4.0 


Muscle Margin 
(OD04286) 


10.3 


Breast Cancer 
A209073 


12.2 
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Lung Malignant Cancer 
(OD03126) 


O 1 

8.1 


Breast Margin 
A2090734 


2J2 


Lung Margin (OD03126) 


8.8 


Normal Liver 


1.0 


Lung Cancer (OD04404) 


47.0 


Liver Cancer 064003 


0.8 


Lime Marffin fOD044041 


1 1 0 


T ivpr Panrer 1 09S 


0 7 


Lung Cancer (OD04565) 


6.9 


Liver Cancer 1026 


0.6 


Lung Margin (UD04565) 


1 T 

1.7 


Liver Cancer 6004-T 


3.1 


Lung Cancer (UDU42 J7- 
01) 


12.2 


Liver Tissue 6004-N 


1.7 


Lung Margin (OD04237- 
02) 


5.4 


Liver Cancer 6005-T 


0.8 


Ocular Mel Met to Liver 
(ODO4310) 


5.1 


Liver Tissue 6005-N 


0.8 


Liver Margin (ULJU431U) 


1 A 

1.0 


Normal Bladder 


12.5 


Melanoma Mets to Lung 
(OD04321) 


3.3 


Bladder Cancer 1023 


1.4 


Lung Margin (OD04321) 


5.7 


Bladder Cancer 
A302173 


6.7 


Normal Kidney 


7.2 


Bladder Cancer 
(OD04718-01) 


16.4 


Kidney Ca, Nuclear grade 
2(OD04338) 


8.4 


Bladder Normal 
Adjacent (OD04718- 
03) 


23.3 


Kidney Margin 
{VDu433&) 


2.9 


Normal Ovary 


2.0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


0.8 


Ovarian Cancer 
064008 


"5.6 


Kidney Margin 


1.8 


Ovarian Cancer 
(OD04768-07) 


2.4 


Kidney Ca, Clear cell 
type (UUU4i40) 


2.1 


Ovary Margin 
(OD04768-08) 


2.4 


Kidney Margin 

{CiTifiAIArW 
\\JLJ\J £ t j*f v j 


4.2 


Normal Stomach 


1.0 


Kidney Ca 3 Nuclear grade 


1.6 


Gastric Cancer 

OA£A1 CO 


0.5 


Kidney Margin 
fr\r\(\AiAfi\ 

\VJLJ\J £ t3 £ to) 


10.4 


Stomach Margin 

Ofl£AQ CO 


2.4 


Kidney Cancer 


1.3 


Gastric Cancer 


4.5 


Kidney Margin 
(OD04622-03) 


1.1 


Stomach Margin 
9060394 


1.9 


Kidney Cancer 
(OD04450-01) 


9.0 


Gastric Cancer 
9060397 j 


7.7 


Kidney. Margin 
(OD04450-03) 


14.0 


Stomach Margin 
9060396 


0.3 
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Kidney Cancer 8120607 


0.9 


Gastric Cancer 
064005 


8.3 



Table CEE. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag3045, Run 
164886427 


Tissue Name 


Rel. Exp.(%) 
Ag3045,Run 
164886427 


Daoy- Medulloblastoma 


3.4 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


6.6 


TE671- Medulloblastoma 


4.4 


ES-2- Ovarian clear cell 
carcinoma 


2.1 


D283 Med- 
Medulloblastoma 


4.2 


Ramos- Stimulated with 
PMA/ionomycin 6h 


6.0 


PFSK-1- Primitive 
Neuroectodermal 


1.7 


Ramos- Stimulated with 
PMA/ionomycin 14h 


1.7 


VE /too /~nvTO 


7.1 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 




SNB-78- Glioma 


2.2 


Raji- Burkitt's lymphoma 


2.0 


SF-268- Glioblastoma 


6.8 


Daudi- Burkitt's lymphoma 


3.4 


T98G- Glioblastoma 


1.3 


U266- B-cell plasmacytoma 


4.9 


SK-N-SH- 
Neuroblastoma 
(metastasis) 


3.1 


CA46- Burkitt's lymphoma 


0.5 


SF-295- Glioblastoma 


2.0 


RL- non-Hodgkin's B-cell 
lymphoma 


2.1 


Cerebellum 


100.0 


JM1- pre-B-cell lymphoma 


5.8 


Cerebellum 


1.4 


Jurkat- T cell leukemia 


3.5 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


23.0 


TF-1- Erythroleukemia 


8J 


DMS-114- Small cell 
lung cancer 


8.4 


HUT 78- T-cell lymphoma 


3.4 


DMS-79- Small cell lung 
cancer 


26.6 


U937- Histiocytic lymphoma 


12 


NCI-H146- Small cell 
lung cancer 


5.0 


KU-812- Myelogenous 
leukemia 


2.6 


NCI-H526- Small cell 
lung cancer 


30.6 


769-P- Clear cell renal 
carcinoma 


1.4 


NCI-N417- Small cell 
lung cancer 


1.3 


Caki-2- Clear cell renal 
carcinoma 


3.3 


NCI-H82- Small cell 
lung cancer 


4.2 


SW 839- Clear cell renal 
carcinoma 


1.6 


NCI-H157- Squamous 
cell lung cancer 


2.6 


G401- Wilms' tumor 


2.0 
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(metastasis) 








NCI-HI 155- Large cell 
lung cancer 


50.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


5.3 


NCI-H1299- Large cell 
lung cancer 


10.5 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


2.6 


NCI-H727-Lung 
carcinoid 


2.6 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


6.5 


NCI-UMC-ll-Lung 
carcinoid 


8.5 


BxPC-3- Pancreatic 
adenocarcinoma 


1.5 


LX-1- Small cell lung 
cancer 


2.0 


HPAC- Pancreatic 

• • 
adenocarcinoma 


2.8 


Colo-205- Colon cancer 


3.0 


MIA PaCa-2- Pancreatic 
carcinoma 


1.9 


KM12- Colon cancer 


2.6 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


6.9 


KM20L2- Colon cancer 


0.8 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


7.5 


NCI-H716- Colon cancer 


3.7 


T24- Bladder carcinma 
(transitional cell) 


2.3 


SW-48- Colon 
adenocarcinoma 


2.6 


5637- Bladder carcinoma 


4.0 


SW1 116- Colon 

l * 

adenocarcinoma 


1.0 


HT-1 197- Bladder carcinoma 


3.8 


LS174T- Colon 
adenocarcinoma 


3.2 


UM-UC-3- Bladder carcinma 
(transitional cell) 


1.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


1.2 j 


SW-480- Colon 
adenocarcinoma 


1.1 


HT-1080- Fibrosarcoma 


6.8 


NCI-SNU-5- Gastric 
carcinoma 


3.2 


MG-63- Osteosarcoma 


1-3 


KATO HI- Gastric 
carcinoma 


3.9 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


6.3 


NCI-SNU-l 6- Gastric 
carcinoma 


1.6 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


2.6 ' 


NCI-SNU-l- Gastric 
carcinoma 


3.2 


A431- Epidermoid carcinoma 


5.1 


RF-1- Gastric 

axX^UKJK^CU \ziUSJlxlCi 


4.2 


WM266-4- Melanoma 


4.5 


RF-48- Gastric 
adenocarcinoma 


4.5 


DU 145- Prostate carcinoma 
(brain metastasis) 


1.0 


MKN-45- Gastric 
carcinoma 


4.7 


MDA-MB-468- Breast 
adenocarcinoma 


2.3 


NCI-N87- Gastric 
carcinoma 


0.9 


SCC-4- Squamous cell 
carcinoma of tongue 


1.4 



1404 



WO 02/068649 



PCT/US02/02785 



OVCAR-5- Ovarian 
carcinoma 


1.4 


SCC -9- Squamous cell 
carcinoma of tongue 


0.3 


RL95-2- Uterine 
carcinoma 


0.3 


SCC-15- Squamous cell 
carcinoma of tongue 


0.7 


HelaS3- Cervical 
adenocarcinoma 


2.3 


CAL 27- Squamous cell 
carcinoma of tongue 


8.0 


Table CEF. Panel 4D 


Tissue Name 


Rel.Exp.(%) 
Ag3045, Run 
162559632 


Tissue Name 


Rel. Exp.(%) 
Ag3045, Run 
162559632 


Secondary Thl act 


10.0 


HUVEC IL-lbeta 


2.2 


Secondary Th2 act 


12.0 


HUVEC IFN gamma 


4.2 


Secondary Trl act 


21.9 


HUVEC TNF alpha + IFN 
gamma 


3.2 


Secondary Thl rest 


3.3 


HUVEC TNF alpha + IL4 


4.4 


Secondary Th2 rest 


5.6 


HUVEC IL-11 


2.9 


Secondary Trl rest 


6.7 


Lung Microvascular EC 
none 


6.9 


Primarv Th 1 art 

x * until y i ij i aui 


13.1 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


A O 


Primarv Th2 art 

x i until y i lu> uvl 


162 


Microvascular Dermal EC 
none 


o.z 


Primarv Trl act 

JL 11111IU j til HVt 


17.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 




Primarv Th 1 rest 


47.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


1 1 
1 . i 


Primary Th2 rest 




21.9 . 


Small airway epithelium 
none 


• 2.3 


Primary Trl rest 




33.2 


Small airway epithelium 
TNFalpha + IL-lbeta 


34.2 


CD45RA CD4 
lymphocyte act 


5.1 


Coronery artery SMC rest 


2.4 


CD45RO CD4 
lymphocyte act 


11.6 


Coronery artery SMC 
TNFalpha + IL-lbeta 


2.1 


CD8 lymphocyte act 


8.6 


Astrocytes rest 


5.4 


Secondary CD8 
lymphocyte rest 


8.1 


Astrocytes TNFalpha + 
IL-lbeta 


3.6 


Secondary CD8 
lymphocyte act 


6.0 


KU-812 (Basophil) rest 


4.3 


CD4 lymphocyte none 


2.3 


KU-812 (Basophil) 
PMA/ionomycin 


14.1 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


12.0 


CCD1 1 06 (Keratinocytes) 
none 


6.0 
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LAK cells rest 


10.7 


CCD1 106 nCeratinocvte<i\ 
TNFalpha + IL-1 beta 


0.5 


LAK cells IL-2 


8 6 


T Jver cirrhosis 


3 3 


LAK cells IL-2+IL-12 

X-lJ\.X\^ V> V> llhJ XX-J 4m ' IJ^ 1X< 


8 5 


T iinn^ lcidnf*v 

uUL/U^ tVlvil lv Y 


1 7 
i • / 


LAK cells IL-2+IFN 
gamma 


14.7 


NCI-H292 none 


100.0 


LAK cells EL-2+IL-18 


18.0 


NCI-H292 IL-4 


84.1 


LAK cells 
PMA/ionomycin 


3.7 


NCI-H292 IL-9 


96.6 


NK Cells IL-2 rest 


6.8 


NCI-H292 IL-13 


24.1 


Two Wav MLR 3 dav 


6 6 


NCI-H292 I FN gamma 

liA/^ 11 IN ^dl 111 lid 


22 8 


Two Wav MLR 5 dav 


8.8 


HPAEC none 


4 9 i 


Two Way MLR 7 day 


5 - 7 


HPAEC TNF aloha + IL-1 
beta 


3.9 


PBMC rest 


4 5 


Liiim* fihrnhla^t nnnp 


3 8 


PBMC PWM 


27.5 


Lung fibroblast TNF alpha 
+ TT - 1 beta 

* JUL* i uwia 


2.6 


PBMC PHA-L 


7.0 


Lung fibroblast IL-4 


9.5 


Ramos (B cell) none 


o.o 


T XT1 _ _a TT f\ 

Lung fibroblast IL-9 


5.5 


Ramos (B cell) 
ionomycin 


34.4 


Lung fibroblast IL-13 


6.0 


B lymphocytes PWM 


29.1 


Lung fibroblast IFN 
gamma 


5.2 


B lymphocytes CD40L 
and 1L-4 


10.7 


Dermal fibroblast 
CCD 1070 rest 


11.4 


EOL-1 dbcAMP 


6.3 


Dermal fibroblast 
CCD1070 TNF alpha 


32.5 


JiULr-I aDCAJVLr 
PMA/ionomycin 


12.2 


Dermal fibroblast 
CCD 1070 IL-1 beta 


5.7 


Dendritic cells none 


9.7 


Dermal fibroblast TFN 

11 JCll 1.1 1/1 UUItUl 11 1 > 

gamma 


3.6 


Dendritic cells LPS 


6.1 


Dermal fibroblast IL-4 


5.9 


Dendritic cells anti- 
CD40 


11.7 


IBD Colitis 2 


0.3 


Monocytes rest 


10.2 


IBD Crohn's 


0.4 


Monocytes LPS 


9.5 


Colon 


8.7 


Macrophages rest 


22.5 


Lung 


8.0 


Macrophages LPS 


2.4 


Thymus 


13.5 


HUVEC none 


5.7 


Kidney 


43.5 


HUVEC starved 


9.1 







Table CEG. Panel CNS 1 



Tissue Name \ Rel. Exp.(%) Ag3045, [ Tissue Name j Rel. Exp.(%) Ag3045, 
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Run 171694539 j 


Run 171694539 




0.9 


BA17PSP 


0 0 


BA4 Control2 


0.0 


BA17PSP2 


6.5 


BA4 
Alzheimer's2 


93 


Sub Nigra Control 


2.6 


BA4 Parkinson's 


55.1 


Sub Nigra Control2 


0.0 


BA4 
Parkinson's! 


19.1 


Sub Nigra 
Alzheimer's2 


11.2 


BA4 
Huntington's 


18.7 


Sub Nigra 
Parkinson's2 


9.0 


BA4 
Huntington's2 


6.5 


Sub Nigra 
Huntington's 


22.8 


BA4PSP 


0.0 


Sub Nigra 
Huntington f s2 


21.9 


BA4 PSP2 


6.1 


Sub Nigra PSP2 


l 2.3 


BA4 Depression 


15.1 


Sub Nigra 
Depression 


0.0 


BA4 
Depression2 


6.6 


Sub Nigra 
Depression2 


3.0 


BA7 Control 


8.7 


Glob Palladus 
Control 


2.1 


BA7 Controli 


0.0 


Glob Palladus 
Control2 


0.0 


BA7 
Alzheimer f s2 


14.7 


Glob Palladus 
Alzheimer's 


0.9 


BA7 Parkinson's 


26.8 


Glob Palladus 
Alzheimer l s2 


2.8 


BA7 
Parkinson's2 


16.3 


Glob Palladus 
Parkinson's 


61.6 


BA7 
nuntingion s 


61.1 


Glob Palladus 
Parkinson's2 


2.9 


Dr\ / 

Huntington's2 


23.5 


Glob Palladus PSP 


0.0 




0.0 


Glob Palladus PSP2 


0 8 


BA7PSP2 


11.6 


Glob Palladus 
Depression 


0.0 


BA7 Depression 


8.1 


Temp Pole Control 


0.0 


BA9 Control 


0.9 


Temp Pole Control2 


1.3 


BA9 Control2 


0.0 


Temp Pole 
Alzheimer's 


0.0 


BA9 Alzheimer's 


0.0 


Temp Pole 
Alzheimer's2 


5.6 


BA9 
Alzheimer's2 


20.6 


Temp Pole 
Parkinson's 


27.9 


BA9 Parkinson's 


31.4 


Temp Pole 


2.0 
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Parkinson's2 




BA9 
Parkinson ! s2 


21.9 


Temp Pole 
Huntington's 


29.1 


T) A A 

Huntinfrton's 


23.0 


Temp Pole PSP 


0.0 


BA9 
Hunti n ffton 's2 


15.2 


Temp Pole PSP2 


0.9 


BA9 PSP 


0.0 


Temp Pole 
jL'cprcbbiuiiz 


5.5 


BA9 PSP2 


3.6 


Cing Gyr Control 


2 0 


BA9 Depression 


2.0 


Cing Gyr Control2 


0.9 


n AO 

Depression2 


2.9 


Cing Gyr 

Alt * 1 

Alzheimers 


0.0 


BA1 7 Control 


. 15.1 


Cing Gyr 
Alzheimer's2 


14.5 


rJAl / L/OntTOli 


3.3 


Cing Gyr Parkinson's 


26.4 


BA17 
Alzheimer's2 


15.5 


Cing Gyr 
Parkinson's2 


12.0 


BA17 
Parkinson ! s 


100.0 


Cing Gyr 
Huntington's 


46.0 


BA17 
Parkinson , s2 


54.0 


Cing Gyr 
Huntington f s2 


19.8 


BA17 

nUIHIIlglUIl a 


32.3 


Cing Gyr PSP 


0.5 


BA17 
Huntington's2 


5.9 


Cing Gyr PSP2 


1.9 


BA17 
Depression 


4.0 


Cing Gyr Depression 


3.9 - 


BA17 
Depression2 


5.6 


Cing Gyr 
Depression2 


8.4 



CNSj»eurodegeneration_vl.O Summary: Ag3045 The NOV90 gene is not 
differentially expressed in the postmortem brains of Alzheimer's diseased patients when 
compared to non-demented control. However, this panel does confirm the expression of this 
5 gene in the CNS of an independent sample of individuals. See panel 1 for a discussion of 
utility of this gene in the central nervous system. 

Panel 1 3D Summary: Ag3045 Expression of the NOV90 gene shows a brain- 
preferential expression profile, and is expressed at moderate levels in all regions examined. 
Thus, this gene may be of utility as a small molecule target in neurologic disease. 
1 0 In addition, significant expression is seen in a cluster of brain cancer cell lines. Thus, 

expression of this gene could be used to differentiate between this sample and other samples 
on this panel and as a marker to detect the presence of brain cancer. Furthermore, therapeutic 
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modulation of the expression or fiinction of this gene may be effective in the treatment of 
brain cancer. 

Panel 2D Summary: Ag3045 The NOV90 gene is expressed at low levels in this 
panel. There is higher expression in gastric, breast, uterus and lung cancers then the normal 
5 samples from these organs. Expression of this gene could therefore be used as a diagnostic 
marker for gastric, lung, breast and uterine cancers. Furthermore, modulation of the gene 
product using small molecule inhibitors could be used for the treatment of these cancers. 

Panel 3D Summary: Ag3045 The NOV90 gene is expressed at a low level in most of 
the cancer cell lines on this panel. Modulation of the gene product using small molecule 
1 0 inhibitors culd therefore be used for the treatment of cancer. Highest expression of this gene is 
seen in the cerebellum, confirming the results seen in Panel 1.3D. 

Panel 4D Summary: Ag3045 The NOV90 gene, a serine/threonine-protein kinase 
homolog is expressed at moderate levels in pulmonary mucoepidermoid cells prepared under 
several conditions of cell activation: NCI-H292 none (CT=29.01), NCI-H292 IL-4 
15 (CT=29.26), NCI-H292 IL-9 (CT=29.06), NCI-H292 IL-13 (CT=31.06), NCI-H292 EFN 
gamma (CT—3L14). Therefore, small molecule antagonists that block the function of the 
NOV90 gene product may be useful in several autoimmune and inflammatory diseases of the 
lung including, but not limited to, chronic obstructive pulmonary disease, asthma, and 
emphysema. 

20 Panel CNSJL Summary: Ag3045 The expression in this panel confirms the presence 

of the NOV90 in the brain. Thus, this gene may be of utility as a small molecule target in 
neurologic disease. 

NOV91 

Expression of gene NOV9 1 was assessed using the primer-probe set Ag30 1 8, 
25 described in Table CFA. Results of the RTQ-PCR runs are shown in Tables CFB, CFC and 
CFD. 

Table CFA. Probe Name Ag3018 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ctgcagggttggagaaatg-3 ' 


19 


71 


1354 


Probe 


TET-5 1 -ccatcctgggcaaacccaaggat-3 » - 
TAMRA ' 


23 


107 


1355 


Reverse 


5 1 -ctacacccatcatgttcacatg-3 ■ 


22 


130 


1356 
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Table CFB. CNS_neurodegeneration_vl.O 



Tissue Name 

X IojUV 1 ill 111 V 


Rel. Exp.(%) Ag3018, 
Run 209820676 


Ticciif* IVarrip 


Rel. Exp.(%) Ag3018, 
Run 209820676 


AD 1 Hippo 


0.0 


Control (Path) 3 
Temporal Ctx 


4.5 


AD 2 Hippo 


12.5 


Control (Path) 4 


22.7 


AD 3 Hippo 


2.4 


AD 1 Occipital Ctx 


11.3 


AD 4 Hippo 


7.7 


A t\ » "« t />ii 

AD 2 Occipital Ctx 
(Missing) 


42.6 


AD 5 hippo . 


72.7 


AD 3 Occipital Ctx 


3.0 


AD 6 Hippo 


47.3 


AD 4 Occipital Ctx 


14.2 


Control 2 Hippo 


12.9 


AD 5 Occipital Ctx 


52.1 


Control 4 Hippo 


3.6 


AD 6 Occipital Ctx 


48.0 


Control (Path) 3 
Hippo 


3.8 


Control 1 Occipital 
Ctx 


1.6 


AD 1 Temporal Ctx 


13.6 


Control 2 Occipital 
Ctx 


32.1 


AD 2 Temporal Ctx 


29.9 


Control 3 Occipital 
Ctx 


6.7 


AD 3 Temporal Ctx 


4.0 


Control 4 Occipital 
Ctx 


0.0 


AD 4 Temporal Ctx 


26.8 


Control (Path) 1 
Occipital Ctx 


73.7 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


12.6 


AD 5 SupTemporal 
Ctx 


54.7 


Control (Path) 3 
Occipital Ctx 


1.7 


AD 6 Inf Temporal 
Ctx 


48.6 


Control (Path) 4 
Occipital Ctx 


23.3 


AD 6 Sup Temporal 
Ctx 


49.0 


Control 1 Parietal 
Ctx 


6.5 


Control 1 Temporal 
Ctx 


1.7 


Control 2 Parietal 
Ctx 


57.8 


Control 2 Temporal 
Ctx 


9.9 


Control 3 Parietal 
Ctx 


7.3 


Control 3 Temporal 
Ctx 


14.4 


Control (Path) 1 
Parietal Ctx 


57.8 


Control 4 Temporal 
Ctx 


2.3 


Control (Path) 2 
Parietal Ctx 


21.2 


Control (Path) 1 
Temporal Ctx 


30.4 


Control (Path) 3 
Parietal Ctx 


3.6 


Control (Path) 2 
Temporal Ctx 


36.9 


Control (Path) 4 
Parietal Ctx 


17.6 
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Table CFC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3018, 
Run 167819112 


Tissue Name 


Rel. Exp.(%) Ag3018, 
Run lo/oiyilZ 


^ : ^ 

Liver adenocarcinoma 


13.0 


Kidney (tetal) 


/.J 


Pancreas 


£. A 

6.0 


Renal ca. 786-0 




Pancreatic ca. CAPAN 
2 


12.9 


Renal ca. A498 


4.7 


Adrenal gland 


1.3 


Renal ca. RXr JyJ 




Thyroid 


1.3 


Renal ca. ACHN 


y.5 


Salivary gland 


A A 

0.0 


Renal ca. UO-31 


A 1 

4.1 


Pituitary gland 


7.9 


Renal ca. TK-10 


12.7 


Brain (fetal) 


7.9 


Liver 


2.6 


Brain (whole) 


44.1 


Liver (fetal) 


2.9 


Brain (amygdala) 


3.2 


Liver ca. 
(hepatoblast) HepG2 


10.2 


Brain (cerebellum) 


65.1 


Lung 


6.7 


Brain (hippocampus) 


9.8 


Lung (fetal) 


14.9 


Brain (substantia nigra) 


4.9 


Lung ca. (small cell) 
LX-1 


26.4 


Brain (thalamus) 


8.8 


Lung ca. (small cell) 
NCI-H69 


4.6 


Cerebral Cortex 


10.7 


Lung ca. (s.cell var.) 
SHP-77 


75.3 


Spinal cord 


7.6 


Lung ca. (large 
celI)NCI-H460 


2.9 


glio/astro U87-MG 


232 


Lung ca. (non-sm. 
cell)A549 


55.1 


glio/astroU-118-MG 


37.9 


Lung ca. (non-s.cell) 
NCI-H23 


9.6 


astrocytoma owi /oj 


IS. 1 


Lung ca. (non-s.cell) 
HOP-62 


o.o 


neuro , met oK-in-Ao 


Z./ 


Lung ca. (non-s.cl) 
NCI-H522 


1 1 fi 
I l.U 


astrocytoma oro^v 




Lung ca. (squam.) 
SW 900 


4./ 


astrocytoma aJNts-/j 


1U.0 


Lung ca. (squam.) 
NCI-H596 


1 /.O 


glioma SNB-19 


24.8 


Mammary gland 


2.2 


glioma U251 


28.5 


Breast ca.* (pl.ef) 
MCF-7 


17.3 


glioma SF-295 


34.6 


Breast ca.* (pl.ef) 
MDA-MB-231 


17.3 


Heart (fetal) 


6.4 


Breast ca.* (pl.ef) 
T47D 


6.5 


Heart 


162 


Breast ca. BT-549 


6.3 
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Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


8.4 


Skeletal muscle 


1 1 o 
1 1.7 


Ovary 


0.0 


Bone marrow 


9.2 


Ovarian ca. OVCAR- 

3 


6.2 


Thymus 


27.7 


Ovarian ca. OVCAR- 
4 


9.7 


Spleen 


5.6 


Ovarian ca. OVCAR- 
5 


25.2 


Lymph node 


5.8 


Ovarian ca. OVCAR- 
8 


20.3 


Colorectal 


15.0 


Ovarian ca. IGROV- 

i 
i 


9.2 


Stomach 


1.3 


Ovarian ca.* (ascites) 

jIV"U V J 


26.6 


Slmall intp^tinf* 


1 7 




1 A 


Colon ca. SW480 


14.7 


Placenta 


0.0 


Colon ca. 
SW620(SW480 met) 


100.0 


Prostate 


4.2 


Colon ca. HT29 


2.1 


Prostate ca.* (bone 


15.3 


Colon ca. HCT-116 


12.4 


Testis 


4.5 


Colon ca. CaCo-2 


22.7 


Melanoma 
Hs688(A).T 


3.9 


Colon ca. 
tissue(OD03866) 


7.0 


Melanoma* (met) 
Hs688(B).T 


9.8 


Colon ca. HCC-2998 


35.6 


Melanoma UACC-62 


4.1 


iu \j<x. ^uvci iiicif 

NCI-N87 


19.6 


Melanoma Ml 4 


7.9 


Bladder 


20.3 


Melanoma LOX 
IMVI 


11.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


0.7 


Kidney 


5.7 


Adipose 


0.0 



Table CFTX Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3018, Run 
164528110 


Tissue Name 


Rel. Exp.(%) 
Ag3018, Run 
164528110 


Secondary Thl act 


14.9 


HUVEC IL-lbeta 


7.9 


Secondary Th2 act 


22.1 


HUVEC IFN gamma 


4.9 


Secondary Trl act 


28.9 


HUVEC TNF alpha + IFN 
gamma 


4.5 


Secondary Thl rest 


4.1 


HUVEC TNF alpha + IL4 


6.7 


Secondary Th2 rest 


5.9 


HUVEC IL-11 


4.7 



1412 



WO 02/068649 



PCT/US02/02785 



Secondary Trl rest 


4.4 


"Lung Microvascular EC 
none 


10.9 


Primary Thl act 


19.5 


Lung Microvascular EC 
TNFalpha + DL-lbeta 


10.1 


Primary Th2 act 


9.4 


Microvascular Dermal EC 
none 


13.2 


Primary Trl act 


13.6 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


6.7 


Primary Thl rest 


45.7 


Bronchial epithelium 
TNFalpha + ILlbeta 


6.2 


Primary Th2 rest 


18.6 


Small airway epithelium 
none 


2.4 


Primary Trl rest 


16.5 


Small airway epithelium 
TNFalpha + IL-lbeta 


17.6 


CD45RA CD4 
lymphocyte act 


5.6 


Coronery artery SMC rest 


3.5 


CD45RO CD4 
lymphocyte act 


17.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


1.0 


CD8 lymphocyte act 


16.5 


Astrocytes rest 


7.3 


Secondary CD8 
lymphocyte rest 


9.6 


Astrocytes TNFalpha + 
IL-Ibeta 


3.1 


Secondary CD8 
lymphocyte act 


5.3 


KU-812 fBasoohiFl rest 


2 0 


CD4 lymphocyte none 


4.6 


KU-812 (Basophil) 
PMA/ionomycin 


6.7 


2ryThl/Th2/Trl_anti- 


12.0 


CCD1106 (Keratinocytes) 
none 


4.1 


LAK cells rest 


13.5 


CCD 1106 (Keratmocytes) 
TNFalpha + B.-lbeta 


1.8 


T AT/ TT O 

LrAis. ceils ILrZ 


1 o c 

lo.o 


Liver cirrhosis 


2.1 


T AF nolle TT O-UTT 10 


1 A £ 
14.0 


Lupus kidney 


1.7 


T Alf nolle TT O-4-TCTsJ 

gamma 


16.3 


NCI-H292 none 


13.5 


LAKcelIsIL-2+IL-18 


33.9 


NCI-H292 IL-4 


- 14.5 


T AT^ nolle 

PMA/ionomycin 


4-5 


NCI-H292 IL-9 


23.3 


NK Cells IL-2 rest 


17.3 


NCI-H292 IL-13 


8.0 


l wo way jyllk i day 


1 £ /I 

15.4 


\T/*iT TT^AO TF\T 

JSICI-H292 IFN gamma 


6.4 


Two Wav MLR 5 dav 


6 4 


HP AFP nnnp 

riirrVEjKs none 




Two Way MLR 7 day 


6.3 


HPAECTNF alpha + IL-1 
beta 


6.4 


PBMC rest 


4.2 


Lung fibroblast none 


5.0 


PBMC PWM 


51.1 


Lung fibroblast TNF alpha 
+ IL-1 beta 


1.5 


PBMC PHA-L 


14.7 


Lung fibroblast EL-4 


13.6 
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Ramos (B cell) none 


24.3 


Lung fibroblast IL-9 


7.2 


Ramos (B cell) 
ionomycin 


100.0 


Lung fibroblast IL-13 


13.1 


B lymphocytes PWM 


34.4 


Lung fibroblast IFN 
gamma 


6.8 


B lymphocytes CD40L 

_„ J TT A 

and IL-4 


33.0 


Dermal fibroblast 
CCD 1070 rest 


15.4 


EOL-1 dbcAMP 


13.3 


Dermal iibroblast 
CCD1070TNF alpha 


44.8 


bUL-l abcAMr 
PM A /i cst\ nm vc i n 

x ixn*i luuviuy will 


7.4 


Dermal fibroblast 

CCD 1070 TT -1 heta 
y^y-ri-j iv/u Uj-i ucia 


5.9 


Dendritic cells none 


3.0 


Dermal fihrnhlact TFNJ 

gamma 


3.6 


Dendritic cells LPS 


8.8 


Dermal fibroblast IL-4 


9.7 


Dendritic cells anti- 
CD40 


6.0 


IBD Colitis 2 


0.9 


Monocytes rest 


5.2 


IBD Crohn's 


0.0 


Monocytes LPS 


2.6 


Colon 


6.3 


Macrophages rest 


4.2 


Lung 


3.7 


Macrophages LPS 


1.3 


Thymus 


9.9 


HUVEC none 


5.3 


Kidney 


27.2 


HUVEC starved 


12.8 







CNS_neurodegeneration_vl.O Summary: Ag3018 This panel does not show 
diiferential expression of the NOV91 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 13D Summary: Ag301 8 The NOV91 gene is expressed in the brain at low 
levels. D-dopachrome tautomerase has been implicated in the production of neuromelanin 
from the toxic quinone products of dopamine, and this pathway has been implicated in the 
death of dopaminergic neurons in Parkinson's disease. Thus, this gene may represent an 
1 0 excellent small molecule target for the treatment or prevention of Parkinson's disease. 

In addition, significant expression is seen in a cluster of lung, brain, and colon cancer 
cell lines. Thus, expression of this gene could be used to differentiate between thes sample and 
other samples on this panel and as a marker to detect the presence of these cancers. 
Furthermore, therapeutic modulation of the expression or function of this gene may be 
1 5 effective in the treatment of lung, brain, and colon cancer. 

References: 
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Matsunaga J, Sinha D, Solano F, Santis C, Wistow G, Hearing V. Macrophage 
migration inhibitory factor (MIF)--its role in catecholamine metabolism. Cell Mol Biol 
(Noisy-le-grand) 1999 Nov;45(7):1035-40 

Macrophage migration inhibitory factor (MIF) was originally identified several 
5 decades ago as a lymphokine-derived protein that inhibited monocyte migration. Recently, it 
has been reported that MIF has D-dopachrome tautomerase, phenylpyruvate tautomerase and 
thiol protein oxidoreductase activities, although the physiological significance of those 
activities is not yet clear. Here we show that MIF is able to catalyze the conversion of 
dopaminechrome and norepinephrinechrome, toxic quinone products of the neurotransmitters 

10 dopamine and norepinephrine, respectively, to indole derivatives that may serve as precursors 
to neuromelanin. Since MIF is highly expressed in human brain, these observations raise the 
possibility that MIF participates in a detoxification pathway for catecholamine products and 
could therefore have an important role for neural tissues. The potential role of MIF in the 
formation of neuromelanin from catecholamines is also an extremely interesting possibility. 

1 5 Drukarch B, van Muiswinkel FL. Neuroprotection for Parkinson's disease: a new 

approach for a new millennium. Expert Opin Investig Drugs 2001 Oct;10(10):1855-68 

Parkinson's disease (PD) is the only neurodegenerative disorder in which 
pharmacological intervention has resulted in a marked decrease in morbidity and a significant 
delay in mortality. However, the medium to long-term efficacy of this pharmacotherapy, 

20 mainly consisting of dopaminomimetics like L -dopa and dopamine receptor agonists, suffers 
greatly from the unrelenting progression of the disease process underlying PD, i.e., the 
degeneration of neuromelanin-containing, dopaminergic neurones in the substantia nigra. 
Efforts concentrated on understanding the mechanisms of dopaminergic cell death in 
Parkinson's disease have led to identification of a large variety of pathogenetic factors, 

25 including excessive release of oxygen free radicals during enzymatic dopamine breakdown, 
impairment of mitochondrial function, production of inflammatory mediators, loss of trophic 
support, and apoptosis. Therapeutic approaches aimed at correcting these abnormalities are 
currently being evaluated on their efficacy as neuroprotectants for PD. Here, we focus on the 
process of dopamine auto-oxidation, the chain of reactions leading to the formation of 

30 neuromelanin, as an often overlooked, yet obvious pathogenetic factor. In particular, we 

discuss the option of drug-mediated stimulation of endogenous mechanisms responsible for 
the detoxification of dopamine auto-oxidation products as a novel means of neuroprotection in 
Parkinson's disease. 
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Panel 4D Summary: Ag301 8 The NOV91 gene, a D-dopachrome tautomerase 
homolog, is widely expressed in this panel, with highest expression in Ramos (B cells) 
activated by treatment with ionomycin (CT=31-28). Therefore, small molecule antagonists that 
block the function of theNOV91 gene product may be useful in several autoimmune and 
5 inflammatory diseases in which activated B cells can play major roles as sources of 

autoantibody-producing cells and also as powerful antigen-presenting cells, including, but hot 
limited to, Crohn's disease, ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary 
disease, asthma, emphysema, rheumatoid arthritis, lupus erythematosus, or psoriasis. 

Panel 5 Islet Summary: Ag301 8 Expression of this gene is low/undetectable (CTs > 
10 35) across all of the samples on this panel (data not shown). 

NOV92 

Expression of gene NOV92 was assessed using the primer-probe set Ag3048, 
described in Table CGA. Results of the RTQ-PCR runs are shown in Tables CGB, CGC, CGD 
and CGE. 

1 5 Table CGA. Probe Name Ag3048 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -cacaaccagctgacagacagt-3 1 


21 


490 


1357 


Probe 


TET- 5 1 -ccaccaccttcagcaagctgcatag-3 ' - 
TAMRA 


25 


521 


1358 


Reverse 


5 ' -gggagagatccaggtattcaag-3 ' 


22 j547 


1359 



Table CGB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3048, 
Run 168017062 


Tissue Name 


Rel. Exp.(%) Ag3048, 
Run 168017062 


Liver adenocarcinoma 


2.0 


. Kidney (fetal) 


0.6 


Pancreas 


0.0 


Renal ca. 786-0 


3.8 


Pancreatic ca. CAPAN 
2 


0.3 


Renal ca. A498 


17.7 


Adrenal gland 


0.0 


Renal ca. RXF 393 


12.5 


Thyroid 


0.5 


Renal ca. ACHN 


0.1 


Salivary gland 


0.6 


Renal ca. UO-31 


1.1 


Pituitary gland 


0.5 


Renal ca. TK-10 


0.5 


Brain (fetal) 


0.9 


Liver 


0.0 


Brain (whole) 


2.4 


Liver (fetal) 


0.0 


Brain (amygdala) 


2.9 1 


Liver ca. 
(hepatoblast) HepG2 


0.2 


Brain (cerebellum) 


7.0 


Lung 


1.4 , 
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Brain (hippocampus) 


2.1 


Lung (fetal) 


7.1 


Brain (substantia nigra) 


4,5 


Lung ca. (small cell) 
LX-1 


0.2 


Brain (thalamus) 


4.6 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


1.0 


Lung ca. (s.cell van) 
SHP-77 


0.9 


Spinal cord 


7.6 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


2.8 


Lung ca. (non-sm. 
cell) A549 


0.0 


glio/astroU-118-MG 


5.9 


Lung ca. (non-s.cell) 
NCI-H23 


0.4 


astrocytoma SW1783 


2.7 


Lung ca. (non-s.cell) 
HOP-62 


41.2 


neuro*;met SK-N-AS 


1.1 


Lung ca. (non-sxl) 
NCI-H522 * 


1.6 


astrocytoma SF-539 


100.0 


Lung ca. (squam.) 
SW900 


0.0 


astrocytoma SNB-75 


41.2 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.4 


Mammary gland 


15.9 


glioma U251 


6.7 


Breast ca.* (pl.ef) 

MLr-/ 


3.5 


■ — - 

glioma SF-295 


7.3 


Breast ca.* (pl.ef) 
MDA-MB-231 


2.0 


Heart (fetal) 


5.1 


rsreast ca. (pi.eij 
T47D 


0.3 


Heart 




rjreasi ca. r> l o^y 


n n 
u.u 


Skeletal muscle (fetal) 


4.7 


Breast ca. MDA-N 


0.3 


Skeletal muscle 


2.6 


Ovary 


3.8 


Bone marrow 


0.9 


Ovarian ca. OVCAR- 
3 


0.2. 


Thymus 


0.9 


Ovarian ca. OVCAR- 
4 


0.2 


Spleen 


0.3 


Ovarian ca. OVCAR- 
5 


4.3 


Lymph node 


0.0 


Ovarian ca. OVCAR- 
8 

o 


0.8 


Colorectal 


2.9 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.8 


Ovarian ca.* (ascites) 
SK-OV-3 


1.5 


Small intestine 


1.0 


! Uterus 


0.6 


Colon ca. SW480 


0.8 


Placenta 


0.9 
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Colon ca.* 
SW620(SW480 met) 


1.9 


Prostate 


3.0 


Colon ca. HT29 


0.3 


Prostate ca.* (bone 


1.7 


Colon ca.HCT-1 16 


0.0 


Testis 


3.9 


Colon ca. CaCo-2 


0.2 


Melanoma 
Hs688(A).T 


32.8 


Colon ca. 
tissue(OD03866) 


13.2 


Melanoma* (met) 
Hs688(B).T 


31.4 


Colon ca. HCC-2998 


A 1 

4.1 


Melanoma UACC-62 


1.8 


vja^inw ca. ^nver mei^ 
NCI-N87 


1.5 


Melanoma M14 


0.0 


Bladder 


3.0 


Melanoma LOX 
IMVI 


0.0 ... 


Trachea 


3.7 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.0 


Adipose 


4.3 



Table CGC. Panel 2D 



Tissue Name 


Rel.Exp.(%) 
Ag3048,Run 
170858352 


Tissue Name 


Rel. Exp.(%) I 
Ag3048, Run 
170858352 


Normal Colon 


4.7 


Kidney Margin 
8120608 


0.5 


CCWelltoModDiff 
(OD03866) 


12.0 


Kidney Cancer 
8120613 


0.1 


CC Margin (OD03866) 


1.5 


Kidney Margin 
8120614 


0.4 


CC Gr.2 rectosigmoid 
(OD03868) 


10.1 


Kidney Cancer 
9010320 


13.1 


CC Margin (OD03868) 


0.8 


Kidney Margin 
9010321 


0.9 


CCMod Diff(ODO3920) 


8.4 


Normal Uterus 


0.5 


CC Margin (ODO3920) 


2.6 


Uterus Cancer 064011 


1.0 


CC Gr.2 ascend colon 
(OD03921) 


13.6 


Normal Thyroid 


3.3 


CC Margin (OD03921) 


2.1 


Thyroid Cancer 
064010 


0.7 


CC from Partial 
Hepatectomy (ODO4309) 
Mets. 


3.1 - 


Thyroid Cancer 
A302152 


4.5 


Liver Margin (ODO4309) 


0.1 


Thyroid Margin 
A302153 


0.8 


Colon mets to lung 
(OD04451-01) 


3.3 


Normal Breast 


20.0 
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Lung Margin (OD0445 1- : 
02) 


1.1 


Breast Cancer 
(OD04566) 


6.5 


Normal Prostate 6546-1 


4.1 


Breast Cancer 
(OD04590-01) 


10.4 


Prostate Cancer 
(OD04410) 


19.8 


Breast Cancer Mets 
(OD04590-03) 


11.4 


Prostate Margin 
(OD04410) 


4.5 


Breast Cancer 

Metastasis 
(OD04655-05) 


1.0 


Prostate Cancer 
(OD04720-01) 


9.7 


Breast Cancer 064006 


23.2 


Prostate Margin 
(OD04720-02) 


7.5 


Breast Cancer 1024 


25.7 


Normal Lung 061010 


3.3 


Breast Cancer 
9100266 


16.4 


Lung Met to Muscle 


24.5 


Breast Margin 
y i uuzo d 


24.0 


iviUdwic iviargin 
(OD04286) 


1.7 


orcasc v^ancer 
A209073 


32.3 


Lung Malignant Cancer 
(OD03126) 


11.6 


Breast Margin 
A2090734 


33.2 


Lung Margin (OD03126) 


3.1 


Normal Liver 


0.0 


Lung Cancer (OD04404) 


22.2 


Liver Cancer 064003 


0.0 


T una IMarcnn fCiT}f\AAf\A.\ 

idling iviargm ^ui/u*T t tu i f ) 


7 0 
/.Z 


i_j ver cancer i vzj 




Lung Cancer (OD04565) 


36.1 


Liver Cancer 1026 


4.2 


Lung Margin (OD04565) 


2.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


12.6 


Liver Tissue 6004-N 


2.9 


Lung Margin (OD04237- 
02) 


6.7 


Liver Cancer 6005-T 


2.2 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (ODO4310) 


0.4 


Normal Bladder 


23.8 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


11.8 


Lung Margin (OD04321) 


1.7 


Bladder Cancer 
A302173 


7.2 


Normal Kidney 


0.8 


Bladder Cancer 
(OD04718-01) 


12.7 


Kidney Ca, Nuclear grade 
2 (OD04338) 


0.3 . 


Bladder Normal 
Adjacent (OD047 18- 
03) 


4.9 


Kidney Margin 
(OD04338) 


I- 5 


Normal Ovary 


3.5 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


02 


Ovarian Cancer 
064008 


100.0 
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Kidney Margin 


0.9 


Ovarian Cancer 

(UD04/OO-U /) 


2.5 


Kidney Ca, Clear cell 
type \jJiJ\) £ *jH\j ) 


0.4 


Ovary Margin 

(UD04/Do-Uo) 


8,1 


Kidney Margin 


1.0 


Normal Stomach 


1.5 


Kidney Ca, Nuclear grade 


10.3 


Gastric Cancer 
VUoU3>o 


0.5 


Kidney Margin 


0.4 


Stomach Margin 


0.8 


Kidney Cancer 

(UJUU40ZZ-U1) 


34.4 


Gastric Cancer 
yoo03y5 


7.5 


Kidney Margin 


0.2 


Stomach Margin 


1.9 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


27.0 


Kidney Margin 
(OD04450-03) 


02 


Stomach Margin 
9060396 


1.5 


Kidney Cancer 8120607 


9.0 


Gastric Cancer 
064005 


3.3 



Table CGD. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag3048, Run 
172133336 


Tissue Name 


Rel. Exp.(%) 
Ag3048, Run 
172133336 


Daoy- Medulloblastoma 


6.5 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 


TE671- Medulloblastoma 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


18.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498-CNS 


1.4 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


2.6 


SNB-78- Glioma 


100.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma 


0.0 


T98G- Glioblastoma 


24.1 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- 
Neuroblastoma 
(metastasis) 


32.3 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


2.0 


RL- non-Hodgkin's B-cell 
lymphoma 


0.0 


Cerebellum 


3.0 


JM1- pre-B-cell lymphoma 


0.0 
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Cerebellum 


4.2 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


38.4 


TF-1- Erythroleukemia 


2.3 


DMS-114- Small cell 
lung cancer 


1.7 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small cell lung 
cancer 


3.0 


U937- Histiocytic lymphoma 


0.7 


NCI-H146- Small cell 
lung cancer 


3.3 


KU-812- Myelogenous 
leukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


0.0 


769-P- Clear cell renal 
carcinoma 


0.0 


NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


2.5 


NCI-H82- Small cell 
lung cancer 


0.0 


SW 839- Clear cell renal 
carcinoma 


0.0 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


1.4 


G401- Wilms 1 tumor 


2.3 


NCI-HI 155- Large cell 
lung cancer 


1.8 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


1.5 


NCI-H1299- Large cell 
lung cancer 


2.8 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


NCI-H727-Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


0.8 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.7 


LX-1- Small cell lung 
cancer 


0.0 


HPAC- Pancreatic 
adenocarcinoma 


0.0 


Colo-205- Colon cancer 


7.3 


MIA PaCa-2- Pancreatic 
carcinoma 


0.7 


KM 12- Colon cancer 


1.6 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


1.7 


KM20L2- Colon cancer 


0.0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


6.7 


NCI-H716- Colon cancer 


0.0 


T24- Bladder carcinma 
(transitional cell) 


6.1 


SW-48- Colon 
adenocarcinoma 

wvui viiivlllU 


0.6 


5637- Bladder carcinoma 


2.1 


SW1 116- Colon 
adenocarcinoma 


0.8 


HT-1 197- Bladder carcinoma 


1.7 


LS174T- Colon 
adenocarcinoma 


2.5 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.7 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


5.4 
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SW-480- Colon 
adenocarcinoma 


2.7 


HT-1080- Fibrosarcoma 


11.7 


NCI-SNU-5- Gastric 
carcinoma 


1.6 


MG-63- Osteosarcoma 


0.0 


KATO III- Gastric 
carcinoma 


1.8 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


7.5 


NCI-SNU-16- Gastric 
carcinoma 


1-4 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0.0 


NCI-SNU-1- Gastric 
carcinoma 


0.0 


A431- Epidermoid carcinoma 


5.1 


RF-1- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


16.6 


RF-48- Gastric 
adenocarc inoma 


0.9 


DU 145- Prostate carcinoma 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


5.6 


MDA-MB-468- Breast 
adenocarcinoma 


2.5 


NCI-N87- Gastric 

f 1 a rr» i n nm 5* 
call/iliouia 


2.3 


SCC-4- Squamous cell 
carcinoma oi tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


4.8 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.6 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


0.5 



Table CGE. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3048, Run 
164315038 


Tissue Name 


Rel. Exp.(%) 
Ag3048, Run 
164315038 


Secondary Thl act 


0.4 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


1.9 


Secondary Trl act 


0.6 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


\2 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.3 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


3.9 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


1.2 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.6 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


1.4 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + DLlbeta 


28.7 
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Primary Th2 rest 


0.9 


Small airway epithelium 
none 


39.2 


Primary Trl rest 


0.6 


Small airway epithelium 
TNFalpha + IL-lbeta 


84.7 


CD45RA CD4 
lymphocyte act 


8.6 


Coronery artery SMC rest 


17.9 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


5.5 


CDS lymphocyte act 


0.0 


Astrocytes rest 


6.1 


Secondary CD8 
lymphocyte rest 


0.6 


Astrocytes TNFalpha + 
IL-lbeta 


5.5 


Secondary CD8 
lymphocyte act 


0.6 


KU-812 (Basophil) rest 


0.7 


CD4 lymphocyte none 


0.5 


KU-8 12 (Basophil) 
r im/v ionomycin 


3.8 


2iyThl/Th2/Trl_anti- 


1.3 


CCD 1 106 (Keratinocytes) 
none ■ 


79.0 


LAK cells rest 


0.0 


ccui luo (Js.eratmocytes) 
TNFalpha + IL-lbeta 


97.3 


J-r/vIV CcliS IJLr-Z 


U.U 


Liver cirrhosis 


A 1 

4.3 


Ls/\k\. UCllS W-TaL r llj- J. Z 


n a 

U.U 


Lupus kidney 


•5 A 

3.4 


T AK orIIq TT -74-TT7M 

L-f/vTw CCllo lLi~Z » IriN . 

gamma 


0.0 


NCI-H292 none 


87.1 


LAK cells EL-2+IL-18 


0.0 


NCI-H292 IL-4 


77.4 


PMA/ionomycin 


0.0 


NCI-H292 IL-9 


82.9 


NK Cells IL-2 rest j 


0.0 


NCI-H292 IL-13 


54.0 


Two Wav MT 1? 3 Hnv 


A A 
U.U 


XTPT T-TOOO TT7KT rrammo 

iNd-rizyz iriN gamma 




Two Wav MT R ^ Hnv 


A A 
U.U 


rirAriL, none 


U.7 


Two Way MLR 7 day 


0.8 


nr/vjC/L^ i iNr aipna *r jll-i 
beta 


0.0 


PBMC rest 


0 K 
u.o 


L-ung iiuroDiasi none 




PBMC PWM 


0.0 


Lung fibroblast TNF alpha 

4- TT 1 Koto 

t il-i oeta 


7.0 


PBMC PHA-L 


2.2 


Lung fibroblast IL-4 


46.0 


Kamos (B cell) none 


0.0 


Lung fibroblast IL-9 


38.7 


Ramos (B cell) 
ionomycin 


0.8 


Lung fibroblast IL-13 


26.8 


B lymphocytes PWM 


1.0 


Lung iibroblast IrN 
gamma 


68.8 


B lymphocytes CD40L 
and DL-4 


0.0 


Dermal fibroblast 
CCD1 070 rest 


100.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


59.9 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 


39.8 
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PMA/innomvcin 




rmifi70IL-1 heta 




Dendritic cells none 


1.8 


Dermal fibroblast TFN 

gamma 


n-r-n^.-.-r.,- ^ . 

17.0 


Dendritic cells LPS 


1.7 


Dermal fibroblast IL-4 


30.6 


Dendritic cells anti- 
CD40 


6.8 


IBD Colitis 2 


0.0 


Monocytes rest 


ii 


IBD Crohn's 


0.7 


Monocytes LPS 


0.0 


Colon 


2.0 


Macrophages rest 


19 


Lung 


36.1 


Macrophages LPS 


1.3 


Thymus 


0.6 


HUVEC none 


0.0 


Kidney 


1.7 


HUVEC starved 


2.9 







Panel 13D Summary: Ag3048 The expression of the NOV92 gene appears to be 
highest in a sample derived from a brain cancer cell line (SF-539) (CT=29.4). In addition, 
there is substantial expression associated with samples derived from another brain cancer cell 
5 line, two melanoma cell lines and a lung cancer cell line. Thus, the expression of this gene 
could be used to distinguish SF-539 cells from other samples in the panel. Moreover, 
therapeutic modulation of this gene, through the use of small molecule drugs, protein 
therapeutics or antibodies might be of benefit in treatment of brain or lung cancer or 
melanoma. 

10 This gene, a leucine-rich repeat homolog, is expressed at low levels in the CNS. The 

leucine-rich repeat region proteins have been implicated in axonal guidance. Therefore, this 
gene may be of therapeutic utility in the treatment of any CNS disorder involving neuronal 
loss, specifically by guiding/enhancing compensatory synaptogenesis and fiber outgrowth, 
including such clinical conditions as Alzheimer's, Parkinson's, or Huntington's diseases, 

1 5 stroke, head and spinal cord trauma, vascular dementia or spinocerebellar ataxia. 

Panel 2D Summary: Ag3048 The expression of the NOV92 gene appears to be 
highest in a sample derived from an ovarian cancer (CT=29). In addition, there appears to be 
substantial expression associated with lung cancer, prostate cancer and colon cancer samples. 
Of note is the differential expression in the lung, colon and prostate cancer samples compared 

20 to their respective normal adjacent tissue. Thus, the expression of this gene could be used to 
distinguish this ovarian cancer sample from other samples in the panel. In addition, the 
expression of this gene could be used to distinguish colon, prostate or lung cancer samples 
from their normal adjacent tissue. Moreover, therapeutic modulation of this gene, through the 
use of small molecule drugs, protein therapeutics or antibodies might be beneficial in the 

25 treatment of ovarian, lung, prostate or colon cancer. 
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Pane! 3D Summary: Ag3048 The expression of the NOV92 gene appears to be 
highest in a sample derived from a brain cancer cell line (SNB-78) (CT=30.2). In addition, 
there appears to be substantial expression associated with other brain cancer cell line samples 
and a lung cancer cell line sample. Thus, the expression of this gene could be used to 
5 distinguish SNB-78 cells from other samples in the panel. Moreover, therapeutic modulation 
of this gene, through the use of small molecule drugs, protein therapeutics or antibodies might 
be beneficial in the treatment of brain or lung cancer. 

Panel 4D Summary: Ag3048 The NOV92 gene, a secreted Ieucine-rich repeat (LRR) 
protein, is expressed selectively at moderate levels (CT range 29-31) in several resting and 
10 cytokine-activated epithelial and connective tissue cells of lung and skin. Therefore, the 

if 

NOV92 gene product may be useful as a therapeutic protein as well as a target for therapeutic 
antibodies and small molecules, each of which may prove beneficial in the reduction or 
elimination of the symptoms in patients with chronic obstructive pulmonary disease, asthma, 
emphysema, or psoriasis. 

1 5 NOV93: IMP DEHYDROGENASE 1 

Expression of gene NOV93 was assessed using the primer-probe sets Ag4520 and 
Ag2904, described in Tables CHA and CHB. Results of the RTQ-PCR runs are shown in 
Tables CHC, CHD, CHE, CHF, CHG, CHH and CHI. 

Table CHA. Probe Name Ag4520 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ggtacccatgaggatgacaaat-3 ' 


22 


758 


1360 


Probe 


TET-5 1 -acctggacctgctcacccaggtag-3 ' - 
TAMRA 


24 


783 


1361 


Reverse 


5 1 -cgagtccaagcctatgacatt-3 1 


21 


812 


1362 



20 

Table CHB. Probe Name Ag2904 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID HO: 


Forward 


5 1 -atacttcaacgacggggataag-3 1 


22 


1300 


1363 


Probe 


TET-5 1 -ctccatccaggacaaagggtccatt-3 1 - 
TAMRA 


25 


1348 


1364 


Reverse 


5 1 -aggtagggcacgaacttctg-3 1 


20 


1373 


1365 



Table CHC. AI_comprehensive panel_vl.0 



Tissue Name 


Rel. Exp.(%) 
Ag4520, Run 


Tissue Name 


Rel. Exp.(%) 
Ag4520, Run 



1425 



WO 02/068649 



PCT/US02/02785 





219421380 




219421380 


110967 COPD-F 


11.8 


112427 Match Control 


63.7 


110980 COPD-F 


12.1 


112418 Psoriasis-M 


12.5 


1 10968 COPD-M 


11.0 


1 111 15 Match Control 
Psoriasis-M 


0.3 


1 1 0977 COPD-M 


46.7 


1 12419 Psonasis-M 


10.4 


110989 Emphysema- 

T? 

r 


34.9 


112424 Match Control 
Psoriasis-M 


11.6 


110992 Emphysema- 

x: 
r 


22.7 


112420 Psoriasis-M 


52.1 


110993 Emphysema- 
r 


21.9 


112425 Match Control 
Psoriasis-M 


50.0 


110994 Emphysema- 
r 


13.3 


104689 (MF)OA 
Bone-Backus 


30.8 


110995 Emphysema- 
F 


33.2 


104690 (MF) Adj 
Normal Bone-Backus 


23.0 


110996 Emphysema- 
F 


8.5 


104691 (MF) OA 
Synovium-Backus 


22.4 


110997 Asthma-M 


6.3 


104692 (B A) OA 
Cartilage-Backus 


3.4 


111001 Asthraa-F 


32.5 


104694 (BA) OA 
Bone-Backus 


17.6 


111002 Asthma-F 


22.2 


104695 (BA) Adj 

f 1~R. T ^-^ - - fit T~*\ — . T% f 

Normal Bone-Backus 


24.1 


111003 Atopic 
Astnma-r 


18.9 


104696 (BA) OA 
Synovium-Backus 


13.9 


11 1004 Atopic 
Astnma-r 


20.9 


104700 (SS) OA Bone- 
Backus 


21.3 


ill UUD AtOplC 

Asthma-F 


10.7 


I047UI (Sb) Adj 

"Normal" Bone-Backus 


21.2 


111 UUO AIOplC 

Asthma-F 


3.6 


1 04702 (So) UA 
Synovium-Backus 


42.3 


111417Allergy-M 


13.1 


1 1 /uyj ua cartilage 
Rep7 


27.9 


i izj**/ Aiiergy-M 


Z.o 


1 1 lb 11 UA bone 5 


39.0 


1 1 r )'2AQ XT/ittvi n 1 T unrr 

l iz.jHy iNormai L»ung- 
F 


2.0 


112673 OA Synoviums 


14.6 


1 Mnrmal T nncr- 
i i *j i i wi men jLsiujg 

F 


22.4 


i izo/f oynoviaj 
Fluid cells5 


16.6 


112354 Normal Lung- 
M 


10.4 


117100 OA Cartilage 
Repl4 


5.1 


1 12374 Crohns-F 


11.3 


112756 OA Bone9 


6.8 


112389 Match 
Control Crohns-F 


18.6 


1 12757 OA Synovium9 


14.7 
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112375 Crohns-F 


8.5 


11 2758 OA Synovial 
Fluid Cells9 


12.8 


112732 Match 
Control Crohns-F 


41.8 


117125 RA Cartilage 
Rep2 


10.1 


112725 Crohns-M 


3.2 


113492 Bone2 RA 


28.7 


112387 Match 
Control Crohns-M 


13.5 


113493 Synovium2RA 


13.1 


1 12378 Crohns-M 


1.6 


113494 Syn Fluid Cells 
RA 


26.4 


112390 Match 
Control Crohns-M 


49.7 


1 13499 Cartilage4RA 


29.3 


112726 Crohns-M 


12.8 


113500 Bone4 RA 


41.2 


112731 Match 
Control Crohns-M 


16.7 


113501 Synovium4RA 


30.6 


112380 Ulcer Col-F 


25.5 


1 13502 Syn Fluid 
Cells4RA 


15.9 


112734 Match 
Control Ulcer Col-F 


100.0 


1 13495 Cartilage3RA 


24.1 


112384 Ulcer Col-F 


48.6 


113496 Bone3 RA 


31.9 


1 12737 Match 
Control Ulcer Col-F 


7.4 


113497 Synovium3RA 


13.7 


112386 Ulcer Col-F 


4.4 


1 13498 Syn Fluid 
Cells3 RA 


26.4 


112738 Match 
Control Ulcer Col-F 


12.9 


117106 Normal 
Cartilage Rep20 


6.9 


112381 Ulcer Col-M 


2.0 


1 13663 Bone3 Normal 


2.1 


112735 Match 
Control Ulcer Col-M 


12.0 


113664 Synovium3 
Normal 


12 


112382 Ulcer Col-M 


12.1 


113665 Syn Fluid 
Cells3 Normal 


0.8 


i izjy4 Matcn 
Control Ulcer Col-M 


4.5 


1 1 *71 A*T > T in i 1 

1 171U7 Normal 
Cartilage Rep22 


13.5 


112383 Ulcer Col-M 


26.8 


113667 Bone4 Normal 


18.3 


112736 Match 
Control Ulcer Col-M 


9.5 


113668 Synovium4 
Normal 


17.6 


112423 Psoriasis-F 


11.1 


1 13669 Syn Fluid 
Cells4 Normal 


27.4 



Table CUD. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel.Exp.(%) 
Ag2904,Run 
206485416 


Rel.Exp.(%) 
Ag4520, Run 
206954220 


Tissue 
Name 


Rel. Exp.(%) 
Ag2904,Rnn 
206485416 


Rel. Exp.(%) 
Ag4520,Run 
206954220 


AD 1 Hippo 


29.3 


31.9 


Control 
(Path) 3 
Temporal 


25.0 


28.9 
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Ctx 






AD 2 Hippo 


40.6 


34.2 


. Control 
(Path) 4 
Temporal 
Ctx 


39.0 


30.8 


AD 3 Hippo 


34.2 


26.1 


ADl 
Occipital 
Ctx 


27.4 


23.7 


AD 4 Hippo 


29.5 


19.1 


AD2 
Occipital 

Ctx 
(Missing) 


0.0 


0.0 


AD 5 hippo 


60.7 


58.2 


AD3 
Occipital 
Ctx 


33.0 


24.8 


AD 6 Hippo 


80.7 . 


92.7 


AD4 
Occipital 
Ctx 


24.5 


26.2 


Control 2 
Hippo 


25.0 


30.8 


AD5 
Occipital 
Ctx 


41.5 


30.1 


Control 4 
Hippo 


53.2 


40.9 


AD6 
Occipital 
Ctx 


30.6 


52.1 


Control (Path) 
3 Hippo 


25.0 


22.4 


Control 1 
Occipital 
Ctx 


48.0 


60.3 


AD 1 Temporal 
Ctx 


52.1 


81.2 


Control 2 
Occipital 
Ctx i 


33.4 


41.5 


AD 2 Temporal 
Ctx 


24.3 


21.5 


Control 3 
Occipital 
Ctx 


25.3 


23.3 


AD 3 Temporal 
Ctx 


32.3 


26.1 


Control 4 
Occipital 
Ctx 


30.8 


25.3 


AD 4 Temporal 
Ctx 


40.6 


49.7 


Control 
(Path)l 
Occipital 
Ctx 


49.7 


37.1 


AD 5 Inf 
Temporal Ctx 


52.9 


77.4 


Control 
(Path) 2 
Occipital 
Ctx 


15.5 


12.3 


AD 5 
SupTemporal 
Ctx 


83.5 


100.0 


Control 
(Path) 3 
Occipital 
Ctx 


39.2 


45.1 
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AD6Inf 
Temporal Ctx 


77.4 


85.3 


Control 
(Path) 4 
Occipital 
Ctx 


33.4 


36.1 


AD 6 Sup 
Temporal Ctx 


100.0 


87.1 


Control 1 
Parietal Ctx 


37.4 


29.1 


Control 1 
Temporal Ctx 


29.9 


29.5 


Control 2 
Parietal Ctx 


34.2 


52.1 


Control 2 
Temporal Ctx 


20.0 


19 1 


Control 3 
Parietal Ctx 


14 0 


1 1 0 


Control 3 
TemDoral Ctx 


22.4 


14.3 


Control 
(Path) 1 
Parietal Ctx 


43.2 


30.1 


Control 4 
Temporal Ctx 


26.1 


10.4 


Control 
(Path) 2 
Parietal Ctx 


32.8 


33.4 


Control (Path) 
1 Temporal Ctx 


46.3 


25.7 


Control 
(Path) 3 
Parietal Ctx 


34.9 


% 36.1 


Control (Path) 
2 Temporal Ctx 


34:2 


34.2 


Control 
(Path) 4 
Parietal Ctx 


48.3 


44.4 



Table CHE. General screening .panel vl. 4 



Tissue Name 


Rel. Exp.(%) 
Ag4520,Run 
219274490 


Rel. Exp.(%) 
Ag4520, Run 
219288511 


Tissue Name 


Rel. Exp.(%) 
Ag4520, Run 
219274490 


Rel. Exp.(%) 
Ag4520, Run 
219288511 


Adipose 


26.6 


24.5 


Renal ca. TK-10 


47.0 


42.3 


Melanoma* 
Hs688(A).T 


14.1 


12.9 


Bladder 


412 


37.6 


Melanoma* 
Hs688(B).T 


13.6 


13.3 


Gastric ca. (liver 
met.)NCI-N87 


100.0 


100.0 


Melanoma* 
M14 


8.5 


11.2 


Gastric ca. 
KATOffl 


14.4 


13.5 


Melanoma* 
LOXIMVI 


17.2 


18.6 


Colon ca. SW- 
948 


5.0 


5.3 


Melanoma* 
SK-MEL-5 


10.7 


11.3 


Colon ca. SW480 


18.6 


20.9 


Squamous cell 
carcinoma 
SCC-4 


4.9 


6.6 


Colon ca.* 
(SW480met) 
SW620 


9.0 


9.0 


Testis Pool 


7.5 


7.5 


Colon ca. HT29 


10.1 


10.0 


Prostate ca.* 
(bone met) 
PC-3 


8.0 


7.2 


Colon ca. HCT- 
116 


16.7 


14.6 
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Prostate Pool 


1 1 Q 
13.6 ' 


13.3 


Colon ca. CaCo-2 


ZO.l 


26.4 


Placenta 


18.3 


19.5 


Colon cancer 
tissue 


21.0 


23.8 


Uterus Pool 


9.0 


9.0 


Colon ca. 

C11/1 1 1 /Z 

bWl 1 lo 


1.3 


2.6 


Ovarian ca. 

U V LAK-j 


15.2 


13.5 


Colon ca. Colo- 
one 


3.5 


3.7 


Ovarian ca. 

Qlf r\\r o 


28.9 


28.3 


Colon ca. SW-48 


4.3 


4.5 


Ovarian ca. 

UVLAK-4 


5.9 


6.2 


Colon Pool 


31.9 


30.1 


L/vanan ca. 
OVCAR-5 


49.0 


— — ' — 

49.0 


omali intestine 
Pool 


29.7 


32.5 


uvarian ca. 
IGROV-1 


16.3 


15.0 


Stomach Pool 


15.6 


15.2 


Ovarian ca. 

v-J V LAK-o 


6.9 


6.0 


Bone Marrow 
rool 


13.4 


13.9 


Ovary 


11.3 


10.0 


Fetal Heart 


20.3 


21.8 


Breast ca. 
MCF-7 


27.7 


27.0 


Heart Pool 


11.3 


15.4 


Breast ca. 
MDA-MB- 
Ad i 


37.9 


48.3. 


Lymph Node 
Pool 


26.8 


30.1 


oreasi ca. o i 
549 


52.9 


53.2 


retal Skeletal 
Muscle 


7.5 


10.7 


Breast ca. 


69.7 


74.7 


Skeletal Muscle 
Pool 


20.9 


24.3 


Breast ca. 
MDA-N 


4.7 


6.7 


Spleen Pool 


45.i 


57.8 


Breast Pool 


31.0 


37.1 


Thymus Pool 


31.9 


31.0 


Trachea 


31.0 


28.5 


CNS cancer 
(glio/astro) U87- 
MG 


26.1 


25.9 


Lung 


4.5 


3.7 


CNS cancer 
(glio/astro) U- 
118-MG 


21.3 


24.0 


Fetal Lung 


68.3 


100.0 


CNS cancer 
(neuro;met) SK- 

XT A C 

JN-Aa 


18.9 


21.0 


Lung ca. NCI- 
N417 


0.5 


0.6 


VINO wdllV/vl 

(astro) SF-539 


10.6 


6.9 


Lung ca. LX- 
1 


13.2 


11.9 


CNS cancer 
(astro) SNB-75 


48.0 


47.0 


Lung ca. NCI- 
H146 


4.1 


3.7 


CNS cancer 
(glio) SNB-19 


14.7 


13.4 


Lung ca. j 6.3 


6.3 


CNS cancer 


18.8 | 20.4 
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SHP r 77 ! 




(gho) SF-295 






Lung ca. 
A549 




23.5 


Brain 
(Amygdala) Pool 


2.4 


3.3 


Lung ca. NCI- 
H526 


i .j 


1.7 


Brain 
(cerebellum) 


9.2 


10.1 


Lung ca. NCI- 
H23 


90 7 


29.5 


Brain (fetal) 


7.7 


8.2 


.LAing ca. 
H460 


14.2 


15.1 


Brain 
(Hippocampus) 
Pool 


5.5 


4.8 


Lung ca. 
HOP-62 


17.4 


3.6 


Cerebral Cortex 
Pool 


5.4 


3.7 


Lung ca. NCI- 
H522 


20.3 


25.5 


Brain (Substantia 
nigra/ rooi 


3.8 


3.2 


Liver 


2.0 


2.0 


Brain (Thalamus) 

rUOl 


4.4 


4.7 


Fetal Liver 


14.6 


15.9 


Brain (whole) 


4.0 


4.8 


Liver ca. 
HepG2 


16.2 


18.8 


Spinal Cord Pool 


6.5 


7.0 


Kidney Pool 


36.3 


44.4 


Adrenal Gland 


28.9 


28.3 


Fetal Kidney 


31.9 


29.1 


Pituitary gland 
Pool 


3.3 


2.5 


Renal ca. 786- 
0 


50.7 


54.3 


Salivary Gland 


8.0 


6.0 


Renal ca. 
A498 


28.9 


27.4 


Thyroid (female) 


9.5 


7.6 


Renal ca. 
ACHN 


45.4 


41.2 


Pancreatic ca. 
CAPAN2 


51.4 


48.3 


Renal ca. UO- 
31 


31.9 


31.6 


Pancreas Pool 


35.4 


39.0 



Table CHF. Panel 1 3D 



Tissue Name 


Rel. Exp.(%) Ag2904, 
Run 162556421 


Tissue Name 


Rel. Exp.(%) Ag2904, 
Run 162556421 


Liver adenocarcinoma 


16.0 


Kidney (fetal) 


40.3 


Pancreas 


6.9 


Renal ca. 786-0 


13.3 


Pancreatic ca. CAPAN 
2 


6.4 


Renal ca. A498 


34.2 


Adrenal gland 


9.7 


Renal ca. RXF 393 


16.3 


Thyroid 




Renal ca. ACHN 


21.9 


Salivary gland 


7.4 


Renal ca.UO-31 


6.2 


Pituitary gland 


2.1 


Renal ca. TK-10 


9.5 


Brain (fetal) 


1.6 


Liver 


4.2 


Brain (whole) 


1.5 


Liver (fetal) 


11.3 
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Brain (amygdala) 


" 6 | Liver ca. 

j(hepatoblast) HepG2 


11.3 


Brain (cerebellum) 


6.7 J Lung 


70.7 


Brain (hippocampus) 


7.6 j Lung (fetal) 


42.3 


Brain (substantia nigra) 


9 r | Lung ca. (small cell) 
lA 1 LX-1 


4.7 


Brain (thalamus) 




Lung ca. (small cell) 
NCI-H69 


1.6 


Cerebral Cortex 


10 3 


Lung ca. (s.cell var.) 
SHP-77 


2.3 


Spinal cord 


I J.J 


Lung ca. (large 
cell)NCI-H460 


6.9 


glio/astro U87-MG 


13.3 


Lung ca. (non-sm. 
cell)A549 


3.5 


glio/astroU-118-MG 


3.4 


Lung ca. (non-s.cell) 
NCI-H23 


10.6 


astrocytoma SW1783 


6.1 


Lung ca. (non-s.cell) 
HOP-62 


4.9 


neuro*;met SK-N-AS 


6.4 


Lung ca. (non-s.cI) 
NCI-H522 


6.9 


astrocytoma SF-539 


8.0 


Lung ca. (squam.) 

C\17 onn 


18.2 


astrocytoma SNB-75 


10.9 


Lung ca. (squam.) 


2.4 


_ l* 0*KTT> "t f\ 

glioma SNB-19 


7.3 


Mammary gland 


9.7 


glioma U251 


3.4 


Breast ca.* (pl.ef) 
MCF-7 


18.0 


glioma SF-295 


8.1 


Breast ca.* (pl.ef) 
MDA-MB-231 


10.1 


Heart (fetal) 


9.9 


Breast ca.* (pl.ef) 
T47D 


4.1 


Heart 


24.3 


Breast ca. BT-549 


4 8 ! 

*T.O 


Skeletal muscle (fetal) 


i fin ft 


iireast ca. mua-jn 


3.6 


Skeletal muscle 


9.8 


Ovary 


18.7 


Bone marrow 


35.4 


Ovarian ca. OVCAR- 
3 


2.5 


Thymus 


80.7 


Ovarian ca. OVCAR- 
4 


1.7 


Spleen 


50.0 


Ovarian ca. OVCAR- 
5 


8.2 


Lymph node 


29.1 


Ovarian ca. OVCAR- 
8 


3.5 


Colorectal 


42.3 


Ovarian ca. IGROV- 
1 


4.9 


Stomach 


10.7 


Ovarian ca.* (ascites) 


5.2 
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SK-OV-3 




Small intestine 


25.3 


Uterus 


10.7 


Colon ca. SW480 


2.4 


Placenta 


36.3 


i^oion ca. 
SW620(SW480 met) , 


3.8 


Prostate 


10.5 


Colon ca. HT29 


5.4 


rrostate ca. (bone 
met)PC-3 


1.7 


Colon ca.HCT-1 16 


2.6 


Testis 


7.9 


Colon ca. CaCo-2 


10.7 


Melanoma 
Hs688(A).T 


4.6 


v^oion ca, 
tissue(OD03866) 


21.6 


Melanoma* (met) 
Hs688(B).T 


6.2 


V^OlOn Ca. XT\sKs-Z.yyo 




Melanoma UAL-C-o/ 


u.y 


Gastric ca * fliver mefi 
NCI-N87 


27.2 


Melanoma M14 


2.8 


Bladder 


39.5 


Melanoma LOX 
IMVI 


2.9 


Trachea 


36.9 


Melanoma* (met) 
SK-MEL-5 


2.5 


Kidney 


16.0 


Adipose 


27.4 



Table CHG. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2904, Run 
162345750 


Tissue Name 


Rel. Exp.(%) 
Ag2904,Run 
162345750 


Normal Colon 


58.6 


Kidney Margin 
8120608 


5.3 


CC Well to Mod Diff 
(OD03866) 


12.2 


Kidney Cancer 
8120613 


7.9 


CC Margin (0DO3866) 


9.2 


Kidney Margin 
8120614 


14.2 


CC Gr.2 rectosigmoid 
(OD03868) 


50.0 


Kidney Cancer 
9010320 


31.2 


CC Margin (OD03868) 


5.6 


Kidney Margin 
9010321 


27.2 


CC Mod Diff(ODO3920) 


45.4 


Normal Uterus 


6.5 


CC Margin (ODO3920) , 


27.2 


Uterus Cancer 064011 


26.6 


CC Gr.2 ascend colon 
(0DO3921) 


30.4 


Normal Thyroid 


10.7 


CC Margin (OD03921) 


9.9 


Thyroid Cancer 
064010 


14.2 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


34.2 


Thyroid Cancer 
A302152 


36.3 
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Liver Margin (ODO4309) 


19.1 


Thyroid Margin 


22.4 


Colon mets to lung 

(UJJU44 J 1 -U 1 ) 


52.1 


Normal Breast 


23.5 ; 


Lung Margin (OD04451- 
02) 


43.2 


Breast Cancer 
(OD04566) 


6.8 


Normal Prostate 6546-1 


66.9 


Breast Cancer 
(OD04590-01) 


13.5 • 


Prostate Cancer 
(OD04410) 


22.4 


Breast Cancer Mets 
(OD04590-03) 


29.3 


Prostate Margin 
(OD04410) 


21.9 


Breast Cancer 
Metastasis 

(UJJU40JJ-U3J 


. 18.7 


Prostate Cancer 

[\JlJ\J £ t / ZU-U I ) 


26.2 


Breast Cancer 064006 


27.5 


Prostate Margin 

[KJLJU't / ZU-UZ ) 


28.9 


Breast Cancer 1024 


20.9 


Normal Lung 061010 


100.0 


Breast Cancer 
ylUUzoo 


12.2 


i^ung Met to Muscle 
(OD04286) 


27.2 


Breast Margin 
9100265 


8.8 


Muscle Margin 

lilMUV * W 1'AUI gill 

(OD04286) 


13.6 


Breast Cancer 
A209073 


15.7 


Lung Malignant Cancer 
(OD03126) 


32.5 


Breast Margin 
A2090734 


17.6 


Lung Margin (OD03126) 


63.3 . 


Normal Liver 


13.4 


Lung Cancer (OD04404) 


28.3 


Liver Cancer 064003 


13.9 


Lung Margin (OD04404) 


34.9 


Liver Cancer 1 025 


13 9 


Lung Cancer (OD04565) 


23.3 


Liver Cancer 1026 


6.6 


L/Ung iviargin [KjUKj^t j o j ) 


AO fi 
4V.U 


Liver cancer ouuh- 1 


1 < A 


idling cancer \\Jij\j £ t£ o /- 
01) 


21.0 


Liver Tissue 6004-N 


9.6 


02) 


55.9 


Liver Cancer 6005-T 


6.5 


Onilpr N/TpI TV^At tr\ T iv*»r 
v/wuiai ivici ivici juivcr 

(ODO4310) 


3.6 


Liver Tissue 6005-N 


2.6 


T iv«»r Mnrmn f fYTlf'lA'l 1 (Y\ 

i-ii vcr iviargin ^vjuuhj i u j 




jNormai Diaaaer 


Ati n 
40. / 


Melanoma Mets to Lung 
(OD04321) 


7.0 


Bladder Cancer 1023 


10.4 


Lung Margin (OD04321) 


67.8 


Bladder Cancer 
A302173 


16.0 


Normal Kidney 


47.0 


Bladder Cancer 
(OD04718-01) 


32.1 


Kidney Ca, Nuclear grade 
2 (OD04338) 


39.5 


Bladder Normal 
Adjacent (OD04718- 
03) 


28.3 
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Kidney Margin 
(OD04338) 


45.4 


Normal Ovary 


2.4 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


80.1 


Ovarian Cancer 
064008 


33.2 


Kidney Margin 
(OD04339) 


28.1 


Ovarian Cancer 
(OD04768-07) 


80.7 


Kidney Ca, Clear cell 
type (OD04340) 


67.8 


Ovary Margin 
(OD04768-08) 


11.5 


Kidney Margin 
(OD04340) 


57.4 


Normal Stomach 


27.2 


Kidney Ca, Nuclear grade 
3 (OD04348) 


16.3 


Gastric Cancer 
9060358 


6.9 


Kidney Margin 
(OD04348) 


60.7 


Stomach Margin 
9060359 


10.3 


Kidney Cancer 
(OD04622-01) 


27.5 


Gastric Cancer 
9060395 


18.4 


Kidney Margin 
\yj\J\) t V3lL-\)5 ) 


5.6 


Stomach Margin 


22.8 


Kidney Cancer 
(OD04450-01) 


28.5 


Gastric Cancer 
9060397 


27.4 


Kidney Margin 
(OD04450-03) 


34.4 


Stomach Margin 
9060396 


9.2 


Kidney Cancer 8120607 


3.5 


Gastric Cancer 
064005 


77.9 



Table CHEL Panel 4.1D 



Tissue Name 


Rel. 
Exp.(%) 
Ag4520, 

Run 
198263642 


Rel. 
Exp.(%) 
Ag4520, 
Run 
219310605 


Tissue Name 


Rel. 
Exp.(%) 
Ag4520, 

Run 
198263642 


Rel. 
Exp.(%) 
Ag4520, 
Run 
219310605 


Secondary Thl act 


22.5 


26.1 


HUVEC IL-lbeta 


9.5 


11.3 


Secondary Th2 act 


37.4 


40.1 


HUVECIFN 
gamma 


17.0 


15.6 


Secondary Trl act 


22.8 


22.7 


HUVEC TNF 
alpha + IFN 
gamma 


18.0 


23.8 


Secondary Thl rest 


7.8 


6.3 


HUVEC TNF 
alpha + IL4 


12.0 


12.8 


Secondary Th2 rest 


8.7 


11.7 


HUVEC IL-11 


5.4 


4.6 


Secondary Trl rest 


7.7 


3.8 


Lung 
Microvascular EC 
none 


17.6 


20.7 


Primary Thl act 


35.8 


22.4 


Lung 
Microvascular EC 


19.9 


23.5 
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TNFalpha + IL- 
lbeta 






Primary Th2 act 


36.3 


J 39.0 


Microvascular 
Dermal EC none 


8.1 


8.7 


Primary Trl act 


.34.9 


36.6 


Microsvasular 

Dermal EC 
TNFalpha + IL- 
Ibeta 


9.1 


11.1 


Primary Thl rest 


3.8 


2.6 


Bronchial 
epithelium 
TNFalpha + 
ILlbeta 


5.3 


5.0 


Primary Th2 rest 


3.4 


3.1 


Small airway 
epithelium none 


1 6 


1 8 


Primarv Trl rest 

* * 1*1.1 1** J X * -I 1 VJW 


4.1 


8.8 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


10 6 


9 7 


CD45RA CD4 
lymphocyte act 


15.1 


14.0 


Coronery artery 
SMC rest 


2.5 


1.9 


CD45RO CD4 
lymphocyte act 


24.3 


26.2 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


3.7 


3.4 


CD8 lymphocyte 
act 


13.4 


16.3 


Astrocytes rest 


1.1 


2.4 


Secondary CD8 
lymphocyte rest 


16.8 


14.1 


Astrocytes 
TNFalpha + EL- 
lbeta 


7.1 


5.8 


Secondary CD8 
lymphocyte act 


9.1 


10.8 


KU-812 
(Basophil) rest 




1 7 


CD4 lymphocyte 
none 


4.2 


3.7 


KU-812 
(Basophil) 
PMA/ionomycin 


8.4 


7.0 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


14.7 


13.1 


CCD1106 
(Keratinocytes) 
none 


1.2 


1.7 


LAK cells rest 


12.4 


11.7 


CCD1106 | 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


6.6 


3.6 ' 


T AK celk TT -? 


14 ^ 
i*t.j 




•Liver cirrnosis 


Z.O 


■a a. 
i.O 


LAK cells IL-2+IL- 
12 


9.7 


6.7 


NCI-H292 none 


2.6 


2.9 


LAK cells IL- 
2+IFN gamma 


9.3 


9.0 


NCI-H292 IL-4 


2.6 


3.6 


LAK cells IL-2+ 
IL-18 


12.8 


11.7 


NCI-H292 IL-9 


3.5 


3.5 
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LAK cells 
PMA/ionomvcin 

a i'U mJ IvilV/lU J will 


25.0 


32.1 


NCI-H292 IL-13 


3.4 


3.1 


NK Cells IL-2 rest 


18.2 


19.6 


NCI-H292 IFN 


3.6 


4.2 


Two Way MLR 3 
day 


20.7 


22.7 


HPAEC none 


5.0 


5.2 


i wo Way JVLLK 5 
day 


11.4 


15.3 


HPAEC TNF 
alpha + IL-I beta 


19.9 


25.0 


1 wo way JVLLK 7 
day 


132 


14.6 


Lung fibroblast 
none 


1.5 


1.6 


PBMC rest 


5.0 


4.7 


T X~1_ 1_ I . 

Lung fibroblast 
TNF alpha + IL-1 
beta 


3.9 


3.2 


PBMCPWM 


18.0 


21.8 


Lung fibroblast 


1.2 


1.4 


PBMC PHA-L 


15.0 


17.4 


Lung fibroblast 


1.1 


2.4 


Ramos (B cell) 
none 


1.9 


2.1 


Lung fibroblast 
IL-13 


1.5 


1.8 


Kamos (d cell) 
ionomycin 


3.0 


3.0 


Lung fibroblast 
IFN gamma 


2.5 


3.1 


B lymphocytes 
rWM 


10.4 


12.7 


Dermal fibroblast 
CCDl 070 rest 


3.4 


1.8 


B lymphocytes 
CD40LandIL-4 


1 H A 
17.0 


OA A 

20.0 


Dermal fibroblast 
CCDl 070 TNF 
alpha 


15.3 


18.3 


EOL-1 dbcAMP 


9.8 


17.0 


Dermal fibroblast 
CCDl 070 IL-1 

DC la 


4.6 


4.6 


EOL-1 dbcAMP 
PMA/ionomvcin 


37.4 


323 


Dermal fibroblast 

1 p\J onmma 
JLTIN gainiild. 


3.3 


2.9 


Dendritic cells 
none 


8.4 


4.8 


Piptmfil "fihr/Vhlnct 

IL-4 


4.2 


2.9 


uenontic cells Lro 


OA l\ 

20.0 


25.2 


Dermal 
Fibroblasts rest 


3.3 


1.8 


Dendritic cells anti- 
CD^ 


U.J 


O.J 


Neutrophils . 
TNFa+LPS • 


31. o 


5o.o 


Monocytes rest 


10.8 


9.8 


Neutrophils rest 


23.2 


26.2 


Monocytes LPS 


100.0 


100.0 


Colon 


1.7 


1.6 


Macrophages rest 


7.6 


6.8 


Lung 


4.0 


1.8 


Macrophages LPS 


29.7 


26.6 


Thymus 


7.9 


8.7 


HUVEC none 


3.1 


4.3 


Kidney 


6.3 


3.6 


HUVEC starved 


10.4 


8.0 
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Table CHI. Panel 4D 



l issue fName 


Rel. Exp.(%) 
Ag2yo4, Kun 
159078059 


Tissue Name 


Rel. Exp.(%) 
Ag2904, Run 
159078059 


oeconuary l n i ac i 


JZ.J 


MUV tic LL,-y Deta 


14.5 


Secondary Th2 act 


65.5 


HUVEC IFN gamma 


32.5 


Secondary Trl act 


58.2 


HUVEC TNF alpha + IFN 
gamma 


52.9 


Secondary Thl rest 


13.1 


HUVEC TNF alpha + IL4 


28.9 


Secondary Th2 rest 


18.8 


HUVEC IL- 11 


8.4 


Secondary Trl rest 


16.6 


Lung Microvascular EC 
none 


22.5 


Primary Thl act 


94.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


39.2 


Primary Th2 act 


66.4 


Microvascular Dermal EC 
none 


20.2 


Primary Trl act 


94.6 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


30.4 


Primary Thl rest 


53.6 


Bronchial epithelium 
TNFalpha + ILlbeta 


9.2 


Primary Th2rest 


30.1 


Small airway epithelium 
none 


6.1 


Primary Trl rest 


18.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


53.6 


CD45RA CD4 
lymphocyte act 


24.7 


Coronery artery SMC rest 


16.7 


CD45RO CD4 
lymphocyte act 


47.6 


Coronery artery SMC 
TNFalpha + IL-lbeta 


4.6 


CD8 lymphocyte act 


20.4 


Astrocytes rest 


4.8 


Secondary CD8 
lymphocyte rest 


24.8 


Astrocytes TNFalpha + 
IL-Ibeta 


16.2 


Secondary CD8 
lymphocyte act 


27.5 


KU-812 (Basophil) rest 


6.6 


CD4 lymphocyte none 


9.9 


KU-812 (Basophil) 
PMA/ionomycin 


17.3 


2ry Thl/Th2/Trl_anti- 


27.5 


CCD1106 (Keratinocytes) 
none 


2.1 


LAK cells rest 


28.9 


CCD1 1 06 flCeratinncvte^ 
TNFalpha + IL-lbeta 


4.2 


LAK cells IL-2 


33.2 


Liver cirrhosis 


5.6 


LAK cells IL-2+IL-12 


34.9 


Lupus kidney 


6.3 


LAK cells IL-2+IFN 
gamma 


47.3 


NCI-H292none 


16.2 


LAK cells IL-2+IL- 18 


94.0 


NCI-H292 IL-4 


13.4 


LAK cells 


48.3 


NCI-H292 IL-9 


12.9 
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PMA/ionomycin 








NK Cells IL-2 rest 


28.1 


NCI-H292 IL-13 


6.2 


l wu w djr lvuLfix j uay 




NfT-WQl TFN pamma 
iiv-i nA7i xx i^i gamma 


13 6 


Two Wflv A/fT R S Hflv 




TTPAFC none 


14 6 


Two Way MLR 7 day 


22.1 


HP AFC TNF alnha + IT -1 
beta 


35.1 


xDxviy^ rest 




T iin rr "fii^T*r\K»1 cici* tirvn a 
JL/UUg ilOrUUlabL I1UI1C 


J.U 


PBMC PWM 


100.0 


Lung fibroblast TNF alpha 

+ TT -1 heta 


6.4 


PBMC PHA-L 


34.9 


Lung fibroblast IL-4 


5.8 


Ramos (B cell) none 


4.5 


Lung fibroblast IL-9 


4.0 


Ramos (B cell) 
ionomycin 


13.0 


Lung fibroblast IL-13 


3.0 


B lymphocytes PWM 


96.6 


Lung fibroblast IFN 
gamma 


8.2 


B lymphocytes CD40L 
and IL-4 


84.7 


Dermal fibroblast 
CCD 1070 rest 


9.5 


EOL-1 dbcAMP 


19.5 


Dermal fibroblast 
CCD1070 TNF alpha 


99.3 


EOL-1 dbcAMP 
x ivi/v l onumy c in 


74.7 


Dermal fibroblast 

fpn 1070 TT -1 Kp>tQ 


15.2 


Dendritic cells none 


13.4 


nprmq] -RhrnMact TT7M 
JL/Cl lllal ilDruDlaSl IT IN 

gamma 


7.5 


Dendritic cells LPS 


52.9 


Dermal fibroblast IL-4 


6.7 


Dendritic cells anti- 
CD40 


12.0 


IBD Colitis 2 


1.3 


Monocytes rest 


22.2 


IBD Crohn's 


3.7 


Monocytes LPS 


67.8 


Colon 


15.7 


Macrophages rest 


18.9 


Lung 


11.3 


Macrophages LPS 


99.3 


Thymus 


28.3 


HUVEC none 


15.3 


Kidney 


31.9 


HUVEC starved 


34.2 







AI_comprehensive paneljvl.O Summary: Ag4520 The NOV93 gene is widely 
expressed among the samples on this panel, with highest expression in normal colon adjacent 
to diseased colon (CT=29). This widespread pattern of expression is consistent with 
5 expression in Panels 4D and 4.1D. Please see Panel 4.1D for discussion of utility of this gene 
in inflammation. 

CNS_neurodegeneration_vl.O Summary: Ag2904/Ag4520 The NOV93 gene, an 

IMP dehydrogenase homolog, shows a small but significant (p=0.02) upregulation in the 

postmortem Alzheimer's brain when compared to nondemented controls. IMP dehydrogenase 

10 is involved in purine metabolism, and has been implicated as a drug target for supressing the 
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immune response, inflammation, and cerebral edema. The observed increase in the expression 
of this gene is in concordance with the evidence for the role of neuroinflammation in 
Alzheimer's disease. Therefore, the inhibition of this molecule may be of therapeutic benefit in 
Alzheimer's disease, head or spinal cord trauma, stroke, cerebral edema, or viral infections of 
5 theCNS. 

References: 

Hall IH, Wyrick SD. Cytoxicity of [(5,6-dichloro-9a-n-propyl-2,3,9,9a-tetrahydro-3- 
oxo-lH fluoren-7-yl)oxy]acetic acid, an agent known to reduce brain edema. Biomed 
Pharmacother 1996;50(l):19-23 

10 A known agent, [(5,6-dichloro-9a-n-propyl-2,3,9,9a-tetrahydro-3-oxo-lH fluoren-7- 

yl)oxy]acetic acid, which blocks brain edema, was also shown to be a potent cytotoxic agent in 
leukemia cells. The major site of action of the agents appears to be in the de novo purine 
synthetic pathway in L1210 leukemic cells. Both PRPP amido transferase and IMP 
dehydrogenase activities were suppressed by the agent. The inhibition of both regulatory 

1 5 enzymes of the pathway along with the reduction of dihydrofolate reductase activity would 
account for the observed suppression of DNA and RNA syntheses and subsequent cancer cell 
death. 

Senda M, Natsumeda Y. Tissue-differential expression of two distinct genes for human 
IMP dehydrogenase (E.C.I. 1.1. 205). Life Sci 1994;54(24): 191 7-26 

20 Human IMP dehydrogenase (E.C .1.1.1 .205) is recently regarded as a potent targeting- 

enzyme for immunosuppressive drugs. Tissue differential expressions of human type I and 
type II IMP dehydrogenase were investigated in sixteen human adult organs (heart, brain, 
placenta, lung, liver, skeletal muscle, kidney, pancreas, spleen, thymus, prostate, testis, ovary, 
small intestine, colon, peripheral blood leukocytes) and five human fetal organs (heart, brain, 

25 lung, liver, kidney) using Northern blot analysis. In all tissues examined in this study, the sizes 
ofmRNAs of each isoform were identical, respectively. The 2.3 kb type II mRNA was shown 
predominantly, and the 3.5 kb type I mRNA level was lower than type II in most human 
tissues examined. In contrast, type I IMPDH gene expressed higher than type II in peripheral 
blood leukocytes, uniquely. We also demonstrated that both type I and type II IMPDH genes 

30 are widely distributed among various species by Southern blot analysis. Interestingly, type I 
IMPDH gene may have multiple gene families in primates, [dstone, 17-Jan-02] 

General_screening_panel_vl.4 Summary: Ag4250 Two experiments with the same 
probe and primer set produce results that are in excellent agreement, with highest expression 
of the NOV93 gene in a gastric cancer cell line and fetal lung tissue. In addition, there appears 
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to be substantial expression associated with breast cancer cell lines, lung cancer cell lines and 
renal cancer cell lines. Thus, the expression of this gene could be used to distinguish NCI-N87 
and fetal lung tissue from the other samples in the panel. Moreover, therapeutic modulation of 
this gene, through the use of small molecule drugs, antibodies or protein therapeutics might be 
5 beneficial in the treatment of breast, lung or kidney cancer. 

Among tissues with metabolic function, this gene has low-to-moderate levels of 
expression in adipose, liver, heart, skeletal muscle, adrenal, pituitary, thyroid and pancreas. 
Thus, this gene product may be a small molecule target for the treatment of metabolic and 
endocrine diseases, including obesity and Type 2 diabetes. This encodes a putative IMP 

10 dehydrogenase, which is involved in purine metabolism and has been implicated as a target for 
suppressing the immune response. Thus, this gene product may also be a treatment for Type 1 
diabetes, in which insulin-secreting beta cells are destroyed by the autoimmune response 
against them. In addition, this gene appears to be differentially expressed in fetal (CT values = 
30) vs adult liver (CT value - 33) and in fetal (CT values -27-28) vs. adult lung (CTs - 32- 

15 33), and may be useful for the differentiation between the two sources of these tissues. 

This molecule is also expressed at moderate to low levels in all CNS regions examined. 
Please see panel CNS_Neurodegeneration for a discussion of utility of this gene in the central 
nervous system. 

Panel 1.3D Summary: Ag2904 Expression of the NOV93 gene is higher overall in 
20 normal tissues, with highest expression in fetal skeletal muscle. Furthermore, this gene is 
expressed at higher levels in fetal skeletal muscle (CT-29) when compared to expression in 
adult skeletal muscle (CT=32). Thus, expression of this gene could be used to differentiate 
between fetal skeletal muscle and other samples on this panel and between fetal and adult 
skeletal muscle. 

25 Expression in the CNS is consistent with expression in previous panels. Please see 

CNS_neurodegeneration for discussion of utility of this gene in the central nervous system. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal muscle, heart 
and liver. This widespread expression among these tissues suggests that this gene product may 

30 be useful for the diagnosis and/or treatment of metabolic disease, including obesity and 
diabetes. 

Panel 2D Summary: Ag2904 The expression of the NOV93 gene appears to be 
highest in a sample derived from normal lung tissue. In addition, there appears to be 
substantial expression in most of the samples in the panel. Of note is the expression associated 
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with normal lung tissue when compared to adjacent lung cancer tissue. Thus, the expression of 
this gene could be used to distinguish this sample of normal lung tissue from other samples in 
the panel. In addition, the expression of this gene could be used to distinguish normal lung 
tissue adjacent to cancer tissue. Moreover, therapeutic modulation of this gene, through the 
5 use of small molecule drugs, antibodies or protein therapeutics might be beneficial in the 
treatment of lung cancer. 

Panel 4D/4.1D Summary: Ag2904/Ag4520 The NOV93 gene, a novel IMP 
dehydrogenase-like protein, is differentially expressed, as displayed in Panels 4. ID and 4D, in 
activated T cells, activated B cells, activated monocytes, activated macrophages, and activated 

1 0 dendritic cells. Small molecuie antagonists of the previously characterized IMP dehydrogenase 
have been found to be useful in the treatment of several immunopathological states (See 
Allison and Eugui, 2001). Therefore, small molecule antagonists of the NOV93 gene product 
may reduce or eliminate the symptoms of autoimmune and inflammatory diseases, including 
Crohn's disease, ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary disease, 

15 asthma, emphysema, rheumatoid arthritis, lupus erythematosus, or psoriasis. 
References: 

Allison AC, Eugui EM. Mycophenolate mofetil and its mechanisms of action. 
Immunopharmacology 2000 May;47(2-3):85-l 1 8 

Mycophenolate mofetil (MMF, CellCept(R)) is a prodrug of mycophenolic acid 

20 (MPA), an inhibitor of inosine monophosphate dehydrogenase (IMPDH). This is the rate- 
limiting enzyme in de novo synthesis of guanosine nucleotides. T- and B-lymphocytes are 
more dependent on this pathway than other cell types are. Moreover, MPA is a fivefold more 
potent inhibitor of the type II isoform of IMPDH, which is expressed in activated 
lymphocytes, than of the type I isoform of IMPDH, which is expressed in most cell types. 

25 MPA has therefore a more potent cytostatic effect on lymphocytes than on other cell types. 

This is the principal mechanism by which MPA exerts immunosuppressive effects.Three other 
mechanisms may also contribute to the efficacy of MPA in preventing allograft rejection and 
other applications. First, MPA can induce apoptosis of activated T-lymphocytes, which may 
eliminate clones of cells responding to antigenic stimulation. Second, by depleting guanosine 

30 nucleotides, MPA suppresses glycosylation and the expression of some adhesion molecules, 
thereby decreasing the recruitment of lymphocytes and monocytes into sites of inflammation 
and graft rejection. Third, by depleting guanosine nucleotides MPA also depletes 
tetrahydrobiopterin, a co-factor for the inducible form of nitric oxide synthase (iNOS). MPA 
therefore suppresses the production by iNOS of NO, and consequent tissue damage mediated 
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by peroxynitrite. CellCept(R) suppresses T-lymphocytic responses to allogeneic cells and 
other antigens. The drug also suppresses primary, but not secondary, antibody responses. The 
efficacy of regimes including CellCept(R) in preventing allograft rejection, and in the 
treatment of rejection, is now firmly established. CellCept(R) is also efficacious in several 
5 experimental animal models of chronic rejection, and it is hoped that the drug will have the 
same effect in humans. 

NOV94 

Expression of gene NOV94 was assessed using the primer-probe set Ag2905, 
described in Table CIA. Results of the RTQ-PCR runs are shown in Tables CIB, CIC and 
10 CID. 

Table CIA. Probe Name Ag2905 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ZD NO: 


Forward 


5' -gcagaataccacgatgacttct-3 ' 


22 


1468 


1366 


Probe 


TET-5 1 -agtcagcttacgtcgctgcctctgag-3 1 - 
TAMRA 


26 


1496 


1367 


Reverse 


5 1 -gttcctggtgctgtaatgca-3 1 


20 


1523 


1368 



Table CIB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2905, 
Run 161374149 


Tissue Name 


Rel.Exp.(%)Ag2905, 
Run 161374149 


Liver adenocarcinoma 


20.4 


Kidney (fetal) 


1.4 


Pancreas 


0.1 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


4.9 


Renal ca. A498 


7.4 


Adrenal gland 


0.2 


Renal ca.RXF 393 


9.0 


Thyroid 


0.9 


Renal ca. ACHN 


2.4 


Salivary gland 


0.6 


Renal ca.UO-31 


5.5 


Pituitary gland 


0.4 


Renal ca. TK-10 


13.7 


Brain (fetal) 


0.4 


Liver 


0.3 


Brain (whole) 


0.2 


Liver (fetal) 


0.8 


Brain (amygdala) 


12 


Liver ca. 
(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.2 


Lung 


0.6 


Brain (hippocampus) 


1.0 


Lung (fetal) 


0.9 


Brain (substantia nigra) 


0.3 


Lung ca. (small cell) 
LX-1 


15.9 


Brain (thalamus) 


1.0 


Lung ca. (small cell) 
NCI-H69 


I- 2 
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Cerebral Cortex 


2.3 


Lung ca. (s.cell var.) 
SHP-77 


27.7 


Spinal cord 


1.5 


Lung ca. (large 
cell)NCI-H460 


2.0 


glio/astro U87-MG 


11.7 


Lung ca. (non-sm. 
cell) A549 


8.0 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


2.8 


astrocytoma SW1783 


0.2 


Lung ca. (non-s.cell) 
HOP-62 


1.9 


neuro*; met SK-N-AS 


5.8 


Lung ca. (non-s.cl) 
NCI-H522 


5.4 


astrocytoma SF-539 


5.3 


Lung ca. (squam.) 
SW 900 


2.6 


astrocytoma SNB-75 


1.7 


Lung ca. (squam.) 
NCI-H596 


1.6 


glioma SNB-19 


0.1 


Mammary gland 


2.2 


glioma U251 


0.0 


Breast ca.* (pl.ef) 

MPF 7 


40.6 


glioma SF-295 


0.3 


rsreasx ca. yji.eij 
MDA-MB-231 


6.3 


Heart (fetal) 


0.2 


ore as i ca. yji.ei^ 
T47D 


0.0 


riCaJl 




D rooc f DT KAQ 


1 0 


Skeletal muscle (fetal) 


3.4 


Breast ca. MDA-N 


10.1 


Skeletal muscle 


0.7 


Ovary 


1.9 i 


Bone marrow 


1.0 


Ovarian ca. OVCAR- 
3 


1.6 


Thymus 


20.2 


Ovarian ca. OVCAR- 
4 


0.3 


Spleen 


0.8 


Ovarian ca. OVCAR- 
5 


7.8 


Lymph node 


0.2 


Ovarian ca. OVCAR- 
c 

o 


102 


Colorectal 


1.0 


Ovarian ca. IGROV- 

i 
i 


1.5 


Stomach 


1.9 


Ovarian ca.* (ascites) 

OlV-U V J 


9.4 


Small intestine 


2.3 


Uterus 


0 1 


Colon ca. SW480 


6.0 


Placenta 


1.6 


Colon ca.* 
SW620(SW480 met) 


92 


Prostate 


3.5 


Colon ca. HT29 


30.8 


Prostate ca.* (bone 
met)PC-3 


9.3 


Colon ca.HCT-1 16 


112 


Testis 


100.0 
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Colon ca. CaCo-2 


11.3 


Melanoma 
Hs688(A).T 


1.2 


Colon ca. 
tissue(OD03866) 


2.4 


Melanoma* (met) 
Hs688(B).T 


2.0 


Colon ca. HCC-2998 


7.2 


Melanoma UACC-62 


1.5 


Gastric ca.* (liver met) 
NCI-N87 


40.6 


Melanoma M 14 


1.0 


Bladder " 


2.0 


Melanoma LOX 
IMVI 


0.6 


Trachea 


4.0 


Melanoma* (met) 
SK-MEL-5 


1.0 


Kidney 


1.0 


Adipose 


0.4 



Table CIC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2905,Run 
161374481 


Tissue Name 


Rel.Exp.(%) 
Ag2905,Run 
161374481 


Normal Colon 


15.3 


Kidney Margin 
8120608 


1.1 


CC Well to Mod Diff 
(OD03866) 


5.4 


Kidney Cancer 
8120613 


0.5 


CC Margin (OD03866) 


i.i 


Kidney Margin 
8120614 


0.0 


CC Gx2 rectosigmoid 
(OD03868) 


3.1 


Kidney Cancer 
9010320 


42.3 


CC Margin (OD03868) 


1.1 


Kidney Margin 
9010321 


2.0 


CC Mod Diff (ODO3920) 


26.6 


Normal Uterus 


1.8 


CC Margin (ODO3920) 


3.6 


Uterus Cancer 064011 


8.2 


CC Gr.2 ascend colon 
(OD03921) 


14.2 


Normal Thyroid 


9.2 


CC Margin (OD03921) 


6.2 


Thyroid Cancer 
064010 


16.0 ' 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


44.8 


Thyroid Cancer 
A302152 


10.2 


Liver Margin (ODO4309) 


8.4 


Thyroid Margin 
A302153 


15.1 


Colon mets to lung 
(OD04451-01) 


11.0 


Normal Breast 


9.7 


Lung Margin (OD04451- 
02) 


0.9 


Breast Cancer 
(OD04566) 


1.2 


Normal Prostate 6546-1 


10.1 


Breast Cancer 
(OD04590-01) 


3.5 


Prostate Cancer 


52.9 


Breast Cancer Mets 


2.0 
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(OD04410) 




(OD04590-03) 




Prostate Margin 
(OD04410) 


23.0 


Breast Cancer 

Metastasis 
(OD04655-05) 


54.7 


Prostate Cancer 
(OD04720-01) 


18.7 


Breast Cancer 064006 


8.1 


Prostate Margin 
(ODU4720-02) 


332 


Breast Cancer 1024 


26.1 


Normal Lung 061010 


12.9 


Breast Cancer 
9100266 


30.6 


Lung Met to Muscle 
fOD04286^ 


29.3 


Breast Margin 


17.4 


Muscle MarfHn 
(OD04286) 


2.7 


or cool ^oiiLrCr 

A209073 


62.0 


Lung Malignant Cancer 
(OD03126) 


3.4 


Breast Margin 
A2090734 


20.7 


Lung Margin (OD03 126) 


7.5 


Normal Liver 


11.7 


Lung Cancer (OD04404) 


35.6 


Liver Cancer 064003 


6.7 


Lune Mar pin fODfl44ft4^ 


4 0 


ver cancer l uzd 




Lung Cancer (OD04565) 


0.7 


Liver Cancer 1026 


8,2 


Lung Margin (OD04565) 


22 


Liver Cancer 6004-T 


3.3 


Lung Cancer (OD04237- 
01) 


51.8 


Liver Tissue 6004-N 


15.3 


Lung Margin (UD04237- 

„ 02 > 


0.4 


Liver Cancer 6005-T 


10.4 


Ocular Mel Met to Liver 
(ODO4310) 


41.8 


Liver Tissue 6005-N 


02 


Liver Margin (UJLHJ4J1U) 


A A 

4.4 


Normal Bladder 


16.7 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


6.3 


Lung Margin (OD04321) 


4.7 


Bladder Cancer 
A302173 


40.6 


Normal Kidney 


7.1 


Bladder Cancer 
(OD04718-01) 


1.0 


Kidney Ca, Nuclear grade 
2 (OD04338) | 


16.5 


Bladder Normal 
Adjacent (OD04718- 
03) 


5.7 


Kidney Margin 
(OD04338^ 


7.4 


Normal Ovary 


1.3 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


12.9 


Ovarian Cancer 
064008 


100.0 


Kidney Margin 
(OD04339) 


4.6 


Ovarian Cancer 
(OD04768-07) 


18.3 


Kidney Ca, Clear cell 
type (OD04340) 


5.8 


Ovary Margin 
(OD04768-08) 


0.7 
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Kidney Margin 
(OD04340) 


3.5 


Normal Stomach 


5.8 


Kidney Ca, Nuclear grade 
3 (OD04348) 


4.0 


Gastric Cancer 
9060358 


0.9 


Kidney Margin 
(OD04348) 


4.4 


Stomach Margin 
9060359 


4.8 


Kidney Cancer 
(OD04622-01) 


0.7 


Gastric Cancer 
9060395 


13.2 


Kidney Margin 


1.3 


Stomach Margin 


8.3 


Kidney Cancer 
(OD04450-01) 


11.9 


Gastric Cancer 
9060397 


14.7 


Kidney Margin 
(OD04450-03) 


8.8 


Stomach Margin 
9060396 


2.1 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


17.1 



Table CD). Panel 4D 



Tissue Name 


Rel. Exp.(%) 
AplOOS Run 
159772697 


1 lwUc IK^tlllt? 


Rel. Exp.(%) 

ArrtQfWh Dim 

Agzyvj) ivun 
159772697 


Secondary Thl act 


12.8 


HUVEC IL-lbeta 


8.2 


Secondary Th2 act 


6.5 


HUVEC IFN gamma 


9.1 


Secondary Trl act 


8.7 


HUVEC TNF alpha + IFN 
gamma 


11.6 


Secondary Thl rest 


4.7 


HUVEC TNF alpha + IL4 


12.2 


Secondary Th2 rest 


6.0 


HUVEC IL-11 


9.0 


Secondary Trl rest 


6.7 


Lung Microvascular EC 
none 


1.4 


Primary Thl act 


12.5 


Lung Microvascular EC 
TNFalpha* IL-lbeta 


3.1 


Primary Th2 act 


10.6 


Microvascular Dermal EC 
none 


24.3 


Primary Trl act 


16.6 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


13.7 


Primary Thl rest 


24.5 


Bronchial epithelium 
TNFalpha 4- ILlbeta 


1.0 


Primary Th2 rest 


10.2 


Small airway epithelium 
none 


6.7 


Primary Trl rest 


17.4 


Small airway epithelium 
TNFalpha + IL-lbeta 


29.1 


CD45RA CD4 
lymphocyte act 


3.8 


Coronery artery SMC rest 


9.9 


CD45RO CD4 
lymphocyte act 


8.9 


Coronery artery SMC 
TNFalpha + IL-lbeta 


6.0 
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j. i 


Astrocytes rest 


A 1 

9.1 


Secondary CD8 

1 vmnVi npvfp rpct 


8.2 


Astrocytes TNFalpha + 

TT 1 koto 

il- i beta 


5.4 


Secondary CD8 

Ivmnhnovtp apt 


7.6 


KU-8] 2 (Basophil) rest 


10.7 


CD4 lymphocyte none 


1.0 


ivU-oiz (JDasopnilJ 
PMA/i onom vein 


21.6 


2ry Thl/Th2/Trl anti- 
CD95CH11 


8.9 


CCD 1 106 (Keratinocytes) 
none 


30.4 


LAK cells rest 


4.7 


CCD 1 106 (Keratinocvtes^ 
TNFalpha + IL-lbeta 


1.1 


LAK cells IL-2 


10.0 


Liver cirrho^te 


1 i 


LAK cells IL-2+IL-12 


11.2 


Ludus kidnev 


1 0 


LAK cells EL-2+IFN 
gamma 


14.7 


NCI-H292 none 


35.4 


LAK cellsIL-2+IL-18 


11.7 


NCI-H292 IL-4 


29.7 


LAK cells 
PMA/ionomycin 


2.8 


NCI-H292 IL-9 


49.0 


NK Cells IL-2 rest 


5.0 


NCI-H292 IL-13 


17.9 


Two Way MLR 3 day 


5.3 


NCI-H292 IFN gamma 




Two Way MLR 5 day 


5.8 


HPAEC none 


& R 

*T.O 


Two Way MLR 7 day 


3.4 


HPAEC TNF aloha + IL-1 
beta 


5.0 


PBMC rest 


' 0.8 


Lun2 fibroblast none 


7 1 


PBMC PWM 


14.4 


Lung fibroblast TNF alpha 
+ IL-1 beta 


3.9 


PBMC PHA-L 


9.0 


Lung fibroblast IL-4 


8.4 


Ramos fR rell^ nnnf* 


17 4 


l^ung iiDroDiast LL-y 


10.9 


Ramos (B cell) 
ionomvcin 

1VUV11I If Will 


100.0 


Lung fibroblast IL-13 


8.8 


B lymphocytes PWM 


31.2 


Lung fibroblast IFN 
gam ma 


I7.l 


B lymphocytes CD40L 
and IL-4 


15.5 


Dermal fibroblast 
^y^u i u / u rest 


31.9 


EOL-1 dbcAMP 


0.0 


ju/crmai iiDroDiasi 
CCD1070 TNF alpha 


39.2 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


uermai IiDroDiast 
CCD1070EL-1 beta 


12.2 


Dendritic rpllc nnnp 


A O 


Dermal fibroblast IFN 
gamma 


■—■■"■"»— — — 

6.4 


Dendritic cells LPS 


3.0 


Dermal fibroblast IL-4 


18.2 


Dendritic cells anti- 
CD40 


4.9 


IBD Colitis 2 


0.7 


Monocytes rest 


2.0 


IBD Crohn's 


0.0 j 


Monocytes LPS 


2.0 


Colon 


5.4 i 
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Macrophages rest 


3.6 | 


Lung 


5.2 


Macrophages LPS 


1.8 


Thymus 


3.0 


HUVEC none 


192 


Kidney 


20.0 


HUVEC starved 


24.0 







Panel 1.3D Summary: Ag2905 The expression of the NOV94 gene appears to be 
highest in a sample derived from normal testis tissue (CT=28.9). In addition, there is 
substantial expression associated with samples derived from colon cancer cell lines, lung 
5 cancer cell lines and breast cancer cell lines. Thus, the expression of this gene could be used to 
distinguish normal testis tissue from other samples in the panel. Moreover, therapeutic 
modulation of this gene, through the use of small molecule drugs, protein therapeutics or 
antibodies could be beneficial for the treatment of colon, lung or breast cancer. 

In addition, this gene appears to be differentially expressed in fetal (CT value - 37) vs 
1 0 adult heart (CT value = 32), and may be useful for the differentiation between the two sources 
of heart tissue. 

Panel 2D Summary: Ag2905 The expression of the NOV94 gene appears to be 
highest in a sample derived from an ovarian cancer (CT=30.5). In addition, there appears to be 
substantial expression associated with breast cancers, lung cancers, gastric cancers, prostate . 

15 cancers and colon cancers. Thus, the expression of this gene could be used to distinguish this 
ovarian cancer sample from others in the panel. Moreover, therapeutic modulation of this 
gene, through the use of small molecule drugs, protein therapeutics or antibodies might be 
beneficial in the treatment of ovarian, breast, lung, gastric, prostate or colon cancer. 

Panel 4D Summary: Ag2905 Low but significant expression of the NOV94 transcript 

20 is found predominantly in activated B cell lymphoma cell line (Ramos) and activated B cells 
(CTs=32-34). It is also found in HUVEC, keratinocytes, lung fibroblasts and the muco- 
epidermoid cell line H292. Therefore, targeting of this gene product with a small molecule 
drug therapeutic may modulate the functions of B cells and lead to the improvement of 
symptoms in autoimmune diseases such as lupus erythematosus, rheumatoid arthritis, 

25 hyperglobulinemia and other B cell disorders. 

NOV95 

Expression of gene NOV95 was assessed using the primer-probe set Ag3060, 
described in Table CJA. Results of the RTQ-PCR runs are shown in Tables CJB and CJC. 
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Table CJA. Probe Name Ag3060 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5' -caaagattgcagcaatcgatag-3 1 


22 


189 


1369 


Probe 


TET-5 ' -agtatacacgaggctttggccatcca-3 1 - 
TAMRA 


26 


219 


1370 


Reverse 


5 1 -aggacagagctttcacaagtga-3 » 


22 


245 


1371 



Table CJB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3060, 
Run 168016485 


Tissue Name 


Rel Fxn (%\ AcrtOfiO 
Run 168016485 


Liver adenocarcinoma 


0.2 


Kidnev (fetall 


b 3 


Pancreas 


0.2 


Renal ca. 786-0 


0.1 


Pancreatic* ra PAPA\T 
2 


0.0 


Renal ca. A498 


0.1 


AHrptiftl olnnH 
aui vital gla.UU 


n 1 

U.l 


Pan«l T>"VT? 1 C\1 

Kenai ca. KAr yy5 


0.1 


Thyroid 


0.0 


Renal ca. ACHN 


0.1 


Salivary gland 


0.0 


Renal ca.UO-31 * 


o.i 


Pituitary gland 


0.1 


Renal ca.TK- 10 


0.1 


Brain (fetal) 


0.1 


Liver 


0.0 


Brain (whole) 


0.1 


Liver (fetal) 


0.0 | 


Brain (amygdala) 


0.1 


Liver ca. 
(hepatoblast) HepG2 


0.1 


Brain (cerebellum) 


0.1 


Lung 


0.0 


Brain (hippocampus) 


. 02 


Lung (fetal) 


0.1 


Brain (substantia nigra) 


0.1 


Lung ca. (small cell) 
LX-1 


0.1 


Brain (thalamus) 


0.1 


Lung ca. (small cell) 
NCI-H69 


ft 1 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.2 


Spinal cord 


0.1 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell)A549 


0.3 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.1 


astrocytoma SW1783 


0.2 


Lung ca. (non-s.cell) 
HOP-62 


0.1 


neuro*;metSK-N-AS 


0.1 


Lung ca. (non-s.cl) 
NCI-H522 


0.2 


astrocytoma SF-539 


0.1 


Lung ca. (squam.) 
SW 900 


0.1 


astrocytoma SNB-75 


0.2 


Lung ca. (squam.) 


0.1 
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NCI-H596 




glioma SNB-19 


0.1 


Mammary gland 


0.1 


glioma U251 


0.3 


T» . sfc / 1 _ X\ 

Breast ca.* (pl.ef) 
MrF-7 


0.1 


glioma SF-295 


0.2 


MDA-MB-231 


0.1 


Heart (fetal) 


. 3.3 


T47D 


100.0 


Heart 


0 ft 


DiCaaL wa. £> 1-j*t7 


ft ft 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0' 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0.2 


Thymus 


0.1 


Ovarian ca. OVCAR- 
4 


0.2 


Spleen 


0.1 


Ovarian ca. OVCAR- 
5 


0.6 


Lymph node 


0.1 


Ovarian ca. OVCAR- 

0 

o 


0.1 


Colorectal 


0.0 


Ovarian ca. IGROV- 

i 
i 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 

oJV-U V«0 


0.2 


C rn nil int<=»o+i np 
OlXutJl lillCMIIlC 


ft ft 


uterus 


U.x 


Colon ca. SW480 


0.1 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.2 


Prostate 


0.0 


Colon ca. HT29 


0.1 


Prostate ca.* (bone 
incijrv^-j 


0.1 


Colon ca. HCT-116 


0.1 


Testis 


3.1 


Colon ca. CaCo-2 


0.1 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.1 


Colon ca. HCC-2998 


0.2 


Melanoma UACC-62 


0.0 


vja^irio ca. \iiver roeij 
NCI-N87 


0.1 


Melanoma M14 


0.1 


Bladder 


0.1 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.1 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


o.i 


Adipose 


0.1 
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Table CJC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3060, Run 
164317425 


Tissue Name 


Rel. Exp.(%) 
Ag3060, Run 
164317425 


secondary Inl act 


22.7 


TTTTirP/* 1 TT 11 a _ 

HUVEC IL-lbeta 


7.2 


Secondary Th2 act 


25.7 


HUVEC IFN gamma 


15.1 


Secondary Trl act 


40.3 


HUVEC TNF alpha + IFN 
gamma 


12.3 


Secondary Thl rest 


6.1 


HUVEC TNF alpha + IL4 


10.4 


Secondary Th2 rest 


9.3 


HUVEC IL-11 


4.8 


Secondary Trl rest 


11.7 


Lung Microvascular EC 
none 


6.0 


Primary Thl act 


29.5 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


9.7 


Primary Th2 act 


24.1 


Microvascular Dermal EC 
none 


11.8 


Primary Trl act 


33.4 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


9.6 


Primary Thl rest 


52.9 


Bronchial epithelium 
TNFalpha + ILlbeta 


20.0 


Primarv Th2 rest 


26 2 


Small airway epithelium 
none 




Primary Trl rest 


17.4 


Small airway epithelium 
TNFalpha + IL-lbeta 


18 4 


CD45RA CD4 
lymphocyte act 


13.8 


Cornnerv artprv RMY** 1 rpct" 


12 1 


CD45RO CD4 
lymphocyte act 


23.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


8 7 


CD8 lymphocyte act 


26.4 


Astrocytes rest 


9.0 


Secondary CD8 
lymphocyte rest 


25.9 


Astrocytes TNFalpha + 
IL-lbeta 


5.6 


Secondary CDS 
lymphocyte act 


15.1 


KU-812 (Basophil) rest 


22,7 


CD4 lymphocyte none 


7.1 


-KU-812 (Basophil) 
PMA/ionomycin 


67.8 


2ry Thl/Th2/Trl_anti- 


13.4 


CCD1106 (Keratinocytes) 
none 


10.3 


LAK cells rest 


15.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


5.8 


LAK cells IL-2 


18.7 


Liver cirrhosis 


2.6 | 


LAK cells IL-2+IL-12 


14.5 


Lupus kidney 


1.0 


LAK cells IL-2+IFN 
gamma 


24.1 


NCI-H292 none 


22.1 


LAK cells IL-2+IL-18 


19.5 


NCI-H292IL-4 J 21.8 


LAK cells 


11.7 


NCI-H292 IL-9 j 28.7 
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PMA/ionomycin 








NK Cells IL-2 rest 


14.2 


NC1-H292IL-13 


15.5 


x wu vv ay jlyjll<x\. j uay 


14 0 


XJP I WOO TPM tjammu 

jN^i-nz^z iriN gamma 


97 4 


Twn Wflv MT R S Hav 
i wu vv ay ivjj_»xv »; uay 


14 S 


ixr/vcv^ none 


7 1 


Two Way MLR 7 day 


13.7 


HP AFP TMF alr»hn + TT -1 

beta 


11.5 




4 9 


i^ung iioroDiasL none 


ft 0 


PBMC PWM 


38.2 


Lung fibroblast TNF alpha 

-4- TT -1 U^f« 
» 11^-1 Dc la 


10.8 


PBMC PHA-L 


21.3 


Lung fibroblast IL-4 


14.6 


Ramos (B cell) none 


14.4 


Lung fibroblast IL-9 


13.9 


Ramos (B cell) 
ionomycin 


67.8 


Lung fibroblast IL-1 3 


11.0 


B lymphocytes PWM 


100.0 


Lung fibroblast IFN 
gamma 


20.9 


B lymphocytes CD40L 

J TT A 

and IL-4 


27.2 


Dermal fibroblast 
CCD1070rest 


16.3 


EOL-1 dbcAMP 


13.4 


Dermal fibroblast 
CCD 1070 TNF alpha 


32.3 


EOL-1 dbcAMP 
rivi/v ionomycin 


14.8 


Dermal fibroblast 
ui^JLiiu/u oeta 


7.3 


Dendritic cells none 


11.4 


uermai iioroDiasi itjn 
eamma 


10.2 


Dendritic cells LPS 


15.9 


Dermal fibroblast IL-4 


182 


Dendritic cells anti- 
CD40 


10.8 


IBD Colitis 2 


0.5 


Monocytes rest 


10.6 


D3D Crohn's 


1.2 


Monocytes LPS 


7.0 


Colon 


6.8 


Macrophages rest 


12.2 


Lung 


9.5 


Macrophages LPS 


10:9 


Thymus 


14.3 


HUVEC none 


7.0 


Kidney 


28.9 


HUVEC starved 


17.7 







Panel 1.3D Summary: Ag3060 Results from one experiment with this gene are not 
included. The amp plot indicates that there were experimental difficulties with this run (data 
not shown). 

5 Panel 4D Summary: Ag3060 High expression of the NOV95 transcript (CTs= 26.3- 

26.9) is found in activated B cells and B cell lymphoma (Ramos). B cells generate antibody 
response and lead to activation of T cell mediated response as antigen presenting cells and are 
central to the function of the immune response. Therefore, targeting of this gene product with 
a small molecule drug therapeutic may modulate the functions of B cells and lead to the 
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improvement of symptoms of autoimmune diseases such as lupus erythematosus, rheumatoid 
arthritis, hyperglobulinemia and other B cell disorders. 

In addition, moderate expression of this gene is also found in a wide range of cell types 
of significance in the immune response in health and diseases. This suggests the broader 
5 involvement of the protein encoded by this gene in many inflammatory and autoimmune 
diseases. 

NOV96a, NOV96b, and NOV96c 

Expression of gene NOV96a and full length clones NOV96b and NOV96c was 
assessed using the primer-probe set Ag4532, described in Table CKA. Results of the RTQ- 
10 PCR runs are shown in Table CKB. 

Table CKA. Probe Name Ag4532 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -actcacctctctcctccatcat-3 1 


22 


626 


1372 


Probe 


TET-5 1 -cgttacactgttgccctcaccctgat-3 • - 
TAMRA 


26 


660 


1373 


Reverse 


5 1 -agggaatgaagtagccagtgtt-3 ' 


22 


687 


1374 



Table CKB. General^screening panel vl.4 



Tissue Name 


Rel. Exp.(%) Ag4532, 
Run 222735297 


Tissue Name 


Rel. Exp.(%) Ag4532, 
Run 222735297 


Adipose 


2.0 


Renal ca.TK-10 


31.9 


Melanoma* 
Hs688(A).T 


3.6 


Bladder 


3.7 


Melanoma* 
Hs688(B).T 


5.0 


Gastric ca. (liver met.) 
NCI-N87 


26.6 


Melanoma* Ml 4 


18.2 


Gastric ca. KATO III 


33.7 


Melanoma* 

Loxnvrvi 


5.4 


Colon ca. SW-948 


15.0 


Melanoma* SK- 
MEL-5 


2.7 


Colon ca. SW480 


31.2 


Squamous cell 
carcinoma SCC-4 


6.3 


Colon ca.* (SW480 
met) SW620 


18.4 


Testis Pool 


1.0 


Colon ca. HT29 


4.2 


Prostate ca.* (bone 
met) PC-3 


7.3 


Colon ca.HCT-1 16 


20.6 


Prostate Pool 


1.5 


Colon ca. CaCo-2 


17.1 


Placenta 


9.0 


Colon cancer tissue 


20.4 


Uterus Pool 


0.8 


Colon ca. SW1116 


6.0 
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Ovarian ca. 
OVCAR-3 


10.7 


Colon ca. CoIo-205 


12.3 


Ovarian ca. SK-OV- 

3 


2.5 


Colon ca. SW-48 


13.1 


Ovarian ca. 
OVCAR-4 


9.0 


Colon Pool 


3.4 


Ovarian ca. 
OVCAR-5 


29.5 


Small Intestine Pool 


2.7 


Ovarian ca. IGROV- 
i 


4.0 


Stomach Pool 


1.1 


Ovarian pa 

OVCAR-8 


14.6 


Bone Marrow Pool 


1.3 


Ovary 


1.7 


Fetal Heart 


1.7 


Breast ca. MCF-7 


17.1 


Heart Pool 


2.1 


Breast ca. MDA- 
MB-231 


20.0 


Lymph Node Pool 


2.7 


Breast ca. BT 549 


32.3 


Fetal Skeletal Muscle 


0.7 


RrpflQt ra T47D 

JJJ WOOL \^d. I *T / X-J 


UO.O 


oKeieiai iviuscie r ooi 




Breast ca. MDA-N 


13.5 


Spleen Pool 


5.2 


Breast Pool 


2.3 


Thymus Pool 


4.0 


Trachea 


7.3 


CNS cancer (glio/astro) 
U87-MG 


47.3 


Lung 


0.2 


CNS cancer (glio/astro) 
U-118-MG 


4.0 


Fetal Lung 


8.4 


CNS cancer 
(neuro;met) SK-N-AS 


1.4 


Lungca. NCI-N417 


1.0 


CNS cancer (astro) SF- 
539 


13.6 


Lung ca. LX-1 


21.2 


CNS cancer (astro) 


16.5 


Lungca.NCI-H146 


0.2 


Cancer y*uoj 
SNB-19 


4.4 


Lung ca. SHP-77 


13.8 


v^ino cdncer \^giioj jr- 
295 


9.3 


Lung ca. A549 


44.4 


Brain (Amygdala) Pool 


0.5 


Lungca NCI-H526 


3 1 


Dlalll ^VCIGUCIIUIIIJ 


A 0 


Lungca NCI-H23 


19 5 


DYalU \LCuxi) 


J. 5 1 


Lung ca. NCI-H460 


1.5 


ordin ^xiippo^ampusj 
Pool 


l.i 


Lung ca. HOP-62 


17.4 


Cerebral Cortex Pool 


0.5 


Lungca.NCI-H522 


26.6 


Brain (Substantia nigra) 
Pool 


1.1 


Liver 


23.3 


Brain (Thalamus) Pool 


1.3 


Fetal Liver 


34.2 


Brain (whole) 


1.3 


Liver ca. HepG2 


32.8 


Spinal Cord Pool 


1.3 
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Kidney Pool 


4.6 


Adrenal Gland 


2.8 


Fetal Kidney 


3.8 


Pituitary gland Pool 


0.6 


Renal ca. 786-0 


9.3 


Salivary Gland 


6.3 


Renal ca. A498 , 


10.2 


Thyroid (female) 


6.0 


Renal ca. ACHN 


100.0 


Pancreatic ca. CAPAN2 


32.1 


Renal ca.UO-31 


22.2 


Pancreas Pool 


2.4 



General screening panel vl.4 Summary: Ag4532 The expression of the NOV96a 
gene appears to be highest in a sample derived from a renal cancer cell line 
(ACHN)(CT=26.4). In addition, there is substantial expression associated with other renal 
5 cancer cell lines as well as gastric cancer cell lines, colon cancer cell lines, lung cancer cell 
lines, and breast cancer cell lines. Thus, the expression of this gene could be used to 
distinguish ACHN cells from other samples in this panel. Moreover, therapeutic modulation of 
this gene, through the use of small molecule drugs, protein therapeutics or antibodies might be 
of benefit in the treatment of kidney, gastric, colon, lung or breast cancer. 

10 Among metabolic tissues, this gene has low-to-moderate levels of expression in 

adipose, liver, heart, skeletal muscle, adrenal, pituitary, thyroid, and pancreas. Thus, this gene 
product may be a small molecule target for the treatment of endocrine and metabolic diseases, 
including obesity and Types 1 and 2 diabetes. The direction of therapeutic modulation for this 
gene product would, of necessity, be tissue- or organ-specific. The consequences of altered 

1 5 lactate/monocarboxylate/ketone body transport would differ dramatically between tissues. 

In addition, this gene, a monocarboxylate transporter homolog, is expressed at low to 
moderate levels in all CNS regions examined. The monocarboxylate transporters have been 
implicated in post-ischemic neuronal loss in stroke, such that blockade of these transporters 
increase stroke-related damage. Thus, this gene is an excellent drug target, such that increasing 

20 its activity may decrease postishemic damage in stroke/cerebral infarct. 
References: 

Schurr A, Payne RS, Miller JJ, Tseng MT, Rigor BM. Blockade of lactate transport 
exacerbates delayed neuronal damage in a rat model of cerebral ischemia. Brain Res 2001 Mar 
23;895(l-2):268-72 

25 Studies over the past decade have demonstrated that lactate is produced aerobically 

during brain activation and it has been suggested to be an obligatory aerobic energy substrate 
postischemia. It has been also hypothesized, based on in vitro studies, that lactate, produced by 
glia in large amounts during activation and/or ischemia/hypoxia, is transported via specific 
glial and neuronal monocarboxylate transporters into neurons for aerobic utilization. To test 
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the role of lactate as an aerobic energy substrate postischemia in vivo, we employed the 
cardiac-arrest-induced transient global cerebral ischemia (TGI) rat model and the 
monocarboxylate transporter inhibitor alpha-cyano-4-hydroxycinnamate (4-CIN). Once 4-CIN 
was establish to cross the blood-brain barrier, rats were treated with the inhibitor 60 min prior 
5 to a 5-min TGI. These rats exhibited a significantly greater degree of delayed neuronal damage 
in the hippocampus than control, untreated rats, as measured 7 days post-TGI. We concluded 
that intra-ischemically-accumulated lactate is utilized aerobically as the main energy substrate 
immediately postischemia. Blockade of lactate transport into neurons prevents its utilization 
and, consequently, exacerbates delayed ischemic neuronal damage. 

10 NOV97candNOV97d 

Expression of gene NOV97c and variant NOV97d was assessed using the primer-probe 
set Ag3697, described in Table CLA. Results of the RTQ-PCR runs are shown in Table CLB. 

Table CLA. Probe Name Ag3697 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -catctggattgacactggaatt-3 ' 


22 


585 


1375 


Probe 


TET-5 1 -actcccgggagtggatcacccat-3 1 - 
TAMRA 


23 


608 


1376 


Reverse 


5 • -aatcttattggcagtccagatg-3 ' 


22 


S39 


1377 



15 Table CLB. General screening panel vl,4 



Tissue Name 


Rel. Exp.(%) Ag3697, 
Run 218253566 


Tissue Name 


Rel.Exp.(%)Ag3697, 
Run 218253566 


Adipose 


0.0 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* 
Hs688(B).T 


0.3 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMV1 


0.0 


Colon ca. SW-948 


1.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


8.2 


Squamous cell 
carcinoma SCC-4 


0.9 


Colon ca.* (SW480 
met)SW620 


6.9 


Testis Pool 


100.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.3 


Colon ca.HCT-1 16 


0.8 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.3. 
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Placenta 


7.8 


Colon cancer tissue 


0.0 


U LCI lid iUUl 


ft ft 


<~"r>lr>n r<i CM/1 1 1 £L 

coion ca. owiiio 


U.U 


Ovarian ca. 

V_/ V Lnl\ J 


0.6 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV- 


1.3 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


. 4.1 


Colon Pool 


0.7 


Ovarian f»a 

OVCAR-5 


0.0 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV- 
] 


0.8 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR^8 


2.0 


Bone Marrow Pool 


0.0 


Ovary 


0.2 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.5 


Heart Pool 


0.7 


Breast ca. MDA- 
MB-231 


1.2 


Lymph Node Pool 


0.2 


Breast ca. BT 549 


0.4 


Fetal Skeletal Muscle 


4.5 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0 7 


Breast ca. MDA-N 


0.0 | Spleen Pool 


1.0 


Rrpact Pnnl 
; caol r \J\Jl 


0.3 


Thymus Pool 


1 o 

1.8 


Trachea 


1.4 


CNS cancer (glio/astro) 
U87-MG 


1.1 


Lung 


0.9 


CNS cancer (glio/astro) 
U-118-MG 


4.7 


Fetal Lung 


1.2 


CNS cancer 
(neurojmet) SK-N-AS 


0.2 


Lung ca.NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.0 


Lung ca. LX-1 


2.5 


CNS cancer (astro) 
SNB-75 


1.7 


Lung ca. NCI-H146 


1.4 


CNS cancer (glio) 
SNB-19 


0.0 


Lung ca. SHP-77 


28.9 | 


CNS cancer (glio) SF- 
295 


0.9 


Lung ca. A549 


0.5 


Brain (Amygdala) Pool 


0.0 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0 2 


Lung ca. NCI-H23 


0.8 


Brain (fetal) 


0.0 


Lungca.NCI-H460 


0.7 


Brain (Hippocampus) 
Pool 


0.0 


Lung ca. HOP-62 


1.4 


Cerebral Cortex Pool 


0.5 


Lungca.NCI-H522 


0.0 


Brain (Substantia nigra) 
Pool 


0.3 


Liver j 0.0 


Brain (Thalamus) Pool 


1.6 
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Fetal Liver 


0.5 


Brain (whole) 


0.7 


Liver ca. HepG2 


0.3 


Spinal Cord Pool 


0.3 


Kidney Pool 


0.0 


Adrenal Gland 


0.7 


Fetal Kidney 


0.0 


Pituitary gland Pool 


1.1 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.3 


Pancreatic ca. CAPAN2 


0.4 


Renal ca.UO-31 


4.1 


Pancreas Pool 


2.5 



CNS_neurodegeneration_vl.O Summary: Ag3697 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General screening panel vl.4 Summary: Ag3697 Expression of this gene is highest 
5 in and almost exclusive to testis (CT = 30.7). Therefore, expression of this gene could be used 
to distinguish testis from the other samples on this panel. Moreover, therapeutic modulation of 
the activity of this gene or its protein product using protein therapeutics, antibodies or small 
molecule drugs could be of benefit in the treatment of infertility. 

Panel 4.1D Summary: Ag3697 Expression of this gene is low/undetectable (CTs > 
10 35) across all of the samples on this panel (data not shown). 

NOV98: AGRIN 

Expression of gene NOV98 was assessed using the primer-probe set Ag3974, 
described in Table CMA. Results of the RTQ-PCR runs are shown in Tables CMB, CMC and 
CMD. 

1 5 Table CMA. Probe Name Ag3974 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gacaccaggatcttctttgtga-3 ' 


22 


274 . 


1378 


Probe 


TET-5 ■ -catacctgtggccagcccacaag-3 ' - 
TAMRA 


23 


308 


1379 


Reverse 


5 1 -gagttgagcatcagctcgtt-3 1 


20 


331 


1380 



Table CMB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3974, 
Run 212348647 


Tissue Name 


Rel. Exp.(%) Ag3974, 
Run 212348647 


AD 1 Hippo 


28.7 


Control (Path) 3 
Temporal Ctx 


15.4 


AD 2 Hippo 


36.3 


Control (Path) 4 
Temporal Ctx 


46.0 
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AD 3 Hippo 


19.2 


i AD 1 Occipital Ctx 


28.9 ' 


AD 4 Hippo 


21.8 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


80.7 


AD 3 Occipital Ctx 


19.5 


AD 6 Hippo 


42.6 


AD 4 Occinital Ctx 


21 0 


Control 2 Hippo 


42.3 


AD 5 Occipital Ctx 


47.6 


v^uniroi *t jtiippo 




AD o Occipital Ctx 


14.7 


Control (Path) 3 

jnippu 


12.4 


Control 1 Occipital 
Ctx 


20.4 


AD 1 Temporal Ctx 


32.1 


Control 2 Occipital 
Ctx 


55.1 


AD 2 Temporal Ctx 


31.2 


Control 3 Occipital 
ctx 


22.5 


AD 3 Temporal Ctx 


20.2 


Control 4 Occipital 
Ctx 


22.2 


AD 4 Temporal Ctx 


24.0 


Control (Path) 1 
Occipital Ctx 


71.2 


AD 5 Inf Temporal 


100.0 


Control (Path) 2 
Occipital Ctx 


17.4 


AD 5 Sup Temporal 

V^IX 


58.6 


Control (Path) 3 
Occipital Ctx 


15.4 


AD 6 Inf Temporal 


40.6 


Control (Path) 4 
Occipital Ctx 


38.7 


AD 6 Sup Temporal 


35.1 


Control 1 Parietal 
Ctx 


18.2 


Control 1 Temporal 


19.8 


Control 2 Parietal 
Ctx 


67.4 


Control 2 Temporal 


48.3 


Control 3 Parietal 
ctx 


21.2 


Control 3 Temporal 

V-/LA 


17.8 


Control (Path) 1 
ranetal Ctx 


51.4 


Control 3 Temporal 
Ctx 


25.0 


Control (Path) 2 
Parietal Ctx 


32.3 


Control (Path) 1 
Temporal Ctx 


63.7 


Control (Path) 3 
Parietal Ctx 


11.9 


Control (Path) 2 
Temporal Ctx 


43.5 


Control (Path) 4 
Parietal Ctx 


58.6 


Table CMC. General_screening_panel vl .4 


Tissue Name 


ReI.Exp.(%)Ag3974, 
Ran 217508632 


Tissue Name 


ReI.Exp.(%) Ag3974, 
i Run 217508632 


_ Adipose 


1.5 


Renal ca. TK-10 


16.4 


Melanoma* 
Hs688(A).T 


3.2 


Bladder 


9.0 
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Melanoma* 
Hs688(B).T 


4.2 


Gastric ca. (liver met.) 
NCI-N87 


80.7 


Melanoma* M14 


6.4 


Gastnc ca. KATO III 


17.7 


Melanoma* 
LOXIMVI 


4.0 | 


Colon ca. SW-948 


7.8 


Melanoma* SK- 


4.2 


Colon ca. SW480 


323 


squamous ceii 
carcinoma SCC-4 


8.4 


L/OlOlJ Ca. ^oWW 

met) SW620 


rrwu. ,,. -- - - r - - - 

4.6 


Testis Pool 


1.1 


Colon ca. HT29 


30.6 


iTOState ca. ^oone 
met) PC-3 


24.8 


Colon ca.HCT-1 16 


5.8 


xTOSlaLc rOOl 


n 9. 

U.O 


v^OiOn Ca. v»/av^O"Z 


10 A 


Placenta 


1.3 


Colon cancer tissue 


10.0 


Uterus Pool 


0.4 


Colon ca. SW1116 


3.6 


Ovarian ca. 
OVCAR-3 


66.9 


Colon ca. Colo-205 


1.5 


Ovarian ca. SK-OV- 
3 


36.3 


Colon ca. SW-48 


0.7 


Ovarian ca. 
OVCAR-4 


12.7 


Colon Pool 


1.3 


Ovarian ca. 

UVLAKO 


44.4 


Small Intestine Pool 


1.0 


Ovarian ca. IGROV- 

i 
i 


27.7 


Stomach Pool 


12 


ovarian ca. 
OVCAR-8 


14.9 


Bone Marrow Pool 


0.5 


Ovary 


1.9 


Fetal Heart 


1.0 


Breast ca. MCF-7 


9.7 


Heart Pool 


0.8 


Breast ca. MDA- 
MB-231 


31.2 


Lymph Node Pool 


2.0 


Breast ca. BT 549 


10.1 


Fetal Skeletal Muscle 


0.5 




i fin n 


C1 1/- •fro 1 lVAnctfOo T^/\/\1 
oKclClal IViuSCIS JTOOI 


n ^ 

V.J 


Breast ca. MDA-N 


42 


Spleen Pool 


0.7 


Breast Pool 


1.6 


Thymus Pool 


2.2 


Trachea 


2.6 


CNS cancer (glio/astro) 
U87-MG 


6.0 


Lung 


0.1 


CNS cancer (glio/astro) 
U-H8-MG 


11.2 


Fetal Lung 


8.3 


CNS cancer 
(neuro;met) SK-N-AS 


0.9 


Lungca.NCI-N417 


0.7 


CNS cancer (astro) SF- 
539 


5.0 1 


Lung ca. LX-1 


11.0 


CNS cancer (astro) 
SNB-75 


32.3 
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Lung ca. NCI-H146 


0.1 


CNS cancer (glio) 
SNB-19 


20.2 


Lung ca. SHP-77 


0.8 


CNS cancer (glio) SF- 
295 


38.2 


Lung ca. A549 


10.4 


Brain (Amygdala) Pool 


1.3 


Lung ca. NCI-H526 


1.6 


Brain (cerebellum) 


1.0 


Lung ca. NCI-H23 


20.6 


Brain (fetal) 


2.8 


Lung ca. NCI-H460 


9.3 


Brain (Hippocampus) 
Pool 


0.9 


Lung ca. HOP-62 


23.0 


Cerebral Cortex Pool 


6.9 


Lung ca. INd-rijZZ 




Brain (Substantia nigra) 
Pool 


1.7 


Liver 


0.6 


Brain (Thalamus) Pool 


1.6 


Fetal Liver 


1.4 


Brain (whole) 


1.1 


Liver ca. HepG2 


12.6 


Spinal Cord Pool- 


1.4 


Kidney Pool 


2.5 


Adrenal Gland 


0.4 


Fetal Kidney 


4.6 


Pituitary gland Pool 


0.2 


Renal ca. 786-0 


39.5 


Salivary Gland 


1.3 


Renal ca. A498 


7.9 


Thyroid (female) 


3.7 


Renal ca. ACHN 


15.9 


Pancreatic ca. CAPAN2 


27.7 


Renal ca. UO-31 


38.7 


Pancreas Pool 


4.1 



Table CMD. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag3974,Run 
170739806 


Tissue Name 


Rel. Exp.(%) 
Ag3974,Run 
170739806 


Secondary Thl act 


1.2 


HUVECIL-lbeta 


18.9 


Secondary Th2 act 


8.0 


HUVEC IFN gamma 


16.7 


Secondary Trl act 


3.5 


HUVECTNF alpha + IFN 
gamma 


34.9 


Secondary Thl rest 


0.7 


HUVECTNF alpha + IL4 


31.4 


Secondary Th2 rest 


0.2 


HUVEC IL-11 


13.9 


Secondary Trl rest 


1.2 


Lung Microvascular EC 
none 


100.0 


Primary Thl act 


3.2 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


97.9 


Primary Th2 acf 


2.0 


Microvascular Dermal EC 
none 


48.3 


Primary Trl act 


2.9 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


47.0 


Primary Thl rest 


0.4 


Bronchial epithelium 
TNFalpha + ILlbeta 


90.1 


Primary Th2 rest 


0.2 


Small airway epithelium 


32.5 
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none 




Primary Trl rest 


0.3 


Small airway epithelium 
TNFalpha + IL-lbeta 


93.3 


CJL/4DKA CD4 
lymphocyte act 


22.7. 


Coronery artery SMC rest 


28.5 


/T"\/| rnA fT\A 

CIJhDKxJ CJJ4 
lymphocyte act 


5.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


28.7 

- - ■ ■ y - ■- 


LJDo lymphocyte act 


3.3 


Astrocytes rest 


C C 1 

55.1 


Secondary CD8 1 
lymphocyte rest 


3.3 


Astrocytes TNFalpha + 

TT 1 1— _ j 

IL-lbeta 


66.4 


Secondary CD8 
lymphocyte act 


3.5 


KU-812 (Basophil) rest 


1.9 


CD4 lymphocyte none 


0.1 


T^T T Ol^ /T~v ■ • *\ 

KU-812 (Basophil) 

AIyXtV/ lUUUllljr will 


2.8 


2iyThl/Th2/Trl_anti- 
CD95 CH11 


0.4 


CCDl 1 06 (Keratinocytes) 

nunc 


82.4 


LAK cells rest 


6.4 


CCDl 1 06 flCeratinoevte^ 
TNFalpha + IL-lbeta 


72.7 


LAK cells IT -2 


1 7 


±-i vcr L/irrnubia 


1 A A 


LAK cells IL-2+IL-12 




i>i \^%.~rxjLy^ nunc 


ft 


LAK cells IT -2+IFN 
gamma 


1.1 


NCI-H292 IL-4 


78.5 


LAK cells TL-2+ TT -1 X 


1 (\ 




7Q £ 


LAK cells 

Juni\ wviij 

PMA/ionomycin 


4.6 


NCI-H292 IL-13 


59.9 


NK Cells IL-2 rest 


1 0 


iNv^i-nz^z iriN gamma 


71 7 


Two Wav MLR 1 dav 


1? 4 


rxi/vCv^ none 




Two Way MLR 5 day 


5.3 


TTPAFP TNFF alnha + TT -1 
Xi.iT/Tj_/v_/ i INF aijjna i ii_/—j 

beta 


45.4 


Two Way MLR 7 day 


4.0 


Lung fibroblast none 


29.3 


PBMC rest 


0.6. 


T lino flHrnhlact TNTF alnha 

+ IL-l beta 


87.7 


PBMC PWM 


4 9 


T nn<* "fiVirnKlact TT -A 


4.D.J 


PBMC PHA-L 


3.4 


Lung fibroblast IL-9 


30.4 


Kamos (h> cell; none 


0.4 


T _C 1_ _ t 1 a. TT -1 o 

Lung fibroblast IL-l 3 


36.6 


Ramos (B cell) 
lonomycin 


0.2 


Lung fibroblast IFN 
gamma 


29.7 


B lymphocytes PWM 


3.0 


Dermal fibroblast 
x^k^u i u / u rest 


27.2 


B lymphocytes CD40L 
and IL-4 


3.7 


Dermal fibroblast 
CCD1070 TNFalpha 


20.6 


EOL-1 dbcAMP 


3.1 


Dermal fibroblast 
CCD1070 IL-l beta 


22.4 


EOL-1 dbcAMP 
PMA/ionomycin 


8.0 


Dermal fibroblast IFN 
gamma 


10.3 
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Dendritic cells none 


9.0 


Dermal fibroblast IL-4 


8.0 


Dendritic cells LPS 


32.8 


Dermal Fibroblasts rest 


6.3 


Dendritic cells anti- 
CD40 


o o 

8.8 


Neutrophils TNFa+LPS 


0.9 


Monocytes rest 


1.4 


Neutrophils rest 


1.0 


Monocytes LPS 


81.2 


Colon 


5.8 


Macrophages rest 


9.7 


Lung 


23.3 


Macrophages LPS 


43.8 


Thymus 


7.3 


HUVEC none 


12.6 


Kidney 


33.2 


HUVEC starved 


25.2 







CNSjaeurodegeneration_vl.O Summary: Ag3974 This panel does not show 
differential expression of the NOV98 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system, 

General_screeningjpanel_vl.4 Summary: Ag3974 The expression of the NOV98 
gene appears to be highest in a sample derived from a breast cancer cell line (T47D) 
(CT=22.5). In addition, there appears to be substantial expression in other samples derived 
from breast cancer cell lines, ovarian cancer cell lines, kidney cancer cell lines, lung cancer 
10 cell lines, colon cancer cell lines and brain cancer cell lines. Thus, the expression of this gene 
could be used to distinguish T47D cells from other samples in the panel. Moreover, 
therapeutic modulation of this gene, through the use of small molecule drugs, protein 
therapeutics, or antibodies could be of benefit in the treatment of breast, ovarian, kidney, lung, 
colon or brain cancer. 

15 Among metabolic tissues, this gene has low-to-moderate levels of expression in 

adrenal, pituitary, adult and fetal heart, adult and fetal liver, adult and fetal skeletal muscle, 
and adipose. This gene product has high levels of expression (CT values = 27) in pancreas and 
thyroid. Thus, this gene product may be important for the pathogenesis, diagnosis, and/or 
treatment of metabolic and endocrine diseases, including obesity, Types 1 and 2 diabetes and 

20 thyroidopathies. It has recently been reported that an agrin minigene rescued dystrophic 

symptoms in a mouse model of muscular dystrophy. Therefore, this gene product may also be 
used as a treatment or cure for congenital muscular dystrophies. 

This gene is also expressed at moderate to high levels in all regions of the CNS. This 
molecule is a homolog of agrin, which has been implicated in the formation of senile plaques 

25 in Alzheimer's disease and in the acetylcholine synapse/neuromuscular junction. This gene is 
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therefore an excellent drug target in AD or in any disease involving the neuromuscular 
junction or the acetylchpoline system. 
References: 

Moll J, Barzaghi P, Lin S, Bezakova G, Lochmuller H, Engvall E, Muller U, Ruegg 
5 MA. An agrin minigene rescues dystrophic symptoms in a mouse model for congenital 
muscular dystrophy. Nature. 2001 Sep 20;413(6853):302-7. 

Congenital muscular dystrophy is a heterogeneous and severe, progressive muscle- 
wasting disease that frequently leads to death in early childhood. Most cases of congenital 
muscular dystrophy are caused by mutations in LAMA2, the gene encoding the alpha2 chain 
10 of the main laminin isoforms expressed by muscle fibres. Muscle fibre deterioration in this 
disease is thought to be caused by the failure to form the primary laminin scaffold, which is 
necessary for basement membrane structure, and the missing interaction between muscle 
basement membrane and the dystrophin-glycoprotein complex (DGC) or the integrins. With 
the aim to restore muscle function in a mouse model for this disease, we have designed a 
15 minigene of agrin, a protein known for its role in the formation of the neuromuscular junction. 
Here we show that this mini-agrin-which binds to basement membrane and to alpha- 
dystroglycan, a member of the DGC-amends muscle pathology by a mechanism that includes 
agrin-mediated stabilization of alpha-dystroglycan and the laminin alpha5 chain. Our data 
provides in vivo evidence that a non-homologous protein in combination with rational protein 
20 design can be used to devise therapeutic tools that may restore muscle function in human 
muscular dystrophies. 

PMID: 11565031 

Liyanage Y, Hoch W, Beeson D, Vincent A. The agrin/muscle-specific kinase 
pathway: New targets for autoimmune and genetic disorders at the neuromuscular junction. 

25 Muscle Nerve 2002 Jan£5(l):4-16 

The increasing understanding of the structural complexity of the neuromuscular 
junction (NMJ), and the processes that are important in its development, suggests many 
possible new disease targets. Here, we summarize briefly the genetic and autoimmune 
disorders that affect neuromuscular transmission, and the identified targets, including new 

30 evidence that antibodies to muscle-specific receptor tyrosine kinase (MuSK) are involved in 
the pathogenesis of acetylcholine receptor (AChR) antibody-negative myasthenia gravis. We 
then review the development of the NMJ, focusing on the important roles of nerve-derived 
agrin and MuSK in clustering of AChRs and other essential components of the NMJ. 
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van Horssen J, Otte-Holler I, David G, Maat-Schieman ML, van den Heuvel LP, 
Wesseling P, de Waal RM, Verbeek MM. Heparan sulfate proteoglycan expression in 
cerebrovascular amyloid beta deposits in Alzheimer's disease and hereditary cerebral 
hemorrhage with amyloidosis (Dutch) brains. Acta Neuropathol (Berl) 2001 Dec; 102(6) :604- 
5 14 

Cerebrovascular deposition of amyloid beta protein (A beta) is a characteristic lesion 
of Alzheimer's disease (AD) and hereditary cerebral hemorrhage with amyloidosis of the 
Dutch type (HCHWA-D). Besides A beta, several other proteins and proteoglycans 
accumulate in cerebral amyloid angiopathy (CAA). We have now analyzed the expression of 

10 the heparan sulfate proteoglycan (HSPG) subtypes agrin, perlecan, glypican-1, syndecans 1-3 
and HS glycosaminoglycan (GAG) side chains in CAA in brains of patients with AD and 
HCHWA-D. Hereto, specific well-characterized antibodies directed against the core protein of 
these HSPGs and against the GAG side chains were used for immunostaining. Glypican-1 was 
abundantly expressed in CAA both in AD and HCHWA-D brains, whereas perlecan and 

1 5 syndecans- 1 and -3 were absent in both. Colocalization of agrin with vascular A beta was 
clearly observed in CAA in HCHWA-D brains, but only in a minority of the AD cases. 
Conversely, syndecan-2 was frequently associated with vascular A beta in AD, but did not 
colocalize with vascular A beta deposits in HCHWA-D. The three different syndecans, agrin, 
glypican-1 and HS GAG, but not perlecan, were associated with the majority of senile plaques 

20 (SPs) in all brains. Our results suggest a role for agrin in the formation of SPs and of CAA in 
HCHWA-D, but not in the pathogenesis of CAA in AD. Both syndecan-2 and glypican, but 
not perlecan, may be involved in the formation of CAA. We conclude that specific HSPG 
species may be involved in the pathogenesis of CAA in both AD and HCHWA-D, and that the 
pathogenesis of CAA and SPs may differ with regard to the involvement of HSPG species. 

25 Panel 4AD Summary: Ag3974 The NOV98 gene is expressed at moderate levels 

(CT=29-32) in a wide range of cell types of significance in the immune response in health and 
disease. Therefore, targeting of this gene product with a small molecule drug or antibody 
therapeutic may modulate the functions of cells of the immune system as well as resident 
tissue cells and lead to improvement of the symptoms of patients suffering from autoimmune 

30 and inflammatory diseases such as COPD, emphysema, asthma, allergies, inflammatory bowel 
disease, lupus erythematosus, and arthritis, including osteoarthritis and rheumatoid arthritis. 
Based on its homology to agrin, this gene product may also be beneficial to the treatment of 
multiple sclerosis as suggested by the referene below. 
References: 
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Liyanage Y, Hoch W, Beeson D, Vincent A. The agrin/muscle-specific kinase 
pathway: New targets for autoimmune and genetic disorders at the neuromuscular junction. 
Muscle Nerve 2002 Jan;25(l):4-1 6 

The increasing understanding of the structural complexity of the neuromuscular 
5 junction (NMJ), and the processes that are important in its development, suggests many 
possible new disease targets. Here, we summarize briefly the genetic and autoimmune 
disorders that affect neuromuscular transmission, and the identified targets, including new 
evidence that antibodies to muscle-specific receptor tyrosine kinase (MuSK) are involved in 
the pathogenesis of acetylcholine receptor (AChR) antibody-negative myasthenia gravis. We 
1 0 then review the development of the NMJ, focusing on the important roles of nerve-derived 
agrin and MuSK in clustering of AChRs and other essential components of the NMJ. 

Example 3. SNP analysis of NOVX clones 

SeqCallingTM Technology: cDNA was derived from various human samples 

1 5 representing multiple tissue types, normal and diseased states, physiological states, and 

developmental states from different donors. Samples were obtained as whole tissue, cell lines, 
primary cells or tissue cultured primary cells and cell lines. Cells and cell lines may have been 
treated with biological or chemical agents that regulate gene expression for example, growth 
factors, chemokines, steroids. The cDNA thus derived was then sequenced using CuraGen's 

20 proprietary SeqCalling technology. Sequence traces were evaluated manually and edited for 
corrections if appropriate. cDNA sequences from all samples were assembled with themselves 
and with public ESTs using bioinformatics programs to generate CuraGen's human SeqCalling 
database of SeqCalling assemblies. Each assembly contains one or more overlapping cDNA 
sequences derived from one or more human samples. Fragments and ESTs were included as 

25 components for an assembly when the extent of identity with another component of the 

assembly was at least 95% over 50 bp. Each assembly can represent a gene and/or its variants 
such as splice forms and/or single nucleotide polymorphisms (SNPs) and their combinations. 

Variant sequences are included in this application. A variant sequence can include a 
single nucleotide polymorphism (SNP). A SNP can, in some instances, be referred to as a 

30 "cSNP" to denote that the nucleotide sequence containing the SNP originates as a cDNA. A 
SNP can arise in several ways. For example, a SNP may be due to a substitution of one 
nucleotide for another at the polymorphic site. Such a substitution can be either a transition or 
a transversion. A SNP can also arise from a deletion of a nucleotide or an insertion of a 
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nucleotide, relative to a reference allele. In this case, the polymorphic site is a site at which 
one allele bears a gap with respect to a particular nucleotide in another allele. SNPs occurring 
within genes may result in an alteration of the amino acid encoded by the gene at the position 
of the SNP. Intragenic SNPs may also be silent, however, in the case that a codon including a 
5 SNP encodes the same amino acid as a result of the redundancy of the genetic code. SNPs 

occurring outside the region of a gene, or in an intron within a gene, do not result in changes in 
any amino acid sequence of a protein but may result in altered regulation of the expression 
pattern for example, alteration in temporal expression, physiological response regulation, cell 
type expression regulation, intensity of expression, stability of transcribed message. 

10 Method of novel SNP Identification: SNPs are identified by analyzing sequence 

assemblies using CuraGen's proprietary SNPTool algorithm. SNPTool identifies variation in 
assemblies with the following criteria: SNPs are not analyzed within 10 base pairs on both 
ends of an alignment; Window size (number of bases in a view) is 10; The allowed number of 
mismatches in a window is 2; Minimum SNP base quality (PHRED score) is 23; Minimum 

1 5 number of changes to score an SNP is 2/assembly position. SNPTool analyzes the assembly 
and displays SNP positions, associated individual variant sequences in the assembly, the depth 
of the assembly at that given position, the putative assembly allele frequency, and the SNP 
sequence variation. Sequence traces are then selected and brought into view for manual 
validation. The consensus assembly sequence is imported into CuraTools along with variant 

20 sequence changes to identify potential amino acid changes resulting from the SNP sequence 
variation. Comprehensive SNP data analysis is then exported into the SNPCalling database. 

Method of novel SNP Confirmation: SNPs are confirmed employing a validated 
method know as Pyrosequencing (Pyrosequencing, Westborough, MA). Detailed protocols for 
Pyrosequencing can be found in: Alderborn et al. Determination of Single Nucleotide 

25 Polymorphisms by Real-time Pyrophosphate DNA Sequencing. (2000). Genome Research. 10, 
Issue 8, August. 1249-1265. In brief, Pyrosequencing is a real time primer extension process 
of genotyping. This protocol takes double-stranded, biotinylated PGR products from genomic 
DNA samples and binds them to streptavidin beads. These beads are then denatured producing 
single stranded bound DNA. SNPs are characterized utilizing a technique based on an indirect 

30 bioluminometric assay of pyrophosphate (PPi) that is released from each dNTP upon DNA 
chain elongation. Following Klenow polymerase-mediated base incorporation, PPi is released 
and used as a substrate, together with adenosine 5-phosphosulfate (APS), for ATP sulforylase, 
which results in the formation of ATP. Subsequently, the ATP accomplishes the conversion of 
luciferin to its oxi-derivative by the action of luciferase. The ensuing light output becomes 
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proportional to the number of added bases, up to about four bases. To allow processivity of the 
method dNTP excess is degraded by apyrase, which is also present in the starting reaction 
mixture, so that only dNTPs are added to the template during the sequencing. The process has 
been fully automated and adapted to a 96-well format, which allows rapid screening of large 
5 SNP panels. The DNA and protein sequences for the novel single nucleotide polymorphic 
variants are reported. Variants are reported individually but any combination of all or a select 
subset of variants are also included. In addition, the positions of the variant bases and the 
variant amino acid residues are underlined. 

Results 

1° Variants are reported individually but any combination of all or a select subset of 

variants are also included as contemplated NOVX embodiments of the invention. 

NO VI a SNP data: 

NOV la has two SNP variants, whose variant positions for its nucleotide and amino 
15 acid sequences is numbered according to SEQ ID NOs: 1 and 2, respectively. The nucleotide 
sequence of the NOVla variant differs as shown in Table 101 . 1 



Table 101. cSNP and Coding Variants for NOVla 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


393 


T 


c 


130 


S->P 


420 


T 


c 


139 


W->R 


431 


T 


c 


142 


No change 


501 


C 


T 


166 


L->F 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


420 


T 


c 


20 


0.100 



NOVlb SNP data: 

NOVlb has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:3 and 4, respectively. The nucleotide 
sequence of the NOVlb variant differs as shown in Table 102. 



Table 102. cSNP and Coding Variants for NOVlb 


NT. Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


393 


T 


c 


130 


S->P 


420 


T 


c 


139 


W->R 
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NOV3a SNP data: 

NOV3a has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:l 1 and 12, respectively. The nucleotide 
sequence of the NOV3a variant differs as shown in Table 103. 



Table 103. cSNP and Coding Variants for NOV3a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


212 


T 


C 


54 


C->R 


439 


A 


G 


149 


No Change 



NOV4a SNP data: 

NOV4a has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:17 and 18, respectively. The nucleotide 
sequence of the NOV4a variant differs as shown in Table 104. 



Table 104. cSNP and Coding Variants for NOV4a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


229 


G 


A 


73 


No change 


390 


G 


A 


127 ' 


W->End 


631 


G 


C 


207 


Q->H 



NOV5a SNP data: 

NOV5a has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:23 and 24, respectively. The nucleotide 
sequence of the NOV5a variant differs as shown in Table 105. 



Table 105. cSNP and Coding Variants for NOV5a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


79 


C 


T 


19 


A->V 


204 


T 


C 


61 


No change 


658 


A 


T 


212 


Q->L 


884 


C 


G 


287 


No change 


1149 


G 


T 


376 


D->Y 



NOV6 SNP data: 
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NOV6 has four SNP variants, whose variant positions for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:33 and 34, respectively. The nucleotide 
sequence of the NOV6 variant differs as shown in Table 106. 



Table 106. cSNP and Coding Variants for NOV6 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


543 


T 


c 


156 


V->A 


549 


T 


c 


158 


I->T 


660 


C 


T 


195 


A->V 


734 


A 


G 


220 


T->A 


782 


G 


A 


236 


A->T 



NOV7a SNP data: 

NOV7a has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:35 and 36, respectively. The nucleotide 
sequence of the NOV7a variant differs as shown in Table 107. 



Table 107. cSNP and Coding Variants for NOV7a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
Position 


Amino Acid 
Change 


168 


T 


c 


24 


V->G 


459 


C 


T 


121 


A->V 


815 


T 


C 


240 


S->P 


896 


A 


G 


N/A 


No change 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


428 


G 


A 


8 


0250 



NOV7c SNP data: 

NOV7c has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:39 and 40, respectively. The nucleotide 
sequence of the NOV7c variant differs as shown in Table 108. 



Table 108. cSNP and Coding Variants for NOV7c 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


383 


c 


T 


5 


0.400 



NOV7d SNP data: 
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NOV7d has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:41 and 42, respectively. The nucleotide 
sequence of the NOV7d variant differs as shown in Table 109. 



Table 109. cSNP and Coding Variants for NOV7d 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
Position 


Amino Acid 
Change 


260 


G 


A 


86 


I->E 



NOV7e SNP data: 

NOV7e has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:323 and 324, respectively. The 
nucleotide sequence of the NOV7d variant differs as shown in Table 1 10. 

10 



Table 110. cSNP and Coding Variants for NOV7e 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
Position 


Amino Acid 
Change 


304 


A 


G 


102 


T->A 



NOV9a SNP data: 

NOV9a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:45 and 46, respectively. The nucleotide 
1 5 sequence of the NOV9a variant differs as shown in Table 111. 



Table 111. cSNP and Coding Variants for NOV9a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


264 


G 


A 


N/A 


No change 


391 


C 


A 


8 


P->T 


438 


G 


A 


23 


No change 


550 


T 


G 


61 


F->V 


672 


C 


T 


101 


No change 


1286 


T 


C 


306 


L->S 


1338 


G 


A 


323 


No change 



NOV10SNP data: 

NOV10 has one SNP variant, whose variant position for its nucleotide and amino acid 
20 sequences is numbered according to SEQ ID NOs:49 and 50, respectively. The nucleotide 
sequence of the NOV10 variant differs as shown in Table 1 12. 
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Table 112. cSNP and Coding Variants for NOV10 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


335 


G 


A 


10 


0.400 



NOV13b SNP data: 

NOV 1 3b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:57 and 58, respectively. The nucleotide 
5 sequence of the NOV 13b variant differs as shown in Table 113. 



Table 113. cSNP and Coding Variants for NOV13b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


362 


G 


A 


11 


0.455 



NOV15b SNP data: 

NOV 15b has one SNP variant, whose variant position for its nucleotide and amino acid 
10 sequences is numbered according to SEQ ID NOs:63 and 64, respectively. The nucleotide 
sequence of the NOV15b variant differs as shown in Table 1 14. 



Table 114. cSNP and Coding Variants for NOVlSb 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


388 


T 


G 


17 


0.294 



NOV16b SNP data: 

1 5 NOV 1 6b has one SNP variant, whose variant position for its nucleotide and amino acid 

sequences is numbered according to SEQ ID NOs:67 and 68, respectively. The nucleotide 
sequence of the NOV16b variant differs as shown in Table 115. 



Table 115. cSNP and Coding Variants for NOV16b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


463 


A 


T 


16 


0.125 


465 


C 


T 


16 


0.125 


535 


T 


c 


15 


0.133 


735 


C 


T 


12 


0.167 
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814 



11 



0.182 



NOV21a SNP data: 

NOV21a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:87 and 88, respectively. The nucleotide 
sequence of the NOV21a variant differs as shown in Table 116. 



Table 116. cSNP and Coding Variants for NOV21a 


NT Position 
of CSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


121 


T 


C 


41 


W->R 


170 


T 


c 


57 


V->A 


364 


C 


T 


122 


No change 


415 


G 


A 


139 


E->K 


535 


G 


A 


182 


R->H 


630 


G 


A 


210 


No change 



10 



NOV21b SNP data: 

NOV21b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:89 and 90, respectively. The nucleotide 
sequence of the NOV2 1 b variant differs as shown in Table 1 1 7. 



15 



Table 117. cSNP and Coding Variants for NOV21b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


485 


G 


A 


65 


0246 J 


616 


G 


A 


44 


0.136 


714 


G 


A 


36 


0.083 



NOV22a SNP data: 

NOV22a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ED NOs:91 and 92, respectively. The nucleotide 
sequence of the NOV22a variant differs as shown in Table 118. 



Table 118. cSNP and Coding Variants for NOV22a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


669 


A 


G 


223 


No change 


725 


C 


T 


242 


T->M 
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NOV22c SNP data: 

NOV22c has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:95 and 96, respectively. The nucleotide 
sequence of the NOV22c variant differs as shown in Table 1 19. 



Table 119. cSNP and Coding Variants for NOV22c 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


85 


A 


G 


30 


0.067 


288 


T 


C 


31 


0.065 


! 484 


A 


G 


37 


0.054 


540 


C 


T 


29 


0.241 



NOV24a SNP data: 

NOV24a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:103 and 104, respectively. The nucleotide 
10 sequence of the NOV24a variant differs as shown in Table 120. 



Table 120. cSNP and Coding Variants for NOV24a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


539 


c 


T 


511 


No change 



NOV24b SNP data: 

NOV24b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:105 and 106, respectively. The nucleotide 
sequence of the NOV24b variant differs as shown in Table 121. 



Table 121. cSNP and Coding Variants for NO V24b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


437 


A 


G 


143 


N->S 


664 


T 


G 


219 


F->V 


1150 


G 


T 


381 


A->S 


1210 


G 


T 


401 


E->End 


1770 


C 


T 


587 


No change 


2011 


A 


G 


N/A 


No change 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


329 


c 


T 


11 


0.364 
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491 



13 



0.154 



NOV25 SNP data: 

NOV25 has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:107 and 108, respectively. The nucleotide 
sequence of the NOV25 variant differs as shown in Table 122. 



Table 122. cSNP and Coding Variants for NOV25 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


221 


G 


A 


54 


No change 


462 


C 


T 


135 


L->F 



10 



NOV26a SNP data: 

NOV26a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs: 109 and 1 10, respectively. The nucleotide 
sequence of the NOV26a variant differs as shown in Table 123. 



Table 123. cSNP and Coding Variants for NOV26a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change . 


67 


A 


G 


1 


M->V 


98 


c 


T 


11 


P->L 


128 


A 


G 


21 


E->G 


176 


A 


G 


37 


A->T 


233 


A 


G 


56 


Q->R 


243 


T 


C 


59 


No change 


252 


A 


G 


62 


No change 


260 


A 


G 


65 


D->G 


296 


A 


G 


77 


K->R 


316 


A 


G 


84 


N->D 


369 


G 


A 


101 


M->I 


395 


A 


G 


110 


Q->R 


465 


G 


A 


N/A 


No change 



NOV26b SNP data: 

1 5 NOV26b has one SNP variant, whose variant position for its nucleotide and amino acid 

sequences is numbered according to SEQ ID NOs:l 1 1 and 1 12, respectively. The nucleotide 
sequence of the NOV26b variant differs as shown in Table 124. 
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Table 124. cSNP and Coding Variants for NOV26b 


NT Position 
of cSNP 


Wild Tvne 
NT 


Varianh KTT 

* J. O.XI U XX X 


ueptn 


Putative 

Allele 
Frequency 


133 


A 


G 


41 


0.049 


268 


A 


G 


41 


0.049 


324 


A 


G 


41 


0.049 


372 


A 


G 


41 


0.049 


376 


A 


* 


41 


0.049 


456 


T 


C 


40 


0.050 


488 


A 


G 


32 


0.344 



NOV27a SNP data: 

NOV27a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:l 13 and 1 14, respectively. The nucleotide 
5 sequence of the NOV27a variant differs as shown in Table 125. 



Table 125. cSNP and Coding Variants for NOV27a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


158 


A 


T 


36 


No change 


491 


C 


T 


147 


No change 


562 


T 


C 


171 


L->P 


858 


C 


T 


270 


No change 


1750 


C 


T 


567 * 


P->L 



NOV27b SNP data: 

NOV27b has one SNP variant, whose variant position for its nucleotide and amino acid 
1 0 sequences is numbered according to SEQ ID NOs: 1 1 5 and 1 16, respectively. The nucleotide 
sequence of the NOV27b variant differs as shown in Table 126. 



Table 126. cSNP and Coding Variants for NOV27b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


131 


G 


A 


43 


0233 


1072 


G 


A 


14 


0214 


1368 


A 


G 


24 


0.125 


1439 


G 


A 


42 


0.071 


1733 


G 


A 


43 


0.047 


! 1772 


T 


A 


43 


0.442 


1787 


G 


A 


42 


0.286 
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NOV29c SNP data: 

NOV29c has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:125 and 126, respectively. The nucleotide 
sequence of the NOV29c variant differs as shown in Table 127. 



Table 127. cSNP and Coding Variants for NOV29c 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


760 


A 


G 


254 


T->A 


923 


T 


C 


308 


G->D 



NOV30 SNP data: 

NOV30 has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:127 and 128, respectively. The nucleotide 
10 sequence of the NOV30 variant differs as shown in Table 128. 



Table 128. cSNP and Coding Variants for NOV30 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Ami.no Acid 
Change 


103 


A 


G 


28 


I->V 


207 


T 


C 


62 


No change 


225 


C 


T 


68 


No change 


233 


A 


G 


71 


D->G 


267 


T 


C 


82 


No change 


318 


A 


G 


99 


No change 


392 


T 


C 


124 


L->P 


431 


T 


C 


137 


M->T 


464 


A 


G 


148 


E->G 


479 


T 


A 


153 


V->E 



NOV33 SNP data: 

NOV33 has one SNP variant, whose variant position for its nucleotide and amino acid 
15 sequences is numbered according to SEQ ID NOs:133 and 134, respectively. The nucleotide 
sequence of the NOV33 variant differs as shown in Table 129. 





Table 129. cSNP and Coding Variants for NOV33 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


5097 


T 


c 


1699 


No change 


6012 


C 


T 


2004 


No change 
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NOV36aSNPdata: 

NOV36a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs: 139 and 140, respectively. The nucleotide 
sequence of the NOV36a variant differs as shown in Table 1 30. 



Table 130. cSNP and Coding Variants for NOV36a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


351 


T 


c 


102 


No change 


737 


A 


G 


231 


D->G 



NOV38 SNP data: 

NOV38 has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs: 145 and 146, respectively. The nucleotide 
1 0 sequence of the NOV38 variant differs as shown in Table 131. 



Table 131. cSNP and Coding Variants for NOV38 



NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


566 


c 


T 


188 


H->R 


658 


A 


G 


219 


H->R \ 


844 


G 


A 


281 


C->Y ! 


892 


C 


T 


297 


A->V 


910 


T 


C 


303 


V->A 


1009 


G 


A 


336 


S->N 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


95 


T 


c 


14 


N/A 



NOV39a SNP data: 

NOV39a has one SNP variant, whose variant position for its nucleotide and amino acid 
1 5 sequences is numbered according to SEQ ID NOs: 147 and 148, respectively. The nucleotide 
sequence of the NOV39a variant differs as shown in Table 132. 



Table 132. cSNP and Coding Variants for NOV39a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


1095 


T 


c 


11 


N/A 



NOV39b SNP data: 
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NOV39b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:149 and 150, respectively. The nucleotide 
sequence of the NOV39b variant differs as shown in Table 1 33. 



Table 133. cSNP and Coding Variants for NOV39b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


933 


c 


T 


9 


0.222 



NOV42c SNP data: 

NOV42c has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:161 and 162, respectively. The nucleotide 
sequence of the NOV42c variant differs as shown in Table 134. 



Table 134. cSNP and Coding Variants for NOV42c 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


330 


G 


A 


103 


R->Q 


783 


A 


G 


254 


D->G 


903 


G 


A 


294 


R->H 


1389 


A 


T 


456 


E->V 


1389 


A 


G 


456 


E->G 


1394 


G 


A 


458 


A->T | 


1642 


C 


T 


540 


No change 


1656 


T 


C 


545 


V->A 


1658 


G 


A 


546 


A->T 



NOV43 SNP data: 

NQV43 has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ IDNOs:165 and 166, respectively. The nucleotide 
sequence of the NOV43 variant differs as shown in Table 135. 



Table 135. cSNP and Coding Variants for NOV43 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


378 


G 


A 


121 


No change 


496 


G 


A 


162 


R->Q 



NOV46b SNP data: 
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NOV46b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:173 and 174, respectively. The nucleotide 
sequence of the NOV46b variant differs as shown in Table 136. 



Table 136. cSNP and Coding Variants for NOV46b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


500 


c 


T 


163 


No change 


NT Position 
of cSNP 


. Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


486 


c 


T 


16 


0.125 



NOV48a SNP data: 

NOV48a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ IDNOs:181 and 182, respectively. The nucleotide 
sequence of the NOV48a variant differs as shown in Table 137. 



Table 137. cSNP and Coding Variants for NOV48a 


NT Position 
of cSNP 


Wild Type 
NT . 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


370 


A 


G 


1 


M->V 


436 


C 


T 


23 


L->F 


539 


A 


! G 


57 


D->G 


650 


A 


G 


94 


E->G 


1012 


c 


T 


215 


Q->End 


1922 


A 


G 


518 


K->R 


! 2057 


A 


G 


563 


Q->R 


2066 


C 


T 


566 


A->V 


2198 


C 


T 


610 


P->L 


2618 


A 


G 


750 


D->G 


2656 


G 


A 


N/A 


No change 



NOV50b SNP data: 

NOV50b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:189 and 190, respectively. The nucleotide 
15 sequence of the NOV50b variant differs as shown in Table 138. 



Table 138, cSNP and Coding Variants for NOV50b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


797 


A 


G 


265 


Q->R 
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NOV52 SNP data: 

NOV52 has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:193 and 194, respectively. The nucleotide 
sequence of the NOV52 variant differs as shown in Table 139. 



Table 139- cSNP and Coding Variants for NOV52 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


318 


T 


c 


48 


No change 


351 


c 


T 


59 


R->C 


1961 


G 


T 


595 


M->I 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


70 


c 


T 


. 54 


0.056 



NOV56a SNP data: 

NOV56a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:203 and 204, respectively. The nucleotide 
sequence of the NOV56a variant differs as shown in Table 140. 



Table 140. cSNP and Coding Variants for NOV56a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


118 


A 


c 


39 


No change 


435 


c 


T 


135 


P->L 


439 


T 


C 


146 


No change 


473 


A 


G 


158 


T->A 


588 


T 


C 


196 


V->A 


596 


G 


A 


199 


G->R 


614 


A 


G 


205 


M->V 


631 


T 


C 


210 


No change 


637 


A 


G 


212 


No change 


642 


T 


C 


214 


M->T 


732 


G 


T 


244 


W->L 


902 


A 


T 


301 


M->L 



NOV57 SNP data: 

NOV57 has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:207 and 208, respectively. The nucleotide 
sequence of the NOV57 variant differs as shown in Table 141 . 
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Table 141. cSNP and Coding Variants for NOV57 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


939 


T 


A 


N/A 


No change 



NOV58b SNP data: 

NOV58b has one SNP variant, whose variant position for its nucleotide and amino acid 
5 sequences is numbered according to SEQ ID NOs:21 1 and 212, respectively. The nucleotide 
sequence of the NOV58b variant differs as shown in Table 142. 



Table 142. cSNP and Coding Variants for NOV58b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


88 


T 


c 


11 


0.273 


377 


A 


G 


18 


0.111 


500 


T 


C 


18 


0.111 


509 


A 


G 


18 


0.111 


570 


T 


C 


17 


0.118 


647 


C 


T 


9 


0.222 



NOV60a SNP data: 

NOV60a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:215 and 216, respectively. The nucleotide 
sequence of the NOV60a variant differs as shown in Table 1 43 . 



Table 143. cSNP and Coding Variants for NOV60a 


NT Position 
of c'SNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


341 


T 


c 


52 




401 


G 


c 


79 




411 


T 


c 


79 




444 


c 


T 


79 




644 


C 


T 


59 




653 


T 


A 


59 




670 


T 


C 


59 




707 


T 


c 


33 





15 NOV60b SNP data: 
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NOV60b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:217 and 218, respectively. The nucleotide 
sequence of the NOV60b variant differs as shown in Table 144. 



Table 144. cSNP and Coding Variants for NOV60b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


162 


T 


G 


54 


0.259 


192 


A 


G 


54 


0.056 


229 


A 


G 


81 


0.025 


246 


- A 


T 


80 


0.025 


255 


G 


A 


79 


0.038 


263 


A 


G 


77 


0.039 


342 


G 


A 


103 


0.019 


389 


A 


G 


105 


0.019 - 



Example 4. In-frame Cloning 
NOV7c 

For NOV7c the cDNA coding for the DOMAIN of NOV7c from residues 1 to 230 was 
targeted for "in-frame" cloning by PCR. The PCR template was based on the previously 
10 identified plasmid, when available, or on human cDNA(s). 



Table 145. Oligonucleotide primers used to clone the target cDNA sequence: 



Primers 


Sequences 


Fl 


5 


- AGATCTCCCACC ATGGAACTTCAGGACCTGGAACTGC -3' (SEQ ID NO: 1382) 


Rl 


5 


- CTCGAG TCCACTTACAATTTCCCGTCTGATTTCC -3' (SEQ ID NO: 1385) 


SF1 


5' 


- TCCTCCTGGAGAAAGCTCAGAATCTGTTTT -3 ' (SEQ ID NO: 1387) 


SF2 


5« 


- CTCCAGATTTGGAAAGTTCTGAGGAA -3 • (SEQ ID NO: 1388) 


SRI 


5' 


- ATTTCTCCAAGTCCCAGGCCC -3» (SEQ ID NO: 1389) ! 


SR2 


5' 


- GAGCCTGTTCTAGAAGGAGCTGTTG -3-' (SEQ ID NO: 1390) j 



For downstream cloning purposes, the forward primer includes an in-frame Hind III 

restriction site and the reverse primer contains an in-frame Xho I restriction site. 

1 5 Two parallel PCR reactions were set up using a total of 0.5-1 .0 ng human pooled 

cDNAs as template for each reaction. The pool is composed of 5 micrograms of each of the 

following human tissue cDNAs: adrenal gland, whole brain, amygdala, cerebellum, thalamus, 

bone marrow, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, liver, lymphoma, 

Burkitt's Raji cell line, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 

20 gland, skeletal muscle, small Intestine, spleen, stomach, thyroid, trachea, uterus. 

When the tissue of expression is known and available, the second PCR was performed 

using the above primers and 0.5ng~1.0 ng of one of the following human tissue cDNAs: 
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skeleton muscle, testis, mammary gland, adrenal gland, ovary, colon, normal 
cerebellum, normal adipose, normal skin, bone marrow, brain amygdala, brain hippocampus, 
brain substantia nigra, brain thalamus, thyroid, fetal lung, fetal liver, fetal brain, kidney, heart, 
spleen, uterus, pituitary gland, lymph node, salivary gland, small intestine, prostate, placenta, 
5 spinal cord, peripheral blood, trachea, stomach, pancreas, hypothalamus. 

The reaction mixtures contained 2 microliters of each of the primers (original 
concentration: 5 pmol/ul), 1 microliter of lOmM dNTP (Clontech Laboratories, Palo Alto CA) 
and 1 microliter of 50xAdvantage-HF 2 polymerase (Clontech Laboratories) in 50 microliter- 
reaction volume. The following reaction conditions were used: 
10 PCR condition 1: 

a) 96°C 3 minutes 

b) 96°C 30 seconds denaturation 

c) 60°C 30 seconds, primer annealing 

d) 72°C 6 minutes extension 

15 

Repeat steps b-d 15 times 

e) 96°C 15 seconds denaturation 

f) 60°C 30 seconds, primer annealing 

g) 72°C 6 minutes extension 

20 

Repeat steps e-g 29 times 

e) 72°C 10 minutes final extension 

PCR condition 2: 
25 a) 96°C 3 minutes 

b) 96°C 15 seconds denaturation 

c) 76°C 30 seconds, primer annealing, reducing the temperature by 1 °C per 
cycle 

d) 72°C 4 minutes extension 

30 

Repeat steps b-d 34 times 

e) 72°C 10 minutes final extension 



35 
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Example 5: SAGE Analysis 

Hs.181638 : ESTs, Weakly similar to SSR1_HUMAN SOMATOSTATIN RECEPTOR TYPE 1 [H. 

SAGE library data and reliable tag summary: 
Reliable tags found in SAGE libraries: 

TTGTCGATAT 



i t > ^- • ~, * ' ^. 3U •* 




: ^ ecu 












p otaltags 


SAGE Chen LNCaP 


16 




1 


62267 


SAGE Chen Normal Pr 


30 


**** 


2 


66193 


SAGE Chen Tumor Pr 


14 




1 


68384 


SAGECAPAN1 


26 


■mm 


1 


37926 


SAGE Panel 


80 






24879 


SAGE Duke H54 EGFRvIII 


34 






57164 


SAGE CPDR LNCaP-T 


22 


^> 




44122 


SAGE 293-IND 


40 






24481 


SAGE PR317 normal prostate 


16 






59419 


SAGE PR317 prostate tumor 


15 






65109 


SAGE BB542 whitematter 


10 






94806 


SAGE Pane 96-6252 


27 






35745 


SAGE SciencePark MCF7 
Control Oh 


16 






61079 


SAGE SciencePark MCF7 
estradiol lOh 


16 






60435 


SAGE Duke H566 


15 






65728 


SAGEOVT-6 


23 






42336 


SAGE mammary epithelium 


20 






49167 


SAGE ML10-1Q 


35 




2 


56943 


SAGE DukeH1043 


13 




1 


76673 
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OTHER EMBODIMENTS 

Although particular embodiments have been disclosed herein in detail, this has been 
done by way of example for purposes of illustration only, and is not intended to be limiting 
with respect to the scope of the appended claims, which follow. In particular, it is 
5 contemplated by the inventors that various substitutions, alterations, and modifications may be 
made to the invention without departing from the spirit and scope of the invention as defined 
by the claims. The choice of nucleic acid starting material, clone of interest, or library type is 
believed to be a matter of routine for a person of ordinary skill in the art with knowledge of the 
embodiments described herein. Other aspects, advantages, and modifications considered to be 
10 within the scope of the following claims. 
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WHAT IS CLAIMED IS: 

1. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

(a) a mature form of an amino acid sequence selected from the group consisting of 
SEQ ID NOS:2n, wherein n is an integer between 1 and 162; 

(b) a variant of a mature form of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2n, wherein n is an integer between 1 and 162, 
wherein one or more amino acid residues in said variant differs from the amino 
acid sequence of said mature form, provided that said variant differs in no more 
than 15% of the amino acid residues from the amino acid sequence of said mature 
form; . 

(c) an amino acid sequence selected from the group consisting SEQ ID NOS:2n, 
wherein n is an integer between 1 and 162; and 

(d) a variant of an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2n, wherein n is an integer between 1 and 162, wherein one or more amino 
acid residues in said variant differs from the amino acid sequence of said mature 
form, provided that said variant differs in no more than 15% of amino acid 
residues from said amino acid sequence. 

2 The polypeptide of claim 1, wherein said polypeptide comprises the amino acid sequence 
of a naturally-occurring allelic variant of an amino acid sequence selected from the group 
consisting SEQ ID NOS:2n, wherein n is an integer between 1 and 162. 

3. The polypeptide of claim 2, wherein said allelic variant comprises an amino acid 
sequence that is the translation of a nucleic acid sequence differing by a single nucleotide 
from a nucleic acid sequence selected from the group consisting of SEQ ID NOS:2n-l, 
wherein n is an integer between 1 and 162. 

4. The polypeptide of claim 1, wherein the amino acid sequence of said variant comprises a 
conservative amino acid substitution. 
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5. An isolated nucleic acid molecule comprising a nucleic acid sequence encoding a 
polypeptide comprising an amino acid sequence selected from the group consisting of: 

(a) a mature form of an amino acid sequence selected from the group consisting of 
SEQ ID NOS:2n, wherein n is an integer between 1 and 162; 

(b) a variant of a mature form of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2n, wherein n is an integer between 1 and 162, 
wherein one or more amino acid residues in said variant differs from the amino 
acid sequence of said mature form, provided that said variant differs in no more 
than 15% of the amino acid residues from the amino acid sequence of said mature 
form; 

(c) an amino acid sequence selected from the group consisting of SEQ ID NOS:2n, 
wherein n is an integer between 1 and 162; 

(d) a variant of an amino acid sequence selected from the group consisting SEQ ID 
NOS:2n, wherein n is an integer between 1 and 162, wherein one or more amino 
acid residues in said variant differs from the amino acid sequence of said mature 
form, provided that said variant differs in no more than 15% of amino acid 
residues from said amino acid sequence; 

(e) a nucleic acid fragment encoding at least a portion of a polypeptide comprising an 
amino acid sequence chosen from the group consisting of SEQ ID NOS:2n, 
wherein n is an integer between 1 and 162, or a variant of said polypeptide, 
wherein one or more amino acid residues in said variant differs from the amino 
acid sequence of said mature form, provided that said variant differs in no more 
than 15% of amino acid residues from said amino acid sequence; and 

(f) a nucleic acid molecule comprising the complement of (a), (b), (c), (d) or (e). 

6. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule comprises the 
nucleotide sequence of a naturally-occurring allelic nucleic acid variant. 

7. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule encodes a 
polypeptide comprising the amino acid sequence of a naturally-occurring polypeptide 
variant. 
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8. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule differs by a 
single nucleotide from a nucleic acid sequence selected from the group consisting of SEQ 
ID NOS:2n-l, wherein n is an integer between 1 and 162. 

9. The nucleic acid molecule of claim 5 3 wherein said nucleic acid molecule comprises a 
nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence selected from the group consisting of SEQ ID NOS:2n- 1 , 
wherein n is an integer between 1 and 162; 

(b) a nucleotide sequence differing by one or more nucleotides from a nucleotide 
sequence selected from the group consisting of SEQ ID NOS:2n-l, wherein n is 
an integer between 1 and 162, provided that no more than 20% of the nucleotides 
differ from said nucleotide sequence; 

(c) a nucleic acid fragment of (a); and 

(d) a nucleic acid fragment of (b). 

1 0. The nucleic acid molecule of claim 5, wherein said nucleic acid molecule hybridizes 
under stringent conditions to a nucleotide sequence chosen from the group consisting 
SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162, or a complement of said 
nucleotide sequence. 

1 1 . The nucleic acid molecule of claim 5, wherein the nucleic acid molecule comprises a 
nucleotide sequence selected from the group consisting of: 

(a) a first nucleotide sequence comprising a coding sequence differing by one or 
more nucleotide sequences from a coding sequence encoding said amino acid 
sequence, provided that no more than 20% of the nucleotides in the coding 
sequence in said first nucleotide sequence differ from said coding sequence; 

(b) an isolated second polynucleotide that is a complement of the first polynucleotide; . 
and 

(c) a nucleic acid fragment of (a) or (b). 

12. A vector comprising the nucleic acid molecule of claim 1 1 . 

13. The vector of claim 12, further comprising a promoter operably-linked to said nucleic 
acid molecule. 
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14. A cell comprising the vector of claim 12. 

15. An antibody that binds immunospecifically to the polypeptide of claim 1 . 

1 6. The antibody of claim 1 5, wherein said antibody is a monoclonal antibody. 

17. The antibody of claim 15, wherein the antibody is a humanized antibody. 

1 8. A method for determining the presence or amount of the polypeptide of claim 1 in a 
sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with an antibody that binds immunospecifically to the 
polypeptide; and 

(c) determining the presence or amount of antibody bound to said polypeptide, 
thereby determining the presence or amount of polypeptide in said sample. 

19. A method for determining the presence or amount of the nucleic acid molecule of claim 5 
in a sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with a probe that binds to said nucleic acid molecule; and 

(c) determining the presence or amount of the probe bound to said nucleic acid 
molecule, 

thereby determining the presence or amount of the nucleic acid molecule in said sample. 

20. The method of claim 19 wherein presence or amount of the nucleic acid molecule is used 
as a marker for cell or tissue type. 

21. The method of claim 20 wherein the cell or tissue type is cancerous. 

22. A method of identifying an agent that binds to a polypeptide of claim 1, the method 
comprising: 

(a) contacting said polypeptide with said agent; and 

(b) determining whether said agent binds to said polypeptide. 
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23. The method of claim 22 wherein the agent is a cellular receptor or a downstream effector. 

24. A method for identifying an agent that modulates the expression or activity of the 
polypeptide of claim 1, the method comprising: 

(a) providing a cell expressing said polypeptide; 

(b) contacting the cell with said agent, and 

(c) determining whether the agent modulates expression or activity of said 
polypeptide, 

whereby an alteration in expression or activity of said peptide indicates said agent modulates 
expression or activity of said polypeptide. 

25. A method for modulating the activity of the polypeptide of claim 1, the method 
comprising contacting a cell sample expressing the polypeptide of said claim with a 
compound that binds to said polypeptide in an amount sufficient to modulate the activity 
of the polypeptide. 

26. A method of treating or preventing a NOVX-associated disorder, said method comprising 
administering to a subject in which such treatment or prevention is desired the 
polypeptide of claim 1 in an amount sufficient to treat or prevent said NOVX-associated 
disorder in said subject. 

27. The method of claim 26 wherein the disorder is selected from the group consisting of 
cardiomyopathy and atherosclerosis. 

28. The method of claim 26 wherein the disorder is related to cell signal processing and 
metabolic pathway modulation. 

29. The method of claim 26, wherein said subject is a human. 

30. A method of treating or preventing a NOVX-associated disorder, said method comprising 
administering to a subject in which such treatment or prevention is desired the nucleic 
acid of claim 5 in an amount sufficient to treat or prevent said NOVX-associated disorder 
in said subject. 
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3 1 . The method of claim 30 wherein the disorder is selected from the group consisting of 
cardiomyopathy and atherosclerosis. 

32. The method of claim 30 wherein the disorder is related to cell signal processing and 
metabolic pathway modulation. 

33. The method of claim 30, wherein said subject is a human. 

34. A method of treating or preventing a NOVX-associated disorder, said method comprising 
administering to a subject in which such treatment or prevention is desired the antibody 
of claim 15 in an amount sufficient to treat or prevent said NOVX-associated disorder in 
said subject. 

35. The method of claim 34 wherein the disorder is diabetes. 

36. The method of claim 34 wherein the disorder is related to cell signal processing and 
metabolic pathway modulation. 

37. The method of claim 34, wherein the subject is a human. 

38. A pharmaceutical composition comprising the polypeptide of claim 1 and a 
pharmaceutically-acceptable carrier. 

39. A pharmaceutical composition comprising the nucleic acid molecule of claim 5 and a 
pharmaceutically-acceptable carrier. 

40. A pharmaceutical composition comprising the antibody of claim 15 and a 
pharmaceutically-acceptable carrier. 

41. A kit comprising in one or more containers, the pharmaceutical composition of claim 38. 

42. A kit comprising in one or more containers, the pharmaceutical composition of claim 39. 
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43. A kit comprising in one or more containers, the pharmaceutical composition of claim 40. 

44. A method for determining the presence of or predisposition to a disease associated with 
altered levels of the polypeptide of claim 1 in a first mammalian subject, the method 
comprising: 

(a) measuring the level of expression of the polypeptide in a sample from the first 
mammalian subject; and 

(b) comparing the amount of said polypeptide in the sample of step (a) to the amount 
of the polypeptide present in a control sample from a second mammalian subject 
known not to have, or not to be predisposed to, said disease; 

wherein an alteration in the expression level of the polypeptide in the first subject as compared to 
the control sample indicates the presence of or predisposition to said disease. 

45. The method of claim 44 wherein the predisposition is to a cancer. 

46. A method for determining the presence of or predisposition to a disease associated with 
altered levels of the nucleic acid molecule of claim 5 in a first mammalian subject, the 
method comprising: 

(a) measuring the amount of the nucleic acid in a sample from the first mammalian 
subject; and 

(b) comparing the amount of said nucleic acid in the sample of step (a) to the amount 
of the nucleic acid present in a control sample from a second mammalian subject 
known not to have or not be predisposed to, the disease; 

wherein an alteration in the level of the nucleic acid in the first subject as compared to the 
control sample indicates the presence of or predisposition to the disease. 

47. The method of claim 46 wherein the predisposition is to a cancer. 

48. A method of treating a pathological state in a mammal, the method comprising 
administering to the mammal a polypeptide in an amount that is sufficient to alleviate the 
pathological state, wherein the polypeptide is a polypeptide having an amino acid 
sequence at least 95% identical to a polypeptide comprising an amino acid sequence of at 
least one of SEQ ID NOS:2n, wherein n is an integer between 1 and 162, or a 
biologically active fragment thereof. 
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49. A method of treating a pathological state in a mammal, the method comprising 

administering to the mammal the antibody of claim 15 in an amount sufficient to alleviate 
the pathological state. 
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